[1] |
中国互联网络信息中心第41次《中国互联网络发展状况统计报告》[R]. 2018.
|
|
China Internet Network Information Center The 41th statistical report on Internet development in China[R]. 2018.
|
[2] |
DHILLON I S , MODHA D S . Concept decompositions for large sparse text data using clustering[C]// Machine Learning. 2001: 143-175.
|
[3] |
KUMMAMURU K , DHAWALE A , KRISHNAPURAM R . Fuzzy co-clustering of documents and keywords[C]// The IEEE International Conference on Fuzzy Systems. 2003: 772-777.
|
[4] |
ZHAO Y , KARYPIS G . Soft clustering criterion functions for partitional document clustering:a summary of results[C]// Thirteenth ACM International Conference on Information & Knowledge Management. 2004: 246-247.
|
[5] |
MAKKONEN J , AHONENMYKA H , SALMENKIVI M . Topic detection and tracking with spatio-temporal evidence[C]// European Conference on Ir Research. 2003: 251-265.
|
[6] |
WU C , WANG B . Extracting topics based on Word2Vec and improved jaccard similarity coefficient[C]// IEEE Second International Conference on Data Science in Cyberspace. 2017: 389-397.
|
[7] |
HOFMANN T , . Probabilistic latent semantic indexing[C]// International ACM SIGIR Conference on Research and Development in Information Retrieval. 1999: 50-57.
|
[8] |
BLEI D M , NG A Y , JORDAN M I . Latent dirichlet allocation[J]. J Machine Learning Research Archive, 2003,3: 993-1022.
|
[9] |
STEYVERS M , GRIFFITHS T . Probabilistic topic models[J]. Handbook of Latent Semantic Analysis, 2007,427(7): 424-440.
|
[10] |
BLEI D , CARIN L , DUNSON D . Probabilistic topic models[C]// ACM SIGKDD International Conference Tutorials. 2011:1.
|
[11] |
BERNHARD S , JOHN P , THOMAS H . A collapsed variational bayesian inference algorithm for latent dirichlet allocation[C]// The Twentieth Conference on Neural Information Processing Systems. 2006: 1353-1360.
|
[12] |
GRIFFITHS T L , STEYVERS M . Finding scientific topics[J]. National Academy of Sciences of the United States of America, 2004: 5228-5235.
|
[13] |
RAMAGE D , . Characterizing microblogs with topic models[C]// International AAAI Conference on Weblogs and Social Media. 2010: 130-137.
|
[14] |
CHEN Z , LIU B . Mining topics in documents:standing on the shoulders of big data[C]// ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2014: 1116-1125.
|
[15] |
LIN T , TIAN W , MEI Q ,et al. The dual-sparse topic model:mining focused topics and focused terms in short text[C]// International Conference on World Wide Web. 2014: 539-550.
|
[16] |
ZHAI K , BOYD G J , ASADI N ,et al. MrLDA:a flexible large scale topic modeling package using variational inference in MapReduce[C]// International Conference on World Wide Web. 2012: 879-888.
|
[17] |
ARONSSON F . Large scale cluster analysis with Hadoop and Mahout[J]. Technology & Engineering, 2015.
|
[18] |
MENG X R , BRADLEY J , BURAK Y ,et al. MLlib:machine learning in apache spark[J]. Journal of Machine Learning Research, 2015,17(1): 1235-1241.
|