Research Practice and Progress of Models and Algorithms Applied in Topic Identification and Prediction Based on the Analysis of CNKI
Abstract
:1. Introduction
2. Literature Review
2.1. Research on Topic Identification
2.2. Research on Topic Prediction
3. Research Design
3.1. Research Questions
3.2. Research Method and Process
3.2.1. Research Method
- Systematic literature review (SLR). This is a specific type of literature review that is characterized as methodical, comprehensive, transparent, and replicable. It consists of raising research questions on a specific topic, searching and obtaining all relevant literature in a comprehensive way, and conducting a systematic assessment and integration of analysis to address the research question. This method contains five steps: defining research scope, developing selection criteria, planning the search, collecting and screening the literature, and presenting results [31];
- Bibliometric analysis. This is a quantitative method used to explore and describe previous studies, and is helpful for evaluating academic studies in a specific field [32,33]. It offers a systematic, transparent, and reproducible review process, and thus enhances the reliability and quality of the review [34]. In this study, VOSviewer software has been used as a tool to perform the co-occurrence analysis, and then to realize the visualization of the intellectual structure;
- Classification method. This study uses the classification method to delineate the types of models and algorithms used in the literature for topic classification and prediction. The essence of classification is the grouping together of things or concepts that are similar in some way and, in doing so, comparing a given object with the object that the assessor believes best represents the category [35]. Here, we have used EXCEL to record the algorithms and models used in each article and the domain to which they belong, and to classify them accordingly.
3.2.2. Research Process
4. Overview of Models’ and Algorithms’ Applications in Topic Identification and Prediction
4.1. Chronological Distribution
4.2. Number of Types
4.3. Keywords Analysis
5. Specific Application of Topic Identification Models and Algorithms
5.1. Application of LDA Theme Model and Its Derivative Models
5.1.1. LDA Theme Model
5.1.2. The Derivative Models of LDA Theme Model
5.2. Application of Machine Learning and Deep Learning Models and Algorithms
5.2.1. Word2Vec
5.2.2. K-Means
5.2.3. TF-IDF Algorithm
5.2.4. BERT
5.3. Application of Citation Analysis and Its Associated Models and Algorithms
5.4. Application of Text Mining and Its Associated Models and Algorithms
5.4.1. Co-Word Analysis and Co-Occurrence Matrix
5.4.2. SAO Structure
6. Specific Application of Topic Prediction Models and Algorithms
6.1. Deep Learning or Machine Learning Analysis Models and Algorithms Based on Time Sequence
6.1.1. LSTM Model
6.1.2. The Markov Model
6.1.3. SVM Model
6.1.4. Exponential Smoothing Method
6.1.5. ARIMA Model
6.1.6. Other Models and Algorithms
6.2. Application of Link Prediction Model
7. Indicator System and Effectiveness Verification
7.1. Indicator System
7.2. Validation of Method Effectiveness
- Time series models for prediction based on quantitative indicators.
- Link prediction model based on complex networks
- Classification model of topic recognition based on deep learning.
8. Conclusions
8.1. Academic and Practical Contributions
8.2. Limitations and Suggestions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Appendix A
No. | Topic Identification Models and Algorithms | Topic Prediction Models and Algorithms | Domain | References | No. | Topic Identification Models and Algorithms | Topic Prediction Models and Algorithms | Domain | References |
---|---|---|---|---|---|---|---|---|---|
1 | LDA (latent Dirichlet allocation) model, keywords | TPP-LSTM (topic popularity prediction model based on long short-term memory) topic hot prediction model | library and information science | Huo et al. (2021) [9] | 49 | ETM (embedding theme model) | epidemic policy of China and US | Zhou and Wu (2021) [98] | |
2 | LDA model | ARIMA (autoregressive integrated moving average model) model | information construction | Yue et al., (2019) [3] | 50 | Relim algorithm | machine learning algorithm | animal genetics and breeding | Nie (2019) [99] |
3 | LDA model | LSTM (long short-term memory) model | privacy research | Zhu et al., (2020) [36] | 51 | LDA model | digital journalism | Chao et al., (2021) [38] | |
4 | LDA model | Link prediction | deep learning | Liu et al., (2019) [47] | 52 | BERT-LDA model | agricultural robot | Wang and Gao (2021) [100] | |
5 | LDA model. Word2vec (word to vector) | LSTM model | technical topic | Li (2019) [53] | 53 | Leiden community discovery algorithm | additive manufacturing | Li, Guo et al., (2021) [101] | |
6 | LDA model, TP-JIF (topic popularity computing model based on journal impact factor) model | TPP-LSTM (topic popularity computing model based on journal impact factor) topic hot prediction model | library and information science | Huo et al. (2021) [76] | 54 | Generative topological mapping, BERT-LSTM model | new energy vehicles | Xu and Gui (2021) [89] | |
7 | VOLDA (variable online-LDA) model | ESG (EEMD-SVM-GMDH), EEMD (ensemble empirical mode decomposition), SVM (support vector machine), GMDH (group method of data handing) | topic popularity | Pei (2018) [50] | 55 | Word2Vec | TRIZ (theory of the solution of inventive problems) | patent text | Liu (2017) [81] |
8 | keywords analysis | grey prediction model | information science | Xu et al., (2016) [102] | 56 | LDA model | network public opinion | Cui (2010) [88] | |
9 | Kelim algorithm | time series integration method | animal genetics and breeding | Nie et al., [80] | 57 | SAO | hematopoietic stem cells | Ma et al., (2021) [74] | |
10 | Co-LDA (word co-occurrence analysis combined with LDA) model | link prediction | diabetes drugs | Wu et al., (2021) [6] | 58 | link prediction of weighted network, neural network | application of perovskite materials | Huang et al., (2019) [83] | |
11 | LDA model | time series analysis, SVM model | graphene field | Li et al., [37] | 59 | LDA model | patent | Chen (2015) [39] | |
12 | HDP (Hierarchical Dirichlet Processes) | Naive Bayes, SVM, XGBoost (extreme gradient boosting) classification algorithm | massive news data source | Zheng (2019) [4] | 60 | LDA-kmeans | EMD (empirical mode decomposition)-LSTM technology prediction model | robot technology | Xu and Gui (2020) [89] |
13 | causal model | panel data | Du et al., (2016) [103] | 61 | conditional random field, domain vocabulary | link prediction technology, opportunity identification | gold nanoparticles | Ma (2016) [82] | |
14 | citation curve | stem cells | Cao et al., (2020) [85] | 62 | LDA model | Hidden Markov Model | marine diesel engine | Chen et al., (2018) [43] | |
15 | PLDA (probabilistic LDA) model | SVM, improved particle swarm optimization algorithm | graphene | Xu and Wang (2019) [28] | 63 | analytical hierarchy process | industrial robot | Cao et al., (2022) [104] | |
16 | LDA model | Markov chain, HDP | NIPS papers | Mao (2016) [44] | 64 | text classification text clustering | 3D printing | Zhao (2017) [105] | |
17 | LDA model | curve fitting prediction method | fuel cell | Bai et al. (2020) [106] | 65 | SAO, similarity calculation, clustering analysis | artificial intelligence | Fan (2020) [72] | |
18 | LDA model | HMM (hidden markov model) | 3D printing | Lin (2020) [77] | 66 | patent citation, Bass model | solar photovoltaic power generation | Yu et al., (2021) [63] | |
19 | LDA model, controlled vocabulary | AR (auto regressive) model | NLP | Xue (2013) [42] | 67 | fuzzy consistent matrix | smart and traditional cellphone, quantum and optical fiber communication, additive manufacturing | Li et al., (2021) [87] | |
20 | Co-word analysis, community discovery algorithm, | Emergence scores (EScores) indicator | synthetic biology | Meng (2018) [67] | 68 | patent citation, questionnaire | speech recognition | Wang et al., (2022) [64] | |
21 | IOLDA (improved online LDA) model | ESA (EEMD-SVR-Adaboost) EEMD (ensemble empirical mode decomposition), SVR (support vector regression) model | scientific and technological information | Luo (2018) [51] | 69 | citation, genetic backward forward path, technical discontinuity theory | surgical robot | Wu et al., (2022) [65] | |
22 | LDA model | Emerging index, amount index | artificial intelligence | Liu et al., (2018) [107] | 70 | LDA model, mean value, linear regression fitting | ARIMA model, word2vec | stem cell field | Yue et al., (2020) [3] |
23 | LDA model | structure hole, Delphi method | chip | Xuan (2020) [108] | 71 | subject words relation recognition, BERT model, TRIZ theory | data mining | Tan (2019) [60] | |
24 | Co-word matrix | Blondel partition algorithm, node coincidence degree | information science | Kui et al. (2016) [66] | 72 | SAO, catastrophe theory | stem cells | Ma et al., (2022) [75] | |
25 | LDA model | Markov, Hidden Markov Models | cloud computing | Tian (2021) [1] | 73 | DSE (discovery–select–evaluation) identification ideas | Deng et al., (2022) [109] | ||
26 | Co-occurrence matrix | clinical medicine | Chen et al. (2017) [110] | 74 | TF-IDF, K-means++ | unmanned combat platform in ship field | Ren et al., (2022) [58] | ||
27 | LDA model, K-means | exponential smoothing method | artificial intelligence | Song and Zhu (2021) [30] | 75 | LDA model, Word2vec | blockchain | Chen et al., (2022) [39] | |
28 | LDA model | BP (back propagation) neural network, SVR machine learning algorithm | emerging topics | Ye et al., (2022) [111] | 76 | LDA model, emotion analysis method, entropy method, CRITIC (criteria importance though intercriteria correlation) method | intelligent connected vehicle | Tang and Qiu (2021) [112] | |
29 | topic prediction using basic research results | Wu et al. (2022) [113] | 77 | Chunk-LDAvis toolkit | nano agriculture | Liu et al., (2019) [46] | |||
30 | PLDA, product technical attribute words-technical feature vocabulary | comment topic identification, multi-dimensional analysis of technical attributes | smart phones | Wu et al. (2021) [114] | 78 | CSToT (content similarity–topics over time) | domestic information science research | He et al., (2018) [115] | |
31 | LDA model | topic model, Sen’s slope estimation method, Mann Kendall, exponential smoothing method | smart library | Song and Ran (2022) [78] | 79 | LDA model, Rao Stirling index | nanotechnology | Han et al., (2018) [116] | |
32 | network topology evolution model, link prediction | Liu (2016) [117] | 80 | hLDA (hierarchical Latent dirichlet allocation) | library science and information science | Wang (2014) [49] | |||
33 | Knowledge–technology–environment three-dimensional framework | perovskite solar battery | Xie (2019) [118] | 81 | Word2Vec, Fast Unfolding algorithm, Page Rank algorithm | governor mailbox data | Teng et al., (2022) [55] | ||
34 | LDA model, Word2vec | topic intensity and content evolution analysis, expert evaluation | artificial intelligence | Yang et al. (2022) [54] | 82 | PathSelClus (integrating meta-path selection with user-guided object clustering) algorithm | genetically engineered vaccine | Xu et al., (2019) [89] | |
35 | co-occurrence matrix | time series evolution | volatile organic compounds | Chen et al. (2020) [68] | 83 | LDA model | artificial intelligence | Li et al., (2022) [48] | |
36 | SAO (subject– action–object) | technology evolution model of patent feature mining | slow and controlled release fertilizer | Li (2020) [73] | 84 | CO-LDA model | online medical review | Gao et al., (2019) [27] | |
37 | patent classification code co-occurrence analysis | multidimensional technology correlation trend evolution model | medicine | Wang et al. (2021) [69] | 85 | scientific metrological characteristics of IDR theme | artificial intelligence | Dong et al., (2022) [86] | |
38 | multidimensional scale analysis, K-means (K-means clustering algorithm) | artificial intelligence | Gao et al. (2020) [56] | 86 | LDA model, science and technology citation and text relevance | Bruton’s tyrosine kinase inhibitor | Li (2020) [62] | ||
39 | Multi-level comprehensive index evaluation system | solid oxide fuel battery | Hou and Zhu (2014) [119] | 87 | Vos (VOSviewer) clustering, strategic coordinate analysis | ARIMA model, exponential smoothing method | underwater information perception technology | Cui et al., (2022) [88] | |
40 | polynomial regression model | home appliance industry | Wu et al. (2022) [120] | 88 | AP (affinity propagation) nearest neighbor propagation clustering algorithm | time series clustering | innovation management | Li and Wu (2019) [79] | |
41 | TF-IDF (term frequency–inverse document frequency) algorithm, K-means | keywords patent map prediction method | industrial wastewater treatment | Huang (2019) [57] | 89 | OVL (overlap function) superposition algorithm, linear weighting method, K-means algorithm | industrial robot | Tian and Zhang (2021) [121] | |
42 | topic identification on knowledge flow perspectives | quantitative technology prediction frame | harmonic reducer | Liao (2017) [122] | 90 | citation content analysis | carbon nanotube fiber | Zhu and Leng (2014) [61] | |
43 | SAO | SAO, morphological analysis prediction model | dye-sensitized solar battery | Guo (2016) [71] | 91 | K-means, LDA model, GSDMM (Gibbs sampling algorithm for the Dirichlet multinomial mixture model) model | time series analysis | artificial intelligence | Zhang et al., (2022) [97] |
44 | BERT (bidirectional encoder representation from transformers)-LSTM identification model | EEMD-LSTM technology prediction model | lithium-ion battery for new energy vehicles | Gui (2021) [59] | 92 | LDA model, Rao Stirling index | solar photovoltaic | Han et al., (2021) [123] | |
45 | LDA2vec (LDA combined with word2vec), text similarity theory and calculation method | antidepressants | Zhang (2019) [52] | 93 | LDA model, co-word analysis | intelligent technology | He (2021) [70] | ||
46 | LDA-GS model | graphene industry | Wu et al. (2021) [124] | 94 | LDA model, K-means, co-word analysis | library and information science and pedagogy | Ruan and Xia (2018) [29] | ||
47 | technology prediction model based on knowledge evolution | molecular breeding | Li (2021) [125] | 95 | calculation method of frontier topic characteristic index | treatment and prognosis of cardiovascular disease | Fan et al., (2018) [126] | ||
48 | link prediction weighted co-occurrence matrix, technology life cycle theory | nucleic acid detection technology | Zhang et al. (2021) [84] | 96 | LDA model, patent–paper hybrid co-citation analysis, expert interview | intelligent security technology | Zhu (2020) [92] |
References
- Tian, Y.D. Research on the Evolution and Prediction of Knowledge Topic. Inf. Sci. 2021, 6, 123–133. [Google Scholar] [CrossRef]
- Yang, J.Q.; Wei, Y.H.; Huang, S.Z.; Luo, W.; Lu, W. Research Review on Emerging Topic Identification Based on Scientific Literatures. Inf. Sci. 2020, 8, 159–163, 177. [Google Scholar] [CrossRef]
- Yue, L.X.; Liu, Z.Q.; Hu, Z.Y. Evolution Analysis of Hot Topics with Trend-Prediction. Data Anal. Knowl. Discov. 2020, 6, 22–34. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C44YLTlOAiTRKibYlV5Vjs7i8oRR1PAr7RxjuAJk4dHXov3eiaK-qy0V0MEwvWDcF540nJEGMnun7si7FR8zZ3ek&uniplatform=NZKPT (accessed on 10 June 2023).
- Zheng, X.Q. Research on System Construction Based on Emerging Topic Detection and Prediction Method; Xiamen University: Xiamen, China, 2019; pp. 43–54. [Google Scholar]
- Xu, X.G.; Gui, M.Z. Identifying Technology Innovation Opportunities Based on GTM Reverse Mapping. Inf. Stud. Theory Appl. 2021, 6, 146–153, 198. [Google Scholar] [CrossRef]
- Wu, S.N.; Tian, R.N.; Pu, H.J.; Liang, W.Q.; Zhang, Y.F.; Yu, Q.; He, P.F. Research on the Prediction Method of Related Topics in the Medical Field Based on Social Media. Data Anal. Knowl. Discov. 2021, 12, 98–109. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C44YLTlOAiTRKibYlV5Vjs7iJTKGjg9uTdeTsOI_ra5_Xf80tvfaHzPNRCyn4BN7A8H7prX-1A3ec2RlY9xqy9u6&uniplatform=NZKPT (accessed on 10 June 2023).
- Wang, C.Y.; Koo, L.J.; Yuan, J.F. A study on the classification of universities based on discipline characteristics: An example of “double first-class” universities. China High. Educ. Res. 2022, 351, 38–44. [Google Scholar] [CrossRef]
- Khaire, U.M.; Dhanalakshmi, R. Stability of feature selection algorithm: A review. J. King Saud Univ. Comput. Inf. Sci. 2022, 34, 1060–1073. [Google Scholar] [CrossRef]
- Huo, C.G.; Dong, K.; Si, X.Y. Evolution Analysis and Prediction of Scientific Topic Popularity in the Field of LIS. Doc. Inf. Knowl. 2021, 2, 35–47, 57. [Google Scholar] [CrossRef]
- Wang, Y.P. Research Progress of Scientific and Technical Literature Topic Detection and Evolution Based on Topic Model in China. Libr. Inf. Serv. 2016, 3, 130–137. [Google Scholar] [CrossRef]
- Kai, Q.; Dou, Y.X. Survey of Disruptive Technology Identification. J. Intell. 2021, 11, 31–38. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C44YLTlOAiTRKibYlV5Vjs7iy_Rpms2pqwbFRRUtoUImHdOR7NvLKjZ7LPyq3EM1bLXtZ4TSi4gYzC4yFBDYNafB&uniplatform=NZKPT (accessed on 10 June 2023).
- Zhang, J.W.; Dong, Y. Research Progress of Disruptive Technical Identification Indicators. Inf. Stud. Theory Appl. 2020, 6, 194–199. [Google Scholar] [CrossRef]
- Qiao, Y.L.; Huang, Y.; Zhang, S.; Wang, X.F. The Identification of Disruptive Technology from a Multi-dimensional Perspective: Research Progress and Future Prospects. J. Intell. 2022, 8, 45–52. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C44YLTlOAiTRKibYlV5Vjs7iJTKGjg9uTdeTsOI_ra5_XYdCGNxWtU8QthnBzxOQtLhYpHolN8oHXTTQIxuntxNM&uniplatform=NZKPT (accessed on 10 June 2023).
- Park, J.; Lee, H.J.; Cho, S. Hot topic detection in central bankers’ speeches. Expert Syst. Appl. 2023, 230, 120563. [Google Scholar] [CrossRef]
- Savin, I.; Ott, I.; Konop, C. Tracing the evolution of service robotics: Insights from a topic modeling approach. Technol. Forecast. Soc. Chang. 2022, 174, 121280. [Google Scholar] [CrossRef]
- Li, L.P.; Zhao, X.B. Review on Topic Discovery Methods Based on Text Clustering. Inf. Res. 2020, 11, 121–127. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C44YLTlOAiTRKibYlV5Vjs7i8oRR1PAr7RxjuAJk4dHXoirdakr6rX4kUmaZQhq2PnSRWn4lxXUdckURk3QC9M-d&uniplatform=NZKPT (accessed on 10 June 2023).
- Liu, X.L.; Tang, Z.Y. Progress on Methods of Emerging Technology Topics Identification. Libr. Inf. Serv. 2020, 11, 145–152. [Google Scholar] [CrossRef]
- Luo, P.; Luo, M.; Li, F.; Qi, X.; Huo, A.; Wang, Z.; He, B.; Takara, K.; Nover, D.; Wang, Y. Urban flood numerical simulation: Research, methods and future perspectives. Environ. Model. Softw. 2022, 156, 105478. [Google Scholar] [CrossRef]
- Shelton, R.C.; Philbin, M.M.; Ramanadhan, S. Qualitative research methods in chronic disease: Introduction and opportunities to promote health equity. Annu. Rev. Public Health 2022, 43, 37–57. [Google Scholar] [CrossRef]
- Xu, S.; Nowamooz, H.; Lai, J.; Liu, H. Mechanism, influencing factors and research methods for soil desiccation cracking: A review. Eur. J. Environ. Civ. Eng. 2023, 3, 1–25. [Google Scholar] [CrossRef]
- Zhang, X.; Fang, S.; Wang, C.H. Review on Technology Evolution Research from Patent Citation Perspective. Sci. Sci. Manag. Sci. Technol. 2016, 3, 58–67. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C44YLTlOAiTRKibYlV5Vjs7ijP0rjQD-AVm8oHBO0FTadljcow_eXGgKMEvDsj9YhRltpkI3SFHHp3VmYIua-TbX&uniplatform=NZKPT (accessed on 10 June 2023).
- Zhou, Y.; Liu, H.L.; Liao, L.; Xue, L. Literature Review of Quantitative Technology Foresight Methods Based on Topic Modeling. Sci. Technol. Manag. Res. 2017, 11, 185–196. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C44YLTlOAiTRKibYlV5Vjs7iAEhECQAQ9aTiC5BjCgn0RmB4NtpdPbcvLESyvJs_62iRWk01nAtHn56clFBDlm3M&uniplatform=NZKPT (accessed on 10 June 2023).
- Liu, Q.Y.; Wu, X.N. Review on Disruptive Technology Discovery Methods. Libr. Inf. Serv. 2017, 7, 127–136. [Google Scholar] [CrossRef]
- Wang, C.; Xu, H.Y.; Fang, S. Progress of Approaches for Identification and Forecasting of Disruptive Technologies. Sci. Technol. Prog. Policy 2018, 9, 152–160. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C44YLTlOAiTRKibYlV5Vjs7i0-kJR0HYBJ80QN9L51zrPz0d_hXdda6BS1nZFoPQi2jI7Ig1FkXPfFMsz_DuKaae&uniplatform=NZKPT (accessed on 10 June 2023).
- Coelho, L.B.; Zhang, D.; Van Ingelgem, Y.; Steckelmacher, D.; Nowé, A.; Terryn, H. Reviewing machine learning of corrosion prediction in a data-oriented perspective. NPJ Mater. Degrad. 2022, 6, 8. [Google Scholar] [CrossRef]
- Hond, A.A.H.; Leeuwenberg, A.M.; Hooft, L.; Kant, I.M.J.; Nijman, S.W.J.; van Os, H.J.A.; Aardoom, J.J.; Debray, T.P.A.; Schuit, E.; van Smeden, M.; et al. Guidelines and quality criteria for artificial intelligence-based prediction models in healthcare: A scoping review. NPJ Digit. Med. 2022, 5, 2. [Google Scholar] [CrossRef]
- Gao, H.Y.; Liu, J.W.; Yang, S.X. Identifying Topics of Online Healthcare Reviews Based on Improved LDA. Trans. Beijing Inst. Technol. 2019, 4, 427–434. [Google Scholar] [CrossRef]
- Xu, L.L.; Wang, F. Scientific Frontier Prediction Model Based on Support Vector Machine and Improved Particle Swarm Optimization. Inf. Sci. 2019, 8, 22–28. [Google Scholar] [CrossRef]
- Ruan, G.C.; Xia, L. Research on Interdisciplinary Topics Identification. Inf. Sci. 2020, 12, 152–157. [Google Scholar] [CrossRef]
- Song, K.; Zhu, Y.J. Patent Frontier Technology Topic Identification and Trend Prediction. J. Intell. 2021, 1, 33–38. Available online: https://kns.cnki.net/kcms/detail/detail.aspx?dbcode=CJFD&dbname=CJFDLAST2021&filename=QBZZ202101005&uniplatform=NZKPT&v=fDjsTQ_pkUc34IT6lc0BsMuRPAfV82K8s9TjMuvX273LH7EwJeLtgNhpZ2jL4g7G (accessed on 10 June 2023).
- Siddaway, A.P.; Wood, A.M.; Hedges, L.V. How to do a systematic review: A best practice guide for conducting and reporting narrative reviews, meta-analyses, and meta-syntheses. Annu. Rev. Psychol. 2019, 70, 747–770. [Google Scholar] [CrossRef]
- Rey-Martí, A.; Ribeiro-Soriano, D.; Palacios-Marqués, D. A bibliometric analysis of social entrepreneurship. J. Bus. Res. 2016, 69, 1651–1655. [Google Scholar] [CrossRef]
- Small, H. Co-citation in the scientific literature: A new measure of the relationship between two documents. J. Am. Soc. Inf. Sci. 1973, 24, 265–269. [Google Scholar] [CrossRef]
- Bellis, N.D. Bibliometrics and Citation Analysis: From the Science Citation Index to Cyber-Metrics; Scarecrow Press: Lanham, ML, USA, 2009. [Google Scholar]
- Sarker, I.H. Deep Learning: A comprehensive overview on techniques, taxonomy, applications and research directions. SN Comput. Sci. 2021, 2, 420. [Google Scholar] [CrossRef] [PubMed]
- Zhu, G.; Liu, L.; Li, F.J. Research on Topic Relation and Prediction Based on LDA and LSTM. J. Mod. Inf. 2020, 8, 38–50. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C44YLTlOAiTRKibYlV5Vjs7i8oRR1PAr7RxjuAJk4dHXoge9KT-AMPGtQmxVZzBNW4KlYBpdAL03RUjJN4cNQfFw&uniplatform=NZKPT (accessed on 10 June 2023).
- Li, J.; Xu, L.L.; Zhao, S.J. Prediction and Visualization of Emerging Topics of Fund Sponsored Projects Based on Time Series Analysis and SVM Model. Inf. Stud. Theory Appl. 2019, 1, 118–123, 152. [Google Scholar] [CrossRef]
- Chao, N.P.; Han, S.Q.; Wu, X.T. Topic Discovery and Evolution Analysis of Digital Journalism. J. Mass Commun. 2021, 9, 4–13. [Google Scholar] [CrossRef]
- Chen, W.; Lin, C.R.; Li, J.Q.; Yang, Z.L. Analysis of the Evolutionary Trend of Technical Topics in Patents Based on LDA and HMM. J. China Soc. Sci Technol. Inf. 2018, 7, 732–741. [Google Scholar]
- Yue, L.X.; Zhou, X.Y.; Chen, Y.N. Thematic Trend Prediction of Information Architecture Based on the ARIMA Model. Doc. Inf. Knowl. 2019, 5, 54–63, 72. [Google Scholar] [CrossRef]
- Cui, K. The Research and Implementation of Topic Evolution Based on LDA. Nati. Univ. Def. Technol. 2010, 5, 18–39. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C475KOm_zrgu4lQARvep2SAkhskYGsHyiXlyV6jw0YcPLAKe5A8Jxr64DzLnGUNqeloGZjehXwbAzHL9PlAzVpfL&uniplatform=NZKPT (accessed on 10 June 2023).
- Xue, Y.B. Topic Discovery and Trend Forecasting in the Science and Technology Literature. Harbin Inst. Technol. Univ. 2014, 3, 20–29. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C475KOm_zrgu4lQARvep2SAk8URRK9V8kZLG_vkiPpTeIb4GvxjDyNTXVh9b-4cZ5dIgkzg75Tfct0iLdXZyEBGS&uniplatform=NZKPT (accessed on 10 June 2023).
- Chen, H.S. Research on Topic Model Based Patent Mining and Its Applications. Beijing Inst. Technol. 2016, 4, 71–88. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C447WN1SO36whLpCgh0R0Z-iv9r0YoQXiId4v9BfOE9rDiv39gx3RLlwMF6A-xU1fhweMSwWcdDHO04FFWlwEWPJ&uniplatform=NZKPT (accessed on 10 June 2023).
- Mao, L.F. Study of text evolution analysis and prediction based on topic model. Nanjing Univ. Posts Telecommun. 2017, 2, 12–43. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C475KOm_zrgu4lQARvep2SAkkyu7xrzFWukWIylgpWWcEgwSw85v4Y4_8hlSQb4pGaghtZSQvRWf7CLlQO1wCp0d&uniplatform=NZKPT (accessed on 10 June 2023).
- Chen, H.S.; Song, Y.H.; Jin, Q.Q.; Wang, X.F. Radical Innovative Topic Identification from a Perspective of Dynamic Topic Network. Libr. Inf. Serv. 2022, 10, 45–58. [Google Scholar] [CrossRef]
- Liu, Z.Q.; Xu, H.Y.; Yue, L.X.; Fang, S. Research on Lagging Effect of Topic Diffusion Evolution Face to Prediction of Research Front. J. China Soc. Sci. Technol. Inf. 2018, 10, 979–988. [Google Scholar]
- Liu, J.W.; Long, Z.X.; Wang, F.F. Finding Collaboration Opportunities from Emerging Issues with LDA Topic Model and Link Prediction. Data Anal. Knowl. Discov. 2019, 1, 104–117. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C44YLTlOAiTRKibYlV5Vjs7iLik5jEcCI09uHa3oBxtWoB3Uo_fHZa_LSTlXA0PMbyBDEXz9KCLopwPKpVKMK4_O&uniplatform=NZKPT (accessed on 10 June 2023).
- Li, W.S.; Tan, L.M.; Zhang, G.L.; Wen, X.F.; Liao, T.; Shi, M. Research on Topic Recognition of Key Core Technology in Industrial Chain Based on Multi-source Information Fusion. J. Inf. Resour. Manag. 2022, 1, 116–126. [Google Scholar] [CrossRef]
- Wang, P. Topic Extraction and Evolution for Scientific Literature Based on Hierarchical Probabilistic Topic Model. Libr. Inf. Serv. 2014, 22, 70–77. [Google Scholar] [CrossRef]
- Pei, K.F. Research on Topic Heat Prediction Based on VOLDA Theme Model and ESG Prediction Model. Nanjing Univ. Aeronaut. Astronaut. 2019, 2, 11–31. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C475KOm_zrgu4lQARvep2SAkWfZcByc-RON98J6vxPv10VISgQ7zi55MhQcfqP16cdwQUaQEW3OxxXcGsmFc7YK7&uniplatform=NZKPT (accessed on 10 June 2023).
- Luo, Y. Research on Topic Discovery and Evolutive Prediction and Its Application Based on Scientific and Technical Literature. Univ. Electron. Sci. Technol. China 2021, 1, 15–22. [Google Scholar] [CrossRef]
- Zhang, H. Research on Technology Forecasting Method from the Perspective of Data Fusion. Jilin Univ. 2020, 2, 69–78. [Google Scholar] [CrossRef]
- Li, Y.J. Technology Topic Prediction Research Based on LSTM. Xiangtan Univ. 2020, 2, 21–25. [Google Scholar] [CrossRef]
- Yang, H.; Wang, R.F.; Zhang, L. Technology Prediction Based on Core Patents Technology Topic Recognition and Evolution Analysis. J. Intell. 2022, 7, 49–56. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C44YLTlOAiTRKibYlV5Vjs7iJTKGjg9uTdeTsOI_ra5_XYdCGNxWtU8Qv57pnMYKPbXWo9jcdJG_AZN3v9SZv61k&uniplatform=NZKPT (accessed on 10 June 2023).
- Teng, J.; Hu, G.W.; Wang, T. Topic Identification and Evolution Path Analysis of Social Appeal Based on Dynamic Semantic Dependency Network. Inf. Doc. Serv. 2022, 3, 20–33. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C44YLTlOAiTRKibYlV5Vjs7iJTKGjg9uTdeTsOI_ra5_XdZ9ekciUQMV-5xDMlhiMVCtMvA9BpHyD424dblG52gE&uniplatform=NZKPT (accessed on 10 June 2023).
- Gao, N.; Peng, D.Y.; Fu, J.Y.; Zhao, Y.H. Research on Technology Fronts Prediction Based on Patent IPC Classification and Text Information. Inf. Stud. Theory Appl. 2020, 4, 123–129. [Google Scholar] [CrossRef]
- Huang, N. Research on the Evolution Path of Industrial Wastewater Treatment Technology Based on Patent Mining. Tianjin Univ. 2022, 1, 31–57. [Google Scholar] [CrossRef]
- Ren, H.C.; Huang, Q.L.; Zhang, Z.G.; Yu, W.D. Research on Topic Identification Technology of Emerging Technology in the Ship Field. Inf. Stud. Theory Appl. 2022, 11, 103–106. [Google Scholar] [CrossRef]
- Gui, M.Z. Research on Key Technology Forecasting Based on Intelligent Methods. Shanghai Univ. 2018, 6, 32–63. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C447WN1SO36whLpCgh0R0Z-iTEMuTidDzndci_h58Y6ouQoOYLrrcHvDLa4hVZVgdQbHG__kFiWIAu3hXgrmFB1e&uniplatform=NZKPT (accessed on 10 June 2023).
- Tan, T.T. Research Technology Evolution Analysis Method Oriented on Patent. Nanjing Univ. Sci. Technol. 2022, 1, 35–55. [Google Scholar] [CrossRef]
- Zhu, Q.S.; Leng, F.H. Topic Identification of Highly Cited Papers Based on Citation Content Analysis. J. Libr. Sci. China 2014, 1, 39–49. [Google Scholar] [CrossRef]
- Li, P.X. Identifying Innovation Topic Within the Relevancy Between Texts and Citations. Peking Union Med. Coll. 2021, 5, 23–47. [Google Scholar] [CrossRef]
- Yu, G.H.; Ning, Z.; Li, H.F. Research on Identification Method of Disruptive Technology Based on Patent and Bass Model. Stud. Sci. Sci. 2021, 39, 1467–1473, 1536. [Google Scholar] [CrossRef]
- Wang, K.; Chen, Y.; Wang, Y.Q.; Song, C. Research on Disruptive Technology Identification Based on Patent Citation Changes. J. Intell. 2022, 1, 74–80, 169. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C44YLTlOAiTRKibYlV5Vjs7iJTKGjg9uTdeTsOI_ra5_XYdCGNxWtU8QLn9VUsU5lRkyBz4xKUYcfNWC3W9x6G3k&uniplatform=NZKPT (accessed on 10 June 2023).
- Wu, K.F.; Wang, W.; Zhang, S.Y.; Che, H.X.; Cai, L.; Chen, X. Research on Disruptive Technology Identification Methods from the Perspective of Technology Discontinuities. Inf. Stud. Theory Appl. 2022, 10, 125–131. [Google Scholar] [CrossRef]
- Kui, L.; Xu, H.Y.; Hu, Z.Y.; Dong, K.; Wang, C.; Pang, H.S. Multiple-pattern Analysis and Prediction of Topic Evolution Path Based on Topic Correlation. Libr. Inf. Serv. 2016, 13, 71–81. [Google Scholar] [CrossRef]
- Meng, J.Y. Research on Scientific Topic Evolution and Forecasting. Beijing Eng. Technol. Univ. 2021, 7, 26–45. [Google Scholar] [CrossRef]
- Chen, R.; He, C.C.; Sun, J.Q.; Yan, S.M.; Liu, Y. Research on Technology Forecasting Based on Trend Evolution Analysis. Sci. Technol. Manag. Res. 2020, 24, 47–53. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C44YLTlOAiTRKibYlV5Vjs7iy_Rpms2pqwbFRRUtoUImHYrFaEO8QzFsfoWWIb05eWYtkwmQEJ49RFfVRisoi8i0&uniplatform=NZKPT (accessed on 10 June 2023).
- Wang, S.S.; Yan, S.M.; Chen, R.; Li, J.X. Research on the Evolution of Technological Relatedness Trends Based on Patent Codes Co-Occurrence. J. Intell. 2021, 40, 53–61. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C44YLTlOAiTRKibYlV5Vjs7iy_Rpms2pqwbFRRUtoUImHdOR7NvLKjZ755L5uOAdibkX9XuTPd8hgP4bKZxr431A&uniplatform=NZKPT (accessed on 10 June 2023).
- He, Y.J. Topic Recognition and Feature Analysis about the Researches in the Field of Intelligent Technology. Nanchang Univ. 2022, 8, 36–62. [Google Scholar] [CrossRef]
- Guo, J.F. A Semantic Mining-Based Method to Analyze and Evaluate Technology Innovation Pathway. Beijing Inst. Technol. 2018, 6, 67–92. [Google Scholar]
- Fan, M.J. Early Identification of Disruptive Technology Based on Multi-Source Heterogeneous Data. Beijing Univ. Technol. 2021, 6, 23–31. [Google Scholar] [CrossRef]
- Li, X.M. Technology Evolution Analysis Based on Patent Elements Features. Chin. Acad. Agric. Sci. 2020, 1, 35–53. [Google Scholar] [CrossRef]
- Ma, M.; Wang, C.; Zhou, Y.; Xu, H.Y.; Hu, Z.Y.; Xiong, G.H. Research on Core Technology Topic Identification and Evolution Trend Analysis Based on Semantic Information. Inf. Stud. Theory Appl. 2021, 9, 106–113. [Google Scholar] [CrossRef]
- Ma, M.; Wang, C.; Zhang, W.R.; Xu, H.Y.; Zhou, Y. Research on the Methods of Identifying and Analyzing Potential Disruptive Technologies from the Perspective of Catastrophe. Inf. Stud. Theory Appl. 2022, 3, 157–164, 156. [Google Scholar] [CrossRef]
- Huo, C.G.; Huo, F.F.; Dong, K. The Popularity Prediction of Scientific Topics Based on LSTM. Doc. Inf. Knowl. 2021, 2, 25–34. [Google Scholar] [CrossRef]
- Lin, C.R. Research on Key Generic Technology Identification and Foresight Based on Patent Data Mining. Harbin Eng. Univ. 2021, 4, 104–127. [Google Scholar] [CrossRef]
- Song, K.; Ran, C.J. A Method for Development Hierarchy Division and Trend Prediction of Subject Research Topic. Inf. Sci. 2022, 7, 136–144. [Google Scholar] [CrossRef]
- Li, H.L.; Wu, X.L. Research on Topic Discovery and Evolution Based on Time Series Clustering. J. China Soc. Sci. Technol. Inf. 2019, 10, 1041–1050. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C44YLTlOAiTRKibYlV5Vjs7iLik5jEcCI09uHa3oBxtWoDWrTnpDfVAo-fIwIFrlvmtwZcyJNu6yBrTT_vR1ksws&uniplatform=NZKPT (accessed on 10 June 2023).
- Nie, X.; Xie, N.F.; Wu, S.S.; Li, X.Y.; Wang, H.J. Prediction of Hot Trends in Animal Genetics and Breeding Based on Machine Learning. Agric. Outlook 2020, 1, 101–105. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C44YLTlOAiTRKibYlV5Vjs7i8oRR1PAr7RxjuAJk4dHXoh-KV_Gm6cjyLpUKC8fcuXARynP-1YTKDCXq-yd-cKyz&uniplatform=NZKPT (accessed on 10 June 2023).
- Liu, H. Research on Technology Trend Prediction Method Based on Word Vector. Beijing Univ. Technol. 2018, 7, 29–43. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C475KOm_zrgu4lQARvep2SAkZIGkvqfmUZglMdu7fCR481K0pCE4mjcx4y3CC_afkpF2rrQ8Xcih-IibouS2UCTu&uniplatform=NZKPT (accessed on 10 June 2023).
- Ma, J. Applying Text Mining to Technology Opportunities Analysis in Biomedical Field. Beijing Inst. Technol. 2019, 9, 79–89. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C447WN1SO36whLpCgh0R0Z-iDdIt-WSAdV5IJ_Uy2HKRASVjT2vCJYzcCU8cA7s4Y38B3DRVz6yzPls0j7biIx2k&uniplatform=NZKPT (accessed on 10 June 2023).
- Huang, L.; Zhu, Y.H.; Zhang, Y. Research on Identification of Emerging Topics Based on Link Prediction with Weighted Networks. J. China Soc. Sci Technol. Inf. 2019, 4, 335–341. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C44YLTlOAiTRKibYlV5Vjs7iLik5jEcCI09uHa3oBxtWoDWrTnpDfVAojkdgbYFroK9HWgn_Nf0F5Ysnw3lY3pgl&uniplatform=NZKPT (accessed on 10 June 2023).
- Zhang, Y.; Lin, Y.H.; Hou, J.H. Technology Prediction Method Based on Data Fusion and Life Cycle. J. China Soc. Sci Technol. Inf. 2021, 5, 462–470. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C44YLTlOAiTRKibYlV5Vjs7iy_Rpms2pqwbFRRUtoUImHZHj7Gr3AZ3G1gh3PVbAJnQrfDM9i5UaoyVaWdqAjZQ7&uniplatform=NZKPT (accessed on 10 June 2023).
- Cao, Y.W.; Xu, H.Y.; Wu, H.w.; Luo, R.J. Study on Radical Innovation Prediction to Emerging Technology Topics Based on Citation Curve Fitting. Libr. Inf. Serv. 2020, 5, 100–113. [Google Scholar] [CrossRef]
- Dong, K.; Xu, H.Y.; Luo, R.; Fang, S. Research on Multi-dimensional Interdisciplinary Topics Identification Method Based on Scientific Literature Contents Analysis. Inf. Stud. Theory Appl. 2018, 5, 131–136. [Google Scholar] [CrossRef]
- Li, Q.R.; Guo, J.F.; Huang, Y.; Wang, X.F. Research on the Method of Disruptive Technology Identification Based on Patent Bibliometrics. Stud. Sci. Sci. 2021, 39, 1166–1175. [Google Scholar] [CrossRef]
- Cui, J.; Zhang, J.P.; Bao, Z.; Ding, S.C. Development Forecast of Core Theme in Science and Technology Field Based on Trend Analysis. Data Anal. Knowl. Discov. 2022, 9, 1–13. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C44YLTlOAiTRKibYlV5Vjs7iJTKGjg9uTdeTsOI_ra5_Xf80tvfaHzPNpC0tB6QvIH8O60mGM8cg0RoB–CdGM0wu&uniplatform=NZKPT (accessed on 10 June 2023).
- Xu, X.G.; Gui, M.Z. Technology Forecast Based on Deep Learning. J. Intell. 2020, 8, 53–62. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C44YLTlOAiTRKibYlV5Vjs7i8oRR1PAr7RxjuAJk4dHXoh96Y1LJOLtJy0g5PBpP–g0i6yrybnHvJK–_LTVplKsC&uniplatform=NZKPT (accessed on 10 June 2023).
- Raeesi Vanani, I. Text analytics of customers on twitter: Brand sentiments in customer support. J. Inf. Technol. Manag. 2019, 11, 43–58. [Google Scholar] [CrossRef]
- Caruso, F.P.; Scala, G.; Cerulo, L.; Ceccarelli, M. A review of COVID-19 biomarkers and drug targets: Resources and tools. Brief. Bioinform. 2021, 22, 701–713. [Google Scholar] [CrossRef]
- Zhu, J.H. Research on the Technical Theme and its Technical Principle of Intelligent Security Technology from the Perspective of Patent Literature. Dalian Univ. Technol. 2021, 6, 19–40. [Google Scholar] [CrossRef]
- Zhang, J.; Tan, L.; Tao, X.; Pham, T.; Chen, B. Relational intelligence recognition in online social networks—A survey. Comput. Sci. Rev. 2020, 35, 1–33. [Google Scholar] [CrossRef]
- Kleminski, R.; Kazienko, P.; Kajdanowicz, T. Analysis of direct citation, co-citation and bibliographic coupling in scientific topic identification. J. Inf. Sci. 2022, 48, 349–373. [Google Scholar] [CrossRef]
- Quille, R.V.E.; Barros, J.M.; Barbado, M.; De Almeida, F.V.; Correa, P.L.P. Detecting Favorite Topics in Computing Scientific Literature via Dynamic Topic Modeling. IEEE Access 2023, 11, 41535–41545. [Google Scholar] [CrossRef]
- Ebrahimi, F.; Dehghani, M.; Makkizadeh, F. Analysis of Persian Bioinformatics Research with Topic Modeling. BioMed Res. Int. 2023, 2023, 3728131. [Google Scholar] [CrossRef] [PubMed]
- Zhang, J.Z.; Qiu, M.M.; Wang, Q.Y. Citation Topic Identification and Evolution Based on Citation Content Clustering. Inf. Sci. 2023, 3, 1–21. Available online: http://kns.cnki.net/kcms/detail/22.1264.g2.20220617.1314.022.html (accessed on 9 January 2023).
- Zhou, X.Q.; Wu, P. Research on Topic Detecting of Pandemic Policies of China and the United State of America Based on Embedded Topic Model. Inf. Stud. Theory Appl. 2022, 5, 173–180. [Google Scholar] [CrossRef]
- Nie, X.P. Hot Topic Prediction Based on Time Series. Chin. Acad. Agric. Sci. 2019, 9, 13–47. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C475KOm_zrgu4lQARvep2SAkOsSuGHvNoCRcTRpJSuXuqRwJWPPGVr5l_Dijs2KuLmIwYu5CQpWnAyD8GOhDBDfA&uniplatform=NZKPT (accessed on 10 June 2023).
- Wang, X.H.; Gao, M. The Key Technology Identification Method Based on BERT-LDA and Its Empirical Research. Libr. Inf. Serv. 2021, 22, 114–125. [Google Scholar] [CrossRef]
- Li, Q.R.; Guo, J.F.; Huang, Y.; Wang, X.F. Topic Evolution Research of Disruptive Technology Based on Mutation and Fusion Perspective. Stud. Sci. Sci. 2021, 39, 2129–2139. [Google Scholar] [CrossRef]
- Xu, Y.; Meng, W.X.; Li, G.J. Forecasting Hot Topics of Information Science Based on Grey Prediction Model. Inf. Sci. 2016, 7, 3–6, 30. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C475KOm_zrgu4lQARvep2SAkHr3ADhkADnVu66WViDP_3J4sNt5fDsyHNihxOdt36XBMPF1Qs4u2WnTW-wjYK8G6&uniplatform=NZKPT (accessed on 10 June 2023). [CrossRef]
- Du, H.; Guo, Y.; Fan, Y.X.; Zhang, J.; Yu, Z.H.; Cheng, X.Q. Calculation and Prediction of Topic Popularity Based on Causal Model. J. Chin. Inf. Process. 2016, 2, 50–55. Available online: https://kns.cnki.net/kcms/detail/detail.aspx?dbcode=CJFD&dbname=CJFDLAST2016&filename=MESS201602008&uniplatform=NZKPT&v=a2h6sDzgRXvJvrea6GXuMcpOWMqvKUh31vGJL2SQ5IrDwmD_-IzMUopZ3Obx8x_Q (accessed on 10 June 2023).
- Cao, R.; Bai, C.; Zhang, Y.J.; Zhang, X.X.; Zhang, Y.; Li, M.Y. Research on Disruptive Technology Recognition Model. China Sci. Technol. Resour. Rev. 2022, 2, 81–92. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C44YLTlOAiTRKibYlV5Vjs7i0–kJR0HYBJ80QN9L51zrPzpwteHiqxLlCtMdfKvcFNSaAplnNxV2D4m8qbogkCZt&uniplatform=NZKPT (accessed on 10 June 2023).
- Zhao, G. Disruptive Technology Identification Based on Multi-source Heterogeneous Data. Huazhong Univ. Sci. Technol. 2019, 4, 16–27. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C475KOm_zrgu4lQARvep2SAkWfZcByc-RON98J6vxPv10ZEYtqW41TXSx5zS6kgxUKZFQOK3fFN0Q5bJGVWEyNA4&uniplatform=NZKPT (accessed on 10 June 2023).
- Bai, J.J.; Yan, D.W.; Chen, Q. Trend Prediction of Emerging Topics Based on Topic Model and Curve Fitting. Inf. Stud. Theory Appl. 2020, 7, 130–136, 193. [Google Scholar] [CrossRef]
- Liu, Z.Q.; Xu, H.Y.; Yue, L.X.; Fang, S. Research on Core Technology Topic Identification Based on Chunk-LDAvis. Libr. Inf. Serv. 2019, 9, 73–84. [Google Scholar] [CrossRef]
- Xuan, H.S. Research on Technology Innovation Topic Discovery and Development Prediction for Patent Data. Xidian Univ. 2021, 5, 20–35. [Google Scholar] [CrossRef]
- Deng, J.J.; Liu, A.R.; Cao, X.Y.; Zhang, K.; Bai, G.Z.; Chen, R.; Gao, Z.J.; Dai, H.W.; Wang, W.N. Methodological Framework of Identifying Disruptive Technologies on Emerging Stage. Bull. Chin. Acad. Sci. 2022, 5, 674–684. [Google Scholar] [CrossRef]
- Chen, Y.Y.; Tian, W.F.; Wu, J.H. Visualization Analysis Methods of Subject Area Research Hotspots Tracking and Trend Prediction. Inf. Stud. Theory Appl. 2017, 6, 117–121. [Google Scholar] [CrossRef]
- Ye, G.H.; Wang, C.C.; Li, S.Y. Recognition and Prediction of Emerging Topics in Interdisciplinary Scientific Research Collaboration Based on SciTS Conference Text. Inf. Sci. 2022, 7, 126–135. [Google Scholar] [CrossRef]
- Tang, H.; Qiu, Y.W. Emerging Technology Topic Identification Based on Multi. J. Intell. 2021, 3, 81–88. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C44YLTlOAiTRKibYlV5Vjs7iy_Rpms2pqwbFRRUtoUImHdOR7NvLKjZ7c92OY79rnd6ugzo0KvZMiHncCKAP0FH6&uniplatform=NZKPT (accessed on 10 June 2023).
- Wu, F.F.; Li, Y.W.; Miao, H. Prediction of Frontier Development and Research Topics of Essential Technology Fields in China Based on Linkage Between Basic Research and Technology Development. J. Intell. 2022, 1, 23, 60–66. Available online: https://kns.cnki.net/kcms/detail/detail.aspx?dbcode=CJFD&dbname=CJFDLAST2022&filename=QBZZ202201010&uniplatform=NZKPT&v=wgMWPIxfIaVJDknHfLmgfeuhtJ7obMNaCyrx6lkh2o9bsuvL9Nar8tMRiRV8ge-C (accessed on 10 June 2023).
- Wu, Y.P.; Bai, R.J.; Liu, M.Y.; Wang, X.Y. Research on Technology Opportunity Discovery Based on Comment Topic Identification and Multi Dimension Analysis of Technical Attributes. Libr. Inf. Serv. 2021, 10, 56–67. [Google Scholar] [CrossRef]
- He, W.L.; Feng, G.H.; Xie, H.L. Analyzing Scientific Literature with Content Similarity—Topics over Time Model. Data Anal. Knowl. Discov. 2018, 11, 64–72. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C44YLTlOAiTRKibYlV5Vjs7i0-kJR0HYBJ80QN9L51zrPxcrlhWtdSrQcxBnEn4vHtCdFXUmn3kt1IvYWiCTrf6H&uniplatform=NZKPT (accessed on 10 June 2023).
- Han, Z.Q.; Liu, X.P.; Kou, J.J. Interdisciplinary Literature Discovery Based on Rao-Stirling Diversity Indices. Inf. Sci. 2020, 2, 116–124. [Google Scholar] [CrossRef]
- Liu, S.X. Research on Key Technologies of Link Prediction and Network Evolution of Complex Network. Inf. Eng. Univ. 2021, 1, 53–87. [Google Scholar] [CrossRef]
- Xie, Q.Q. Studying the Evolution Trajectory and Forcasting Development Trend of Emerging Technologies Based on Multi-Source Heterogeneous Data. Beijing Univ. Technol. 2019, 8, 63–79. [Google Scholar] [CrossRef]
- Hou, J.H.; Zhu, X.Q. Evaluation Indicators System of Technology Forecasting and its Empirical Study Based on the Patent. Libr. Inf. Serv. 2014, 18, 77–82, 116. [Google Scholar] [CrossRef]
- Wu, Y.W.; Ji, Y.J.; Gu, X.J. Industrial Generic Technology Prediction Based on Dynamic Complex Network of Patents. Comput. Integr. Manuf. Syst. 2020, 26, 3185–3194. [Google Scholar] [CrossRef]
- Tian, P.W.; Zhang, X. Research on the Patented Technology Topics Identification Based on Heterogeneous Information Network. J. Intell. 2021, 8, 45–52. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C44YLTlOAiTRKibYlV5Vjs7iy_Rpms2pqwbFRRUtoUImHdOR7NvLKjZ7euRoAQ7qHQX9SWHW2eOjr–quJpcuuoFP&uniplatform=NZKPT (accessed on 10 June 2023).
- Liao, L. Method and Case Study on Text Mining and Main Path Analysis based Technological Tendency Forecasting. Huazhong Univ. Sci. Technol. 2017, 3, 16–47. Available online: https://kns.cnki.net/kcms/detail/detail.aspx?dbcode=CMFD&dbname=CMFD201901&filename=1018801970.nh&uniplatform=NZKPT&v=XXgOgghaYaM3NAoQzIjz6GGe_59ortQLlK_1LNOP4XTYDzm2EGsDBlDnvq5tqPRv (accessed on 10 June 2023).
- Han, F.; Zhang, S.T.; Feng, L.Z.; Yuan, J.P. Identifying Breakthrough Patent Topics by Measuring Technological Convergence. Data Anal. Knowl. Discov. 2021, 12, 137–147. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C44YLTlOAiTRKibYlV5Vjs7iJTKGjg9uTdeTsOI_ra5_Xf80tvfaHzPNB9FXl2F0WSEPyA8FuYRGUN5HqZftefIL&uniplatform=NZKPT (accessed on 10 June 2023).
- Wu, C.; Wang, Q.H.; Li, Y.; Zhang, L.F. Forecast and Cooperation Potential of Frontier Technology Fields of Strategic Emerging Industries. Syst. Eng. 2021, 4, 151–158. Available online: https://kns.cnki.net/kcms/detail/detail.aspx?dbcode=CJFD&dbname=CJFDLAST2021&filename=GCXT202104015&uniplatform=NZKPT&v=-590lXLM7-XMe7FOxMEWJ_BHlLOWyF3i2AIitCElCS_qN3d-G6x_JU3V7rQ62UzM (accessed on 10 June 2023).
- Li, N. Research on Technology Foresight Method from the Perspective of Knowledge Evolution. Chin. Acad. Agric. Sci. 2022, 1, 26–55. [Google Scholar] [CrossRef]
- Fan, S.P.; An, X.Y.; Yan, G.L.; Li, Y. Study on the Recognition Method of Frontier Topic in the Medical Field. J. China Soc. Sci Technol. Inf. 2018, 7, 686–694. Available online: https://kns.cnki.net/kcms2/article/abstract?v=3uoqIhG8C44YLTlOAiTRKibYlV5Vjs7i0-kJR0HYBJ80QN9L51zrPzpwteHiqxLlLR76MCYwv8H3y2FYRYxuXOedF6BS0qn-&uniplatform=NZKPT (accessed on 10 June 2023).
Numbers of Modelsamd Algorithms | 1 | 2 | 3 | 4 | ||
---|---|---|---|---|---|---|
Number of Documents | ||||||
Category | ||||||
Topic identification | 45 | 32 | 9 | 1 | ||
Topic prediction | 33 | 19 | 2 | 1 |
Category | Secondary Category | Indicator | References |
---|---|---|---|
Basic scalar | Word frequency | Number of subject words in documents or patents | Nie et al., (2020) [80] |
Citation/Cited quantity | Citation frequency of documents or patents | Chen et al., (2020) [68] | |
Scale | Number of patented products/patentees | Zhu et al., (2020) [36] | |
Topic popularity and intensity | Topic popularity Topic intensity | Weighting of the number of documents issued and citations | Li et al., (2019) [37] |
Number or proportion of supporting documents | Huo et al., (2021) [76] | ||
Keyword weight | Huo et al., (2021) [9] | ||
Emerging degree | Topic novelty | Article/patent publication age | Cao et al., (2020) [85] |
Topic growth rate | Slope of curves of document/patent authorization | Ren et al., (2022) [58] | |
Cross diffusion | Crossing degree | Distribution of subject words and their cross interactions among disciplines | Dong et al., (2018) [86] |
Scientific relevance | Number of scientific documents cited by patents | Li Dong et al., (2021) [87] | |
Topic relevance | The correlation of topic adjacent time evolution | ||
Migration degree | Transition probability of the topic to the next time | Zhu et al., (2020) [36] | |
Network structure | Topic node centrality | Degree/intermediary/proximity/eigenvector centrality | Liu et al., (2019) [47] |
Local information similarity | Common Neighbor/Cosine Similarity/Jaccard/Sorenson/ Priority Link, etc. | Ma et al., (2021) [74] | |
Path similarity | Local path/Katz | Huang et al., (2019) [83] | |
Random walk similarity | Average commuting time/cosine similarity based on random walk/SimRank |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Guo, S.; Si, L.; Liu, X. Research Practice and Progress of Models and Algorithms Applied in Topic Identification and Prediction Based on the Analysis of CNKI. Appl. Sci. 2023, 13, 7545. https://doi.org/10.3390/app13137545
Guo S, Si L, Liu X. Research Practice and Progress of Models and Algorithms Applied in Topic Identification and Prediction Based on the Analysis of CNKI. Applied Sciences. 2023; 13(13):7545. https://doi.org/10.3390/app13137545
Chicago/Turabian StyleGuo, Sicheng, Li Si, and Xianrui Liu. 2023. "Research Practice and Progress of Models and Algorithms Applied in Topic Identification and Prediction Based on the Analysis of CNKI" Applied Sciences 13, no. 13: 7545. https://doi.org/10.3390/app13137545