Abstract
Technology knowledge elements play an important role in technology innovation. However, there is still challenges about their extraction and evaluation. Traditional methods exhibit limitations in precisely linking key technologies with functions, and they usually focus on measuring the overall novelty of patent documents rather than individual technology details, leading to poor interpretability and practicality of research outcomes. In this work, we present a framework that extracts technology knowledge triples and evaluates the novelty of triples based on deep learning model. This framework first identifies key sentences that reflect innovation from patent claims and then extracts technology knowledge elements from these sentences. A novelty index is then designed to evaluate the novelty of these technology knowledge elements based on the probability of their occurrence and the similarity to existing knowledge. The experimental results demonstrate the effectiveness of the proposed method. The extracted technology knowledge elements can use to construct an innovation knowledge graph, which provides practical applications in engineering knowledge retrieval, design and innovation support.
Similar content being viewed by others
References
Abbas, A., Zhang, L., & Khan, S. U. (2014). A literature review on the state-of-the-art in patent analysis. World Patent Information, 37, 3–13. https://doi.org/10.1016/j.wpi.2013.12.006
Allan, J., Carbonell, J. G., Doddington, G., et al. (1998). Topic detection and tracking pilot study final report. Carnegie Mellon University. https://doi.org/10.1184/R1/6626252.v1
Altuntas, S., Erdogan, Z., & Dereli, T. (2020). A clustering-based approach for the evaluation of candidate emerging technologies. Scientometrics, 124(2), 1157–1177. https://doi.org/10.1007/s11192-020-03535-0
Arts, S., Cassiman, B., & Gomez, J. C. (2018). Text matching to measure patent similarity. Strategic Management Journal, 39(1), 62–84. https://doi.org/10.1002/smj.2699
Cho, J. H., Lee, J., & Sohn, S. Y. (2021). Predicting future technological convergence patterns based on machine learning using link prediction. Scientometrics, 126, 5413–5429. https://doi.org/10.1016/j.wpi.2013.12.006
Contributors, P. (2021). Paddlenlp: An easy-to-use and high performance nlp library. https://github.com/PaddlePaddle/PaddleNLP.
Devlin, J., Chang, M. W. & Lee, K. et al. (2018). Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprintarXiv:1810.04805https://doi.org/10.48550/arXiv.1810.04805.
de Diego, I. M., González-Fernández, C. & Fernández-Isabel, A. et al. (2021). System for evaluating the reliability and novelty of medical scientific papers. Journal of Informetrics 15(4):101,188. https://doi.org/10.1016/j.joi.2021.101188.
Edwards-Schachter, M. (2018). The nature and variety of innovation. International Journal of Innovation Studies, 2(2), 65–79. https://doi.org/10.1016/j.ijis.2018.08.004
Geum, Y., & Kim, M. (2020). How to identify promising chances for technological innovation: Keygraph-based patent analysis. Advanced Engineering Informatics, 46(101), 155. https://doi.org/10.1016/j.aei.2020.101155
Guo, L., Yan, F., Li, T., et al. (2022). An automatic method for constructing machining process knowledge base from knowledge graph. Robotics and Computer-Integrated Manufacturing, 73(102), 222. https://doi.org/10.1016/j.rcim.2021.102222
Hain, D. S., Jurowetzki, R., Buchmann, T., et al. (2022). A text-embedding-based approach to measuring patent-to-patent technological similarity. Technological Forecasting and Social Change, 177(121), 559. https://doi.org/10.1016/j.techfore.2022.121559
Hasan, M.A., Spangler, W.S. & Griffin, T., et al. (2009). Coa: Finding novel patents through text analysis. In: Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 1175–1184, https://doi.org/10.1145/1557019.1557146.
Hong, S., Kim, J., Woo, H. G., et al. (2022). Screening ideas in the early stages of technology development: A word2vec and convolutional neural network approach. Technovation, 112(102), 407. https://doi.org/10.1016/j.technovation.2021.102407
Hu, R., Ma, W., Lin, W., et al. (2022). Technology topic identification and trend prediction of new energy vehicle using lda modeling. Complexity. https://doi.org/10.1155/2022/9373911
Hu, Z., Dong, Y., Wang, K., et al. (2020). Heterogeneous graph transformer. Proceedings of the web conference, 2020, 2704–2710. https://doi.org/10.1145/3366423.3380027
Jin, Y., Liu, J., Wang, X., et al. (2021). Technology recommendations for an innovative agricultural robot design based on technology knowledge graphs. Processes, 9(11), 1905. https://doi.org/10.3390/pr9111905
Kim, K. H., Han, Y. J., Lee, S., et al. (2019). Text mining for patent analysis to forecast emerging technologies in wireless power transfer. Sustainability, 11(22), 6240. https://doi.org/10.3390/su11226240
Klavans, R., & Boyack, K. W. (2006). Identifying a better measure of relatedness for mapping science. Journal of the American Society for Information Science and Technology, 57(2), 251–263. https://doi.org/10.1002/asi.20274
Korobkin, D., Shabanov, D., & Fomenkov, S., et al. (2019). Construction of a matrix “physical effects–technical functions” on the base of patent corpus analysis. In: Creativity in Intelligent Technologies and Data Science: Third Conference, CIT &DS 2019, Volgograd, Russia, September 16–19, 2019, Proceedings, Part II 3, Springer, pp 52–68, https://doi.org/10.1007/978-3-030-29750-3_5.
Lee, C. (2021). A review of data analytics in technological forecasting. Technological Forecasting and Social Change, 166(120), 646. https://doi.org/10.1016/j.techfore.2021.120646
Lee, C., Jeon, D., Ahn, J. M., et al. (2020). Navigating a product landscape for technology opportunity analysis: A word2vec approach using an integrated patent-product database. Technovation, 96(102), 140. https://doi.org/10.1016/j.technovation.2020.102140
Liu, Z., Lü, Z., Zheng, W., et al. (2019). Design of obstacle avoidance controller for agricultural tractor based on ros. International Journal of Agricultural and Biological Engineering, 12(6), 58–65. https://doi.org/10.25165/j.ijabe.20191206.4907
Lu, Y., Liu, Q., & Dai, D., et al. (2022). Unified structure generation for universal information extraction. arXiv preprint arXiv:2203.12277https://doi.org/10.48550/arXiv.2203.12277.
Luo, Z., Lu, W., Cai, L., et al. (2022). Application of lexical functions in novelty measurement of academic papers. Journal of the China Society for Scientific and Technical Information, 41(7), 720–732. https://doi.org/10.3772/j.issn.1000-0135.2022.07.006
Ma, T., Zhou, X., Liu, J., et al. (2021). Combining topic modeling and sao semantic analysis to identify technological opportunities of emerging technologies. Technological Forecasting and Social Change, 173(121), 159. https://doi.org/10.1016/j.techfore.2021.121159
Nakai, K., Nonaka, H., Hentona, A., et al. (2018). Community detection and growth potential prediction using the stochastic block model and the long short-term memory from patent citation networks. In: 2018 IEEE International Conference on Industrial Engineering and Engineering Management (IEEM), IEEE, pp 1884–1888, https://doi.org/10.1109/IEEM.2018.8607487.
Nakayama, H., Kubo, T., & Kamura, J., et al. (2018). doccano: Text annotation tool for human, software available from https://github.com/doccano/doccano.
Ng, A., & Jordan, M. (2001). On discriminative vs. generative classifiers: A comparison of logistic regression and naive bayes. Advances in neural information processing systems 14.
Porter, A. L., & Detampel, M. J. (1995). Technology opportunities analysis. Technological Forecasting and Social Change, 49(3), 237–255. https://doi.org/10.1016/0040-1625(95)00022-3
Ren, H., & Zhao, Y. (2021). Technology opportunity discovery based on constructing, evaluating, and searching knowledge networks. Technovation, 101(102), 196. https://doi.org/10.1016/j.technovation.2020.102196
Saleem, M. H., Potgieter, J., & Arif, K. M. (2021). Automation in agriculture by machine and deep learning techniques: A review of recent developments. Precision Agriculture, 22, 2053–2091. https://doi.org/10.1007/s11119-021-09824-9
Stek, P. E. (2021). Identifying spatial technology clusters from patenting concentrations using heat map kernel density estimation. Scientometrics, 126(2), 911–930. https://doi.org/10.1007/s11192-020-03751-8
Strumsky, D., & Lobo, J. (2015). Identifying the sources of technological novelty in the process of invention. Research Policy, 44(8), 1445–1461. https://doi.org/10.1016/j.respol.2015.05.008
Sun, Y., Wang, S., & Feng, S., et al. (2021). Ernie 3.0: Large-scale knowledge enhanced pre-training for language understanding and generation. arXiv preprint arXiv:2107.02137https://doi.org/10.48550/arXiv.2107.02137.
Tang, Q., Luo, Y. W., & Wu, X. D. (2023). Research on the evaluation method of agricultural intelligent robot design solutions. PLoS ONE, 18(3), e0281,554. https://doi.org/10.1371/journal.pone.0281554
Verhoeven, D., Bakker, J., & Veugelers, R. (2016). Measuring technological novelty with patent-based indicators. Research Policy, 45(3), 707–723. https://doi.org/10.1016/j.respol.2015.11.010
Wang, J., & Chen, Y. J. (2019). A novelty detection patent mining approach for analyzing technological opportunities. Advanced Engineering Informatics, 42(100), 941. https://doi.org/10.1016/j.aei.2019.100941
Wang, Z., Zhang, J., & Feng, J., et al. (2014). Knowledge graph embedding by translating on hyperplanes. In: Proceedings of the AAAI conference on artificial intelligence. https://doi.org/10.1609/aaai.v28i1.8870
Wang, Z., Wang, K., Liu, J., et al. (2022). Measuring the innovation of method knowledge elements in scientific literature. Scientometrics, 127(5), 2803–2827. https://doi.org/10.1007/s11192-022-04350-5
Wang, Z., Zhang, H., Chen, J., et al. (2023). Measuring the novelty of scientific literature through contribution sentence analysis using deep learning and cloud model. SSRN,. https://doi.org/10.2139/ssrn.4360535
Yoon, B., Park, I., Yun, D., et al. (2019). Exploring promising vacant technology areas in a technology-oriented company based on bibliometric analysis and visualisation. Technology Analysis & Strategic Management, 31(4), 388–405. https://doi.org/10.1080/09537325.2018.1516864
Yoon, J., Seo, W., Coh, B. Y., et al. (2017). Identifying product opportunities using collaborative filtering-based patent analysis. Computers & Industrial Engineering, 107, 376–387. https://doi.org/10.1016/j.cie.2016.04.009
Zanella, G., Liu, C. Z., & Choo, K. K. R. (2021). Understanding the trends in blockchain domain through an unsupervised systematic patent analysis. IEEE Transactions on Engineering Management. https://doi.org/10.1109/TEM.2021.3074310
Zhang, C. Z., Mayr, P., Lu, W., et al. (2023). Guest editorial: Extraction and evaluation of knowledge entities in the age of artificial intelligence. Aslib Journal of Information Management, 75(3), 433–437. https://doi.org/10.1108/AJIM-05-2023-507
Zhang, H., Daim, T., & Zhang, Y. P. (2021). Integrating patent analysis into technology roadmapping: A latent dirichlet allocation based technology assessment and roadmapping in the field of blockchain. Technological Forecasting and Social Change, 167(120), 729. https://doi.org/10.1016/j.techfore.2021.120729
Zhao, J., Yang, Y., Zheng, H., et al. (2020). Global agricultural robotics research and development: Trend forecasts. Journal of Physics: Conference Series. https://doi.org/10.1088/1742-6596/1693/1/012227
Funding
This work is supported by Humanities and Social Sciences program of Chinese Ministry of Education (20YJC740067), and the Young Scholar Project of Pazhou Lab (No. PZL2021KF0021), and Guangzhou Basic and Applied Basic Research (202201010184).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflicts of interest that are relevant to the content of this article.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Wei, T., Feng, D., Song, S. et al. An extraction and novelty evaluation framework for technology knowledge elements of patents. Scientometrics (2024). https://doi.org/10.1007/s11192-024-04990-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11192-024-04990-9