Adopting Text Mining for Patent Analysis to Determine the Attribute and Segment in Automotive Industries

Authors

  • Amir Syafiq Syamin Syah Amir Hamzah Department of Management of Technology (MoT), Malaysia–Japan International Institute of Technology (MJIIT), Universiti Teknologi Malaysia, Jalan Sultan Yahya Petra, 54100 Kuala Lumpur, Malaysia
  • Hafizah Farhah Saipan@Saipol Intellectual Property and Innovation Management (IPIM) iKohza, Malaysia-Japan International Institute of Technology (MJIIT), Universiti Teknologi Malaysia, Jalan Sultan Yahya Petra, 54100 Kuala Lumpur, Malaysia
  • Syarifah Zyurina Nordin Department of Management of Technology (MoT), Malaysia–Japan International Institute of Technology (MJIIT), Universiti Teknologi Malaysia, Jalan Sultan Yahya Petra, 54100 Kuala Lumpur, Malaysia
  • Zatul Alwani Shaffiei Department of Electronic Systems Engineering (ESE), Malaysia–Japan International Institute of Technology (MJIIT), Universiti Teknologi Malaysia, Jalan Sultan Yahya Petra, 54100 Kuala Lumpur, Malaysia
  • Naoki Ohshima Graduate School of Innovation and Technology Management, Yamaguchi University, Yamaguchi, Japan

DOI:

https://doi.org/10.37934/araset.37.2.94103

Keywords:

Text mining, word cloud, co-occurrence networks, correspondence analysis, NLP, pre-processing

Abstract

Analysing massive patent documents in heavy industries including automotive has become important in recent years as they contain a lot of information that is extremely difficult to deal with from huge numbers and various forms. Important documents such as patent data on a huge scale has become a major concern owing to time constraints and enormous costly work. In natural language processing (NLP), text mining is used to determine several features such as segmentations and attributes of data. The important step in data mining is pre-processing data information from massive text data. In this paper, the fundamental concept of pre-processing and data analysing is examined to provide accurate and meaningful information from the chosen data set. This study focuses on two automotive companies, namely Mazda and Mitsubishi to describe the similarities, distances and frequencies between several patent documents. To demonstrate the behaviour of selected patent documents, word cloud, co-occurrence networks and correspondence analysis are also presented.

Author Biographies

Amir Syafiq Syamin Syah Amir Hamzah, Department of Management of Technology (MoT), Malaysia–Japan International Institute of Technology (MJIIT), Universiti Teknologi Malaysia, Jalan Sultan Yahya Petra, 54100 Kuala Lumpur, Malaysia

syamin@utm.my

Hafizah Farhah Saipan@Saipol, Intellectual Property and Innovation Management (IPIM) iKohza, Malaysia-Japan International Institute of Technology (MJIIT), Universiti Teknologi Malaysia, Jalan Sultan Yahya Petra, 54100 Kuala Lumpur, Malaysia

hafizah.farhah@utm.my

Syarifah Zyurina Nordin, Department of Management of Technology (MoT), Malaysia–Japan International Institute of Technology (MJIIT), Universiti Teknologi Malaysia, Jalan Sultan Yahya Petra, 54100 Kuala Lumpur, Malaysia

szyurina@utm.my

Zatul Alwani Shaffiei, Department of Electronic Systems Engineering (ESE), Malaysia–Japan International Institute of Technology (MJIIT), Universiti Teknologi Malaysia, Jalan Sultan Yahya Petra, 54100 Kuala Lumpur, Malaysia

zatulalwani.kl@utm.my

Naoki Ohshima, Graduate School of Innovation and Technology Management, Yamaguchi University, Yamaguchi, Japan

naoki.ohshima@yumot.jp

Downloads

Published

2024-01-16

How to Cite

Amir Syafiq Syamin Syah Amir Hamzah, Hafizah Farhah Saipan@Saipol, Syarifah Zyurina Nordin, Zatul Alwani Shaffiei, & Naoki Ohshima. (2024). Adopting Text Mining for Patent Analysis to Determine the Attribute and Segment in Automotive Industries. Journal of Advanced Research in Applied Sciences and Engineering Technology, 37(2), 94–103. https://doi.org/10.37934/araset.37.2.94103

Issue

Section

Articles