skip to main content
10.1145/3607947.3608055acmotherconferencesArticle/Chapter ViewAbstractPublication Pagesic3Conference Proceedingsconference-collections
research-article

OptBertDCNN: A framework based on BERT and optimized Deep Convolutional Neural Network for MQA

Published:28 September 2023Publication History

ABSTRACT

Multilingual Question Answering (MQA) generates accurate answers to the user’s query despite the context language. MQA has gained popularity as individuals increasingly pose questions in both English and their native languages on social media. However, conventional Question Answering (QA) systems encounter difficulties in handling multiple languages. The development of MQA models is hindered by the lack of large-scale benchmark datasets, impeding the achievement of high performance compared to monolingual systems. An effective MQA system called OptBertDCNN is proposed to address this, leveraging multilingual cased Bidirectional Encoder Representations from Transformers (mBERT) and Optimized Deep Convolutional Neural Networks (CNN). OptBertDCNN employs a BERT tokenizer to segment sentences into tokens, enabling the extraction of features such as word embeddings from the pre-trained model, TF-IDF scores, SentiWordNet scores, and statistical features from both context and question tokens. These informative features are then inputted into OptBertDCNN, which is trained using the EHMQuAD dataset. Notably, OptBertDCNN achieves outstanding performance metrics, including an exact match of 0.755, precision of 0.765, recall of 0.773, and F1-score of 0.769. These results unequivocally demonstrate the effectiveness of OptBertDCNN in addressing the challenges of multilingual question answering.

References

  1. Mikel Artetxe, Sebastian Ruder, and Dani Yogatama. 2020. On the Cross-lingual Transferability of Monolingual Representations. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 4623–4637.Google ScholarGoogle ScholarCross RefCross Ref
  2. VH Arul, VG Sivakumar, Ramalatha Marimuthu, and Basabi Chakraborty. 2019. An approach for speech enhancement using deep convolutional neural network. Multimedia Research (MR) 2, 1 (2019), 37–44.Google ScholarGoogle Scholar
  3. Mahesh B. Shelke, Daivat D Sawant, Chatrabhuj B Kadam, Kailas Ambhure, and Sachin N Deshmukh. 2023. Marathi SentiWordNet: A lexical resource for sentiment analysis of Marathi. Concurrency and Computation: Practice and Experience (2023), e7497.Google ScholarGoogle Scholar
  4. Stefano Baccianella, Andrea Esuli, Fabrizio Sebastiani, 2010. SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining. In Lrec, Vol. 10. 2200–2204.Google ScholarGoogle Scholar
  5. Dzmitry Bahdanau, Kyung Hyun Cho, and Yoshua Bengio. 2015. Neural Machine Translation by Jointly Learning to Align and Translate. In 3rd International Conference on Learning Representations, ICLR 2015.Google ScholarGoogle Scholar
  6. Emily M Bender. 2009. Linguistically Naïve != Language Independent: Why NLP Needs Linguistic Typology. In Proceedings of the EACL 2009 Workshop on the Interaction between Linguistics and Computational Linguistics: Virtuous, Vicious or Vacuous?26–32.Google ScholarGoogle ScholarCross RefCross Ref
  7. Casimiro Pio Carrino, Marta R Costa-jussà, and José AR Fonollosa. 2020. Automatic Spanish Translation of SQuAD Dataset for Multi-lingual Question Answering. In Proceedings of the 12th Language Resources and Evaluation Conference. 5515–5523.Google ScholarGoogle Scholar
  8. Muhao Chen and Carlo Zaniolo. 2017. Learning Multi-faceted Knowledge Graph Embeddings for Natural Language Processing. In Proceedings of the 26th International Joint Conference on Artificial Intelligence. 5169–5170.Google ScholarGoogle ScholarCross RefCross Ref
  9. Aarti Chugh, Vivek Kumar Sharma, Sandeep Kumar, Anand Nayyar, Basit Qureshi, Manjot Kaur Bhatia, and Charu Jain. 2021. Spider Monkey Crow Optimization Algorithm With Deep Learning for Sentiment Classification and Information Retrieval. IEEE Access 9 (2021), 24249–24262.Google ScholarGoogle ScholarCross RefCross Ref
  10. R Cristin, V Cyril Raj, and Ramalatha Marimuthu. 2019. Face Image Forgery Detection by Weight Optimized Neural Network Model. Multimedia Research 2, 2 (2019), 19–27.Google ScholarGoogle Scholar
  11. Amitava Das and Sivaji Bandyopadhyay. 2010. SentiWordNet for Indian languages. In Proceedings of the eighth workshop on Asian language resouces. 56–63.Google ScholarGoogle Scholar
  12. Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). 4171–4186.Google ScholarGoogle Scholar
  13. Deepak Gupta, Asif Ekbal, and Pushpak Bhattacharyya. 2019. A Deep Neural Network Framework for English Hindi Question Answering. ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP) 19, 2 (2019), 1–22.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Deepak Gupta, Surabhi Kumari, Asif Ekbal, and Pushpak Bhattacharyya. 2018. MMQA: A multi-domain multi-lingual question-answering framework for English and Hindi. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018).Google ScholarGoogle Scholar
  15. Momchil Hardalov, Todor Mihaylov, Dimitrina Zlatkova, Yoan Dinkov, Ivan Koychev, and Preslav Nakov. 2020. EXAMS: A Multi-subject High School Examinations Dataset for Cross-lingual and Multilingual Question Answering. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). 5427–5444.Google ScholarGoogle ScholarCross RefCross Ref
  16. Zhiheng Huang, Marcus Thint, and Zengchang Qin. 2008. Question Classification using Head Words and their Hypernyms. In Proceedings of the 2008 Conference on empirical methods in natural language processing. 927–936.Google ScholarGoogle ScholarCross RefCross Ref
  17. Ammar Ismael Kadhim. 2019. Term Weighting for Feature Extraction on Twitter: A Comparison Between BM25 and TF-IDF. In 2019 international conference on advanced science and engineering (ICOASE). IEEE, 124–128.Google ScholarGoogle ScholarCross RefCross Ref
  18. Farhan Hassan Khan, Usman Qamar, and Saba Bashir. 2016. SentiMI: Introducing point-wise mutual information with SentiWordNet to improve sentiment polarity detection. Applied Soft Computing 39 (2016), 140–153.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Patrick Lewis, Barlas Oguz, Ruty Rinott, Sebastian Riedel, and Holger Schwenk. 2020. MLQA: Evaluating Cross-lingual Extractive Question Answering. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 7315–7330.Google ScholarGoogle ScholarCross RefCross Ref
  20. Ekaterina Loginova, Stalin Varanasi, and Günter Neumann. 2021. Towards End-to-End Multilingual Question Answering. Information Systems Frontiers 23, 1 (2021), 227–241.Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Benjamin Muller, Luca Soldaini, Rik Koncel-Kedziorski, Eric Lind, and Alessandro Moschitti. 2021. Cross-Lingual GenQA: A Language-Agnostic Generative Question Answering Approach for Open-Domain Question Answering. arXiv preprint arXiv:2110.07150 (2021).Google ScholarGoogle Scholar
  22. Pranav Rajpurkar, Jian Zhang, Konstantin Lopyrev, and Percy Liang. 2016. SQuAD: 100,000+ Questions for Machine Comprehension of Text. In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing. 2383–2392.Google ScholarGoogle ScholarCross RefCross Ref
  23. Machhirke Vinodkumar Sadhuram and Aparna Soni. 2020. Natural Language Processing based New Approach to Design Factoid Question Answering System. In 2020 Second International Conference on Inventive Research in Computing Applications (ICIRCA). IEEE, 276–281.Google ScholarGoogle Scholar
  24. Tasmiah Tahsin Mayeesha, Abdullah Md Sarwar, and Rashedur M Rahman. 2021. Deep learning based question answering system in Bengali. Journal of Information and Telecommunication 5, 2 (2021), 145–178.Google ScholarGoogle ScholarCross RefCross Ref
  25. S Thara, E Sampath, Phanindra Reddy, 2020. Code Mixed Question Answering Challenge using Deep Learning Methods. In 2020 5th International Conference on Communication and Electronics Systems (ICCES). IEEE, 1331–1337.Google ScholarGoogle Scholar
  26. Rui Yan, Weiheng Liao, Jianwei Cui, Hailei Zhang, Yichuan Hu, and Dongyan Zhao. 2021. Multilingual COVID-QA: Learning towards global information sharing via web question answering in multiple languages. In Proceedings of the Web Conference 2021. 2590–2600.Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Seyhmus Yilmaz and Sinan Toklu. 2020. A deep learning analysis on question classification task using Word2vec representations. Neural Computing and Applications 32, 7 (2020), 2909–2928.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. OptBertDCNN: A framework based on BERT and optimized Deep Convolutional Neural Network for MQA

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          IC3-2023: Proceedings of the 2023 Fifteenth International Conference on Contemporary Computing
          August 2023
          783 pages
          ISBN:9798400700224
          DOI:10.1145/3607947

          Copyright © 2023 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 28 September 2023

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Research
          • Refereed limited
        • Article Metrics

          • Downloads (Last 12 months)20
          • Downloads (Last 6 weeks)2

          Other Metrics

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        HTML Format

        View this article in HTML Format .

        View HTML Format