AI-Based Assistance for Management of Oral Community Knowledge in Low-Resource and Colloquial Kannada Language

Aparna, M.; Srivatsa, Sharath; Sai Madhavan, G.; Dinesh, T. B.; Srinivasa, Srinath

doi:10.1007/978-3-031-58502-9_1

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14516))

Included in the following conference series:

International Conference on Big Data Analytics

32 Accesses

Abstract

Knowledge in rural communities is largely created, preserved, and is transferred verbally, and it is limited. This information is valuable to these communities, and managing and making it available digitally with state-of-the-art approaches enriches awareness and collective knowledge of people of these communities. The large amounts of data and information produced on the Internet are inaccessible to the population in these rural communities due to factors like lack of infrastructure, connectivity, and limited literacy. Knowledge internal to rural communities is also not conserved and made available in any global Big Data information systems. Artificial Intelligence (AI) technologies such as Automatic Speech Recognition (ASR) and Natural Language Processing (NLP) provide substantial assistance when vast quantities of data, like Big Data, are available to build solutions. In the case of low-resource languages like Kannada and rural colloquial dialects, publicly available corpora are significantly less. Building state-of-the-art AI solutions is challenging in this context, and we address this problem in this work. Knowledge management in rural communities requires a low-cost and efficient approach that social workers can use. This paper proposes an architecture for oral knowledge management for rural communities speaking colloquial Kannada. The proposed architecture has an interface for oral knowledge retrieval using text processing on transcripts generated from the smallest state-of-the-art ASR model. We propose three interfaces to search for content: an n-gram based fuzzy search to search for texts in audios, the most frequent entities search based on the Kannada Named Entity Recognition (NER) model, and question-answering with Large Language Model (LLM) using a community knowledge vector store.

This work was supported by the Mphasis F1 Foundation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 74.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
https://www.statista.com/statistics/1232343/internet-literacy-index-by-category-india/.
2.
https://blog.janastu.org/covid-19-campaign-namma-halli-radio/.
3.
http://lisindia.ciil.org/Kannada/Kannada.html.
4.
https://www.audacityteam.org/.
5.
https://openslr.org/.
6.
Demo app: http://103.156.19.244:33035/,
username: guest, password: guest123.

References

Aksënova, A., et al.: Accented speech recognition: benchmarking, pre-training, and diverse data (2022)
Google Scholar
Baevski, A., Hsu, W.N., Conneau, A., Auli, M.: Unsupervised speech recognition. In: Ranzato, M., Beygelzimer, A., Dauphin, Y., Liang, P., Vaughan, J.W. (eds.) Advances in Neural Information Processing Systems, vol. 34, pp. 27826–27839. Curran Associates, Inc. (2021)
Google Scholar
Baevski, A., Zhou, Y., Mohamed, A., Auli, M.: Wav2vec 2.0: a framework for self-supervised learning of speech representations. In: Larochelle, H., Ranzato, M., Hadsell, R., Balcan, M., Lin, H. (eds.) Advances in Neural Information Processing Systems, vol. 33, pp. 12449–12460. Curran Associates, Inc. (2020)
Google Scholar
Cohn, D., Ghahramani, Z., Jordan, M.: Active learning with statistical models. In: Advances in Neural Information Processing Systems, vol. 7 (1994)
Google Scholar
Devlin, J., Chang, M., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. CoRR abs/1810.04805 (2018). http://arxiv.org/abs/1810.04805
Duarte, F.: Amount of data created daily (2023). https://explodingtopics.com/blog/data-generated-per-day. Accessed 08 Oct 2023
Goodmann, E., Matienzo, M.A., VanCour, S., Dries, W.V.: Building the national radio recordings database: a big data approach to documenting audio heritage. In: 2019 IEEE International Conference on Big Data (Big Data), pp. 3080–3086 (2019). https://doi.org/10.1109/BigData47090.2019.9006520
Kakwani, D., et al.: IndicNLPSuite: monolingual corpora, evaluation benchmarks and pre-trained multilingual language models for indian languages. In: Findings of EMNLP (2020)
Google Scholar
Levenshtein, V.I., et al.: Binary codes capable of correcting deletions, insertions, and reversals. In: Soviet Physics Doklady, vol. 10, pp. 707–710. Soviet Union (1966)
Google Scholar
Lewis, P., et al.: Retrieval-augmented generation for knowledge-intensive NLP tasks (2021)
Google Scholar
Mhaske, A., et al.: Naamapadam: a large-scale named entity annotated data for Indic languages. In: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 10441–10456. Association for Computational Linguistics, Toronto (2023). https://doi.org/10.18653/v1/2023.acl-long.582, https://aclanthology.org/2023.acl-long.582
Najafabadi, M.M., Villanustre, F., Khoshgoftaar, T.M., Seliya, N., Wald, R., Muharemagic, E.: Deep learning applications and challenges in big data analytics. J. Big Data 2(1), 1–21 (2015)
Article Google Scholar
Pan, X., Zhang, B., May, J., Nothman, J., Knight, K., Ji, H.: Cross-lingual name tagging and linking for 282 languages. In: Annual Meeting of the Association for Computational Linguistics (2017)
Google Scholar
Pratap, V., et al.: Scaling speech technology to 1,000+ languages. arXiv (2023)
Google Scholar
Radford, A., Kim, J.W., Xu, T., Brockman, G., Mcleavey, C., Sutskever, I.: Robust speech recognition via large-scale weak supervision. In: Krause, A., Brunskill, E., Cho, K., Engelhardt, B., Sabato, S., Scarlett, J. (eds.) Proceedings of the 40th International Conference on Machine Learning. Proceedings of Machine Learning Research, vol. 202, pp. 28492–28518. PMLR (2023). https://proceedings.mlr.press/v202/radford23a.html
Verma, J.P., Agrawal, S., Patel, B., Patel, A.: Big data analytics: challenges and applications for text, audio, video, and social media data. Int. J. Soft Comput. Artif. Intell. Appl. (IJSCAI) 5(1), 41–51 (2016)
Google Scholar
Vryzas, N., Tsipas, N., Dimoulas, C.: Web radio automation for audio stream management in the era of big data. Information 11(4) (2020). https://doi.org/10.3390/info11040205, https://www.mdpi.com/2078-2489/11/4/205
Wang, P., Wang, X., Liu, X.: Selection of audio learning resources based on big data. Int. J. Emerg. Technol. Learn. (Online) 17(6), 23 (2022)
Article Google Scholar
Zhang, J., et al.: Managing and analysing big audio data for environmental monitoring. In: 2013 IEEE 16th International Conference on Computational Science and Engineering, pp. 997–1004 (2013). https://doi.org/10.1109/CSE.2013.146
Zhang, Q., Yang, L.T., Chen, Z., Li, P.: A survey on deep learning for big data. Inf. Fusion 42, 146–157 (2018). https://doi.org/10.1016/j.inffus.2017.10.006, https://www.sciencedirect.com/science/article/pii/S1566253517305328

Download references

Author information

Authors and Affiliations

International Institute of Information Technology, 26/C, Electronics City Phase 1, Bangalore, Karnataka, India
M. Aparna, Sharath Srivatsa, G. Sai Madhavan & Srinath Srinivasa
iruWay Rural Research Lab, Janastu, Durgadahalli, Tumkur Dist., India
T. B. Dinesh

Authors

M. Aparna
View author publications
You can also search for this author in PubMed Google Scholar
Sharath Srivatsa
View author publications
You can also search for this author in PubMed Google Scholar
G. Sai Madhavan
View author publications
You can also search for this author in PubMed Google Scholar
T. B. Dinesh
View author publications
You can also search for this author in PubMed Google Scholar
Srinath Srinivasa
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to M. Aparna .

Editor information

Editors and Affiliations

National Institute of Technology Delhi, New Delhi, Delhi, India
Shelly Sachdeva
University of Aizu, Aizu-Wakamatsu, Fukushima, Japan
Yutaka Watanobe

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Aparna, M., Srivatsa, S., Sai Madhavan, G., Dinesh, T.B., Srinivasa, S. (2024). AI-Based Assistance for Management of Oral Community Knowledge in Low-Resource and Colloquial Kannada Language. In: Sachdeva, S., Watanobe, Y. (eds) Big Data Analytics in Astronomy, Science, and Engineering. BDA 2023. Lecture Notes in Computer Science, vol 14516. Springer, Cham. https://doi.org/10.1007/978-3-031-58502-9_1

Download citation

DOI: https://doi.org/10.1007/978-3-031-58502-9_1
Published: 27 April 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-58501-2
Online ISBN: 978-3-031-58502-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

AI-Based Assistance for Management of Oral Community Knowledge in Low-Resource and Colloquial Kannada Language