Abstract
Data analysis and processing allow for acquiring competitive advantages both in the business and academic and research worlds. One of the sciences that carries out this analysis is machine learning, which has evolved with greater emphasis in recent years due to its advantages and applicability in different areas. Aware of the importance and current relevance of data management for industries, especially in the banking sector, this study applies supervised learning techniques to generate classification and prediction models by treating a set of data from an Ecuadorian financial institution. Different algorithms are compared, and each of the steps to follow in constructing the models is explained in detail. This allows the financial entity to classify its clients as VIPs or not with greater certainty, as well as to predict the investment amounts of the potential clients based on variables such as age, occupation, and among others. The main results show that the K-nearest neighbor algorithm with k = 5 is optimal for classification, while for prediction, the multilayer perceptron algorithm is the most favorable.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Sivarajah U, Kamal MM, Irani Z, Weerakkody V (2017) Critical analysis of big data challenges and analytical methods. J Bus Res 70:263–286. https://doi.org/10.1016/j.jbusres.2016.08.001
Agarwal R, Dhar V (2014) Editorial—big data, data science, and analytics: the opportunity and challenge for IS research. Inform Syst Res 25:443–448. https://doi.org/10.1287/isre.2014.0546
Kubat M (2021) Ambitions and goals of machine learning. In: Kubat M (ed) An introduction to machine learning. Springer International Publishing, Cham, pp 1–15
Murdoch WJ, Singh C, Kumbier K, Abbasi-Asl R, Yu B (2019) Definitions, methods, and applications in interpretable machine learning. Proc Natl Acad Sci 116:22071–22080. https://doi.org/10.1073/pnas.1900654116
Crisci C, Ghattas B, Perera G (2012) A review of supervised machine learning algorithms and their applications to ecological data. Ecol Model 240:113–122. https://doi.org/10.1016/j.ecolmodel.2012.03.001
Ghahramani Z (2004) Unsupervised learning. In: Bousquet O, von Luxburg U, y Rätsch G (eds) Advanced lectures on machine learning: ML summer schools 2003, Canberra, Australia, February 2–14, 2003, Tübingen, Germany, August 4–16, 2003, Revised lectures. Springer, Berlin, Heidelberg, pp 72–112
Ota R, Yamashita F (2022) Application of machine learning techniques to the analysis and prediction of drug pharmacokinetics. J Control Release 352:961–969. https://doi.org/10.1016/j.jconrel.2022.11.014
Zhou B, Lapedriza A, Khosla A, Oliva A, Torralba A (2018) Places: a 10 million image database for scene recognition. IEEE Trans Pattern Anal Mach Intell 40:1452–1464. https://doi.org/10.1109/TPAMI.2017.2723009
Jaquart P, Dann D, Weinhardt C (2021) Short-term bitcoin market prediction via machine learning. J Finance Data Sci 7:45–66. https://doi.org/10.1016/j.jfds.2021.03.001
Conneau A, Kiela D, Schwenk H, Barrault L, Bordes A (2017) Supervised learning of universal sentence representations from natural language inference data. In: Proceedings of the 2017 conference on empirical methods in natural language processing. Association for Computational Linguistics, Copenhagen, pp 670–680
Alonso-Robisco A, Carbó JM (2022) Can machine learning models save capital for banks? Evidence from a Spanish credit portfolio. Int Rev Financial Anal 84:102372. https://doi.org/10.1016/j.irfa.2022.102372
Lagasio V, Pampurini F, Pezzola A, Quaranta AG (2022) Assessing bank default determinants via machine learning. Inform Sci 618:87–97. https://doi.org/10.1016/j.ins.2022.10.128
Khandani AE, Kim AJ, Lo AW (2010) Consumer credit-risk models via machine-learning algorithms. J Bank Finance 34:2767–2787. https://doi.org/10.1016/j.jbankfin.2010.06.001
Wang Y, Wang S, Lai KK (2005) A new fuzzy support vector machine to evaluate credit risk. IEEE Trans Fuzzy Syst 13:820–831. https://doi.org/10.1109/TFUZZ.2005.859320
Mainelli M, Yeandle M (2006) Best execution compliance: new techniques for managing compliance risk. J Risk Finance 7:301–312. https://doi.org/10.1108/15265940610664979
European Banking Federation (2020) AI in the banking industry: EBF position paper. https://www.ebf.eu/innovation-cybersecurity/ai-in-the-banking-industry-ebf-position-paper/
Borkin D, Nemethova A, Michalconok G, Maiorov K (2019) Impact of data normalization on classification model accuracy. Res Papers Faculty Mater Sci Technol Slovak Univ Technol 27:79–84. https://doi.org/10.2478/rput-2019-0029
Mohammed R, Rawashdeh J, Abdullah M (2020) Machine learning with oversampling and undersampling techniques: overview study and experimental results. In: Proceedings of the 2020 11th international conference on information and communication systems (ICICS), pp 243–248
Tharwat A (2020) Classification assessment methods. Appl Comput Inform 17:168–192. https://doi.org/10.1016/j.aci.2018.08.003
Acknowledgements
The authors would like to thank to “Vicerrectorado de Investigación” of the University of Cuenca, Ecuador, for the financial support given to the present research, development, and innovation work.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Flores-Siguenza, P., Espinoza-Saquicela, D., Moscoso-Martínez, M., Siguenza-Guzman, L. (2023). Applying Machine Learning Techniques to the Analysis and Prediction of Financial Data. In: Yang, XS., Sherratt, R.S., Dey, N., Joshi, A. (eds) Proceedings of Eighth International Congress on Information and Communication Technology. ICICT 2023. Lecture Notes in Networks and Systems, vol 694. Springer, Singapore. https://doi.org/10.1007/978-981-99-3091-3_69
Download citation
DOI: https://doi.org/10.1007/978-981-99-3091-3_69
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-3090-6
Online ISBN: 978-981-99-3091-3
eBook Packages: EngineeringEngineering (R0)