Skip to main content

Applying Machine Learning Techniques to the Analysis and Prediction of Financial Data

  • Conference paper
  • First Online:
Proceedings of Eighth International Congress on Information and Communication Technology (ICICT 2023)

Abstract

Data analysis and processing allow for acquiring competitive advantages both in the business and academic and research worlds. One of the sciences that carries out this analysis is machine learning, which has evolved with greater emphasis in recent years due to its advantages and applicability in different areas. Aware of the importance and current relevance of data management for industries, especially in the banking sector, this study applies supervised learning techniques to generate classification and prediction models by treating a set of data from an Ecuadorian financial institution. Different algorithms are compared, and each of the steps to follow in constructing the models is explained in detail. This allows the financial entity to classify its clients as VIPs or not with greater certainty, as well as to predict the investment amounts of the potential clients based on variables such as age, occupation, and among others. The main results show that the K-nearest neighbor algorithm with k = 5 is optimal for classification, while for prediction, the multilayer perceptron algorithm is the most favorable.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 229.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 299.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Sivarajah U, Kamal MM, Irani Z, Weerakkody V (2017) Critical analysis of big data challenges and analytical methods. J Bus Res 70:263–286. https://doi.org/10.1016/j.jbusres.2016.08.001

    Article  Google Scholar 

  2. Agarwal R, Dhar V (2014) Editorial—big data, data science, and analytics: the opportunity and challenge for IS research. Inform Syst Res 25:443–448. https://doi.org/10.1287/isre.2014.0546

    Article  Google Scholar 

  3. Kubat M (2021) Ambitions and goals of machine learning. In: Kubat M (ed) An introduction to machine learning. Springer International Publishing, Cham, pp 1–15

    Chapter  MATH  Google Scholar 

  4. Murdoch WJ, Singh C, Kumbier K, Abbasi-Asl R, Yu B (2019) Definitions, methods, and applications in interpretable machine learning. Proc Natl Acad Sci 116:22071–22080. https://doi.org/10.1073/pnas.1900654116

    Article  MathSciNet  MATH  Google Scholar 

  5. Crisci C, Ghattas B, Perera G (2012) A review of supervised machine learning algorithms and their applications to ecological data. Ecol Model 240:113–122. https://doi.org/10.1016/j.ecolmodel.2012.03.001

    Article  Google Scholar 

  6. Ghahramani Z (2004) Unsupervised learning. In: Bousquet O, von Luxburg U, y Rätsch G (eds) Advanced lectures on machine learning: ML summer schools 2003, Canberra, Australia, February 2–14, 2003, Tübingen, Germany, August 4–16, 2003, Revised lectures. Springer, Berlin, Heidelberg, pp 72–112

    Google Scholar 

  7. Ota R, Yamashita F (2022) Application of machine learning techniques to the analysis and prediction of drug pharmacokinetics. J Control Release 352:961–969. https://doi.org/10.1016/j.jconrel.2022.11.014

    Article  Google Scholar 

  8. Zhou B, Lapedriza A, Khosla A, Oliva A, Torralba A (2018) Places: a 10 million image database for scene recognition. IEEE Trans Pattern Anal Mach Intell 40:1452–1464. https://doi.org/10.1109/TPAMI.2017.2723009

    Article  Google Scholar 

  9. Jaquart P, Dann D, Weinhardt C (2021) Short-term bitcoin market prediction via machine learning. J Finance Data Sci 7:45–66. https://doi.org/10.1016/j.jfds.2021.03.001

    Article  Google Scholar 

  10. Conneau A, Kiela D, Schwenk H, Barrault L, Bordes A (2017) Supervised learning of universal sentence representations from natural language inference data. In: Proceedings of the 2017 conference on empirical methods in natural language processing. Association for Computational Linguistics, Copenhagen, pp 670–680

    Google Scholar 

  11. Alonso-Robisco A, Carbó JM (2022) Can machine learning models save capital for banks? Evidence from a Spanish credit portfolio. Int Rev Financial Anal 84:102372. https://doi.org/10.1016/j.irfa.2022.102372

    Article  Google Scholar 

  12. Lagasio V, Pampurini F, Pezzola A, Quaranta AG (2022) Assessing bank default determinants via machine learning. Inform Sci 618:87–97. https://doi.org/10.1016/j.ins.2022.10.128

    Article  Google Scholar 

  13. Khandani AE, Kim AJ, Lo AW (2010) Consumer credit-risk models via machine-learning algorithms. J Bank Finance 34:2767–2787. https://doi.org/10.1016/j.jbankfin.2010.06.001

    Article  Google Scholar 

  14. Wang Y, Wang S, Lai KK (2005) A new fuzzy support vector machine to evaluate credit risk. IEEE Trans Fuzzy Syst 13:820–831. https://doi.org/10.1109/TFUZZ.2005.859320

    Article  Google Scholar 

  15. Mainelli M, Yeandle M (2006) Best execution compliance: new techniques for managing compliance risk. J Risk Finance 7:301–312. https://doi.org/10.1108/15265940610664979

    Article  Google Scholar 

  16. European Banking Federation (2020) AI in the banking industry: EBF position paper. https://www.ebf.eu/innovation-cybersecurity/ai-in-the-banking-industry-ebf-position-paper/

  17. Borkin D, Nemethova A, Michalconok G, Maiorov K (2019) Impact of data normalization on classification model accuracy. Res Papers Faculty Mater Sci Technol Slovak Univ Technol 27:79–84. https://doi.org/10.2478/rput-2019-0029

    Article  Google Scholar 

  18. Mohammed R, Rawashdeh J, Abdullah M (2020) Machine learning with oversampling and undersampling techniques: overview study and experimental results. In: Proceedings of the 2020 11th international conference on information and communication systems (ICICS), pp 243–248

    Google Scholar 

  19. Tharwat A (2020) Classification assessment methods. Appl Comput Inform 17:168–192. https://doi.org/10.1016/j.aci.2018.08.003

    Article  Google Scholar 

Download references

Acknowledgements

The authors would like to thank to “Vicerrectorado de Investigación” of the University of Cuenca, Ecuador, for the financial support given to the present research, development, and innovation work.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lorena Siguenza-Guzman .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Flores-Siguenza, P., Espinoza-Saquicela, D., Moscoso-Martínez, M., Siguenza-Guzman, L. (2023). Applying Machine Learning Techniques to the Analysis and Prediction of Financial Data. In: Yang, XS., Sherratt, R.S., Dey, N., Joshi, A. (eds) Proceedings of Eighth International Congress on Information and Communication Technology. ICICT 2023. Lecture Notes in Networks and Systems, vol 694. Springer, Singapore. https://doi.org/10.1007/978-981-99-3091-3_69

Download citation

  • DOI: https://doi.org/10.1007/978-981-99-3091-3_69

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-99-3090-6

  • Online ISBN: 978-981-99-3091-3

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics