Skip to main content

A Combined Data Analytics and Network Science Approach for Smart Real Estate Investment: Towards Affordable Housing

  • Chapter
  • First Online:
Smart Governance for Cities: Perspectives and Experiences

Abstract

Sophisticated tools for smart management and public services are crucial aspects of smart cities and especially affordable housing. In this context, a novel algorithm is introduced, which assists a user to identify locations for real estate investment. The methodology involves an application of data analytics for selection of top attributes of real estate for a user, and based on these attributes stacks of machine learning algorithms like decision trees, principal component analysis (PCA), and K-means clustering identify the location for investment. While data analytics comprising statistical modeling and machine learning techniques can compute the important attributes and thereby identify locations, it is nontrivial to get good insight at the scale of a large complex network consisting of hundreds of attributes and locations. This is mainly due to the underlying assumptions of i.i.d (independent and identically distributed) on random variables of many learning algorithms. Network science provides the necessary tools to analyze interactions and relations among entities in large networks considering the interdependencies of variables. In this chapter, a network created from the locations outputted by machine learning layers is described that utilizes network measures like eigen centrality that helps a user to determine the best location for investment, while providing deeper insight into the location identification problem. In addition, simulation of network dynamics provides the most influential and stable attribute of the designed real estate complex network, in the presence of the random link weight perturbations.

Real estate investment comprises many attributes that can be categorized into social, economic, governmental, and environmental. Of all these, only real estate factors are considered in this work. However, the same work can be extended to other factors as well.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 129.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Internet document—“Affordable Housing in India”, An Inclusive Approach to Sheltering the Bottom of the Pyramid

    Google Scholar 

  2. J.F. Schram, Real Estate Appraisal (Rockwell, Bellevue, 2006)

    Google Scholar 

  3. D.H. Carr, J. Lawson, J. Schultz, Dearborn Real Estate Education, Mastering Estate Appraisal (Dearborn Financial Publications, Chicago, 2003)

    Google Scholar 

  4. Y. Zhang, S. Liu, S. He, Z. Fang, Forecasting research on real-estate prices in Shanghai, in: 2009 International Conference on Grey Systems and Intelligent Services (GSIS 2009), Nanjing, 2009, pp. 625-629

    Google Scholar 

  5. W. Wei, T. Guang-ji, Z. Hong-rui, Empirical analysis on the housing price in Harbin City based on hedonic model, in: 2010 International conference on Management Science and Engineering 17th Annual Conference Proceeding, Melbourne, VIC, 2010, pp. 1659–1664

    Google Scholar 

  6. B. park, J.K. Bae, Using machine learning algorithms for housing price prediction: “The case of Fairfax County”, Virginia housing data. Expert Syst. Appl. 42(6), 2928–2934 (2015). ISSN: 0957-4174

    Article  Google Scholar 

  7. H. Xue, The prediction on residential real estate price based on BPNN, in: 2015 8th International Conference on Intelligent Computation Technology and Automation (ICICTA), Nanchang, 2015, pp. 1008–1013

    Google Scholar 

  8. B. Liu, B. Mavrin, D. Niu, L. Kong, House price modeling over heterogeneous regions with hierarchical spatial functional analysis, in: 2016 IEEE 16th International Conference on Data Mining (ICDM), Barcelona, 2016, pp. 1047–1052

    Google Scholar 

  9. C. Cheng, X. Cheng, M. Yuan, K. Chao, S. Zhou, J. Gao, L. Xu, T. Zhang, A novel architecture and machine learning algorithm for real estate. Signal Inf. Process. Netw. Comput. 473, 491–499 (2017). Springer, Singapore. Lecture Notes in Electrical Engineering

    Google Scholar 

  10. Kecheng Zhao, Wei Shen, Spatial characteristic with individual house properties and multilevel approach to hedonic models, in: 2011 International Conference on Computer Science and Service System (CSSS), Nanjing, 2011, pp. 2579–2582

    Google Scholar 

  11. T. Oladunni, S. Sharma, Hedonic housing theory—a machine learning investigation, in: 2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA), Anaheim, CA, 2016, pp. 522–527. doi: https://doi.org/10.1109/ICMLA.2016.0092

  12. B. Park, J.K. Bae, Using machine learning algorithms for housing price prediction. Expert Syst. Appl. 42, 2928–2934 (2015)

    Article  Google Scholar 

  13. I.D. Wilson, S.D. Paris, J.A. Ware, D.H. Jenkins, Residential property price time series forecasting with neural networks, in: The Twenty-First SGES International Conference on Knowledge Based Systems and Applied Artificial Intelligence, Cambridge, December 2001, pp. 17–28, Springer Publications

    Google Scholar 

  14. H. Xu, A. Gade, Smart real estate assessments using structured deep neural networks, in: 2017 IEEE SmartWorld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computed, Scalable Computing & Communications, Cloud & Big Data Computing, Internet of People and Smart City Innovation (SmartWorld/SCALCOM/UIC/ATC/CBDCom/IOP/SCI), San Francisco, CA, 2017, pp. 1-7

    Google Scholar 

  15. S. Lu, Z. Li, Z. Qin, X. Yang, R.S.M. Goh, A hybrid regression technique for house prices prediction, in: 2017 IEEE International Conference on Industrial Engineering and Engineering Management (IEEM), Singapore, 2017, pp. 319–323

    Google Scholar 

  16. D. Sangani, K. Erickson, M. A. Hasan, Predicting zillow estimation error using linear regression and gradient boosting, in: 2017 IEEE 14th International Conference on Mobile Ad Hoc and Sensor Systems (MASS), Orlando, FL, 2017, pp. 530–534

    Google Scholar 

  17. W.T. Lim, L. Wang, Y. Wang, Q. Chang, Housing price prediction using neural networks, in: 2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD), Changsha, 2016, pp. 518–522

    Google Scholar 

  18. J. Demongeot, H. Pempelfort, J.M. Martinez, R. Vallejos, M. Barria, C. Taramasco, Information design of biological networks: application to genetic, immunologic, metabolic and social networks, in: 2013 27th International Conference on Advanced Information Networking and Applications Workshops, Barcelona, 2013, pp. 1533–1540

    Google Scholar 

  19. D.P. Cheung, M.H. Gunes, A complex network analysis of the United States air transportation, in: 2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, Istanbul, 2012, pp. 699–701

    Google Scholar 

  20. Q. Xuan, Z.Y. Zhang, C. Fu, H.X. Hu, V. Filkov, Social synchrony on complex networks. IEEE Trans. Cybernetics 48(5), 1420–1431 (2018)

    Article  Google Scholar 

  21. ESRI—Real estate website, https://www.esri.com/en-us/industries/real-estate/overview

  22. Black stone, https://www.blackstone.com/the-firm/asset-management/real-estate

  23. J. Wang, Pearson correlation coefficient, in Encyclopedia of Systems Biology, ed. by W. Dubitzky, O. Wolkenhauer, K. H. Cho, H. Yokota, (Springer, New York, NY, 2013)

    Google Scholar 

  24. The data for our work was taken from: www.terrafly.com/

  25. G. Skourletopoulos et al., Big data and cloud computing: a survey of the state-of-the-art and research challenges, in Advances in Mobile Cloud Computing and Big Data in the 5G Era. Studies in Big Data, ed. by C. Mavromoustakis, G. Mastorakis, C. Dobre, vol. 22, (Springer, Cham, 2017)

    Google Scholar 

  26. Alan Said, Data Science in Practice (Springer Publications, 2019)

    Google Scholar 

  27. E. Hart, Biological networks, in Encyclopedia of Astrobiology, ed. by R. Amils et al., (Springer, Berlin, Heidelberg, 2014)

    Google Scholar 

  28. V. Latora, V. Nicosia, G. Russo, Complex Networks: Principles, Methods and Applications (Cambridge University Press, Cambridge, UK, 2017)

    Book  Google Scholar 

  29. S. Han, Y. Ko, S. Kim, D.H. Shin, Home sales index prediction model based on cluster and principal component statistical approaches in a big data analytic concept. KSCE J. Civil Eng. 21(1), 67–75 (2017). Springer publications

    Article  Google Scholar 

  30. T. Oladunni, S. Sharma, Spatial dependency and hedonic housing regression model, in: 2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA), Anaheim, CA, 2016, pp. 553–558

    Google Scholar 

  31. V. Del Giudice, P. De Paola, G.B. Cantisani, Valuation of real estate investments through fuzzy logic. Buildings 7, 26 (2017)

    Article  Google Scholar 

  32. C. Bagnoli, H.C. Smith, The theory of fuzzy logic and its application to real estate valuation. J. Real Estate Res. 16(2), 169–200 (1998). American Real Estate Society

    Google Scholar 

  33. W. Didimo, G. Liotta, F. Montecchiani, Network visualization for financial crime detection. J. Vis. Lang. Comput. 25(4), 433–451 (2014). https://doi.org/10.1016/j.jvlc.2014.01.002

    Article  Google Scholar 

  34. S. Perera, M.G.H. Bell, M.C.J. Bliemer, Network science approach to modelling the topology and robustness of supply chain networks: a review and perspective. Appl. Netw. Sci. 2, 2–35 (2017). https://doi.org/10.1007/s41109-017-0053-0

    Article  Google Scholar 

  35. Z. Wang, J. Han, Visualization of the UK stock market based on complex networks for company’s revenue forecast, in Information and Knowledge Management in Complex Systems. ICISO 2015. IFIP Advances in Information and Communication Technology, ed. by K. Liu, K. Nakata, W. Li, D. Galarreta, vol. 449, (Springer, Cham, 2015)

    Google Scholar 

  36. Realdata, https://www.realdata.com/

  37. CREmodel, https://www.cremodel.com/

  38. Proapod, http://www.proapod.com/

  39. Y. Dong, Value ranges of Spearman’s Rho and Kendall’s Tau of a class of copulas, in: 2010 International Conference on Computational and Information Sciences, Chengdu, 2010, pp. 182-185. doi: https://doi.org/10.1109/ICCIS.2010.335

  40. S.E. Kumar, V. Talasila, N. Rishe, T.V.S. Kumar, S.S. Iyengar, Location identification for real estate investment using data analytics. Int. J. Data Sci. Analytics, 1–25 (2019)

    Google Scholar 

  41. M.J. Moshkov, Time complexity of decision trees, in Transactions on Rough Sets III, ed. by J. F. Peters, A. Skowron, (Springer, Berlin, 2005), pp. 244–459

    Chapter  Google Scholar 

  42. D. Hu, Q. Liu, Q. Yan, Decision tree merging branches algorithm based on equal predictability, in: 2009 International Conference on Artificial Intelligence and Computational Intelligence, Shanghai, 2009, pp. 214–218

    Google Scholar 

  43. O.Z. Maimon, R. Lior, Data Mining with Decision Trees: Theory and Applications, 2nd edn. (World Scientific, Singapore, 2015)

    MATH  Google Scholar 

  44. S. Sehgal, H. Singh, M. Agarwal, V. Bhasker and Shantanu, Data analysis using principal component analysis, in: 2014 International Conference on Medical Imaging, m-Health and Emerging Communication Systems (MedCom), Greater Noida, 2014, pp. 45–48

    Google Scholar 

  45. G.A. Wilkin, X. Huang, K-means clustering algorithms: implementation and comparison, in: Second International Multi-Symposiums on Computer and Computational Sciences (IMSCCS 2007), Iowa City, IA, 2007, pp. 133–136

    Google Scholar 

  46. I. Scholtes, Understanding complex systems: when big data meets network science. IT—Information Technology 57(4), 252–256 (2015). https://doi.org/10.1515/itit-2015-0012

    Article  Google Scholar 

  47. A. Bihari, M. K. Pandia, Eigenvector centrality and its application in research professionals’ relationship network, in: 2015 International Conference on Futuristic Trends on Computational Analysis and Knowledge Management (ABLAZE), Noida, 2015, pp. 510–514

    Google Scholar 

  48. F. Grando, D. Noble, L. C. Lamb, An analysis of centrality measures for complex and social networks, in: 2016 IEEE Global Communications Conference (GLOBECOM), Washington, DC, 2016, pp. 1–6

    Google Scholar 

Download references

Acknowledgements

Authors would like to thank Dr. Naphtali Rishe and Dr. S.S. Iyengar of School of Computing and Information Sciences, Florida International University, Miami, Florida, for providing the database and valuable suggestions throughout this work.

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Sandeep Kumar, E., Talasila, V. (2020). A Combined Data Analytics and Network Science Approach for Smart Real Estate Investment: Towards Affordable Housing. In: Lopes, N. (eds) Smart Governance for Cities: Perspectives and Experiences. EAI/Springer Innovations in Communication and Computing. Springer, Cham. https://doi.org/10.1007/978-3-030-22070-9_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-22070-9_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-22069-3

  • Online ISBN: 978-3-030-22070-9

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics