Abstract
Sophisticated tools for smart management and public services are crucial aspects of smart cities and especially affordable housing. In this context, a novel algorithm is introduced, which assists a user to identify locations for real estate investment. The methodology involves an application of data analytics for selection of top attributes of real estate for a user, and based on these attributes stacks of machine learning algorithms like decision trees, principal component analysis (PCA), and K-means clustering identify the location for investment. While data analytics comprising statistical modeling and machine learning techniques can compute the important attributes and thereby identify locations, it is nontrivial to get good insight at the scale of a large complex network consisting of hundreds of attributes and locations. This is mainly due to the underlying assumptions of i.i.d (independent and identically distributed) on random variables of many learning algorithms. Network science provides the necessary tools to analyze interactions and relations among entities in large networks considering the interdependencies of variables. In this chapter, a network created from the locations outputted by machine learning layers is described that utilizes network measures like eigen centrality that helps a user to determine the best location for investment, while providing deeper insight into the location identification problem. In addition, simulation of network dynamics provides the most influential and stable attribute of the designed real estate complex network, in the presence of the random link weight perturbations.
Real estate investment comprises many attributes that can be categorized into social, economic, governmental, and environmental. Of all these, only real estate factors are considered in this work. However, the same work can be extended to other factors as well.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Internet document—“Affordable Housing in India”, An Inclusive Approach to Sheltering the Bottom of the Pyramid
J.F. Schram, Real Estate Appraisal (Rockwell, Bellevue, 2006)
D.H. Carr, J. Lawson, J. Schultz, Dearborn Real Estate Education, Mastering Estate Appraisal (Dearborn Financial Publications, Chicago, 2003)
Y. Zhang, S. Liu, S. He, Z. Fang, Forecasting research on real-estate prices in Shanghai, in: 2009 International Conference on Grey Systems and Intelligent Services (GSIS 2009), Nanjing, 2009, pp. 625-629
W. Wei, T. Guang-ji, Z. Hong-rui, Empirical analysis on the housing price in Harbin City based on hedonic model, in: 2010 International conference on Management Science and Engineering 17th Annual Conference Proceeding, Melbourne, VIC, 2010, pp. 1659–1664
B. park, J.K. Bae, Using machine learning algorithms for housing price prediction: “The case of Fairfax County”, Virginia housing data. Expert Syst. Appl. 42(6), 2928–2934 (2015). ISSN: 0957-4174
H. Xue, The prediction on residential real estate price based on BPNN, in: 2015 8th International Conference on Intelligent Computation Technology and Automation (ICICTA), Nanchang, 2015, pp. 1008–1013
B. Liu, B. Mavrin, D. Niu, L. Kong, House price modeling over heterogeneous regions with hierarchical spatial functional analysis, in: 2016 IEEE 16th International Conference on Data Mining (ICDM), Barcelona, 2016, pp. 1047–1052
C. Cheng, X. Cheng, M. Yuan, K. Chao, S. Zhou, J. Gao, L. Xu, T. Zhang, A novel architecture and machine learning algorithm for real estate. Signal Inf. Process. Netw. Comput. 473, 491–499 (2017). Springer, Singapore. Lecture Notes in Electrical Engineering
Kecheng Zhao, Wei Shen, Spatial characteristic with individual house properties and multilevel approach to hedonic models, in: 2011 International Conference on Computer Science and Service System (CSSS), Nanjing, 2011, pp. 2579–2582
T. Oladunni, S. Sharma, Hedonic housing theory—a machine learning investigation, in: 2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA), Anaheim, CA, 2016, pp. 522–527. doi: https://doi.org/10.1109/ICMLA.2016.0092
B. Park, J.K. Bae, Using machine learning algorithms for housing price prediction. Expert Syst. Appl. 42, 2928–2934 (2015)
I.D. Wilson, S.D. Paris, J.A. Ware, D.H. Jenkins, Residential property price time series forecasting with neural networks, in: The Twenty-First SGES International Conference on Knowledge Based Systems and Applied Artificial Intelligence, Cambridge, December 2001, pp. 17–28, Springer Publications
H. Xu, A. Gade, Smart real estate assessments using structured deep neural networks, in: 2017 IEEE SmartWorld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computed, Scalable Computing & Communications, Cloud & Big Data Computing, Internet of People and Smart City Innovation (SmartWorld/SCALCOM/UIC/ATC/CBDCom/IOP/SCI), San Francisco, CA, 2017, pp. 1-7
S. Lu, Z. Li, Z. Qin, X. Yang, R.S.M. Goh, A hybrid regression technique for house prices prediction, in: 2017 IEEE International Conference on Industrial Engineering and Engineering Management (IEEM), Singapore, 2017, pp. 319–323
D. Sangani, K. Erickson, M. A. Hasan, Predicting zillow estimation error using linear regression and gradient boosting, in: 2017 IEEE 14th International Conference on Mobile Ad Hoc and Sensor Systems (MASS), Orlando, FL, 2017, pp. 530–534
W.T. Lim, L. Wang, Y. Wang, Q. Chang, Housing price prediction using neural networks, in: 2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD), Changsha, 2016, pp. 518–522
J. Demongeot, H. Pempelfort, J.M. Martinez, R. Vallejos, M. Barria, C. Taramasco, Information design of biological networks: application to genetic, immunologic, metabolic and social networks, in: 2013 27th International Conference on Advanced Information Networking and Applications Workshops, Barcelona, 2013, pp. 1533–1540
D.P. Cheung, M.H. Gunes, A complex network analysis of the United States air transportation, in: 2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, Istanbul, 2012, pp. 699–701
Q. Xuan, Z.Y. Zhang, C. Fu, H.X. Hu, V. Filkov, Social synchrony on complex networks. IEEE Trans. Cybernetics 48(5), 1420–1431 (2018)
ESRI—Real estate website, https://www.esri.com/en-us/industries/real-estate/overview
Black stone, https://www.blackstone.com/the-firm/asset-management/real-estate
J. Wang, Pearson correlation coefficient, in Encyclopedia of Systems Biology, ed. by W. Dubitzky, O. Wolkenhauer, K. H. Cho, H. Yokota, (Springer, New York, NY, 2013)
The data for our work was taken from: www.terrafly.com/
G. Skourletopoulos et al., Big data and cloud computing: a survey of the state-of-the-art and research challenges, in Advances in Mobile Cloud Computing and Big Data in the 5G Era. Studies in Big Data, ed. by C. Mavromoustakis, G. Mastorakis, C. Dobre, vol. 22, (Springer, Cham, 2017)
Alan Said, Data Science in Practice (Springer Publications, 2019)
E. Hart, Biological networks, in Encyclopedia of Astrobiology, ed. by R. Amils et al., (Springer, Berlin, Heidelberg, 2014)
V. Latora, V. Nicosia, G. Russo, Complex Networks: Principles, Methods and Applications (Cambridge University Press, Cambridge, UK, 2017)
S. Han, Y. Ko, S. Kim, D.H. Shin, Home sales index prediction model based on cluster and principal component statistical approaches in a big data analytic concept. KSCE J. Civil Eng. 21(1), 67–75 (2017). Springer publications
T. Oladunni, S. Sharma, Spatial dependency and hedonic housing regression model, in: 2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA), Anaheim, CA, 2016, pp. 553–558
V. Del Giudice, P. De Paola, G.B. Cantisani, Valuation of real estate investments through fuzzy logic. Buildings 7, 26 (2017)
C. Bagnoli, H.C. Smith, The theory of fuzzy logic and its application to real estate valuation. J. Real Estate Res. 16(2), 169–200 (1998). American Real Estate Society
W. Didimo, G. Liotta, F. Montecchiani, Network visualization for financial crime detection. J. Vis. Lang. Comput. 25(4), 433–451 (2014). https://doi.org/10.1016/j.jvlc.2014.01.002
S. Perera, M.G.H. Bell, M.C.J. Bliemer, Network science approach to modelling the topology and robustness of supply chain networks: a review and perspective. Appl. Netw. Sci. 2, 2–35 (2017). https://doi.org/10.1007/s41109-017-0053-0
Z. Wang, J. Han, Visualization of the UK stock market based on complex networks for company’s revenue forecast, in Information and Knowledge Management in Complex Systems. ICISO 2015. IFIP Advances in Information and Communication Technology, ed. by K. Liu, K. Nakata, W. Li, D. Galarreta, vol. 449, (Springer, Cham, 2015)
Realdata, https://www.realdata.com/
CREmodel, https://www.cremodel.com/
Proapod, http://www.proapod.com/
Y. Dong, Value ranges of Spearman’s Rho and Kendall’s Tau of a class of copulas, in: 2010 International Conference on Computational and Information Sciences, Chengdu, 2010, pp. 182-185. doi: https://doi.org/10.1109/ICCIS.2010.335
S.E. Kumar, V. Talasila, N. Rishe, T.V.S. Kumar, S.S. Iyengar, Location identification for real estate investment using data analytics. Int. J. Data Sci. Analytics, 1–25 (2019)
M.J. Moshkov, Time complexity of decision trees, in Transactions on Rough Sets III, ed. by J. F. Peters, A. Skowron, (Springer, Berlin, 2005), pp. 244–459
D. Hu, Q. Liu, Q. Yan, Decision tree merging branches algorithm based on equal predictability, in: 2009 International Conference on Artificial Intelligence and Computational Intelligence, Shanghai, 2009, pp. 214–218
O.Z. Maimon, R. Lior, Data Mining with Decision Trees: Theory and Applications, 2nd edn. (World Scientific, Singapore, 2015)
S. Sehgal, H. Singh, M. Agarwal, V. Bhasker and Shantanu, Data analysis using principal component analysis, in: 2014 International Conference on Medical Imaging, m-Health and Emerging Communication Systems (MedCom), Greater Noida, 2014, pp. 45–48
G.A. Wilkin, X. Huang, K-means clustering algorithms: implementation and comparison, in: Second International Multi-Symposiums on Computer and Computational Sciences (IMSCCS 2007), Iowa City, IA, 2007, pp. 133–136
I. Scholtes, Understanding complex systems: when big data meets network science. IT—Information Technology 57(4), 252–256 (2015). https://doi.org/10.1515/itit-2015-0012
A. Bihari, M. K. Pandia, Eigenvector centrality and its application in research professionals’ relationship network, in: 2015 International Conference on Futuristic Trends on Computational Analysis and Knowledge Management (ABLAZE), Noida, 2015, pp. 510–514
F. Grando, D. Noble, L. C. Lamb, An analysis of centrality measures for complex and social networks, in: 2016 IEEE Global Communications Conference (GLOBECOM), Washington, DC, 2016, pp. 1–6
Acknowledgements
Authors would like to thank Dr. Naphtali Rishe and Dr. S.S. Iyengar of School of Computing and Information Sciences, Florida International University, Miami, Florida, for providing the database and valuable suggestions throughout this work.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Sandeep Kumar, E., Talasila, V. (2020). A Combined Data Analytics and Network Science Approach for Smart Real Estate Investment: Towards Affordable Housing. In: Lopes, N. (eds) Smart Governance for Cities: Perspectives and Experiences. EAI/Springer Innovations in Communication and Computing. Springer, Cham. https://doi.org/10.1007/978-3-030-22070-9_8
Download citation
DOI: https://doi.org/10.1007/978-3-030-22070-9_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-22069-3
Online ISBN: 978-3-030-22070-9
eBook Packages: EngineeringEngineering (R0)