Abstract
This paper presents a fast approximation method for Multidimensional Scaling (MDS)-based dimensionality reduction on large cartography datasets. Since MDS preserves data point distances, it is useful in application domains where geolocation data are critical. Typical relevant tasks include smartphone user behavioral pattern extraction, animal motion tracking over long distances, or distributed sensor data monitoring. The input to MDS is a data distance matrix employed for reducing data point dimensionality under distance constraints. Similar procedures are crucial for analyzing and revealing the original hidden data structure, as well as for data visualization, feature extraction, or compression. For N data points, MDS has a computational complexity that exceeds \(\varvec{O(N^{2})}\) which, may be excessive for a large \(\varvec{N}\), e.g., for several hundred thousands or millions of data points. The proposed method allows fast approximate MDS calculation on million-point datasets in less than a minute on a simple laptop, by sampling a small subset of the original dataset, performing regular MDS on it and training a neural regressor to learn the desired MDS mapping. Quantitative and qualitative empirical evaluation of the proposed fast MLP-MDS algorithm on a geospatial data mapping task, i.e., on reducing 3D Earth surface points (longitude, latitude, radius) to 2D maps, has resulted in promising findings and small approximation errors. The benefits are even greater in incremental settings, where new data points are obtained and projected over time. Unlike regular MDS or competing approximations, this is trivially supported in MLP-MDS due to the latter’s model-based nature.
Similar content being viewed by others
Data Availability
The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request.
References
Borg I, Groenen PJF (2005) Modern multidimensional scaling: Theory and applications. Springer Science & Business Media.
Brummelen GV (2012) Heavenly mathematics: the forgotten art of spherical trigonometry. Princeton University Press
Capitani C, Mukama K, Mbilinyi B, Malugu IO, Munishi PKT, Burgess ND, Platts PJ, Sallu SM, Marchant R (2016) From local scenarios to national maps: a participatory framework for envisioning the future of Tanzania. Ecol Soc 21(3)
Choi JY, Bae S-H, Qiu X, Fox G (2010) High performance dimension reduction and visualization for large high-dimensional data analysis. In: Proceedings of the EEE/ACM international conference on cluster, cloud and grid computing
De Silva V, Tenenbaum JB (2004) Sparse multidimensional scaling using landmark points. Stanford University. Technical report
Espadoto M, Hirata NST, Telea AC (2020) Deep learning multidimensional projections. Inf Vis 19:247–269
Hinton GE, Salakhutdinov RR (2006) Reducing the dimensionality of data with neural networks. Science 313:504–507
Joia P, Coimbra D, Cuminato JA, Paulovich FV, Nonato LG (2011) Local affine multidimensional projection. IEEE Trans Visual Comput Graphics 17:2563–2571
Kakaletsis E, Symeonidis C, Tzelepi M, Mademlis I, Tefas A, Nikolaidis N, Pitas I (2021a) Computer vision for autonomous UAV flight safety: An overview and a vision-based safe landing pipeline example. ACM Comput Surv (CSUR) 54(9):1–37
Kakaletsis E, Mademlis I, Nikolaidis N, Pitas I (2021b) Multiview vision-based human crowd localization for UAV fleet flight safety. Sig Process Image Commun 99:116–484
Kakaletsis E, Mademlis I, Nikolaidis N, Pitas I (2021c) Bayesian fusion of multiview human crowd detections for autonomous UAV fleet safety. In: Proceedings of the european signal processing conference (EUSIPCO). IEEE
Kim E, Lee S, Kim C, Kim K (2010) Mobile beacon-based 3D-localization with multidimensional scaling in large sensor networks. IEEE Commun Lett 14:647–649
Kingma D, Ba J (2015) Adam: A method for stochastic optimization. In: Proceedings of the international conference on learning representations (ICLR)
Mahabir R, Croitoru A, Crooks AT, Agouris P, Stefanidis A (2018) A critical review of high and very high-resolution remote sensing approaches for detecting and mapping slums: Trends, challenges and emerging opportunities. Urban Sci 2(1):8
Nusrat S, Alam MJ, Kobourov S (2016) Evaluating cartogram effectiveness. IEEE Trans Visual Comput Graph 24(2):1077–1090
Passalis N, Tefas A (2018) PySEF: A python library for similarity-based dimensionality reduction. Knowl-Based Syst 152:186–187
Paulovich FV, Nonato LG, Minghim R, Levkowitz H (2008) Least square projection: A fast high-precision multidimensional projection technique and its application to document mapping. IEEE Trans Visual Comput Graphics 14:564–575
Platt J (2005) FastMap, MetricMap, and Landmark MDS are all Nystrom algorithms. In: Proceedings of the international conference on artificial intelligence and statistics (AISTATS)
Pohlheim H (2006) Multidimensional scaling for evolutionary algorithms: visualization of the path through search space and solution space using Sammon Mapping. Artif Life 12:203–209
Pressley A (2010)Gauss’ theorema egregium. In: Elementary differential geometry. Springer, pp 247–268
Sammon JW (1969) A nonlinear mapping for data structure analysis. IEEE Trans Comput 100:401–409
Spathis D, Passalis N, Tefas A (2019) Interactive dimensionality reduction using similarity projections. Knowl-Based Syst 165:77–91
Spathis D, Passalis N, Tefas A (2018) Fast, visual and interactive semi-supervised dimensionality reduction. In: Proceedings of the European conference on computer vision (ECCV) workshops
Symeonidis C, Kakaletsis E, Mademlis I, Nikolaidis N, Tefas A, Pitas I (2021) Vision-based UAV safe landing exploiting lightweight Deep Neural Networks. In: Proceedings of the international conference on image and graphics processing (ICIGP)
Xu X, Ester M, Kriegel H-P, Sander J (1998) A distribution-based clustering algorithm for mining in large spatial databases. In: Proceedings of the international conference on data engineering
Yang L (2006) Locally multidimensional scaling for nonlinear dimensionality reduction. In: Proceedings of the international conference on pattern recognition (ICPR)
Acknowledgements
The research leading to these results has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 951911 (AI4Media).
Funding
This study was funded by European Union’s Horizon 2020 research and innovation programme under grant agreement No 951911 (AI4Media).
Author information
Authors and Affiliations
Contributions
IM and IP developed the algorithm and designed the evaluation process. IM wrote the manuscript and submitted the paper. GV implemented the code, ran the experiments and prepared the figures. All authors reviewed and approved the manuscript.
Corresponding author
Ethics declarations
Informed consent
The research presented in this paper did not involve human or animal participants.
Competing interests
The authors declare they have no financial or non-financial interests.
Additional information
Communicated by: H. Babaie.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Mademlis, I., Voulgaris, G. & Pitas, I. Fast multidimensional scaling on big geospatial data using neural networks. Earth Sci Inform 16, 2241–2249 (2023). https://doi.org/10.1007/s12145-023-01004-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12145-023-01004-9