Abstract
The increasing need for data trading across businesses nowadays has created a demand for data marketplaces. However, despite the intentions of both data providers and consumers, today’s data marketplaces remain mere data catalogs. We believe that marketplaces of the future require a set of value-added services, such as advanced search and discovery, that have been proposed in the database research community for years, but are not yet put to practice. With this paper, we report on the effort to engineer and develop an open-source modular data market platform to enable both entrepreneurs and researchers to setup and experiment with data marketplaces. To this end, we implemented and extended existing methods for data profiling, dataset search & discovery, and data recommendation. These methods are available as open-source libraries. In this paper we report on how those tools were assembled together to build topio.market, a real-world web platform for trading geospatial data, that is currently in a beta phase.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
- 8.
- 9.
- 10.
- 11.
- 12.
- 13.
References
Agarwal, A., Dahleh, M., Sarkar, T.: A marketplace for data: an algorithmic solution. In: EC 2019, pp. 701–726 (2019)
Ali, M., et al.: PyKEEN 1.0: a Python library for training and evaluating knowledge graph embeddings. J. Mach. Learn. Res. 22, 82:1–82:6 (2021)
Azcoitia, S.A., Laoutaris, N.: A survey of data marketplaces and their business models. SIGMOD Rec. 51(3), 18–29 (2022)
Batcheller, J.K., Reitsma, F.: Implementing feature level semantics for spatial data discovery: supporting the reuse of legacy data using open source components. Comput. Environ. Urban Syst. 34(4), 333–344 (2010)
Breddels, M.A., Veljanoski, J.: Vaex: big data exploration in the era of Gaia. Astron. Astrophys. 618, A13 (2018)
Chapman, A., et al.: Dataset search: a survey. VLDB J. 29(1), 251–272 (2020)
Chawla, S., Deep, S., Koutrisw, P., Teng, Y.: Revenue maximization for query pricing. Proc. VLDB Endow. 13(1), 1–14 (2019)
Chen, L., Koutris, P., Kumar, A.: Towards model-based pricing for machine learning in a data marketplace. In: SIGMOD, pp. 1535–1552 (2019)
Chu, Y., Yao, J., Zhou, C., Yang, H.: Graph Neural Networks in Modern Recommender Systems. Springer, Singapore (2022)
Fernandez, R.C., Subramaniam, P., Franklin, M.J.: Data market platforms: trading data assets to solve data problems. Proc. VLDB Endow. 13(12), 1933–1947 (2020)
Harrower, M., Bloch, M.: Mapshaper.org: a map generalization web service. IEEE Comput. Graph. Appl. 26(4), 22–27 (2006)
Hayashi, T., Ohsawa, Y.: TEEDA: an interactive platform for matching data providers and users in the data marketplace. Information 11(4), 218 (2020)
Huang, Y., Shekhar, S., Xiong, H.: Discovering colocation patterns from spatial data sets: a general approach. IEEE TKDE 16(12), 1472–1485 (2004)
Ionescu, A., et al.: Topio marketplace: search and discovery of geospatial data. In: EDBT (2023)
Ionescu, A., Hai, R., Fragkoulis, M., Katsifodimos, A.: Join path-based data augmentation for decision trees. In: IEEE ICDEW, pp. 84–88. IEEE (2022)
Koutras, C., et al.: Valentine: evaluating matching techniques for dataset discovery. In: IEEE ICDE, pp. 468–479. IEEE (2021)
Koutris, P., Upadhyaya, P., Balazinska, M., Howe, B., Suciu, D.: Query-based data pricing. J. ACM (JACM) 62(5), 1–44 (2015)
Kristiadi, A., Khan, M.A., Lukovnikov, D., Lehmann, J., Fischer, A.: Incorporating literals into knowledge graph embeddings. In: Ghidini, C., et al. (eds.) ISWC 2019. LNCS, vol. 11778, pp. 347–363. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-30793-6_20
Lacasta, J., Nogueras-Iso, J., Béjar, R., Muro-Medrano, P.R., Zarazaga-Soria, F.J.: A web ontology service to facilitate interoperability within a spatial data infrastructure: applicability to discovery. Data Knowl. Eng. 63(3), 947–971 (2007)
Li, Y., Yu, X., Koudas, N.: Data acquisition for improving machine learning models. Proc. VLDB Endow. 14(10), 1832–1844 (2021)
Liang, F., Yu, W., An, D., Yang, Q., Fu, X., Zhao, W.: A survey on big data market: pricing, trading and protection. IEEE Access 6, 15132–15154 (2018)
Miller, R.J., Nargesian, F., Zhu, E., Christodoulakis, C., Pu, K.Q., Andritsos, P.: Making open data transparent: data discovery on open data. IEEE Data Eng. Bull. 41(2), 59–70 (2018)
Mitropoulos, P., Patroumpas, K., Skoutas, D., Vakkas, T., Athanasiou, S.: BigDataVoyant: automated profiling of large geospatial data. In: EDBT/ICDT Workshops (2021)
Mucha, T., Seppala, T.: Artificial intelligence platforms–a new research agenda for digital platform economy (2020)
Nargesian, F., Zhu, E., Miller, R.J., Pu, K.Q., Arocena, P.C.: Data lake management: challenges and opportunities. Proc. VLDB Endow. 12(12), 1986–1989 (2019)
Naumann, F.: Data profiling revisited. ACM SIGMOD Rec. 42(4), 40–49 (2014)
Niculescu, M.F., Wu, D., Xu, L.: Strategic intellectual property sharing: competition on an open technology platform under network effects. Inf. Syst. Res. 29(2), 498–519 (2018)
de Reuver, M., Ofe, H., Agahari, W., Abbas, A.E., Zuiderwijk, A.: The openness of data platforms: a research agenda. In: Proceedings of the 1st International Workshop on Data Economy, pp. 34–41 (2022)
Sun, Z., Deng, Z., Nie, J., Tang, J.: Rotate: Knowledge graph embedding by relational rotation in complex space. In: 7th International Conference on Learning Representations, ICLR 2019, New Orleans, LA, USA, 6–9 May 2019 (2019)
Wang, Z., Zhang, J., Feng, J., Chen, Z.: Knowledge graph embedding by translating on hyperplanes. In: AAAI, pp. 1112–1119. AAAI Press (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Ionescu, A. et al. (2023). Topio: An Open-Source Web Platform for Trading Geospatial Data. In: Garrigós, I., Murillo Rodríguez, J.M., Wimmer, M. (eds) Web Engineering. ICWE 2023. Lecture Notes in Computer Science, vol 13893. Springer, Cham. https://doi.org/10.1007/978-3-031-34444-2_25
Download citation
DOI: https://doi.org/10.1007/978-3-031-34444-2_25
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-34443-5
Online ISBN: 978-3-031-34444-2
eBook Packages: Computer ScienceComputer Science (R0)