Analytics over Probabilistic Unmerged Duplicates

Ioannou, Ekaterini; Garofalakis, Minos

doi:10.1007/978-3-319-11508-5_17

Analytics over Probabilistic Unmerged Duplicates

Ekaterini Ioannou²¹ &
Minos Garofalakis²¹

Conference paper

493 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8720))

Abstract

This paper introduces probabilistic databases with unmerged duplicates (DB^ud), i.e., databases containing probabilistic information about instances found to describe the same real-world objects. We discuss the need for efficiently querying such databases and for supporting practical query scenarios that require analytical or summarized information. We also sketch possible methodologies and techniques that would allow performing efficient processing of queries over such probabilistic databases, and especially without the need to materialize the (potentially, huge) collection of all possible deduplication worlds.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Andritsos, P., Fuxman, A., Miller, R.: Clean answers over dirty databases: A probabilistic approach. In: ICDE (2006)
Google Scholar
Dalvi, N., Suciu, D.: Efficient query evaluation on probabilistic databases. VLDB 16(4) (2007)
Google Scholar
Dylla, M., Miliaraki, I., Theobald, M.: Top-k query processing in probabilistic databases with non-materialized views. In: ICDE (2013)
Google Scholar
Elmagarmid, A., Ipeirotis, P., Verykios, V.: Duplicate record detection: A survey. TKDE 19(1) (2007)
Google Scholar
Fink, R., Han, L., Olteanu, D.: Aggregation in probabilistic databases via knowledge compilation. PVLDB 5(5) (2012)
Google Scholar
Ioannou, E., Nejdl, W., Niederée, C., Velegrakis, Y.: On-the-fly entity-aware query processing in the presence of linkage. PVLDB 3(1) (2010)
Google Scholar
Olteanu, D., Wen, H.: Ranking query answers in probabilistic databases: Complexity and efficient algorithms. In: ICDE (2012)
Google Scholar
Ré, C., Dalvi, N., Suciu, D.: Efficient top-k query evaluation on probabilistic data. In: ICDE (2007)
Google Scholar
Sismanis, Y., Wang, L., Fuxman, A., Haas, P., Reinwald, B.: Resolution-aware query answering for business intelligence. In: ICDE (2009)
Google Scholar
Soliman, M., Ilyas, I., Chang, K.: Top-k query processing in uncertain databases. In: ICDE (2007)
Google Scholar
Wick, M., Rohanimanesh, K., Schultz, K., McCallum, A.: A unified approach for schema matching, coreference and canonicalization. In: KDD (2008)
Google Scholar

Download references

Author information

Authors and Affiliations

Technical University of Crete, Chania, Greece
Ekaterini Ioannou & Minos Garofalakis

Authors

Ekaterini Ioannou
View author publications
You can also search for this author in PubMed Google Scholar
Minos Garofalakis
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Istituto di Scienza e Tecnologie dell’Informazione (ISTI - CNR), Pisa, Italy
Umberto Straccia
Department of Computer Science and Information Systems, University of London, London, United Kingdom
Andrea Calì

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ioannou, E., Garofalakis, M. (2014). Analytics over Probabilistic Unmerged Duplicates. In: Straccia, U., Calì, A. (eds) Scalable Uncertainty Management. SUM 2014. Lecture Notes in Computer Science(), vol 8720. Springer, Cham. https://doi.org/10.1007/978-3-319-11508-5_17

Download citation

DOI: https://doi.org/10.1007/978-3-319-11508-5_17
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11507-8
Online ISBN: 978-3-319-11508-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics