Abstract
Matching of concepts describing the meaning of data in heterogeneous distributed information sources, such as database schemas and other metadata models, grouped here under the heading of an ontology, is one of the basic operations of semantic heterogeneity reconciliation. The aim of this chapter is to motivate the need for ontology matching, introduce the basics of ontology matching, and then discuss several promising themes in the area as reflected in recent research works. In particular, we focus on such themes as uncertainty in ontology matching, matching ensembles, and matcher self-tuning. Finally, we outline some important directions for future research.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Aberer, K.: Guest editor’s introduction. SIGMOD Record 32(3), 21–22 (2003)
Aberer, K., Cudré-Mauroux, P., Hauswirth, M.: Start making sense: The chatty web approach for global semantic agreements. Journal of Web Semantics 1(1), 89–114 (2003)
Abiteboul, S., Duschka, O.: Complexity of answering queries using materialized views. In: Proceedings of the 17th Symposium on Principles of Database Systems (PODS), Seattle, USA, pp. 254–263 (1998)
Baader, F., Calvanese, D., McGuinness, D., Nardi, D., Patel-Schneider, P. (eds.): The Description Logic Handbook: Theory, Implementation, and Applications. Cambridge University Press, Cambridge (2003)
Batini, C., Lenzerini, M., Navathe, S.: A comparative analysis of methodologies for database schema integration. ACM Computing Surveys 18(4), 323–364 (1986)
Benerecetti, M., Bouquet, P., Zanobini, S.: Soundness of schema matching methods. In: Gómez-Pérez, A., Euzenat, J. (eds.) ESWC 2005. LNCS, vol. 3532, pp. 211–225. Springer, Heidelberg (2005)
Bergman, M.: The deep web: surfacing hidden value. The Journal of Electronic Publishing 7(1) (2001)
Berlin, J., Motro, A.: Autoplex: Automated discovery of content for virtual databases. In: Proceedings of the 9th International Conference on Cooperative Information Systems (CoopIS), Trento, Italy, pp. 108–122 (2001)
Berners-Lee, T., Hendler, J., Lassila, O.: The semantic web. Scientific American 284(5), 34–43 (2001)
Bernstein, P., Giunchiglia, F., Kementsietsidis, A., Mylopoulos, J., Serafini, L., Zaihrayeu, I.: Data management for peer-to-peer computing: A vision. In: Proceedings of the 5th International Workshop on the Web and Databases (WebDB), Madison, USA, pp. 89–94 (2002)
Bernstein, P., Halevy, A., Pottinger, R.: A vision of management of complex models. SIGMOD Record 29(4), 55–63 (2000)
Bernstein, P., Melnik, S., Churchill, J.: Incremental schema matching. In: Proceedings of the 32nd International Conference on Very Large Data Bases (VLDB), Seoul, South Korea, pp. 1167–1170 (2006)
Bernstein, P., Melnik, S., Petropoulos, M., Quix, C.: Industrial-strength schema matching. SIGMOD Record 33(4), 38–43 (2004)
Bilke, A., Naumann, F.: Schema matching using duplicates. In: Proceedings of the 21st International Conference on Data Engineering (ICDE), Tokyo, Japan, pp. 69–80 (2005)
Bosc, P., Prade, H.: An introduction to fuzzy set and possibility theory based approaches to the treatment of uncertainty and imprecision in database management systems. In: Proceedings of the 2nd Workshop on Uncertainty Management in Information Systems: From Needs to Solutions, Santa Catalina, USA, pp. 44–70 (1993)
Bouquet, P., Ehrig, M., Euzenat, J., Franconi, E., Hitzler, P., Krötzsch, M., Serafini, L., Stamou, G., Sure, Y., Tessaris, S.: Specification of a common framework for characterizing alignment. Deliverable D2.2.1, Knowledge web NoE (2004)
Buckles, B., Petry, F.: Generalised database and information systems. In: Bezdek, J.C. (ed.) Analysis of fuzzy Information. CRC Press, Boca Raton (1987)
Convent, B.: Unsolvable problems related to the view integration approach. In: Atzeni, P., Ausiello, G. (eds.) ICDT 1986. LNCS, vol. 243, pp. 141–156. Springer, Heidelberg (1986)
Cudré-Mauroux, P.: Emergent semantics: rethinking interoperability for large scale decentralized information systems. Ph.D thesis, École Polytechnique Fédérale de Lausanne (2006)
Do, H.-H., Melnik, S., Rahm, E.: Comparison of schema matching evaluations. In: Proceedings of the 2nd Workshop on Web, Web-Services, and Database Systems, Erfurt, Germany, pp. 221–237 (2002)
Do, H.-H., Rahm, E.: COMA – a system for flexible combination of schema matching approaches. In: Proceedings of the 28th International Conference on Very Large Data Bases (VLDB), Hong Kong, China, pp. 610–621 (2002)
Doan, A., Madhavan, J., Domingos, P., Halevy, A.: Learning to map between ontologies on the semantic web. In: Proceedings of the 11th International Conference on World Wide Web (WWW), Honolulu, USA, pp. 662–673 (2002)
Doan, A.-H., Domingos, P., Halevy, A.: Reconciling schemas of disparate data sources: A machine-learning approach. In: Proceedings of the 20th International Conference on Management of Data (SIGMOD), Santa Barbara, USA, pp. 509–520 (2001)
Domshlak, C., Gal, A., Roitman, H.: Rank aggregation for automatic schema matching. IEEE Transactions on Knowledge and Data Engineering 19(4), 538–553 (2007)
Donini, F., Lenzerini, M., Nardi, D., Schaerf, A.: Reasoning in description logic. In: Brewka, G. (ed.) Principles on Knowledge Representation, Studies in Logic, Languages and Information, pp. 193–238. CSLI Publications (1996)
Dyreson, C.: A bibliography on uncertainty management in information systems. In: Motro, A., Smets, P. (eds.) Uncertainty Management in Information Systems: From Needs to Solutions, pp. 415–458. Kluwer Academic Publishers, Boston (1996)
Ehrig, M., Staab, S.: QOM – quick ontology mapping. In: McIlraith, S.A., Plexousakis, D., van Harmelen, F. (eds.) ISWC 2004. LNCS, vol. 3298, pp. 683–697. Springer, Heidelberg (2004)
Ehrig, M., Staab, S., Sure, Y.: Bootstrapping ontology alignment methods with APFEL. In: Proceedings of the 4th International Semantic Web Conference (ISWC), Galway, Ireland, pp. 186–200 (2005)
Embley, D., Jackman, D., Xu, L.: Attribute match discovery in information integration: Exploiting multiple facets of metadata. Journal of Brazilian Computing Society 8(2), 32–43 (2002)
Euzenat, J., Shvaiko, P.: Ontology matching. Springer, Heidelberg (2007)
Euzenat, J., Valtchev, P.: Similarity-based ontology alignment in OWL-lite. In: Proceedings of the 15th European Conference on Artificial Intelligence (ECAI), Valencia, Spain, pp. 333–337 (2004)
Fensel, D., Lausen, H., Polleres, A., de Bruijn, J., Stollberg, M., Roman, D., Domingue, J.: Enabling Semantic Web Services: The Web Service Modeling Ontology. Springer, Heidelberg (2007)
Gal, A.: Managing uncertainty in schema matching with top-K schema mappings. Journal of Data Semantics 6, 90–114 (2006)
Gal, A., Anaby-Tavor, A., Trombetta, A., Montesi, D.: A framework for modeling and evaluating automatic semantic reconciliation. VLDB Journal 14(1), 50–67 (2005)
Gal, A., Modica, G., Jamil, H., Eyal, A.: Automatic ontology matching using application semantics. AI Magazine 26(1), 21–32 (2005)
Giunchiglia, F., McNeill, F., Yatskevich, M.: Web service composition via semantic matching of interaction specifications. Technical Report DIT-06-080, University of Trento, Italy (2006)
Giunchiglia, F., Shvaiko, P., Yatskevich, M.: Semantic schema matching. In: Proceedings of the 13rd International Conference on Cooperative Information Systems (CoopIS), Agia Napa, Cyprus, pp. 347–365 (2005)
Giunchiglia, F., Shvaiko, P., Yatskevich, M.: Discovering missing background knowledge in ontology matching. In: Proceedings of the 16th European Conference on Artificial Intelligence (ECAI), Riva del Garda, Italy, pp. 382–386 (2006)
Giunchiglia, F., Yatskevich, M., Shvaiko, P.: Semantic matching: Algorithms and implementation. Journal on Data Semantics 9, 1–38 (2007)
Giunchiglia, F., Zaihrayeu, I.: Making peer databases interact - a vision for an architecture supporting data coordination. In: Proceedings of the 6th International Workshop on Cooperative Information Agents (CIA), Madrid, Spain, pp. 18–35 (2002)
Goutte, C., Gaussier, É.: A probabilistic interpretation of precision, recall and f-score, with implication for evaluation. In: Proceedings of the 27th European Conference on Advances in Information Retrieval Research (ECIR), Santiago de Compostela, Spain, pp. 345–359 (2005)
Gruber, T.R.: A translation approach to portable ontology specifications. Knowledge Acquisition 5(2), 199–220 (1993)
Haas, L., Hernández, M., Ho, H., Popa, L., Roth, M.: Clio grows up: from research prototype to industrial tool. In: Proceedings of the 24th International Conference on Management of Data (SIGMOD), Baltimore, USA, pp. 805–810 (2005)
Halevy, A., Ashish, N., Bitton, D., Carey, M., Draper, D., Pollock, J., Rosenthal, A., Sikka, V.: Enterprise information integration: successes, challenges and controversies. In: Proceedings of the 24th International Conference on Management of Data (SIGMOD), Baltimore, USA, pp. 778–787 (2005)
He, B., Chang, K.: Making holistic schema matching robust: An ensemble approach. In: Proceedings of the 11th International Conference on Knowledge Discovery and Data Mining (KDD), Chicago, USA, pp. 429–438 (2005)
He, B., Patel, M., Zhang, Z., Chang, K.: Accessing the deep web: a survey. Communications of the ACM 50(5), 94–101 (2007)
Hull, R.: Managing semantic heterogeneity in databases: a theoretical prospective. In: Proceedings of the 16th Symposium on Principles of Database Systems (PODS), Tucson, USA, pp. 51–61 (1997)
Ives, Z., Halevy, A., Mork, P., Tatarinov, I.: Piazza: mediation and integration infrastructure for semantic web data. Jornal of Web Semantics 1(2), 155–175 (2004)
Kalfoglou, Y., Schorlemmer, M.: Ontology mapping: the state of the art. The Knowledge Engineering Review 18(1), 1–31 (2003)
Kifer, M., Lausen, G., Wu, J.: Logical foundation of object-oriented and frame-based languages. Journal of the ACM 42(4), 741–843 (1995)
Koifman, G.: Multi-agent negotiation over database-based information goods. Master’s thesis, Technion-Israel Institute of Technology (February 2004)
Lee, Y., Sayyadian, M., Doan, A., Rosenthal, A.: eTuner: tuning schema matching software using synthetic scenarios. VLDB Journal 16(1), 97–122 (2007)
Lenzerini, M.: Data integration: A theoretical perspective. In: Proceedings of the 21st Symposium on Principles of Database Systems (PODS), Madison, USA, pp. 233–246 (2002)
Madhavan, J., Bernstein, P., Domingos, P., Halevy, A.: Representing and reasoning about mappings between domain models. In: Proceedings of the 18th National Conference on Artificial Intelligence (AAAI), Edmonton, Canada, pp. 122–133 (2002)
Madhavan, J., Bernstein, P., Rahm, E.: Generic schema matching with Cupid. In: Proceedings of the 27th International Conference on Very Large Data Bases (VLDB), Rome, Italy, pp. 48–58 (2001)
Melnik, S., Garcia-Molina, H., Rahm, E.: Similarity flooding: a versatile graph matching algorithm. In: Proceedings of the 18th International Conference on Data Engineering (ICDE), San Jose, USA, pp. 117–128 (2002)
Melnik, S., Rahm, E., Bernstein, P.: Developing metadata-intensive applications with Rondo. Journal of Web Semantics 1(1), 47–74 (2003)
Melnik, S., Rahm, E., Bernstein, P.: Rondo: A programming platform for model management. In: Proceedings of the 22nd International Conference on Management of Data (SIGMOD), San Diego, USA, pp. 193–204 (2003)
Miller, R., Haas, L., Hernández, M.: Schema mapping as query discovery. In: Proceedings of the 26th International Conference on Very Large Data Bases (VLDB), Cairo, Egypt, pp. 77–88 (2000)
Miller, R., Hernàndez, M., Haas, L., Yan, L.-L., Ho, C., Fagin, R., Popa, L.: The Clio project: Managing heterogeneity. SIGMOD Record 30(1), 78–83 (2001)
Modica, G., Gal, A., Jamil, H.: The use of machine-generated ontologies in dynamic information seeking. In: Proceedings of the 9th International Conference on Cooperative Information Systems (CoopIS), Trento, Italy, pp. 433–448 (2001)
Mork, P., Rosenthal, A., Seligman, L., Korb, J., Samuel, K.: Integration workbench: Integrating schema integration tools. In: Proceedings of the 22nd International Conference on Data Engineering (ICDE) Workshops, Atlanta, USA, p. 3 (2006)
Motro, A.: Management of uncertainty in database systems. In: Kim, W. (ed.) Modern Database Systems, The object model, interoperability and beyond. Addison-Wesley, Reading (1995)
Nejdl, W., Wolf, B., Qu, C., Decker, S., Sintek, M., Naeve, A., Nilsson, M., Palmér, M., Risch, T.: Edutella: A P2P networking infrastructure based on RDF. In: Proceedings of the 11th International World Wide Web Conference (WWW), Honolulu, USA, pp. 604–615 (2002)
Nottelmann, H., Straccia, U.: A probabilistic, logic-based framework for automated web directory alignment. In: Ma, Z. (ed.) Soft Computing in Ontologies and the Semantic Web. Studies in Fuzziness and Soft Computing, vol. 204, pp. 47–77. Springer, Heidelberg (2006)
Noy, N., Musen, M.: The PROMPT suite: interactive tools for ontology merging and mapping. International Journal of Human-Computer Studies 59(6), 983–1024 (2003)
Oundhakar, S., Verma, K., Sivashanugam, K., Sheth, A., Miller, J.: Discovery of web services in a multi-ontology and federated registry environment. International Journal of Web Services Research 2(3), 1–32 (2005)
Paolucci, M., Kawamura, T., Payne, T., Sycara, K.: Semantic matching of web services capabilities. In: Proceedings of the 1st International Semantic Web Conference (ISWC), Chia Laguna, Italy, pp. 333–347 (2002)
Parent, C., Spaccapietra, S.: Issues and approaches of database integration. Communications of the ACM 41(5), 166–178 (1998)
Parsons, S.: Current approaches to handling imperfect information in data and knowledge bases. IEEE Transactions on Knowledge and Data Engineering 8(3), 353–372 (1996)
Prade, H., Testemale, C.: Generalizing database relational algebra for the treatment of incomplete or uncertain information and vague queries. Information Sciences 34, 115–143 (1984)
Putnam, H. (ed.): Reason, Truth, and History. Cambridge University Press, Cambridge (1981)
Rahm, E., Bernstein, P.: A survey of approaches to automatic schema matching. VLDB Journal 10(4), 334–350 (2001)
Rahm, E., Do, H.-H., Maßmann, S.: Matching large XML schemas. SIGMOD Record 33(4), 26–31 (2004)
Rodríguez-Gianolli, P., Garzetti, M., Jiang, L., Kementsietsidis, A., Kiringa, I., Masud, M., Miller, R.J., Mylopoulos, J.: Data sharing in the Hyperion peer database system. In: Proceedings of the 31st International Conference on Very Large Data Bases (VLDB), Seoul, South Korea, pp. 1291–1294 (2005)
Rousset, M.-C., Adjiman, P., Chatalic, P., Goasdoué, F., Simon, L.: Somewhere in the semantic web. In: Wiedermann, J., Tel, G., Pokorný, J., Bieliková, M., Štuller, J. (eds.) SOFSEM 2006. LNCS, vol. 3831, pp. 84–99. Springer, Heidelberg (2006)
Sarma, A.D., Benjelloun, O., Halevy, A.Y., Widom, J.: Working models for uncertain data. In: Proceedings of the 22nd International Conference on Data Engineering (ICDE), Atlanta, USA, p. 7 (2006)
Sayyadian, M., Lee, Y., Doan, A.-H., Rosenthal, A.: Tuning schema matching software using synthetic scenarios. In: Proceedings of the 31st International Conference on Very Large Data Bases (VLDB), Trondheim, Norway, pp. 994–1005 (2005)
Sheth, A., Larson, J.: Federated database systems for managing distributed, heterogeneous, and autonomous databases. ACM Computing Surveys 22(3), 183–236 (1990)
Shvaiko, P.: Iterative Schema-based Semantic Matching. Ph.D thesis, International Doctorate School in Information and Communication Technology, University of Trento, Trento, Italy (November 2006)
Shvaiko, P., Euzenat, J.: A survey of schema-based matching approaches. Journal of Data Semantics 4, 146 (2005)
Srivastava, B., Koehler, J.: Web service composition - Current solutions and open problems. In: Proceedings of the Workshop on Planning for Web Services at the 13th International Conference on Automated Planning and Scheduling (ICAPS), Trento, Italy, pp. 28–35 (2003)
Su, W., Wang, J., Lochovsky, F.: Holistic schema matching for web query interfaces. In: Proceedings of the 10th Conference on Extending Database Technology (EDBT), Munich, Germany, pp. 77–94 (2006)
Tsichritzis, D., Klug, A.C.: The ansi/x3/sparc dbms framework report of the study group on dabatase management systems. Information Systems 3(3), 173–191 (1978)
Wong, S., Xiang, Y., Nie, X.: Representation of bayesian networks as relational databases. In: Proceedings of the 5th International Conference on Information Processing and Management of Uncertainty (IPMU), Paris, France, pp. 159–165 (1994)
Zadeh, L.: Fuzzy sets. Information and Control 8, 338–353 (1965)
Zadeh, L.: Fuzzy sets as a basis for a theory of possibility. Fuzzy Sets and Systems 1, 3–28 (1978)
Zaihrayeu, I.: Towards Peer-to-Peer Information Management Systems. Ph.D thesis, International Doctorate School in Information and Communication Technology, University of Trento, Italy (March 2006)
Zhdanova, A., Shvaiko, P.: Community-driven ontology matching. In: Proceedings of the 3rd European Semantic Web Conference (ESWC), Budva, Montenegro, pp. 34–49 (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Gal, A., Shvaiko, P. (2008). Advances in Ontology Matching. In: Dillon, T.S., Chang, E., Meersman, R., Sycara, K. (eds) Advances in Web Semantics I. Lecture Notes in Computer Science, vol 4891. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-89784-2_6
Download citation
DOI: https://doi.org/10.1007/978-3-540-89784-2_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-89783-5
Online ISBN: 978-3-540-89784-2
eBook Packages: Computer ScienceComputer Science (R0)