skip to main content
article
Free Access

Storing semistructured data with STORED

Published:01 June 1999Publication History
Skip Abstract Section

Abstract

Systems for managing and querying semistructured-data sources often store data in proprietary object repositories or in a tagged-text format. We describe a technique that can use relational database management systems to store and manage semistructured data. Our technique relies on a mapping between the semistructured data model and the relational data model, expressed in a query language called STORED. When a semistructured data instance is given, a STORED mapping can be generated automatically using data-mining techniques. We are interested in applying STORED to XML data, which is an instance of semistructured data. We show how a document-type-descriptor (DTD), when present, can be exploited to further improve performance.

References

  1. 1 S. Abiteboul, D. Quass, J. McHugh, J. Widom, and J. Wiener. The Lorel query language for semistructured data. International Journal on Digital Libraries, 1(1):68- 88, April 1997.]]Google ScholarGoogle ScholarCross RefCross Ref
  2. 2 R. Agrawal, T. Imie}Linski, and A. Swami. Mining association rules between sets of" items in large databases. In Proceedings of A CM SIGMOD Conference on Management of Data, pages 207-216, Washington, DC, 1993.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. 3 Catriel Beeri and ~Ibva Milo. Schemas for integration and translation of structured and semi-structured data. In Proceedings of the International Conference on Database Theory, 1999. to appear.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. 4 Peter Buneman, Susan Davidson, Gerd Hillebrand, and Dan Suciu. A query language and optimization techniques for unstructured data. In Proceedings of A CM-SIGMOD International Conference on Management of Data, pages 505-516, 1996.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. 5 V. Christophtdes, S. Abiteboul, S. CIuet, and M. Scholl. From structured documents to novel query facilities. In Richard Snodgrass and Marianne Winslett, editors, Proceedings of 1994 A CM SIGMOD International Conference on Management of Data, Minneapolis, Minnesota, May 1994.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. 6 Mary Fernandez, Daniela Florescu, Jaewoo Kang, Alon Levy, and Dan Suciu. Catching the boat with Strudel: experience with a web-site management system. In Proceedings of A CM-SIGMOD International Conference on Management of Data, 1998.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. 7 Mary Fernandez, Daniela Florescu, Alon Levy, and Dan Suciu. A query language for a web-site management system. SIGMOD Record, 26(3):4-11, September 1997.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. 8 M. Garey and D. Johnson. Computers and Intractability: A Guide to the Theory of Af79-completeness. W. H. Freeman, San Francisco, 1979.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. 9 S. Ginsburg. The Mathematical Theory of Context-~.ee Languages. McGraw-Hill, 1966.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. 10 K.BShm, K.Aberer, E.Neuhold, and X.Yang. Structured document storage and refined declarative and navigational access mechanisms in HyperStorM. VLDB Journal, 6(4):296-311, November 1997.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. 11 Alon Levy, Alberto Mendelzon, Yehoshua Sagiv, and Dive sh Srivastava. Answering queries using views. In Proceedings of the 14th Symposium on Principles of Database Systems, San Jose, CA, June 1995.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. 12 M.Volz, K.Aberer, and K.BShm. Applying a flexible OODBMS-IRS-Coupling to structured document handling. In Internaltional Conference on Data Engineering, February 1996.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. 13 S. Nestorov, S. Abiteboul, and R. Motwani. Extracting schema from semistructured data. In Proceedings of ~he A CM Conference on Management of Data, pages 295-31)6, 1998.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. 14 Michael R. Genesereth Oliver M. Duschka. Answering recursive queries using views. In Proceedings of the ACM Symposium on Principles of Database Systems, pages 109- 116, 1997.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. 15 Y. Papakonstantinou, S. Abiteboul, and H. Garcia-Molina. Object fusion in mediator systems. In Proceedings of Very Large Data Bases, pages 413-424, September 1996.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. 16 Y. Papakonstantinou, H. Garcia-Molina, and J. Widom. Object exchange across heterogeneous information sources. In IEEE International Conference on Data Engineering, pages 251-260, March 1995.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. 17 D. Quass, A. Rajaraman, Y. Sagiv, J. Ullman, and J. Widorn. Querying semistructure heterogeneous information. In International Conference on Deductive and Object Oriented Databases, pages 319-344, 1995.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. 18 Dimitri Theodoratos and Timos Sellis. Data warehouse configuration. In Proceedings of the International Conference on Very Large Data Bases, pages 126-135, Athens, Greece, August 1997.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. 19 O. Tsatalos, M. Solomon, and Y. Ioannidis. The GMAP: a vesatile tool for physical data independence. In Proc. 20th International VLDB Conference, 1994.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. 20 Jeffrey D. Ullman. Principles of Database and Knowledgebase Systems H: The New Technologies. Computer Science Press, Rockvill, MD 20850, 1989.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. 21 Ke Wang and Huiqing Liu. Discovering typical structures of documents: a road map approach. In A CM SIGIR Conference on Research and Development in Information Retrieval, August 1998.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. 22 Tian Zhang, Raghu Ramakrishnan, and Miron Livny. BIRCH: an efficient data clustering method for very large databases. In Proceedings of A CM Conference on Management of Data, pages 103-114, 1996.]] Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Storing semistructured data with STORED

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in

            Full Access

            • Published in

              cover image ACM SIGMOD Record
              ACM SIGMOD Record  Volume 28, Issue 2
              June 1999
              599 pages
              ISSN:0163-5808
              DOI:10.1145/304181
              Issue’s Table of Contents
              • cover image ACM Conferences
                SIGMOD '99: Proceedings of the 1999 ACM SIGMOD international conference on Management of data
                June 1999
                604 pages
                ISBN:1581130848
                DOI:10.1145/304182

              Copyright © 1999 ACM

              Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 1 June 1999

              Check for updates

              Qualifiers

              • article

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader