ABSTRACT
In general, the design of XML schemas involves translating conceptual schemas into XML schemas which aim to be: (i) normalized schemas, and (ii) connected structures in order to achieve good performance on queries. However, these requirements address a trade-off because highly connected XML structures allow data redundancy, and normalized schemas generate disconnected XML structures. This paper describes a workload-based approach which balances this trade-off on translating conceptual schemas into XML structures. An experimental study on an XML database shows that our XML schemas provide high query performance on the relevant elements for the workload and, at the same time, low cost of data redundancy on elements that are not relevant for update operations.
- M. Arenas and L. Libkin. A normal form for xml documents. In Symposium on Principles of Database Systems, pages 85--96. ACM, 2002. Google ScholarDigital Library
- D. Barbosa, A. Mendelzon, J. Keenleyside, and K. Lyons. Toxgene: A template-based data generator for xml. In Proc. WebDB, 2002.Google ScholarDigital Library
- C. Batini, S. Ceri, and S. Navathe. Conceptual Database Design: An Entity-Relationship Approach. The Benjamin/Cummings Publishing Company, 1992. Google ScholarDigital Library
- L. Bird, A. Goodchild, and T. A. Halpin. Object role modeling and xml-schema. In International Conference on Conceptual Modeling, pages 661--705. Springer, 2000. Google ScholarDigital Library
- T. Bradford, V. Gritsenko, and K. O'Neill. Apache xindice. http://xml.apache.org/xindice/, 2011.Google Scholar
- C. Curino, E. Jones, Y. Zhang, and S. Madden. Schism: a workload-driven approach to database replication and partitioning. Proc. VLDB Endow., 3:48--57, 2010. Google ScholarDigital Library
- R. Elmasri, J. Weeldreyer, and A. R. Hevner. The category concept: An extension to the entity-relationship model. In Data Knowledge Engineering, volume 1, pages 75--116, 1985. Google ScholarDigital Library
- D. Embley, S. Liddle, and S. Kamha. Enterprise modeling with conceptual xml. In International Conference on Conceptual Modeling, pages 150--165, 2004.Google ScholarCross Ref
- J. Fong and A. F. et. al. Translating relational schema with constraints into xml schema. In International Journal of Software Engineering and Knowledge Engineering, volume 16, pages 201--244, 2006.Google ScholarCross Ref
- T. Kudrass and T. Krumbein. Rule-based generation of xml schemas from uml class diagrams. In Advances in Databases and Information Systems. Springer, 2003.Google ScholarCross Ref
- M. Mani. Erex: A conceptual model for xml. In Internation XML Database Symposium, pages 128--142. Springer, 2004.Google Scholar
- W. Y. Mok and D. W. Embley. Generating compact redundancy-free xml documents from conceptual-model hypergraphs. In IEEE Transactions on Knowledge and Data Engineering, volume 18, pages 1082--1096, 2006. Google ScholarDigital Library
- M. M. Moro, L. Lim, and Y.-C. Chang. Schema advisor for hybrid relational-xml dbms. In SIGMOD '07: ACM SIGMOD international conference on Management of data, pages 959--970. ACM, 2007. Google ScholarDigital Library
- P. Pigozzo and E. Quintarelli. An algorithm for generating xml schemas from er schemas. In Italian Symposium on Advanced Database Systems, pages 192--199, 2005.Google Scholar
- N. Routledge, L. Bird, and A. Goodchild. Uml and xml schema. In Australian Database Conference, pages 157--166. IEEE, 2002. Google ScholarDigital Library
- H. Schöning. Tamino - a dbms designed for xml-schema. In International Conference on Data Engineering, pages 149--154. IEEE, 2001. Google ScholarDigital Library
- R. Schroeder and R. D. S. Mello. Designing xml documents from conceptual schemas and workload information. Multimedia Tools Appl., 43:303--326, July 2009. Google ScholarDigital Library
- R. Schroeder and R. S. Mello. Improving query performance on xml documents: A workload-driven design approach. In Symposium on Document Engineering, pages 177--186, 2008. Google ScholarDigital Library
- L. Stephens and A. G. et. al. Constructing consensus ontologies for the semantic web: A conceptual approach. In World Wide Web Journal, volume 7, 2004. Google ScholarDigital Library
- N. Wiwatwattana and H. J. et. al. Making designer schemas with colors. In ICDE, 2006. Google ScholarDigital Library
- C. Yu and H. V. Jagadish. Xml schema refinement through redundancy detection and normalization. In The VLDB Journal, volume 17, pages 203--223, 2008. Google ScholarDigital Library
Index Terms
- A workload-aware approach for optimizing the XML schema design trade-off
Recommendations
XML-based XML schema access
WWW '07: Proceedings of the 16th international conference on World Wide WebXML Schema's abstract data model consists of components, which are the structures that eventually define a schema as a whole. XML Schema's XML syntax, on the other hand, is not a direct representation of the schema components, and it proves to be ...
Constraint Preserving Transformation from Relational Schema to XML Schema
XML has become the standard for publishing and exchanging data on the Web. However, most business data is managed and will remain to be managed by relational database management systems. As such, there is an increasing need to efficiently and accurately ...
Optimized XPath evaluation for schema-compressed XML data
ADC '12: Proceedings of the Twenty-Third Australasian Database Conference - Volume 124XML has become the de facto standard for data exchange in enterprise information systems. But whenever XML data is stored or processed, e. g. in form of a DOM tree representation, the XML markup causes a huge blow-up of the memory consumption compared ...
Comments