Skip to main content
Log in

Schema-aware XPath filtering on XML document streams

  • Published:
Journal of Intelligent Manufacturing Aims and scope Submit manuscript

Abstract

The XML stream filtering is gaining widespread attention from the research community in recent years. There have been many efforts to improve the performance of the XML filtering system by utilizing XML schema information. In this paper, we design and implement an XML stream filtering system, SFilter, which uses DTD or XML schema information for improving the performance. We propose the simplification and two kinds of optimization, one is static and the other is dynamic optimization. The Simplification and static optimization transform the XPath queries to make automata as an index structure for the filtering. The dynamic optimization are done in runtime at the filtering time. We developed five kinds of static optimization and two kinds of dynamic optimization. We present the novel filtering algorithm for the resulting transformed XPath queries and runtime optimizing. The experimental result shows that our system filters the XML streams efficiently.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Altinel, M., & Franklin, M. J. (2000). Efficient filtering of xml documents for selective dissemination of information. In Proceedings of the 26th VLDB conference (pp. 53–64).

  • Amer-Yahia, S., Cho, S., Lakshmanan, L. V. S., & Srivastava, D. (2001). Minimization of tree pattern queries. In Proceedings of the 2001 ACM-SIGMOD conference (pp. 497–508).

  • Chan, C. Y., Fan, W., & Zeng, Y. (2004). Taming xpath queries by minimizing wildcard steps. In Proceedings of the 30st VLDB conference (pp. 156–167).

  • Chidlovskii, B. (2000). Using regular tree automata as xml schemas. In Proceedings of IEEE advances in digital libraries 2000 (pp. 89–104).

  • Clark, J., & DeRose, S. (1999). XML Path Language (XPath) Version 1.0. http://www.w3.org/TR/xpath/ .

  • Diao, Y., Altinel, M., Franklin, M. J., Zhang, H., & Fischer, P. M. (2003). Path sharing and predicate evaluation for high-performance xml filtering. ACM Transactions on Database System, 28(4), 467–516.

    Article  Google Scholar 

  • Diaz, A. L., & Lovell, D. (1999). XML Generator. http://www.alphaworks.ibm.com/tech/xmlgenerator.

  • Flesca, S., Furfaro, F., & Masciari, E. (2003). On the minimization of xpath queries. In Proceedings of the 29st VLDB conference (pp. 153–164).

  • Green, T. J., Miklau, G., Onizuka, M., & Suciu, D. (2003). Processing xml streams with deterministic automata. In Proceedings of the 9th international conference on database theroy (pp. 173–189).

  • Gupta, A. K., & Suciu, D. (2003). Stream processing of XPath queries with predicates. In Proceedings of the 2003 ACM-SIGMOD conference (pp. 419–430). ACM Press, San Diego, CA.

  • Ramanan, P. (2002). Efficient algorithms for minimizing tree pattern queries. In Proceedings of the 2002 ACM-SIGMOD Conference (pp. 299–309).

  • Su, H., Rundensteiner, E. A., & Mani, M. (2005). Optimization for XQuery over XML Streams. In Proceedings of the 31st VLDB conference (pp. 277–288). Trondheim, Norway.

  • Wood, P. T. (2001). Minimising simple xpath expressions. In Proceedings of the 4th international workshop on the web and databases (pp. 13–18).

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Daewook Lee.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lee, D., Kwon, J., Yang, W. et al. Schema-aware XPath filtering on XML document streams. J Intell Manuf 20, 273–282 (2009). https://doi.org/10.1007/s10845-008-0218-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10845-008-0218-x

Keywords

Navigation