Skip to main content

Improving the Development of Data Warehouses by Enriching Dimension Hierarchies with WordNet

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4623))

Abstract

OLAP (On-Line Analytical Processing) operations, such as roll-up or drill-down, depend on data warehouse dimension hierarchies in order to aggregate information at different levels of detail and support the decision-making process required by final users. This is why it is crucial to capture adequate hierarchies in the requirement analysis stage. However, operational data could not be enough for supplying information to construct every level of these hierarchies. In this paper, we apply knowledge given by relationships among concepts from WordNet to overcome this problem. Therefore, richer dimension hierarchies will be specified in the data warehouse, and OLAP tools will be able to show proper information to improve decision-making process. Decision makers thus will be able to achieve their information needs for analysis. Finally, we will show the benefits of our approach by providing a case study in which a poor hierarchy is enriched with new levels of aggregation.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abelló, A., Samos, J., Saltor, F.: Understanding Analysis Dimensions in a Multidimensional Object-Oriented Model. In: Int. Workshop on Design and Management of Data Warehouses (DMDW) (2001)

    Google Scholar 

  2. Akoka, J., Comyn-Wattiau, I., Prat, N.: Dimension Hierarchies Design from UML Generalizations and Aggregations. In: Kunii, H.S., Jajodia, S., Sølvberg, A. (eds.) ER 2001. LNCS, vol. 2224, pp. 442–455. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  3. Chandrasekaran, B., Josephson, J.R., Benjamins, V.R.: Ontologies: What are they? why do we need them? IEEE Intelligent Systems and Their Applications 14(1), 20–26 (1999)

    Article  Google Scholar 

  4. Gangemi, A., Guarino, N., Masolo, C., Oltramari, A.: Sweetening WORDNET with DOLCE. AI Magazine 24(3), 13–24 (2003)

    Google Scholar 

  5. Gangemi, A., Navigli, R., Velardi, P.: The OntoWordNet Project: Extension and Axiomatization of Conceptual Relations in WordNet. In: Meersman, R., Tari, Z., Schmidt, D.C. (eds.) CoopIS 2003, DOA 2003, and ODBASE 2003. LNCS, vol. 2888, pp. 820–838. Springer, Heidelberg (2003)

    Google Scholar 

  6. Horner, J., Song, I-Y., Chen, P.: An analysis of additivity in OLAP systems. In: 7th ACM Int. Workshop on Data Warehousing and OLAP (DOLAP), pp. 83–91. ACM Press, New York (2004)

    Chapter  Google Scholar 

  7. Hurtado, C.A., Mendelzon, A.O., Vaisman, A.A.: Maintaining Data Cubes under Dimension Updates. In: 15th International Conference on Data Engineering (ICDE), pp. 346–355. IEEE Computer Society Press, Los Alamitos (1999)

    Google Scholar 

  8. Inmon, W.: Building the Data warehouse. John Wiley & Sons, Chichester (1996)

    Google Scholar 

  9. Jagadish, H.V., Lakshmanan, L.V.S., Srivastava, D.: What can Hierarchies do for Data Warehouses? In: 25th VLDB Conference (1999)

    Google Scholar 

  10. Kedad, Z., Métais, E.: Ontology-Based Data Cleaning. In: Andersson, B., Bergholtz, M., Johannesson, P. (eds.) NLDB 2002. LNCS, vol. 2553, pp. 137–149. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  11. Kimball, R.: The Data Warehouse Toolkit: Practical Techniques For Building Dimensional Data Warehouse. John Wiley & Sons, Chichester (1996)

    Google Scholar 

  12. Luján-Mora, S., Trujillo, J., Song, I-Y.: A UML Profile for Multidimensional Modeling in Data Warehouses. Data & Knowledge Engineering 59(3), 725–769 (2006)

    Article  Google Scholar 

  13. Luján-Mora, S., Trujillo, J.: A Comprehensive Method for Data Warehouse Design. In: Proceedings of the 5th International Workshop on Design and Management of Data Warehouses (DMDW’03), Berlin, Germany, pp. 1.1–1.14 (September 2003)

    Google Scholar 

  14. Luján-Mora, S., Trujillo, J.: A Data Warehouse Engineering Process. In: Yakhno, T. (ed.) ADVIS 2004. LNCS, vol. 3261, pp. 14–23. Springer, Heidelberg (2004)

    Google Scholar 

  15. Malinowski, E., Zimányi, E.: OLAP Hierarchies: A Conceptual Perspective. In: Persson, A., Stirna, J. (eds.) CAiSE 2004. LNCS, vol. 3084, pp. 477–491. Springer, Heidelberg (2004)

    Google Scholar 

  16. Miller, G.A., Beckwith, R., Fellbaum, C., Gross, D., Miller, K.J.: WordNet: An on-line lexical database. International Journal of Lexicography 3(4) (1990)

    Google Scholar 

  17. Miller, G.A., Fellbaum, C.: Semantic networks of English. Lexical And Conceptual Semantics. Blackwell Cambridge and Oxford. England, pp. 197–229 (1992)

    Google Scholar 

  18. Montoyo, A., Palomar, M.: WSD Algorithm Applied to a NLP System. In: Bouzeghoub, M., Kedad, Z., Métais, E. (eds.) NLDB 2000. LNCS, vol. 1959, pp. 54–65. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  19. Morato, J., Marzal, M.A., Lloréns, J., Moreiro, J.: WordNet Applications. In: Proc. of the 2nd International WordNet Conference (GWC), pp. 270–278 (2004)

    Google Scholar 

  20. Object Management Group (OMG). Unified Modeling Language Specification 1.5 (2004), http://www.omg.org/cgi-bin/doc?formal/03-03-01

  21. Pourabbas, E., Rafanelli, M.: Characterization of Hierarchies and Some Operators in OLAP Environment. In: Proc. of the 2nd ACM Int. Workshop on Data Warehousing and OLAP (DOLAP), pp. 54–59. ACM Press, New York (1999)

    Chapter  Google Scholar 

  22. Schneider, M.: Well-formed Data Warehouses Structures. In: 5th International Workshop Design and Management of Data Warehouses (2003)

    Google Scholar 

  23. Smith, J.M., Smith, D.C.P.: Database Abstractions: Aggregations and Generalizations. ACM TODS 2(2) (1977)

    Google Scholar 

  24. Storey, V.: Understanding Semantic Relationships. VLDB Journal 2, 455–488 (1993)

    Article  Google Scholar 

  25. Sugumaran, V., Storey, V.: Ontologies for conceptual modeling: their creation, use, and management. Data & Knowledge Engineering 42(3), 251–271 (2002)

    Article  MATH  Google Scholar 

  26. Trujillo, J., Palomar, M., Gómez, J., Song, I.Y.: Designing Data Warehouses with OO Conceptual Models. IEEE Computer 34(12), 66–75 (2001)

    Google Scholar 

  27. Toivonen, S., Niemi, T.: Describing data sources semantically for facilitating efficient creation of OLAP cubes. In: McIlraith, S.A., Plexousakis, D., van Harmelen, F. (eds.) ISWC 2004. LNCS, vol. 3298, Springer, Heidelberg (2004)

    Google Scholar 

  28. Vossen, P.: EuroWordNet: building a multilingual database with wordnets for European languages, vol. 3(1), pp. 7–10. Published in: The ELRA Newsletter, Paris (February 1998), ISSN: 1026-8300

    Google Scholar 

  29. Wache, H., Vögele, T., Visser, U., Stuckenschmidt, H., Schuster, G., Neumann, H., Hübner, S.: Ontology-based integration of information – A survey of existing approaches. In: Proceedings of IJCAI-01. Workshop: Ontologies and Information Sharing, Seattle, WA, pp. 108–117 (2001)

    Google Scholar 

  30. Mazón, J.-N., Trujillo, J., Serrano, M., Piattini, M.: Designing Data Warehouses: from Business Requirement Analysis to Multidimensional Modeling. In: Int. Workshop on Requirements Engineering for Business Needs and IT Alignment, REBNITA (2005)

    Google Scholar 

  31. Yu, E.: Modeling Strategic Relationships for Process Reenginering, Ph.D. Thesis. University of Toronto (1995)

    Google Scholar 

  32. Mazón, J-N., Trujillo, J., Lechtenbörger, J.: A Set of QVT Relations to Assure the Correctness of Data Warehouses by Using Multidimensional Normal Forms. In: Embley, D.W., Olivé, A., Ram, S. (eds.) ER 2006. LNCS, vol. 4215, pp. 385–398. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  33. Moreda, P., Muñoz, R., Martínez-Barco, P., Cachero, C., Palomar, M.: A web information extraction system to DB prototyping. In: Andersson, B., Bergholtz, M., Johannesson, P. (eds.) NLDB 2002. LNCS, vol. 2553, pp. 13–26. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  34. Sugumaran, V., Storey, V.: An Ontology-Based Framework for Generating and Improving Database Design. In: Andersson, B., Bergholtz, M., Johannesson, P. (eds.) NLDB 2002. LNCS, vol. 2553, pp. 1–12. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  35. Kiyavitskaya, N., Zeni, N., Mich, L., Mylopoulos, J.: Experimenting with Linguistic Tools for Conceptual Modeling: Quality of the Models and Critical Features. In: Meziane, F., Métais, E. (eds.) NLDB 2004. LNCS, vol. 3136, pp. 135–146. Springer, Heidelberg (2004)

    Google Scholar 

  36. Guizzardi, G., Wagner, G., Guarino, N., van Sinderen, M.: An Ontologically Well-Founded Profile for UML Conceptual Models. In: Persson, A., Stirna, J. (eds.) CAiSE 2004. LNCS, vol. 3084, pp. 112–126. Springer, Heidelberg (2004)

    Google Scholar 

  37. Golfarelli, M., Maio, D., Rizzi, S.: The Dimensional Fact Model: A Conceptual Model for Data Warehouses. Int. J. Cooperative Inf. Syst. 7(2-3), 215–247 (1998)

    Article  Google Scholar 

  38. Cabibbo, L., Torlone, R.: A Logical Approach to Multidimensional Databases. In: Schek, H.-J., Saltor, F., Ramos, I., Alonso, G. (eds.) EDBT 1998. LNCS, vol. 1377, pp. 183–197. Springer, Heidelberg (1998)

    Chapter  Google Scholar 

  39. Tryfona, N., Busborg, F., Christiansen, J.: starER: A Conceptual Model for Data Warehouse Design. In: Proc. Of the ACM 2nd Intl. Workshop on Data Warehousing and OLAP (DOLAP 1999), Kansas City, USA, ACM Press, New York (1999)

    Google Scholar 

  40. Horner, J., Song, I-Y.: A Taxonomy of Inaccurate Summaries and Their Management in OLAP Systems. In: Delcambre, L.M.L., Kop, C., Mayr, H.C., Mylopoulos, J., Pastor, Ó. (eds.) ER 2005. LNCS, vol. 3716, pp. 433–448. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  41. Mazón, J-N., Trujillo, J.: Enriching Data Warehouse Dimension Hierarchies by Using Semantic Relations. In: Bell, D., Hong, J. (eds.) Flexible and Efficient Information Handling. LNCS, vol. 4042, pp. 278–281. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Martine Collard

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Mazón, JN., Trujillo, J., Serrano, M., Piattini, M. (2007). Improving the Development of Data Warehouses by Enriching Dimension Hierarchies with WordNet. In: Collard, M. (eds) Ontologies-Based Databases and Information Systems. ODBIS ODBIS 2006 2005. Lecture Notes in Computer Science, vol 4623. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-75474-9_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-75474-9_6

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-75473-2

  • Online ISBN: 978-3-540-75474-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics