Skip to main content

Graph-Based Data Warehousing Using the Core-Facets Model

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6870))

Abstract

There are a growing number of data-mining techniques that model and analyze data in the form of graphs. Graphs can link otherwise disparate data to form a holistic view of the dataset. Unfortunately, it can be challenging to manage the resulting large graph and use it during data analysis. To facilitate managing and operating on graphs, the Core-Facets model offers a framework for graph-based data warehousing. The Core-Facets model builds a heterogeneous attributed core graph from multiple data sources and creates facet graphs for desired analyses. Facet graphs can transform the heterogeneous core graph into various purpose-specific homogeneous graphs, thereby enabling the use of traditional graph analysis techniques. The Core-Facets model also supports new opportunities for multi-view data mining. This paper discusses an implementation of the Core-Facets model, which provides a data warehousing framework for tasks ranging from data collection to graph modeling to graph preparation for analysis.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Aggarwal, C., Wang, H. (eds.): Managing and Mining Graph Data, vol. 40. Springer, Heidelberg (2010)

    MATH  Google Scholar 

  2. Chen, C., Yan, X., Zhu, F., Han, J., Yu, P.S.: Graph OLAP: Towards online analytical processing on graphs. In: Giannotti, F., Gunopulos, D., Turini, F., Zaniolo, C., Ramakrishnan, N., Wu, X. (eds.) Proceedings of the Eighth IEEE Intl. Conference on Data Mining, Pisa, Italy, pp. 103–112 (2008)

    Google Scholar 

  3. Martínez-Bazan, N., Muntés-Mulero, V., Gómez-Villamor, S., Nin, J., Sánchez-Martínez, M.-A., Larriba-Pey, J.-L.: DEX: High-performance exploration on large graphs for information retrieval. In: the 16th ACM Conference on Information and Knowledge Management (2007)

    Google Scholar 

  4. Tang, L., Liu, H., Zhang, J., Nazeri, Z.: Community evolution in dynamic multi-mode networks. In: The 14th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2008), pp. 677–685 (2008)

    Google Scholar 

  5. Gao, J., Liang, F., Fan, W., Wang, C., Sun, Y., Han, J.: On Community Outliers and their Efficient Detection in Information Networks. In: Proceedings of the 16th ACM SIGKDD Intl Conference on Knowledge Discovery and Data Mining, pp. 813–822. Association for Computing Machinery, Washington, D.C (2010)

    Google Scholar 

  6. Tang, L., Liu, H., Zhang, J.: Identifying evolving groups in dynamic multi-mode networks. IEEE Transactions on Knowledge and Data Engineering (2010)

    Google Scholar 

  7. Cai, D., Shao, Z., He, X., Yan, X., Han, J.: Community mining from multi-relational networks. In: Jorge, A.M., Torgo, L., Brazdil, P.B., Camacho, R., Gama, J. (eds.) PKDD 2005. LNCS (LNAI), vol. 3721, pp. 445–452. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  8. Tang, L., Liu, H.: Uncovering cross-dimension group structures in multi-dimensional networks. In: SIAM Intl Conference on Data Mining, Workshop on Analysis of Dynamic Networks, Sparks, NV (2009)

    Google Scholar 

  9. Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Morgan Kaufmann Publisher, San Francisco (2006)

    MATH  Google Scholar 

  10. Bastian, M., Heymann, S., Jacomy, M.: Gephi: an open source software for exploring and manipulating networks. In: 3rd Intl AAAI Conference on Weblogs and Social Media (2009)

    Google Scholar 

  11. Inmon, W.H.: Building the Data Warehouse. Wiley Computer Publishing, Chichester (2002)

    Google Scholar 

  12. Kimball, R., Ross, M.: The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling. John Wiley & Sons, New York (2002)

    Google Scholar 

  13. http://www.kenorr.com/pg%2033%20d.w.%20whitepaper.htm

  14. Greene, D., Cunningham, P.: Multi-view clustering for mining heterogeneous social network data. In: Boughanem, M., Berrut, C., Mothe, J., Soule-Dupuy, C. (eds.) ECIR 2009. LNCS, vol. 5478. Springer, Heidelberg (2009)

    Google Scholar 

  15. Kang, U., Tsourakakis, C.E., Faloutsos, C.: PEGASUS: A peta-scale graph mining system - implementation and observations. In: Wang, W., Kargupta, H., Ranka, S., Yu, P.S., Wu, X. (eds.) Proceedings of the Ninth IEEE Intl Conference on Data Mining, Miami, FL, pp. 229–238 (2009)

    Google Scholar 

  16. http://www.wiki.apache.org/hadoop/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lam, D.N., Liu, A.Y., Martin, C.E. (2011). Graph-Based Data Warehousing Using the Core-Facets Model. In: Perner, P. (eds) Advances in Data Mining. Applications and Theoretical Aspects. ICDM 2011. Lecture Notes in Computer Science(), vol 6870. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23184-1_19

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-23184-1_19

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-23183-4

  • Online ISBN: 978-3-642-23184-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics