Skip to main content

Data Mining in Data-Intensive and Cognitively-Complex Settings: Lessons Learned from the Dicode Project

  • Chapter
  • First Online:
Mastering Data-Intensive Collaboration and Decision Making

Part of the book series: Studies in Big Data ((SBD,volume 5))

Abstract

This chapter reports on practical lessons learned while developing the Dicode’s data mining services and using them in data-intensive and cognitively-complex settings. Various sources were taken into consideration to establish these lessons, including user feedbacks obtained from evaluation studies, discussion in teams, as well as observation of services’ usage. The lessons are presented in a way that could aid people who engage in various phases of developing similar kind of systems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Marz, N., Warren, J.: Big Data—Principles and Best Practices of Scalable Real-Time Data Systems. Manning Publications, New York (2012)

    Google Scholar 

  2. Baron, P.: Big Data für IT-Entscheider. Riesige Datenmengen und moderne Technologien gewinnbringend nutzen, München (2013)

    Book  Google Scholar 

  3. Grosskreutz, H., Paurat D.: Fast and memory-efficient discovery of the top-k relevant subgroups in a reduced candidate space. In: Machine Learning and Knowledge Discovery in Databases. Lecture Notes in Computer Science vol. 6911, pp. 533–548. Springer, Heidelberg (2011)

    Google Scholar 

  4. Han, J., Pei, J., Yin, Y.: Mining frequent patterns without candidate generation. In: Proceedings of SIGMOD’00. pp. 1–12. ACM Press, New York http://doi.acm.org/10.1145/342009.335372 (2000)

  5. Büttcher, S., Clarke, C., Cormack, G.: Information Retrieval: Implementing and Evaluating Search Engines. MIT Press, Cambridge, Mass (2010)

    Google Scholar 

  6. Friesen, N., Rüping, S.: Distance Metric Learning for Recommender Systems in Complex Domains. In: Proceedings of dicoSyn 2012 (Mastering Data-Intensive Collaboration through the Synergy of Human and Machine Reasoning), a workshop at CSCW 2012, February 12, 2012, Seattle (2012)

    Google Scholar 

  7. Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Natalja Friesen .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Friesen, N., Kindermann, J., Maassen, D., Rüping, S. (2014). Data Mining in Data-Intensive and Cognitively-Complex Settings: Lessons Learned from the Dicode Project. In: Karacapilidis, N. (eds) Mastering Data-Intensive Collaboration and Decision Making. Studies in Big Data, vol 5. Springer, Cham. https://doi.org/10.1007/978-3-319-02612-1_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-02612-1_10

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-02611-4

  • Online ISBN: 978-3-319-02612-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics