skip to main content
10.1145/3539618.3591898acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article
Open Access

MoocRadar: A Fine-grained and Multi-aspect Knowledge Repository for Improving Cognitive Student Modeling in MOOCs

Published:18 July 2023Publication History

ABSTRACT

Student modeling, the task of inferring a student's learning characteristics through their interactions with coursework, is a fundamental issue in intelligent education. Although the recent attempts from knowledge tracing and cognitive diagnosis propose several promising directions for improving the usability and effectiveness of current models, the existing public datasets are still insufficient to meet the need for these potential solutions due to their ignorance of complete exercising contexts, fine-grained concepts, and cognitive labels. In this paper, we present MoocRadar, a fine-grained, multi-aspect knowledge repository consisting of 2,513 exercise questions, 5,600 knowledge concepts, and over 12 million behavioral records. Specifically, we propose a framework to guarantee a high-quality and comprehensive annotation of fine-grained concepts and cognitive labels. The statistical and experimental results indicate that our dataset provides the basis for the future improvements of existing methods. Moreover, to support the convenient usage for researchers, we release a set of tools for data querying, model adaption, and even the extension of our repository, which are now available at https://github.com/THU-KEG/MOOC-Radar.

References

  1. Ghodai Abdelrahman and Qing Wang. 2019. Knowledge tracing with sequential key-value memory networks. In Proceedings of the 42nd international ACM SIGIR conference on research and development in information retrieval. 175--184.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Ghodai Abdelrahman, Qing Wang, and Bernardo Pereira Nunes. 2022. Knowledge tracing: A survey. Comput. Surveys (2022).Google ScholarGoogle Scholar
  3. John R Anderson, C Franklin Boyle, and Brian J Reiser. 1985. Intelligent tutoring systems. Science, Vol. 228, 4698 (1985), 456--462.Google ScholarGoogle Scholar
  4. Sahan Bulathwela, Maria Perez-Ortiz, Emine Yilmaz, and John Shawe-Taylor. 2020. Truelearn: A family of bayesian algorithms to match lifelong learners to open educational resources. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 565--573.Google ScholarGoogle ScholarCross RefCross Ref
  5. R Philip Chalmers. 2012. mirt: A multidimensional item response theory package for the R environment. Journal of statistical Software, Vol. 48 (2012), 1--29.Google ScholarGoogle ScholarCross RefCross Ref
  6. Mingzhi Chen, Quanlong Guan, Yizhou He, Zhenyu He, Liangda Fang, and Weiqi Luo. 2022. Knowledge Tracing Model with Learning and Forgetting Behavior. In Proceedings of the 31st ACM International Conference on Information & Knowledge Management. 3863--3867.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Penghe Chen, Yu Lu, Vincent W Zheng, and Yang Pian. 2018. Prerequisite-driven deep knowledge tracing. In 2018 IEEE International Conference on Data Mining (ICDM). IEEE, 39--48.Google ScholarGoogle ScholarCross RefCross Ref
  8. Youngduck Choi, Youngnam Lee, Dongmin Shin, Junghyun Cho, Seoyon Park, Seewoo Lee, Jineon Baek, Chan Bae, Byungsoo Kim, and Jaewe Heo. 2020. Ednet: A large-scale hierarchical dataset in education. In Artificial Intelligence in Education: 21st International Conference, AIED 2020, Ifrane, Morocco, July 6-10, 2020, Proceedings, Part II 21. Springer, 69--73.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Konstantina Chrysafiadi and Maria Virvou. 2013. Student modeling approaches: A literature review for the last decade. Expert Systems with Applications, Vol. 40, 11 (2013), 4715--4729.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Albert T Corbett and John R Anderson. 1994. Knowledge tracing: Modeling the acquisition of procedural knowledge. User modeling and user-adapted interaction, Vol. 4 (1994), 253--278.Google ScholarGoogle Scholar
  11. Susan E Embretson and Steven P Reise. 2013. Item response theory. Psychology Press.Google ScholarGoogle Scholar
  12. Mingyu Feng, Neil Heffernan, and Kenneth Koedinger. 2009. Addressing the assessment challenge with an online system that tutors as it assesses. User modeling and user-adapted interaction, Vol. 19 (2009), 243--266.Google ScholarGoogle Scholar
  13. Wenzheng Feng, Jie Tang, and Tracy Xiao Liu. 2019. Understanding dropouts in MOOCs. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 517--524.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Yuxian Gu, Xu Han, Zhiyuan Liu, and Minlie Huang. 2022. PPT: Pre-trained Prompt Tuning for Few-shot Learning. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 8410--8423.Google ScholarGoogle ScholarCross RefCross Ref
  15. Jacob Devlin Ming-Wei Chang Kenton and Lee Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of NAACL-HLT. 4171--4186.Google ScholarGoogle Scholar
  16. Kenneth R Koedinger, Ryan SJd Baker, Kyle Cunningham, Alida Skogsholm, Brett Leber, and John Stamper. 2010. A data repository for the EDM community: The PSLC DataShop. Handbook of educational data mining, Vol. 43 (2010), 43--56.Google ScholarGoogle Scholar
  17. David R Krathwohl. 2002. A revision of Bloom's taxonomy: An overview. Theory into practice, Vol. 41, 4 (2002), 212--218.Google ScholarGoogle Scholar
  18. Wonsung Lee, Jaeyoon Chun, Youngmin Lee, Kyoungsoo Park, and Sungrae Park. 2022. Contrastive learning for knowledge tracing. In Proceedings of the ACM Web Conference 2022. 2330--2338.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Irene Li, Alexander R Fabbri, Robert R Tung, and Dragomir R Radev. 2019. What should i learn first: Introducing lecturebank for nlp education and prerequisite chain learning. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 6674--6681.Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Qi Liu, Zhenya Huang, Yu Yin, Enhong Chen, Hui Xiong, Yu Su, and Guoping Hu. 2019a. Ekt: Exercise-aware knowledge tracing for student performance prediction. IEEE Transactions on Knowledge and Data Engineering, Vol. 33, 1 (2019), 100--115.Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Qi Liu, Shuanghong Shen, Zhenya Huang, Enhong Chen, and Yonghe Zheng. 2021. A survey of knowledge tracing. arXiv preprint arXiv:2105.15106 (2021).Google ScholarGoogle Scholar
  22. Qi Liu, Shiwei Tong, Chuanren Liu, Hongke Zhao, Enhong Chen, Haiping Ma, and Shijin Wang. 2019b. Exploiting cognitive structure for adaptive learning. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 627--635.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Yiming Mao, Bin Xu, Jifan Yu, Yifan Fang, Jie Yuan, Juanzi Li, and Lei Hou. 2021. Learning behavior-aware cognitive diagnosis for online education systems. In Data Science: 7th International Conference of Pioneering Computer Scientists, Engineers and Educators, ICPCSEE 2021, Taiyuan, China, September 17-20, 2021, Proceedings, Part II 7. Springer, 385--398.Google ScholarGoogle ScholarCross RefCross Ref
  24. Shailendra Palvia, Prageet Aeron, Parul Gupta, Diptiranjan Mahapatra, Ratri Parida, Rebecca Rosner, and Sumita Sindhi. 2018. Online education: Worldwide status, challenges, trends, and implications., 233--241 pages.Google ScholarGoogle Scholar
  25. Liangming Pan, Chengjiang Li, Juanzi Li, and Jie Tang. 2017. Prerequisite relation learning for concepts in moocs. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 1447--1456.Google ScholarGoogle ScholarCross RefCross Ref
  26. Shalini Pandey and Jaideep Srivastava. 2020. RKT: relation-aware self-attention for knowledge tracing. In Proceedings of the 29th ACM International Conference on Information & Knowledge Management. 1205--1214.Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Alexandros Paramythis and Susanne Loidl-Reisinger. 2003. Adaptive learning environments and e-learning standards. In Second european conference on e-learning, Vol. 1. 369--379.Google ScholarGoogle Scholar
  28. Zachary A Pardos, Ryan SJD Baker, Maria OCZ San Pedro, Sujith M Gowda, and Supreeth M Gowda. 2013. Affective states and state tests: Investigating how affect throughout the school year predicts end of year learning outcomes. In Proceedings of the third international conference on learning analytics and knowledge. 117--124.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Minlong Peng, Xiaoyu Xing, Qi Zhang, Jinlan Fu, and Xuan-Jing Huang. 2019. Distantly Supervised Named Entity Recognition using Positive-Unlabeled Learning. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 2409--2419.Google ScholarGoogle ScholarCross RefCross Ref
  30. Huy Phuong Phan. 2010. Students' academic performance and various cognitive processes of learning: An integrative framework and empirical analysis. Educational Psychology, Vol. 30, 3 (2010), 297--322.Google ScholarGoogle ScholarCross RefCross Ref
  31. Chris Piech, Jonathan Bassen, Jonathan Huang, Surya Ganguli, Mehran Sahami, Leonidas J Guibas, and Jascha Sohl-Dickstein. 2015. Deep knowledge tracing. Advances in neural information processing systems, Vol. 28 (2015).Google ScholarGoogle Scholar
  32. Chen Pojen, Hsieh Mingen, and Tsai Tzuyang. 2020. Junyi Academy Online Learning Activity Dataset: A large-scale public online learning activity dataset from elementary to senior high school students. Dataset available from https://www.kaggle.com/junyiacademy/learning-activity-public-dataset-by-junyi-academy (2020).Google ScholarGoogle Scholar
  33. Joseph Psotka, Leonard Daniel Massey, and Sharon A Mutter. 1988. Intelligent tutoring systems: Lessons learned. Psychology Press.Google ScholarGoogle Scholar
  34. Shuanghong Shen, Qi Liu, Enhong Chen, Zhenya Huang, Wei Huang, Yu Yin, Yu Su, and Shijin Wang. 2021. Learning process-consistent knowledge tracing. In Proceedings of the 27th ACM SIGKDD conference on knowledge discovery & data mining. 1452--1460.Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Shuanghong Shen, Qi Liu, Enhong Chen, Han Wu, Zhenya Huang, Weihao Zhao, Yu Su, Haiping Ma, and Shijin Wang. 2020. Convolutional knowledge tracing: Modeling individualization in student learning process. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 1857--1860.Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. J Stamper, A Niculescu-Mizil, S Ritter, GJ Gordon, and KR Koedinger. 2010. Challenge data set from KDD Cup 2010 Educational Data Mining Challenge. (2010).Google ScholarGoogle Scholar
  37. Shiwei Tong, Qi Liu, Wei Huang, Zhenya Hunag, Enhong Chen, Chuanren Liu, Haiping Ma, and Shijin Wang. 2020. Structure-based knowledge tracing: An influence propagation view. In 2020 IEEE international conference on data mining (ICDM). IEEE, 541--550.Google ScholarGoogle ScholarCross RefCross Ref
  38. Shiwei Tong, Qi Liu, Runlong Yu, Wei Huang, Zhenya Huang, Zachary A Pardos, and Weijie Jiang. 2021. Item Response Ranking for Cognitive Diagnosis.. In IJCAI. 1750--1756.Google ScholarGoogle Scholar
  39. Kurt VanLehn. 1988. Student modeling. Foundations of intelligent tutoring systems, Vol. 55 (1988), 78.Google ScholarGoogle Scholar
  40. Shanshan Wan and Zhendong Niu. 2019. A hybrid e-learning recommendation approach based on learners' influence propagation. IEEE Transactions on Knowledge and Data Engineering, Vol. 32, 5 (2019), 827--840.Google ScholarGoogle ScholarCross RefCross Ref
  41. Fei Wang, Qi Liu, Enhong Chen, Zhenya Huang, Yuying Chen, Yu Yin, Zai Huang, and Shijin Wang. 2020b. Neural cognitive diagnosis for intelligent education systems. In Proceedings of the AAAI conference on artificial intelligence, Vol. 34. 6153--6161.Google ScholarGoogle ScholarCross RefCross Ref
  42. Pengfei Wang, Yu Fan, Long Xia, Wayne Xin Zhao, ShaoZhang Niu, and Jimmy Huang. 2020a. KERL: A knowledge-guided reinforcement learning model for sequential recommendation. In Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval. 209--218.Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. Meng Xia, Mingfei Sun, Huan Wei, Qing Chen, Yong Wang, Lei Shi, Huamin Qu, and Xiaojuan Ma. 2019. Peerlens: Peer-inspired interactive learning path planning in online question pool. In Proceedings of the 2019 CHI conference on human factors in computing systems. 1--12.Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. Chun Kit Yeung and Dit Yan Yeung. 2018. Addressing two problems in deep knowledge tracing via prediction-consistent regularization. In Proceedings of the 5th ACM Conference on Learning @ Scale. ACM, 5:1--5:10.Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Jifan Yu, Gan Luo, Tong Xiao, Qingyang Zhong, Yuquan Wang, Wenzheng Feng, Junyi Luo, Chenyu Wang, Lei Hou, Juanzi Li, et al. 2020. MOOCCube: a large-scale data repository for NLP applications in MOOCs. In Proceedings of the 58th annual meeting of the association for computational linguistics. 3135--3142.Google ScholarGoogle ScholarCross RefCross Ref
  46. Jifan Yu, Yuquan Wang, Qingyang Zhong, Gan Luo, Yiming Mao, Kai Sun, Wenzheng Feng, Wei Xu, Shulin Cao, Kaisheng Zeng, et al. 2021. MOOCCubeX: a large knowledge-centered repository for adaptive learning in MOOCs. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management. 4643--4652.Google ScholarGoogle ScholarDigital LibraryDigital Library
  47. Jifan Yu, Xiaohan Zhang, Yifan Xu, Xuanyu Lei, Xinyu Guan, Jing Zhang, Lei Hou, Juanzi Li, and Jie Tang. 2022. XDAI: A Tuning-free Framework for Exploiting Pre-trained Language Models in Knowledge Grounded Dialogue Generation. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 4422--4432.Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. Jiani Zhang, Xingjian Shi, Irwin King, and Dit-Yan Yeung. 2017. Dynamic key-value memory networks for knowledge tracing. In Proceedings of the 26th international conference on World Wide Web. 765--774.Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. Bowen Zhao, Jiuding Sun, Bin Xu, Xingyu Lu, Yuchen Li, Jifan Yu, Minghui Liu, Tingjian Zhang, Qiuyang Chen, Hanming Li, et al. 2022. EDUKG: a Heterogeneous Sustainable K-12 Educational Knowledge Graph. arXiv preprint arXiv:2210.12228 (2022).Google ScholarGoogle Scholar
  50. Qingyang Zhong, Jifan Yu, Zheyuan Zhang, Yiming Mao, Yuquan Wang, Yankai Lin, Lei Hou, Juanzi Li, and Jie Tang. 2022. Towards a General Pre-training Framework for Adaptive Learning in MOOCs. arXiv preprint arXiv:2208.04708 (2022).Google ScholarGoogle Scholar
  51. Yutao Zhu, Jian-Yun Nie, Kun Zhou, Pan Du, Hao Jiang, and Zhicheng Dou. 2021. Proactive retrieval-based chatbots based on relevant knowledge and goals. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2000--2004.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. MoocRadar: A Fine-grained and Multi-aspect Knowledge Repository for Improving Cognitive Student Modeling in MOOCs

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          SIGIR '23: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval
          July 2023
          3567 pages
          ISBN:9781450394086
          DOI:10.1145/3539618

          Copyright © 2023 Owner/Author

          Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 18 July 2023

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          Overall Acceptance Rate792of3,983submissions,20%
        • Article Metrics

          • Downloads (Last 12 months)289
          • Downloads (Last 6 weeks)94

          Other Metrics

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader