Abstract
Knowledge acquisition is considered as an extraordinary issue concerning organizations and decision makers nowadays. Learning from previous failures and successes saves plenty of time in understanding the problems and visualizing data. Case-based Reasoning (CBR) as a process is one of the most used methods to solve the problem of knowledge capture and data understanding. In this paper we proposed an approach for clustering theses documents based on CBR combined with lexical similarity and k-means algorithm for cluster-dependent keyword weighting. The cluster dependent keyword weighting help in partitioning and categorizing the theses documents into more meaningful categories. The proposed approach yield to 91.95 % increase of using CBR in comparison to human assessments.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Pressman, R.S.: Software Engineering—A Practitioner’s Approach, 5th edn. McGraw-Hill International Edition, New York (chap. 5) (2001)
Kolodner, J.L.: An introduction to case-based reasoning. Artif. Intell. Rev. 6, 3–34 (1992)
Singh, S. K.: Database Systems: Concepts, Designs and Applications. Pearson Education, India (2006)
Han, J., Kamber, M.: Data Mining: Concepts and Techniques, 2nd edn. Elsevier Inc., Amsterdam (2006)
Rainer, R.K., Snyder C.A. et al.: Decision Support systems, pp. 333–341 (1992)
Ranjan, J.: Managing student data: a data mining-based framework for business schools. Int. J. Inf. Oper. Manage. Edu. 4(1), 83–98 (2011)
Pal, S.K., Shiu, S.C.K.: Foundations of Soft Case-Based Reasoning. Wiley, Hoboken (2004)
Park, M.-K., Lee, I., Shon, K.-M.: Using case based reasoning for problem solving in a complex production process. Expert Syst. Appl. 15, 69–75 (1998)
Grupe, F.H., Urwiler, R., Ramarapu, N.K., Owrang, M.: The application of case-based reasoning to the software development process. Inf. Softw. Technol. 40, 493–499 (1998)
Rezvana, M.T., Zeinal Hamadania, A., Shalbafzadehb, A.: Case-based reasoning for classification in the mixed data sets employing the compound distance methods. Eng. Appl. Artif. Intell. 26(9), 2001–2009 (2013)
Al-Mubaid, H., Nguyen, H.A.: Measuring semantic similarity between biomedical concepts within multiple ontologies. IEEE Trans. Syst. Man Cybern. Part C: Appl. Rev. 39, 389–398 (2009)
Metzler, D., Dumais, S., Meek, C.: Similarity measures for short segments of text. In: Proceeding ECIR’07. Proceedings of the 29th European Conference on IR Research, pp. 16–27 (2007)
Nelson, S.J., Johnston, W.D., Humphreys, B.L.: Relationships in medical subject headings. Relationships in the Organization of Knowledge. K.A. Publishers, New Delhi (2001)
Ayeldeen, H., Hassanien, A.E., Fahmy, A.A.: Evaluation of semantic similarity across MeSH ontology: a Cairo University thesis mining case study. In: 12th Mexican International Conference on Artificial Intelligence, pp. 139–144. Mexico City, Mexico (2013)
Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Comput. Surv. 31, 264–323 (1999)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer India
About this paper
Cite this paper
Ayeldeen, H., Hegazy, O., Hassanien, A.E. (2015). Case Selection Strategy Based on K-Means Clustering. In: Mandal, J., Satapathy, S., Kumar Sanyal, M., Sarkar, P., Mukhopadhyay, A. (eds) Information Systems Design and Intelligent Applications. Advances in Intelligent Systems and Computing, vol 339. Springer, New Delhi. https://doi.org/10.1007/978-81-322-2250-7_39
Download citation
DOI: https://doi.org/10.1007/978-81-322-2250-7_39
Published:
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-2249-1
Online ISBN: 978-81-322-2250-7
eBook Packages: EngineeringEngineering (R0)