Abstract
E-governance plays a pivotal role in the domain of online services by ensuring round the clock accessibility of a wide spectrum of services. However, the huge amount of uploaded information and a vacillating user base makes it rather difficult to access the desired information from the portal. This requires a system which intelligently presents a personalized user interface. A challenging requirement in designing such a system is classifying the diversified users on the basis of their web experience. Traditional web usage mining techniques have been used to cluster similar users primarily on the basis of their page access patterns. In this paper, we veer our attention towards the level of user experience by introducing three parameters namely, page switching behavior, page probing behavior and session count which predominantly decide the level of experience acquired by e-governance users. We make an innovative use of Rough Set Theory to derive a rule-based classification system using three reduct optimization algorithms namely, Johnson Algorithm, Genetic Algorithm and Basic Minimal classification method. In order to test our system, we classified the user base that is publically available in the CTI dataset into two categories. The Basic Minimal method reports the highest accuracy of 74.90% with five fold cross validation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
E-Governance: Initiatives in India, pp 26–58. http://arc.gov.in/11threp/arcthreport_ch4.pdf. Accessed 20 July 2015
Chitra, V., Davamani, A.S.: A survey on preprocessing methods for web usage data. Int. J. Comput. Sci. Inf. Secur. (IJCSIS) 7(3), 78–83 (2010)
Lei, M., Fan, L.: A web personalization system based on users’ interested domains. In: IEEE 7th International Conference on Cognitive Informatics, ICCI 2008, 14–16 August 2008, pp. 153–159 (2008)
Khadangi, E., Bagheri, A.: Comparing MLP, SVM and KNN for predicting trust between users in Facebook. In: 3th International Conference in Computer and Knowledge Engineering (ICCKE), pp. 466–470 (2013)
Pawlak, Z.: Rough sets. Int. J. Comput. Inf. Sci. 2, 341–356 (1982)
Wang, S.: Algorithms for solving the reducts problem in rough sets. Master’s projects. San Jose State University, SJSU Scholar Works (2012)
Lingras, P., Peters, G.: Applying Rough Set Concepts to Clustering, Rough Sets: Selected Methods and Applications in Management and Engineering. Advanced Information and Knowledge Processing, pp. 23–37. Springer-Verlag London Limited, London (2012). https://doi.org/10.1007/978-1-4471-2760-4
Hannah Inbarni, H., Thangavel, K.: Rough set based user profiling for web personalization. Int. J. Recent Trends Eng. 2(1), 103–107 (2009)
Bal, M.: Rough sets theory as symbolic data mining method: an application on complete decision table. Inf. Sci. Lett. Int. J. Inf. Sci. Lett. 2, 35–47 (2013)
Rosetta: A Rough Set Tool. http://www.lcb.uu.se/tools/rosetta/. Accessed 20 July 2014
Rose 2.2: A Rough Set Tool. http://idss.cs.put.poznan.pl/site/rose.html. Accessed 20 June 2014
Khashashneh, A.E.A., et al.: Evaluation of discernibility matrix based reduct computation techniques. In: 5th International Conference in Computer Science and Information Technology (CSIT), 27–28 March 2013, pp. 76–81 (2013)
Elshazly, H.I., et al.: Rough sets and genetic algorithms: a hybrid approach to breast cancer classification. In: World Congress in Information and Communication Technologies (WICT), 30 October–2 November 2012, pp. 260–265 (2012)
Lattice Search. http://www.mpri.lsu.edu/textbook/Chapter5-a.htm. Accessed 12 Nov 2015
Shao, X., et al.: A Dynamic Lattice Searching Method for Fast Optimization of Lennard Jones Clusters, pp. 1693–1698. Published Online in Wiley Interscience (2004). www.interscience.wiley.com
Predki, B., Słowiński, R., Stefanowski, J., Susmaga, R., Wilk, S.: ROSE - software implementation of the rough set theory. In: Polkowski, L., Skowron, A. (eds.) RSCTC 1998. LNCS (LNAI), vol. 1424, pp. 605–608. Springer, Heidelberg (1998). https://doi.org/10.1007/3-540-69115-4_85
Inuiguchi, M., et al.: LEM2-based rule induction via clustering decision classes. In: IEEE International Conference in Systems, Man and Cybernetics, 10–12 October 2005, vol. 3, pp. 2781–2786 (2005)
Preprocessed DePaul CTI Web Usage Data. http://facweb.cs.depaul.edu/mobasher/classes/ect584/resource.html. Accessed 20 Oct 2013
Performance Evaluation. http://www.seas.gwu.edu/~bell.csci243.lectures/performance.pdf. Accessed 10 July 2014
Joshi, M., Vaidya, R., Lingras, P.: Automatic determination of learning styles In: 2nd International Conference on Education and Management Technology IPEDR, vol. 13, pp. 1–7. IACSIT Press, Singapore (2011)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Rani, G., Chakraverty, S. (2018). A Rough Set Based Approach for Web User Profiling. In: Panda, B., Sharma, S., Roy, N. (eds) Data Science and Analytics. REDSET 2017. Communications in Computer and Information Science, vol 799. Springer, Singapore. https://doi.org/10.1007/978-981-10-8527-7_45
Download citation
DOI: https://doi.org/10.1007/978-981-10-8527-7_45
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-8526-0
Online ISBN: 978-981-10-8527-7
eBook Packages: Computer ScienceComputer Science (R0)