ABSTRACT
To engage visitors to a Web site at a very early stage (i.e., before registration or authentication), personalization tools must rely primarily on clickstream data captured in Web server logs. The lack of explicit user ratings as well as the sparse nature and the large volume of data in such a setting poses serious challenges to standard collaborative filtering techniques in terms of scalability and performance. Web usage mining techniques such as clustering that rely on offline pattern discovery from user transactions can be used to improve the scalability of collaborative filtering, however, this is often at the cost of reduced recommendation accuracy. In this paper we propose effective and scalable techniques for Web personalization based on association rule discovery from usage data. Through detailed experimental evaluation on real usage data, we show that the proposed methodology can achieve better recommendation effectiveness, while maintaining a computational advantage over direct approaches to collaborative filtering such as the k-nearest-neighbor strategy.
- 1.R.Agarwal,C.Aggarwal,and V.Prasad.A tree projection algorithm for generation of frequent itemsets.In Proceedings of the High Performance Data Mining Workshop Puerto Rico,1999.Google Scholar
- 2.R.Agrawal and Ramakrishnan Srikant.Fast algorithms for mining association ru es.In Proc. 20th Int. Conference on Very Large Data Bases, VLDB94 1994. Google ScholarDigital Library
- 3.C.C.Aggarwal,J.L.Wof,P.S.Yu.A new method for similarity indexing for market data.In Proceedings of the ACM SIGMOD Conference 1999. Google ScholarDigital Library
- 4.R.Coo ey,B.Mobasher,and J.Srivastava.Data preparation for mining World Wide Web browsing patterns.Journal of Knowledge and Information Systems (1)1,1999.Google Scholar
- 5.M.Deshpande and G.Karypis.Selective Markov models for predicting Web-page accesses.Technica Report #00-056,University of Minessota,2000.Google Scholar
- 6.X.Fu,J.Budzik,and K.J.Hammond.Mining navigation history for recommendation.In Proc. 2000 International Conference on Intelligent User Interfaces New Orleans,January 2000.ACM. Google ScholarDigital Library
- 7.W.Lin,S.A.Alvarez,C.Ruiz.Collaborative recommendation via adaptive association rule mining. In Proceedings of the Web Mining for E-Commerce Workshop (WebKDD'2000),August 2000,Boston.Google Scholar
- 8.B.Liu,W.Hsu,andY.Ma.Associationrueswith multiple minimum supports.In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD-99, poster),San Diego,CA,August 1999. Google ScholarDigital Library
- 9.B.Mobasher,R.Cooley,and J.Srivastava.Creating adaptive web sites through usage-based clustering of urls.In IEEE Knowledge and Data Engineering Workshop (KDEX'99),November 1999. Google ScholarDigital Library
- 10.B.Mobasher,R.Cooley,and J.Srivastava.Automatic personalization based on Web usage mining.In Communications of the ACM (43)8,August 2000. Google ScholarDigital Library
- 11.B.Mobasher,H.Dai,T.Luo and M.Nakagawa. Improving the effectiveness of collaborative filtering on anonymous Web usage data.In Proceedings of the IJCAI 2001 Workshop on Intelligent Techniques for Web Personalization (ITWP01),August 2001,Seatt e.Google Scholar
- 12.B.Mobasher,H.Dai,T.Luo,M.Nakagawa,Y.Sun, and J.Wiltshire.Discovery of aggregate usage profiles for Web personalization.In Proceedings of the WebKDD 2000 Workshop at the ACM SIGKKD 2000, Boston,August 2000.Google Scholar
- 13.M.Perkowitz and O.Etzioni.Adaptive Web sites: automatically synthesizing Web pages.In Proceedings of Fifteenth National Conference on Artificial Intelligence Madison,WI,1998. Google ScholarDigital Library
- 14.J.Pitkow and P.Pirolli.Mining longest repeating subsequences to Predict WWW Sur .ng.In Proceedings of the 1999 USENIX Annual Technical Conference 1999. Google ScholarDigital Library
- 15.R.Srikant and R.Agrawal. Mining generalized association rules.In Proceedings of the 21st Int'l Conference on Very Large Databases (VLDB95), Zurich,Switzerland,September 1995. Google ScholarDigital Library
- 16.J.Srivastava,R.Coo ey,M.Deshpande,P-T.Tan. Web usage mining:discovery and applications of usage patterns from Web data.SIGKDD Explorations (1)2,2000. Google ScholarDigital Library
- 17.B.M.Sarwar,G.Karypis,J.Konstan,and J.Riedl. Analysis of recommender algorithms for e-commerce. In Proceedings of the 2nd ACM E-Commerce Conference (EC'00),October 2000,Minneapolis. Google ScholarDigital Library
- 18.B.Sarwar,G.Karypis,J.Konstan,and J.Riedl. Application of dimensionality reduction in recommender systems:a case study.In Proceedings of the WebKDD 2000 Workshop at the ACM SIGKKD 2000,Boston,August 2000.Google Scholar
- 19.S.Schechter,M.Krishnan,and M.D.Smith.Using path profiles to predict http requests.In Proc. 7th International World Wide Web Conference Apri 1998,Brisbane,Australia. Google ScholarDigital Library
- 20.U.Shardanand,P.Maes.Social information .ltering: algorithms for automating "word of mouth."In Proceedings of the ACM CHI Conference (CHI95), 1995. Google ScholarDigital Library
Index Terms
- Effective personalization based on association rule discovery from web usage data
Recommendations
Discovery and Evaluation of Aggregate Usage Profiles for Web Personalization
Web usage mining, possibly used in conjunction with standard approaches to personalization such as collaborative filtering, can help address some of the shortcomings of these techniques, including reliance on subjective user ratings, lack of scalability,...
Web personalization based on usage mining
FDIA'09: Proceedings of the Third BCS-IRSG conference on Future Directions in Information AccessPersonalized or recommender systems are a particular type of information filtering applications. User profiles, representing the information needs and preferences of users, can be inferred from log or clickthrough data, or the ratings that users provide ...
Discovery of Interesting Association Rules Based on Web Usage Mining
MEDIACOM '10: Proceedings of the 2010 International Conference on Multimedia CommunicationsMining of association rules is an important research topic in web usage mining. The purpose of this paper is to research how to dig interesting association rules effectively from the Web logs after been preprocessed. Firstly, using the FP-growth ...
Comments