Skip to main content
Log in

Using the Structure of Prelarge Trees to Incrementally Mine Frequent Itemsets

  • Published:
New Generation Computing Aims and scope Submit manuscript

Abstract

The frequent pattern tree (FP-tree. is an efficient data structure for association-rule mining without generation of candidate itemsets. It was used to compress a database into a tree structure which stored only large items. It, however, needed to process all transactions in a batch way. In the past, we proposed a Fast Updated FP-tree (FUFP-tree. structure to efficiently handle new transactions and to make the tree update process become easier. In this paper, we propose the structure of prelarge trees to incrementally mine association rules based on the concept of pre-large itemsets. Due to the properties of pre-large concepts, the proposed approach does not need to rescan the original database until a number of new transactions have been inserted. The proposed approach can thus achieve a good execution time for tree construction especially when a small number of transactions are inserted each time. Experimental results also show that the proposed approach has a good performance for incrementally handling new transactions.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Agrawal, R., Imielinksi T. and Swami, A., “Mining association rules between sets of items in large database,” in The ACM SIGMOD Conference, pp. 207-216, 1993.

  2. Agrawal, R., Imielinksi T. and Swami, A., “Database mining: a performance perspective,” IEEE Transactions on Knowledge and Data Engineering, 5, 6, pp. 914-925, 1993.

    Article  Google Scholar 

  3. Agrawal R. and Srikant, R., “Fast algorithm for mining association rules,” in The International Conference on Very Large Data Bases, pp. 487-499, 1994.

  4. Agrawal, R., Srikant R. and Vu, Q., “Mining association rules with item constraints,” in The Third International Conference on Knowledge Discovery in Databases and Data Mining, pp. 67-73, 1997.

  5. Chen, M. S., Han J. and Yu, P. S., “Data mining: An overview from a database perspective,” IEEE Transactions on Knowledge and Data Engineering, 8, 6, pp. 966-883, 1996.

    Google Scholar 

  6. Cheung, D. W., Han, J., Ng, V. T. and Wong, C. Y., “Maintenance of discovered association rules in large databases: An incremental updating approach,” in The Twelfth IEEE International Conference on Data Engineering, pp. 106-114, 1996.

  7. Cheung, D.W., Lee, S. D. and Kao, B., “A general incremental technique for maintaining discovered association rules,” in Database Systems for Advanced Applications, pp. 185-194, 1997.

  8. Ezeife, C. I., “Mining incremental association rules with generalized FP-tree,” in The 15th Conference of the Canadian Society for Computational Studies of Intelligence on Advances in Artificial Intelligence, pp. 147-160, 2002.

  9. Fukuda, T., Morimoto, Y., Morishita S. and Tokuyama, T., “Mining optimized association rules for numeric attributes,” Journal of Computer and System Sciences, 58, 1, pp. 182-191, 1996.

    MathSciNet  Google Scholar 

  10. Han J. and Fu, Y., “Discovery of multiple-level association rules from large database,” in The Twenty-first International Conference on Very Large Data Bases, pp. 420-431, 1995.

  11. Han, J., Pei, J. and Yin, Y., “Mining frequent patterns without candidate generation,” in The 2000 ACM SIGMOD International Conference on Management of Data , pp. 1-12, 2000.

  12. Hong, T. P., Wang, C. Y. and Tao, Y. H., “A new incremental data mining algorithm using pre-large itemsets,” Intelligent Data Analysis, 5, 2, pp. 111- 129, 2001.

    Google Scholar 

  13. Hong, T. P., Lin, C. W. and Wu, Y. L., “Incrementally fast updated frequent pattern trees,” Expert Systems with Applications 34, 4, pp. 2424-2435, 2008.

  14. Koh, J. L. and Shieh, S. F., “An effcient approach for maintaining association rules based on adjusting FP-tree structure,” in The Ninth International Conference on Database Systems for Advanced Applications, pp. 417-424, 2004.

  15. Lin, M. Y. and Lee, S. Y., “Incremental update on sequential patterns in large databases,” in The Tenth IEEE International Conference on Tools with Artificial Intelligence, pp. 24-31, 1998.

  16. Mannila, H., Toivonen, H. and Verkamo, A. I., “Efficient algorithm for discovering association rules,” in The AAAI Workshop on Knowledge Discovery in Databases, pp. 181-192, 1994.

  17. Park, J. S., Chen, M. S. and Yu, P. S., “Using a hash-based method with transaction trimming for mining association rules,” IEEE Transactions on Knowledge and Data Engineering, 9, 5, pp. 812-825, 1997.

    Google Scholar 

  18. Qiu, Y., Lan, Y. J. and Xie, Q. S., “An improved algorithm of mining from FP-tree,” in The Third International Conference on Machine Learning and Cybernetics, pp. 26-29, 2004.

  19. Sarda, N. L. and Srinivas, N. V., “An adaptive algorithm for incremental mining of association rules,” in The Ninth International Workshop on Database and Expert Systems, pp. 240-245, 1998.

  20. Srikant, R. and Agrawal, R., “Mining generalized association rules,” in The Twenty-first International Conference on Very Large Data Bases, pp. 407-419, 1995.

  21. Srikant, R. and Agrawal, R., “Mining quantitative association rules in large relational tables,” in ACM SIGMOD International Conference on Management of Data, pp. 1-12, 1996.

  22. Zaiane, O. R. and Mohammed, E. H., “COFI-tree mining: A new approach to pattern growth with reduced candidacy generation,” in IEEE International Conference on Data Mining, 2008.

  23. Zhang, S., “Aggregation and maintenance for database mining,” Intelligent Data Analysis, pp. 475-490, 1999.

  24. Zheng, Z., Kohavi, R. and Mason, L., “Real world performance of association rule algorithms,” in The International Conference on Knowledge Discovery and Data Mining, pp. 401-406, 2001.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Chun-Wei Lin.

About this article

Cite this article

Lin, CW., Hong, TP. & Lu, WH. Using the Structure of Prelarge Trees to Incrementally Mine Frequent Itemsets. New Gener. Comput. 28, 5–20 (2010). https://doi.org/10.1007/s00354-008-0072-6

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00354-008-0072-6

Keywords:

Navigation