Efficiently Mining Frequent Itemsets with Compact FP-Tree

Qin, Liang-Xi; Luo, Ping; Shi, Zhong-Zhi

doi:10.1007/0-387-23152-8_51

Liang-Xi Qin^2,3,4,
Ping Luo^2,3 &
Zhong-Zhi Shi²

Part of the book series: IFIP International Federation for Information Processing ((IFIPAICT,volume 163))

Included in the following conference series:

International Conference on Intelligent Information Processing

1609 Accesses

Abstract

FP-growth algorithm is an efficient algorithm for mining frequent patterns. It scans database only twice and does not need to generate and test the candidate sets that is quite time consuming. The efficiency of the FP-growth algorithm outperforms previously developed algorithms. But, it must recursively generate huge number of conditional FP-trees that requires much more memory and costs more time.

In this paper, we present an algorithm, CFPmine, that is inspired by several previous works. CFPmine algorithm combines several advantages of existing techniques. One is using constrained subtrees of a compact FP-tree to mine frequent pattern, so that it is doesn’t need to construct conditional FP-trees in the mining process. Second is using an array-based technique to reduce the traverse time to the CFP-tree. And an unified memeory management is also implemented in the algorithm. The experimental evaluation shows that CFPmine algorithm is a high performance algorithm. It outperforms Apriori, Eclat and FP-growth and requires less memory than FP-growth.

Download to read the full chapter text

Chapter PDF

Modified FP-Growth: An Efficient Frequent Pattern Mining Approach from FP-Tree

An improved frequent pattern tree: the child structured frequent pattern tree CSFP-tree

Article 26 September 2022

PUF-Tree: A Compact Tree Structure for Frequent Pattern Mining of Uncertain Data

Key words

References

Agarwal R C, Aggarwal C C, and Prasad V V V. A Tree Projection Algorithm for Generation of Frequent Itemsets. Journal of Parallel and Distributed Computing, 2001.
Google Scholar
Agrawal R, Imielinski T, Swami A. Mining association rules between sets of items in large database. In Proc of 1993 ACM SIGMOD Conf on Management of Data, 207–216, Washington DC, May 1993.
Google Scholar
Agrawal R, Srikant R. Fast algorithms for mining association rules. In Proc of the 20^th Int’l Conf on Very Large DataBases (VLDB’94). 487–499. Santiago, Chile, Sept. 1994.
Google Scholar
Brin S, Motwani R, Ullman J D, and Tsur S. Dynamic itemset counting and implication rules for market basket data. In SIGMOD Record (ACM Special Interest Group on Management of Data), 26(2):255, 1997
Google Scholar
FAN Ming, LI Chuan. Mining frequent patterns in an FP-tree without conditional FP-tree generation (In Chinese). Journal of computer research and development, 40(8): 1216–1222. 2003.
Google Scholar
Grahne G, Zhu J. Efficiently using prefix-trees in mining frequent itemsets. In: First Workshop on Frequent Itemset Mining Implementation (FIMI’03). Melbourne, FL
Google Scholar
Han J, Pei J, and Yin Y. Mining Frequent Patterns without Candidate Generation. In Proc of 2000 ACM-SIGMOD Int’l Conf on Management of Data (SIGMOD’00). 1–12. Dallas, TX, 2000.
Google Scholar
Park J S, Chen M-S and Yu P S. An Effective Hash-based Algorithm for Mining Association Rules. In: Proc of 1995 ACM-SIGMOD int’l Conf on Management of Data (SIGMOD’95). San Jose, CA, 1995. 175–186.
Google Scholar
Savasere A, Omiecinski E, Navathe S. An efficient Algorithm for Mining Association Rules in Large Databases, In Proc of 21^st Int’l Conf on Very Large Databases (VLDB’95), pages 432–443. Zurich, Switzerland, Sept. 1995.
Google Scholar
Toivonen H. Sampling Large Databases for Association Rules. In Proc of 22nd Int’l Conf on Very Large Databases (VLDB’96). pages 134–145. Bombay, India, Sept. 1996.
Google Scholar
Zaki M, Parthasarathy S, Ogihara M, and Li W. New algorithms for fast discovery of association rules. In Heckerman D, Mannila H, Pregibon D, and Uthurusamy R eds, Proc of the Third International Conference on Knowledge Discovery and Data Mining (KDD-97), page 283. AAAI Press, 1997. http://citeseer.ist.psu.edu/zaki97new.html
Google Scholar
http://fuzzy.cs.uni-magdeburg.de/~borgelt/
Google Scholar
http://www.cs.helsinki.fi/u/goethals/
Google Scholar
http://www.almaden.ibm.com/softwarequest/Resources/datasets/syndata.html
Google Scholar
http://www.ics.uci.edu/~mlearn/MLRepository.html
Google Scholar

Download references

Author information

Authors and Affiliations

Key Lab of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100080
Liang-Xi Qin, Ping Luo & Zhong-Zhi Shi
Graduate School of Chinese Academy of Sciences, Beijing, 100039
Liang-Xi Qin & Ping Luo
College of Computer and Information Engineering, Guangxi University, Nanning, 530004
Liang-Xi Qin

Authors

Liang-Xi Qin
View author publications
You can also search for this author in PubMed Google Scholar
Ping Luo
View author publications
You can also search for this author in PubMed Google Scholar
Zhong-Zhi Shi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Instittue of Computing TechnologyKey, Laboratory of Int. Infor. Process., Chinese Academy of Sciences, Beijing, 100080, China
Zhongzhi Shi & Qing He &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Qin, LX., Luo, P., Shi, ZZ. (2005). Efficiently Mining Frequent Itemsets with Compact FP-Tree. In: Shi, Z., He, Q. (eds) Intelligent Information Processing II. IIP 2004. IFIP International Federation for Information Processing, vol 163. Springer, Boston, MA. https://doi.org/10.1007/0-387-23152-8_51

Download citation

DOI: https://doi.org/10.1007/0-387-23152-8_51
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-23151-8
Online ISBN: 978-0-387-23152-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Efficiently Mining Frequent Itemsets with Compact FP-Tree

Abstract

Chapter PDF

Similar content being viewed by others

Modified FP-Growth: An Efficient Frequent Pattern Mining Approach from FP-Tree

An improved frequent pattern tree: the child structured frequent pattern tree CSFP-tree

PUF-Tree: A Compact Tree Structure for Frequent Pattern Mining of Uncertain Data

Key words

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Efficiently Mining Frequent Itemsets with Compact FP-Tree

Abstract

Chapter PDF

Similar content being viewed by others

Modified FP-Growth: An Efficient Frequent Pattern Mining Approach from FP-Tree

An improved frequent pattern tree: the child structured frequent pattern tree CSFP-tree

PUF-Tree: A Compact Tree Structure for Frequent Pattern Mining of Uncertain Data

Key words

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation