Abstract
Spatial High Utility Itemset Mining (SHUIM) aims to discover all itemsets in a spatiotemporal database that satisfy the user-specified minimum utility (minUtil) and maximum distance (maxDist) constraints. The popular adoption and successful industrial application of SHUIM suffers from the following two limitations: (i) Since SHUIM determines the interestingness of an itemset without taking into account its support within the data, SHUIM facilitates sporadic itemsets with high utility to be generated as SHUIs. In particular, items in long transactions can combine with each other and be generated as SHUIs. (ii) SHUIM is a computationally expensive process because the generated itemsets do not satisfy the downward closure property. This paper introduces Spatial High Utility Frequent Itemset Mining (SHUFIM) to address these two issues. A SHUI in a spatiotemporal database is said to be a SHUFI if its support is no less than the user-specified minimum support (minSup) constraint. The usage of minSup not only facilitates the proposed model to be tolerant to the long transactions within the data but also facilitates us to employ additional pruning techniques to reduce the computational cost. A single scan fast algorithm has also been proposed to discover all SHUFIs in a spatiotemporal database. Experimental results demonstrate that the proposed algorithm is efficient. We also demonstrate the usefulness of the proposed model with a real-world application.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Agrawal, R., Srikant, R.: Fast algorithms for mining association rules in large databases. In: Proceedings of 20th International Conference on Very Large Data Bases, VLDB 1994, vol. 1215, pp. 487–499 (1994)
Ahmed, C.F., Tanbeer, S.K., Jeong, B., Lee, Y.: Efficient tree structures for high utility pattern mining in incremental databases. IEEE Trans. Knowl. Data Eng. 21(12), 1708–1721 (2009)
Gan, W., Lin, J.C., Fournier-Viger, P., Chao, H., Hong, T., Fujita, H.: A survey of incremental high-tility itemset mining. Wiley Interdisc. Rev.: Data Min. Knowl. Discov. 8(2), e1242 (2018)
Uday Kiran, R., Yashwanth Reddy, T., Fournier-Viger, P., Toyoda, M., Krishna Reddy, P., Kitsuregawa, M.: Efficiently finding high utility-frequent itemsets using cutoff and suffix utility. In: Yang, Q., Zhou, Z.-H., Gong, Z., Zhang, M.-L., Huang, S.-J. (eds.) PAKDD 2019. LNCS (LNAI), vol. 11440, pp. 191–203. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-16145-3_15
Kiran, R.U., Zettsu, K., Toyoda, M., Fournier-Viger, P., Reddy, P.K., Kitsuregawa, M.: Discovering spatial high utility itemsets in spatiotemporal databases. In: Proceedings of the 31st International Conference on Scientific and Statistical Database Management, SSDBM 2019, Santa Cruz, CA, USA, 23–25 July 2019, pp. 49–60 (2019)
Lan, G., Hong, T., Tseng, V.S.: An efficient projection-based indexing approach for mining high utility itemsets. Knowl. Inf. Syst. 38(1), 85–107 (2014)
Liu, J., Wang, K., Fung, B.C.M.: Direct discovery of high utility itemsets without candidate generation. In: 12th IEEE International Conference on Data Mining, pp. 984–989. IEEE (2012)
Liu, Y., Liao, W., Choudhary, A.: A Two-phase algorithm for fast discovery of high utility itemsets. In: Ho, T.B., Cheung, D., Liu, H. (eds.) PAKDD 2005. LNCS (LNAI), vol. 3518, pp. 689–695. Springer, Heidelberg (2005). https://doi.org/10.1007/11430919_79
Song, W., Liu, Y., Li, J.: Bahui: fast and memory efficient mining of high utility itemsets based on bitmap. Int. J. Data Warehous. Min. (IJDWM) 10(1), 1–15 (2014)
Tseng, V.S., Shie, B., Wu, C., Yu, P.S.: Efficient algorithms for mining high utility itemsets from transactional databases. IEEE Trans. Knowl. Data Eng. 25(8), 1772–1786 (2013)
Tseng, V.S., Wu, C., Shie, B., Yu, P.S.: Up-growth: an efficient algorithm for high utility itemset mining. In: Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, 25–28 July 2010, pp. 253–262 (2010)
Yao, H., Hamilton, H.J., Butz, C.J.: A foundational approach to mining itemset utilities from databases. In: Proceedings of the Fourth SIAM International Conference on Data Mining, pp. 482–486 (2004)
Zida, S., Fournier-Viger, P., Lin, J.C., Wu, C., Tseng, V.S.: Efim: a fast and memory efficient algorithm for high-utility itemset mining. Knowl. Inf. Syst. 51(2), 595–625 (2017)
Zida, S., Fournier-Viger, P., Wu, C.-W., Lin, J.C.-W., Tseng, V.S.: Efficient mining of high-utility sequential rules. In: Perner, P. (ed.) MLDM 2015. LNCS (LNAI), vol. 9166, pp. 157–171. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-21024-7_11
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Reddy, P.P.C., Kiran, R.U., Zettsu, K., Toyoda, M., Reddy, P.K., Kitsuregawa, M. (2019). Discovering Spatial High Utility Frequent Itemsets in Spatiotemporal Databases. In: Madria, S., Fournier-Viger, P., Chaudhary, S., Reddy, P. (eds) Big Data Analytics. BDA 2019. Lecture Notes in Computer Science(), vol 11932. Springer, Cham. https://doi.org/10.1007/978-3-030-37188-3_17
Download citation
DOI: https://doi.org/10.1007/978-3-030-37188-3_17
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-37187-6
Online ISBN: 978-3-030-37188-3
eBook Packages: Computer ScienceComputer Science (R0)