Abstract
In the process of rule generation from databases, the volume of generated rules often greatly exceeds the size of the underlying database. Typically only a small fraction of that large volume of rules is of any interest to the user. We believe that the main challenge facing database mining is what to do with the rules after having generated them. Rule post-processing involves selecting rules which are relevant or interesting, building applications which use the rules and finally, combining rules together to form a larger and more meaningful statements. In this paper we propose an application programming interface which enables faster development of applications which rely on rules. We also provide a rule query language which allows both selective rule generation as well as retrieval of selected categories of rules from the pre-generated rule collections.
Similar content being viewed by others
References
Agrawal, R., Imielinski, T., and Swami, A. 1993. Mining associations rules between sets of items in large databases. Proceedings of ACM SIGMOD Conference on Management of Data (SIGMOD'93), Washington D.C., pp. 207-216.
Agrawal, R. and Srikant, R. 1994. Fast algorithms for mining association rules. VLDB'94, Santiago, Chile, pp. 487-499.
Brin, S., Motwani, R., Ullman, J., and Tsur, S. 1997. Dynamic itemset counting and implication rules for market basket data. Proceedings of ACM SIGMOD Conference on Management of Data (SIGMOD'93), Tuscon, Arizona, pp. 255-264.
Han, J., Fu, Y., Koperski, K., Wang, W., and Zaiane, O. 1996. DMQL: A data mining query language for relational databases. DMKD-96 (SIGMOD-96 Workshop on KDD), Montreal, Canada.
Imielinski, T. and Mannila, H. 1996. A database perspective on knowledge discovery. Communications of the ACM, 39(11):58-64.
Imielinski, T. and Virmani, A. 1999. M-sql: A query language for database mining. In Data Mining and Knowledge Discovery, Companion paper.
International Organization for Standardization (ISO) and American National Standards Institue (ANSI). (ISO-ANSI working draft) database language SQL (SQL3). Technical report, 1994.
Mannila, H., Toivonen, H., and Verkamo, A.I. 1995. Discovering frequent episodes in sequences. In Proceedings of the First International Conference on Knowledge Discovery and Data Mining (KDD'95)., U.M. Fayyad and R. Uthurusamy (Eds.), Montreal, Canada, AAAI Press, pp. 210-215.
Meo, R., Psaila, G., and Ceri, S. 1996. A new sql-like operator for mining association rules. Proceedings of the 22nd International Conference on Very Large Data Bases (VLDB'96), Bombay, India, pp. 122-133.
Park, J.S., Chen, M.-S., and Yu, P.S. 1995. An effective hash based algorithm for mining association rules. Proceedings of ACM SIGMOD Conference on Management of Data (SIGMOD'95), San Jose, California, pp. 175-186.
Savasere, A., Omiecinski, E., and Navathe, S. 1995. An efficient algorithm for mining association rules in large databases. Proceedings of the 21st International Conference on Very Large Data Bases (VLDB'95), Zurich, Switzerland, pp. 432-444.
Shen, W.M., Ong, K., Mitbander, B., and Zaniolo, C. 1996. Metaqueries for data mining. In Advances in Knowledge Discovery and Data Mining, U.M. Fayyad, G.P.-Shapiro, P. Smyth, and R. Uthurusamy (Eds.), Menlo Park, CA: AAAI Press.
Tsur, D., Abbiteboul, S., Clifton, C., Motwani, R., and Nestrov, S. 1997. Query flocks: A generalization of association rules. Proceedings of ACM SIGMOD Conference on Management of Data (SIGMOD'98), Seattle, Washington.
Virmani, A. 1998. Discovery board: A query based approach to data mining. Ph.D. Thesis, Rutgers University.
Wrobel, S., Wettschereck, D., Sommer, E., and Emde, W. 1996. Extensibility in data mining systems. Proceedings of the Second International Conference on Knowledge Discovery and Data Mining (KDD'96), Portland, Oregon.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Imieliński, T., Virmani, A. & Abdulghani, A. DMajor—Application Programming Interface for Database Mining. Data Mining and Knowledge Discovery 3, 347–372 (1999). https://doi.org/10.1023/A:1009841028985
Issue Date:
DOI: https://doi.org/10.1023/A:1009841028985