Abstract
Empirical equations are an important class of regularities that can be discovered in databases. We concentrate on the role of equations as definitions of attribute values. Such definitions can be used in many ways in a single database and for transfer of knowledge between databases. We present a quest for equations that can be used as definitions of an attribute in a given database. That quest triggers a discovery mechanism that specializes in searching recursively a system of databases and returns a set of partial definitions. We introduce the notion of shared operational semantics. It is founded on an equation-based system of partial definitions and it gives necessary foundations for designing local query answering systems in a distributed two-layered information system (D2LIS). The knowledge exchange between two sites of D2LIS may only improve an equation-based system of partial definitions at each of these sites. At the same time the shared operational semantics will better interpret user queries. Operational semantics augments the earlier developed semantics for rules used as attribute definitions. To put the shared operational semantics on a firm theoretical foundation we give a formal interpretation of queries which justifies empirical equations in their definitional role.
Similar content being viewed by others
References
Batini, C., Lenzerini, M., and Navathe, S. (1986). A Comparative Analysis of Methodologies for Database Schema Integration, ACM Computing Surveys, 18(4), 325–364.
Bridgman, P.W. (1927). The Logic of Modern Physics. The Macmillan Company.
Carnap, R. (1936). Testability and Meaning. Philosophy of Science, Vol. 3.
Dzeroski, S. and Todorovski, L. (1993). Discovering Dynamics. In Proc. of 10th International Conference on Machine Learning (pp. 97–103).
Grzymala-Busse, J. (1992). LERS-A System for Learning from Examples Based on Rough Sets. In R. Slowinski (Ed.), Intelligent Decision Support, Handbook of Applications and Advances of the Rough Sets Theory (pp. 3–18). Kluwer Academic Publishers.
Klopotek, M., Michalewicz, M., Michalewicz, Z., Ras, Z., Wierzchon, S., and Zytkow, J. (1997). Discovering Knowledge in Distributed Databases. In Proc. of 6th InternationalWorkshop on Intelligent Information Systems (pp. 128–138).
Kryszkiewicz, M. and Rybinski, H. (1996). Reducing Information Systems with Uncertain Attributes. In ISMIS'96 Proceedings (pp. 285–294). LNCS/LNAI, Vol. 1079, Springer.
Maitan, J., Ras, Z., and Zemankova, M. (1989). Query Handling and Learning in a Distributed Intelligent System. In Z.W. Ras (Ed.), Methodologies for Intelligent Systems, Vol. IV (pp. 118–127). North Holland.
Maluf, D. and Wiederhold, G. (1997). Abstraction of Representation for Interoperation. In Proceedings of Tenth International Symposium on Methodologies for Intelligent Systems (pp. 441–455). LNCS/LNAI, Vol. 1325, Springer-Verlag.
Michalski, R.S., Mozetic, I., Hong, J., and Lavrac, N. (1986). The Multipurpose Incremental Learning System AQ15 and its Testing Application to Three Medical Domains. In Proceedings of the Fifth National Conference on Artificial Intelligence (pp. 1041–1045). Morgan Kaufmann.
Navathe, S. and Donahoo, M. (1995). Towards Intelligent Integration of Heterogeneous Information Sources. In Proceedings of the Sixth International Workshop on Database Re-engineering and Interoperability.
Nordhausen, B. and Langley, P. (1993). An Integrated Framework for Empirical Discovery, Machine Learning, 12, 17–47.
Pawlak, Z. (1984). Rough Classification, International Journal of Man-Machine Studies, 20, 469–483.
Prodromidis, A.L. and Stolfo, S. (1998). Mining Databases with Different Schemas: Integrating Incompatible Classifiers. In Proceedings of the Fourth Intern. Conf. on Knowledge Discovery and Data Mining (pp. 314–318). AAAI Press.
Ras, Z. (1997). Resolving Queries Through Cooperation in Multi-Agent Systems. In T.Y. Lin and N. Cercone (Eds.), Rough Sets and Data Mining (pp. 239–258). Kluwer Academic Publishers.
Ras, Z. and Joshi, S. (1997). Query Approximate Answering System for an Incomplete DKBS, Fundamenta Informaticae Journal, 30(3/4), 313–324.
Ras, Z. and Zemankova, M. (1990). Intelligent Query Processing in Distributed Information Systems. In Z.W. Ras and M. Zemankova (Eds.), Intelligent Systems: State of the Art and Future Directions (pp. 357–370). Ellis Horwood Series in Artificial Intelligence, London, England.
Ras, Z. and Żytkow, J. (1999). Discovery of Equations and the Shared Operational Semantics in Distributed Autonomous Databases. In PAKDD'99 Proceedings (pp. 453–463). LNCS/LNAI, Vol. 1574, Springer-Verlag.
Żytkow, J. (1982). An Interpretation of a Concept in Science by a Set of Operational Procedures. In W. Krajewski (Ed.), Polish Essays in the Philosophy of the Natural Sciences (pp. 169–185). Boston Studies in the Philosophy of Science, Vol. 68, Reidel.
Żytkow, J. and Zembowicz, R. (1993). Database Exploration in Search of Regularities, Journal of Intelligent Information Systems, Vol. 2, 39–81.
Żytkow, J.M., Zhu, J., and Zembowicz, R. (1992). Operational Definition Refinement: A Discovery Process. In Proceedings of the Tenth National Conference on Artificial Intelligence (pp. 76–81). The AAAI Press.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Ras, Z.W., Żytkow, J.M. Mining for Attribute Definitions in a Distributed Two-Layered DB System. Journal of Intelligent Information Systems 14, 115–130 (2000). https://doi.org/10.1023/A:1008779617939
Issue Date:
DOI: https://doi.org/10.1023/A:1008779617939