Deciphering Raw Data in Neuro-Symbolic Learning with Provable Guarantees

Authors

  • Lue Tao National Key Laboratory for Novel Software Technology, Nanjing University, China School of Artificial Intelligence, Nanjing University, China
  • Yu-Xuan Huang National Key Laboratory for Novel Software Technology, Nanjing University, China School of Artificial Intelligence, Nanjing University, China
  • Wang-Zhou Dai National Key Laboratory for Novel Software Technology, Nanjing University, China School of Intelligence Science and Technology, Nanjing University, China
  • Yuan Jiang National Key Laboratory for Novel Software Technology, Nanjing University, China School of Artificial Intelligence, Nanjing University, China

DOI:

https://doi.org/10.1609/aaai.v38i14.29455

Keywords:

ML: Neuro-Symbolic Learning, KRR: Diagnosis and Abductive Reasoning, ML: Classification and Regression, ML: Learning Theory, ML: Multi-instance/Multi-view Learning, ML: Statistical Relational/Logic Learning

Abstract

Neuro-symbolic hybrid systems are promising for integrating machine learning and symbolic reasoning, where perception models are facilitated with information inferred from a symbolic knowledge base through logical reasoning. Despite empirical evidence showing the ability of hybrid systems to learn accurate perception models, the theoretical understanding of learnability is still lacking. Hence, it remains unclear why a hybrid system succeeds for a specific task and when it may fail given a different knowledge base. In this paper, we introduce a novel way of characterising supervision signals from a knowledge base, and establish a criterion for determining the knowledge’s efficacy in facilitating successful learning. This, for the first time, allows us to address the two questions above by inspecting the knowledge base under investigation. Our analysis suggests that many knowledge bases satisfy the criterion, thus enabling effective learning, while some fail to satisfy it, indicating potential failures. Comprehensive experiments confirm the utility of our criterion on benchmark tasks.

Published

2024-03-24

How to Cite

Tao, L., Huang, Y.-X., Dai, W.-Z., & Jiang, Y. (2024). Deciphering Raw Data in Neuro-Symbolic Learning with Provable Guarantees. Proceedings of the AAAI Conference on Artificial Intelligence, 38(14), 15310-15318. https://doi.org/10.1609/aaai.v38i14.29455

Issue

Section

AAAI Technical Track on Machine Learning V