Robust and Fast Measure of Information via Low-Rank Representation

Authors

  • Yuxin Dong Xi'an Jiaotong University
  • Tieliang Gong Xi'an Jiaotong University
  • Shujian Yu Vrije Universiteit Amsterdam
  • Hong Chen Huazhong Agricultural University
  • Chen Li Xi'an Jiaotong University

DOI:

https://doi.org/10.1609/aaai.v37i6.25906

Keywords:

ML: Probabilistic Methods, ML: Dimensionality Reduction/Feature Selection, ML: Kernel Methods

Abstract

The matrix-based Rényi's entropy allows us to directly quantify information measures from given data, without explicit estimation of the underlying probability distribution. This intriguing property makes it widely applied in statistical inference and machine learning tasks. However, this information theoretical quantity is not robust against noise in the data, and is computationally prohibitive in large-scale applications. To address these issues, we propose a novel measure of information, termed low-rank matrix-based Rényi's entropy, based on low-rank representations of infinitely divisible kernel matrices. The proposed entropy functional inherits the specialty of of the original definition to directly quantify information from data, but enjoys additional advantages including robustness and effective calculation. Specifically, our low-rank variant is more sensitive to informative perturbations induced by changes in underlying distributions, while being insensitive to uninformative ones caused by noises. Moreover, low-rank Rényi's entropy can be efficiently approximated by random projection and Lanczos iteration techniques, reducing the overall complexity from O(n³) to O(n²s) or even O(ns²), where n is the number of data samples and s ≪ n. We conduct large-scale experiments to evaluate the effectiveness of this new information measure, demonstrating superior results compared to matrix-based Rényi's entropy in terms of both performance and computational efficiency.

Downloads

Published

2023-06-26

How to Cite

Dong, Y., Gong, T., Yu, S., Chen, H., & Li, C. (2023). Robust and Fast Measure of Information via Low-Rank Representation. Proceedings of the AAAI Conference on Artificial Intelligence, 37(6), 7450-7458. https://doi.org/10.1609/aaai.v37i6.25906

Issue

Section

AAAI Technical Track on Machine Learning I