Improved mutual information measure for clustering, classification, and community detection

M. E. J. Newman, George T. Cantwell, and Jean-Gabriel Young
Phys. Rev. E 101, 042304 – Published 23 April 2020

Abstract

The information theoretic measure known as mutual information is widely used as a way to quantify the similarity of two different labelings or divisions of the same set of objects, such as arises, for instance, in clustering and classification problems in machine learning or community detection problems in network science. Here we argue that the standard mutual information, as commonly defined, omits a crucial term which can become large under real-world conditions, producing results that can be substantially in error. We derive an expression for this missing term and hence write a corrected mutual information that gives accurate results even in cases where the standard measure fails. We discuss practical implementation of the new measure and give example applications.

  • Figure
  • Figure
  • Figure
  • Received 7 August 2019
  • Revised 13 February 2020
  • Accepted 25 March 2020

DOI:https://doi.org/10.1103/PhysRevE.101.042304

©2020 American Physical Society

Physics Subject Headings (PhySH)

NetworksGeneral PhysicsInterdisciplinary PhysicsStatistical Physics & Thermodynamics

Authors & Affiliations

M. E. J. Newman1,2, George T. Cantwell1, and Jean-Gabriel Young2

  • 1Department of Physics, University of Michigan, Ann Arbor, Michigan 48109, USA
  • 2Center for the Study of Complex Systems, University of Michigan, Ann Arbor, Michigan 48109, USA

Article Text (Subscription Required)

Click to Expand

References (Subscription Required)

Click to Expand
Issue

Vol. 101, Iss. 4 — April 2020

Reuse & Permissions
Access Options
CHORUS

Article Available via CHORUS

Download Accepted Manuscript
Author publication services for translation and copyediting assistance advertisement

Authorization Required


×
×

Images

×

Sign up to receive regular email alerts from Physical Review E

Log In

Cancel
×

Search


Article Lookup

Paste a citation or DOI

Enter a citation
×