skip to main content
10.1145/3589335.3651448acmconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
short-paper
Free Access

BoxCare: A Box Embedding Model for Disease Representation and Diagnosis Prediction in Healthcare Data

Published:13 May 2024Publication History

ABSTRACT

Diagnosis prediction is becoming crucial to develop healthcare plans for patients based on Electronic Health Records (EHRs). Existing works usually enhance diagnosis prediction via learning accurate disease representation, where many of them try to capture inclusive relations based on the hierarchical structures of existing disease ontologies such as those provided by ICD-9 codes. However, they overlook exclusive relations that can reflect different and complementary perspectives of the ICD-9 structures, and thus fail to accurately represent relations among diseases and ICD-9 codes. To this end, we propose to project disease embeddings and ICD-9 code embeddings into boxes, where a box is an axis-aligned hyperrectangle with a geometric region and two boxes can clearly "include" or "exclude" each other. Upon box embeddings, we further obtain patient embeddings via aggregating the disease representations for diagnosis prediction. Extensive experiments on two real-world EHR datasets show significant performance gains brought by our proposed framework, yielding average improvements of 6.04% for diagnosis prediction over state-of-the-art competitors.

Skip Supplemental Material Section

Supplemental Material

hdp0053.mp4

Supplemental video

mp4

22.3 MB

References

  1. T. Bai, S. Zhang, B. L. Egleston, and S. Vucetic. Interpretable representation learning for healthcare via capturing disease progression through time. In SIGKDD, pages 43--51, 2018.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. E. Choi, M. T. Bahadori, J. A. Kulas, A. Schuetz, W. F. Stewart, and J. Sun. Retain: An interpretable predictive model for healthcare using reverse time attention mechanism. In NeurIPS, 2016.Google ScholarGoogle Scholar
  3. E. Choi, M. T. Bahadori, L. Song, W. F. Stewart, and J. Sun. Gram: graph-based attention model for healthcare representation learning. In SIGKDD, 2017.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. E. R. Hansen, T. Sagi, and K. Hose. Diagnosis prediction over patient data using hierarchical medical taxonomies. Workshop Proceedings of the EDBT/ICDT, 2023.Google ScholarGoogle Scholar
  5. S. Jiang, Q. Yao, Q. Wang, and Y. Sun. A single vector is not enough: Taxonomy expansion via box embeddings. In WWW, pages 2467--2476, 2023.Google ScholarGoogle Scholar
  6. A. E. Johnson, T. J. Pollard, L. Shen, H. L. Li-Wei, M. Feng, M. Ghassemi, B. Moody, P. Szolovits, L. A. Celi, and R. G. Mark. Mimic-iii, a freely accessible critical care database. Scientific data, 2016.Google ScholarGoogle Scholar
  7. C. Lu, C. K. Reddy, P. Chakraborty, S. Kleinberg, and Y. Ning. Collaborative graph learning with auxiliary text for temporal event prediction in healthcare. In IJCAI, 2021.Google ScholarGoogle ScholarCross RefCross Ref
  8. J. Luo, M. Ye, C. Xiao, and F. Ma. Hitanet: Hierarchical time-aware attention networks for risk prediction on electronic health records. In SIGKDD, pages 647--656, 2020.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. F. Ma, R. Chitta, J. Zhou, Q. You, T. Sun, and J. Gao. Dipole: Diagnosis prediction in healthcare via attention-based bidirectional recurrent neural networks. In SIGKDD, pages 1903--1911, 2017.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. F. Ma, Q. You, H. Xiao, R. Chitta, J. Zhou, and J. Gao. Kame: Knowledge-based attention model for diagnosis prediction in healthcare. In CIKM, 2018.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Y. Onoe, M. Boratko, A. McCallum, and G. Durrett. Modeling fine-grained entity types with box embeddings. In ACL, 2021.Google ScholarGoogle ScholarCross RefCross Ref
  12. T. J. Pollard, A. E. Johnson, J. D. Raffa, L. A. Celi, R. G. Mark, and O. Badawi. The eicu collaborative research database, a freely available multi-center database for critical care research. Scientific data, 2018.Google ScholarGoogle Scholar
  13. Z. Qiao, Z. Zhang, X. Wu, S. Ge, and W. Fan. Mhm: Multi-modal clinical data based hierarchical multi-label diagnosis prediction. In SIGIR, 2020.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Z. Sun, X. Yang, Z. Feng, T. Xu, X. Fan, and J. Tian. Ehr2hg: Modeling of ehrs data based on hypergraphs for disease prediction. In BIBM, 2022.Google ScholarGoogle ScholarCross RefCross Ref
  15. Q. Suo, J. Chou, W. Zhong, and A. Zhang. Tadanet: Task-adaptive network for graph-enriched meta-learning. In SIGKDD, pages 1789--1799, 2020.Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Y. Tan, C. J. Yang, X. Wei, C. Chen, W. Liu, L. Li, J. Zhou, and X. Zheng. Metacare: Meta-learning with hierarchical subtyping for cold-start diagnosis prediction in healthcare data. In SIGIR, pages 449--459, 2022.Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. M. Usama, B. Ahmad, W. Xiao, M. S. Hossain, and G. Muhammad. Self-attention based recurrent convolutional neural network for disease prediction using healthcare data. Comput Methods Programs Biomed, 190:105191, 2020Google ScholarGoogle Scholar

Index Terms

  1. BoxCare: A Box Embedding Model for Disease Representation and Diagnosis Prediction in Healthcare Data

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      WWW '24: Companion Proceedings of the ACM on Web Conference 2024
      May 2024
      1928 pages
      ISBN:9798400701726
      DOI:10.1145/3589335

      Copyright © 2024 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 13 May 2024

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • short-paper

      Acceptance Rates

      Overall Acceptance Rate1,899of8,196submissions,23%
    • Article Metrics

      • Downloads (Last 12 months)26
      • Downloads (Last 6 weeks)26

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader