poster_nih_SH_Multi_Modal_Indexing.pptx

Harabagiu, Sanda

doi:10.6084/m9.figshare.4244675.v1

poster_nih_SH_Multi_Modal_Indexing.pptx (1.3 MB)

poster_nih_SH_Multi_Modal_Indexing.pptx

poster

posted on 2016-11-21, 13:43 authored by Sanda HarabagiuSanda Harabagiu

Heterogeneity is one of the attributes of clinical Big Data, as the clinical picture of patients is documented by narratives, images, signals, etc. The ability to search through such data is made possible by multi-modal indexes, capturing the knowledge processed across all forms of clinical documents. Taking advantage of state-of-the-art deep learning methods, we have been able to generate a multi-modal index operating on a vast collection of electroencephalography (EEG) signal recordings and EEG reports and employ it is a patient cohort retrieval system. When EEG reports were indexed, the sections of the EEG reports were identified and medical language processing was performed. When the EEG signal recordings were processed, they were represented by EEG signal fingerprints as low-dimensional vectors produced by deep learning methods on the Big Data of EEG signals. Moreover, we organize the EEG fingerprints into a similarity-based hierarchy, which was included in the multi-modal index.

We used the multi-modal index in a patient cohort retrieval system, called MERCuRY (Multi-modal EncephalogRam patient Cohort discoveRY), which relies on medical language processing to identify the inclusion and exclusion criteria from the queries generated by neurologists. Using state-of-the-art relevance models adapted for a multi-modal index, we obtained very promising results, indicating that the multi-modal index is bridging the gaps between the electrode potentials recorded in the EEG signal and the clinical findings documented in the EEG reports.

Funding

This work was supported by the National Human Genome Research Institute of the National Institutes of Health under award number 1U01HG008468

History

Usage metrics

Keywords

Multi-Modal Indexing Deep Learning Neural Networks patient cohort BD2K_AHM Medical and Health Sciences not elsewhere classified

Licence

CC BY 4.0

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

poster_nih_SH_Multi_Modal_Indexing.pptx

Funding

This work was supported by the National Human Genome Research Institute of the National Institutes of Health under award number 1U01HG008468

History

Usage metrics

Categories

Keywords

Licence

Exports