Language impairment in adults with end-stage liver disease: application of natural language processing towards patient-generated health records

Dickerson, Lindsay K.; Rouhizadeh, Masoud; Korotkaya, Yelena; Bowring, Mary Grace; Massie, Allan B.; McAdams-Demarco, Mara A.; Segev, Dorry L.; Cannon, Alicia; Guerrerio, Anthony L.; Chen, Po-Hung; Philosophe, Benjamin N.; Mogul, Douglas B.

doi:10.1038/s41746-019-0179-9

Download PDF

Article
Open access
Published: 04 November 2019

Language impairment in adults with end-stage liver disease: application of natural language processing towards patient-generated health records

Lindsay K. Dickerson ORCID: orcid.org/0000-0001-6509-8393¹,
Masoud Rouhizadeh²,
Yelena Korotkaya³,
Mary Grace Bowring⁴,
Allan B. Massie^4,5,
Mara A. McAdams-Demarco⁴,
Dorry L. Segev^4,5,
Alicia Cannon⁶,
Anthony L. Guerrerio ORCID: orcid.org/0000-0003-2458-7160³,
Po-Hung Chen⁷,
Benjamin N. Philosophe⁵ &
…
Douglas B. Mogul³

npj Digital Medicine volume 2, Article number: 106 (2019) Cite this article

5177 Accesses
14 Citations
60 Altmetric
Metrics details

Subjects

Abstract

End-stage liver disease (ESLD) is associated with cognitive impairment ranging from subtle alterations in attention to overt hepatic encephalopathy that resolves after transplant. Natural language processing (NLP) may provide a useful method to assess cognitive status in this population. We identified 81 liver transplant recipients with ESLD (4/2013–2/2018) who sent at least one patient-to-provider electronic message pre-transplant and post-transplant, and matched them 1:1 to “healthy” controls—who had similar disease, but had not been evaluated for liver transplant—by age, gender, race/ethnicity, and liver disease. Messages written by patients pre-transplant and post-transplant and controls was compared across 19 NLP measures using paired Wilcoxon signed-rank tests. While there was no difference overall in word length, patients with Model for End-Stage Liver Disease Score (MELD) ≥ 30 (n = 31) had decreased word length in pre-transplant messages (3.95 [interquartile range (IQR) 3.79, 4.14]) compared to post-transplant (4.13 [3.96, 4.28], p = 0.01) and controls (4.2 [4.0, 4.4], p = 0.01); there was no difference between post-transplant and controls (p = 0.4). Patients with MELD ≥ 30 had fewer 6+ letter words in pre-transplant messages (19.5% [16.4, 25.9] compared to post-transplant (23.4% [20.0, 26.7] p = 0.02) and controls (25.0% [19.2, 29.4]; p = 0.01). Overall, patients had increased sentence length pre-transplant (12.0 [9.8, 13.7]) compared to post-transplant (11.0 [9.2, 13.3]; p = 0.046); the same was seen for MELD ≥ 30 (12.3 [9.8, 13.7] pre-transplant vs. 10.8 [9.6, 13.0] post-transplant; p = 0.050). Application of NLP to patient-generated messages identified language differences—longer sentences with shorter words—that resolved after transplant. NLP may provide opportunities to detect cognitive impairment in ESLD.

Large language models to identify social determinants of health in electronic health records

Article Open access 11 January 2024

Case-control study of neuropsychiatric symptoms in electronic health records following COVID-19 hospitalization in 2 academic health systems

Article 15 June 2022

Natural language processing for mental health interventions: a systematic review and research framework

Article Open access 06 October 2023

Introduction

End-stage liver disease (ESLD) is associated with a wide spectrum of neurocognitive impairment ranging from minimal alterations in attention, working memory, and psychomotor speed to coma and death.^1,2,3 Approximately 80% of patients with ESLD have neurocognitive changes associated with poorer quality of life, including deteriorating sleep and work performance. As many as 20% of adults with ESLD can develop the most severe form of cognitive impairment, overt hepatic encephalopathy (HE), which may portend up to 43% mortality at one year.^1,3,4,5,6

Despite the considerable burden caused by these cognitive deficits, diagnosis remains challenging. Common diagnostic modalities include blood tests (e.g., ammonia, amino acid profiles), neurocognitive assessments (e.g., Wechsler Adult Intelligence Scale-Fourth Edition, Delis–Kaplan Executive Function System, The Stroop Color and Word Test), computer-based test batteries, electroencephalogram, and imaging.^{1,7,8,9,10,11,12} While the Psychometric HE Score (PHES), a paper–pencil test battery, is considered to be the gold standard for diagnosis of minimal HE (MHE), it remains labor-intensive to administer and requires an in-person clinical visit.¹⁰ Other modalities such as blood ammonia levels correlate poorly with neurocognitive testing and severity of HE.^13,14 Given the absence of precise diagnosis and substantial mortality risk associated with late-stage encephalopathy, there exists a need for a real-time and determinate means to identify ESLD-related cognitive impairment.

Research on potential language impairment in ESLD is limited and inconclusive.¹⁵ One study demonstrated that patients with MHE had deficits in verbal fluency and phrase construction that resolved after transplantation, while other research has concluded that language remains largely intact.^16,17 Techniques developed from the field of natural language processing (NLP)—a subfield of computer science employing computational techniques to learn, understand, and produce human language content—offer an innovative approach for evaluating ESLD-related language alterations.¹⁸ NLP is increasingly common in non-medical contexts, with its myriad uses including virtual voice-based assistants (e.g. Siri from Apple, Alexa from Amazon), therapy chat bots (e.g. Woebot), and forensic linguistics methods to identify authors of anonymous publications.¹⁹ Despite the widespread utilization of electronic medical records (EMRs), application of NLP technology has only recently extended to healthcare and is typically exploited more often for analyzing provider documentation than assessing patients’ written text or transcribed spoken language. Examples include: identification of key words in patients’ electronic notes that indicate bleeding, use in an algorithm to stratify risk in patients with cirrhosis, and incorporation into a mortality prediction model in the intensive care unit (ICU).^20,21,22

The aim of this pilot study was to determine if NLP-based methods can identify and characterize language differences in patients with ESLD that may suggest the presence of neurocognitive impairment. Thus, we evaluated and compared the language patterns in patient-generated electronic messages written before and after transplant and by “healthy” controls with liver disease but who were not undergoing evaluation for transplantation. We hypothesized that NLP would detect language alterations that resolved after transplantation, with the post-transplant messages being similar to the language patterns of controls.

Results

Study population

The median age (interquartile range, IQR) of the 81 transplanted patients included in analysis was 53.8 years (19–69; Table 1). Among these, 40% were female, 73% were Caucasian, and 38% had a Model for End-stage Liver Disease (MELD) score ≥30 at transplant; controls matched demographically. The most frequent indication for transplant was hepatitis C virus (HCV) cirrhosis (33.3%), followed by nonalcoholic steatohepatitis (NASH) cirrhosis (17.3%). The median (IQR) number of messages per patient was 3 (1–10) in the pre-transplant period, 10 (4–24) post-transplant, and 3 (2–5) for controls. For patients with MELD ≥ 30, the median number of messages per patient was 5 (1–18) in the pre-transplant period, 15 (5–39) post-transplant, and 2 (1–4) for controls. A transjugular Intrahepatic portosystemic shunt was placed in 2% of cases and no controls. Among cases, 54% people used at least one drug that can treat HE, including 16% that used rifaximin, 7% that used lactulose, and 25% that used both. In contrast, 1% of controls (n = 1) used rifaximin and it was prescribed for bacterial overgrowth. Though cases and controls were not matched by use of drugs that may influence cognition (e.g., opioids), the frequency of their use was similar in each group.

Table 1 Demographics, Model for End-stage Liver Disease (MELD) score, message number, and transplant indication

Full size table

Lexical domain

Word length

Among all patients with ESLD (n = 81), word length was similar pre-transplant (median [IQR]: 4.06 [3.85, 4.35]) and post-transplant (4.12 [3.99, 4.23]; p = 0.9; Table 2). Word length in control messages (4.1 [3.9, 4.4]) was also similar to pre-transplant (p = 0.6) and post-transplant (p = 1.0). Among patients with MELD ≥ 30, words were on average shorter in pre-transplant messages (3.95 [3.79, 4.14]) compared to post-transplant (4.13 [3.96, 4.28]; p = 0.01) and controls (4.2 [4.0, 4.4]; p = 0.01) (Table 3). There was no difference in word length between post-transplant and control messages for MELD ≥ 30 (p = 0.4).

Table 2 Change in NLP measures between messages pre-transplant to post-transplant and vs. controls

Full size table

Table 3 Change in NLP measures among patients w/MELD ≥ 30 at transplant (n = 31)

Full size table

6+ letter words

Among all patients with ESLD, there were no differences in the percent of 6+ letter words pre-transplant compared to post-transplant or controls. However, patients with MELD ≥ 30 had a lower percentage of 6+ letter words in pre-transplant messages (19.5 [16.4, 25.9] compared to post-transplant (23.4 [20.0, 26.7]; p = 0.02) and controls (25.0 [19.2, 29.4]; p = 0.01). There was no difference in percent of 6+ letter words between post-transplant and control messages for MELD ≥ 30 (p = 0.6).

Numeral words, capitalized words

There were no differences in the percent of numeral words or capitalized words between pre-transplant and post-transplant messages or control messages, overall or for those with MELD ≥ 30.

Lexico-syntactic domain

Pronouns

Among all patients with ESLD, there was no difference in percent of pronouns between pre-transplant (11.5 [9.7, 14.5]) and post-transplant messages (12.0 [10.1, 13.2]; p = 0.9). Control messages contained a higher percent of pronouns (13.4 [11.1, 15.4]) when compared to pre-transplant (p = 0.048) and post-transplant (p = 0.02). Among patients with MELD ≥ 30, there was no difference in percent of pronouns between pre-transplant (11.5 [8.9, 14.1] and post-transplant messages (12.1 [10.3, 13.1]; p = 0.3). However, patients with MELD ≥ 30 had a lower percent of pronouns in pre-transplant messages compared to controls (13.5 [11.9, 15.4]; p = 0.02), and similar percent of pronouns in post-transplant and control messages (p = 0.1).

Function words, question words, verbs, adjectives, nouns

Among all patients with ESLD and those with MELD ≥ 30, there were no differences in these NLP measures between pre-transplant, post-transplant, and control messages.

Syntactic domain

Sentence length

Among all patients with ESLD, sentence length was increased in pre-transplant messages (12.0 [9.8, 13.7]) compared to post-transplant (11.0 [9.2, 13.3]; p = 0.046). Control messages had slightly decreased sentence length (10.5 [8.7, 13.8]), however, this was not statistically significant when compared to pre-transplant (p = 0.1) or post-transplant (p = 0.5) messages. Patients with MELD ≥ 30 had marginally increased sentence length in pre-transplant messages (12.3 [9.8, 13.7]) when compared to post-transplant (10.8 [9.6, 13.0]; p = 0.050). Control messages had slightly decreased sentence length (10.8 [8.6, 15.1]), however, this was not statistically significant when compared to pre-transplant (p = 0.3) or post-transplant (p = 0.9) messages.

Noun phrase ratio

Among all patients with ESLD, there was no difference in the noun phrase ratio pre-transplant and post-transplant, or when compared to controls. Among patients with MELD ≥ 30, the noun phrase ratio was greater in pre-transplant (1.63 [1.47, 1.82]) than in post-transplant messages (1.55 [1.45, 1.71; p = 0.03). However, there were no differences between control messages (1.6 [1.5, 1.7]) and pre-transplant (p = 0.4) or post-transplant (p = 0.2) among those with MELD ≥ 30.

Subject–verb–object ratio, Flesch–Kincaid grade readability score

Among all patients with ESLD and those with MELD ≥ 30, there were no differences in subject–verb–object ratio or Flesch–Kincaid grade readability score between pre- and post-transplant messages or control messages.

Lexico-semantic domain

Unique words, Brunét’s Index, Honoré’s statistic

Among all patients with ESLD and those with MELD ≥ 30, there were no differences observed in these lexico-semantic measures between pre-transplant, post-transplant, and control messages.

Sentiment domain

Polarity, subjectivity

Among all patients with ESLD and those with MELD ≥ 30, there were no differences in polarity or subjectivity between groups.

Discussion

In this retrospective, pilot study of patient-generated EMR messages, certain NLP measures identified linguistic differences in messages of adults with liver disease before as compared to after transplant. Most notably among patients with high MELD score, messages written in the pre-transplant period contained longer sentences consisting of shorter words and longer noun phrases, in contrast to shorter sentences with longer words and shorter noun phrases in the post-transplant period. After transplant, these differences largely resolved such that post-transplant messages were similar to controls. Thus, NLP identified differences primarily in the lexical and syntactic domains. Results were particularly pronounced in patients with MELD ≥ 30, indicating that NLP may be especially useful for identifying neurocognitive changes as liver disease severity worsens.

Although there are limited studies that have applied NLP tools to analyze cognitive impairment, to the best of our knowledge, this pilot study is the first analysis that applies these tools towards patient-generated EMR messages. Recently, Beltrami et al. compared recorded transcripts of people with known cognitive impairment and healthy controls and identified several lexical and syntactic differences between the groups.¹⁵ Presently about one quarter of people in the United States have engaged with their EMR, and about one in five people have sent a message to a provider, thus indicating the existence of troves of data that can be analyzed to better characterize changes in language in specific populations.³⁴ If findings from our study can be replicated in larger populations, both with ESLD, as well as other disorders associated with cognitive deficits, it may represent a tremendous opportunity to identify screen for these disorders in as well as monitor their disease as it progresses.

While ESLD is known to be associated with neurologic and psychiatric complications, few studies have investigated the impact of ESLD on language. In one study by Mooney et al. using a test battery to examine cognitive dysfunction in patients with ESLD, language appeared preserved.¹⁷ In contrast, Adekanle et al. reported that patients with cirrhosis scored lower than controls in a test’s language domain with respect to naming, comprehension, fluency, definition, and repetition.³⁵ Furthermore, Mattarozzi et al. found that word fluency (i.e. the ability to form and express words necessary for normal social and occupational function) and phrase construction (i.e. the grammatical arrangement of words in a phrase) improved 6 months after liver transplant in adults with cirrhosis and MHE.¹⁶ Furthermore, in children with chronic liver disease, de-Paula et al. noted delays in language development in children awaiting transplantation compared to those who had undergone transplantation.³⁶

In our study, the somewhat surprising finding that pre-transplant messages were actually longer than post-transplant messages may suggest that cognitive impairment leads to rambling sentences with simpler words prior to transplant, compared to more concise sentences with longer words after transplant and in control messages (Supplementary Table 2). For example, one patient wrote pre-transplant: “I was asked by my wife to request that when you send the fax to my disability insurance company that if you could include the following (word count: 26) [list of information to include]. Once I go back to work full time and be off work again I will have to wait an extended period of time before, and if I will receive any benefits (word count: 31).” The same patient wrote post-transplant: “I have scheduled a small vacation and will be out of town for 3 days (word count: 15). My wife wants to make sure this will be ok (word count: 10).”

Given that language alterations may be subtle in liver disease, technology from NLP could fill a current gap in diagnosis of MHE, and provide an opportunity for earlier and more consistent detection of linguistic differences indicative of cognitive decline. Using NLP-based methods is an appropriate choice in this context for multiple reasons. First, these methods have been used successfully for diagnostic and prognostic purposes in neurological and psychiatric disorders.³⁷ For example, Elvevag et al. employed a technique in NLP called latent semantic analysis (LSA) to quantify incoherence, a complex and nonconcrete measure, in the transcribed speech of patients with schizophrenia.³⁸ In addition, Beltrami et al. deemed NLP techniques promising for identifying cognitive decline in the transcribed speech of patients with preclinical dementia.¹⁵ Second, as discussed by Kreimeyer et al. NLP has been shown to be a suitable tool for analyzing and gathering data from unstructured free text such as that in the EMR.³⁹ For example, Amazon Web Services’ new NLP service Comprehend Medical takes advantage of this capability for clinical decision support, revenue cycle management, and other tasks.⁴⁰ Application of NLP to identify subclinical cognitive decline in the EMR messages of patients with ESLD builds on the above research.

Although these pilot data suggest the potential for NLP tools to identify important abnormalities in cognition, one important limitation of this study was that we were unable to correlate our results to validated measurements or clinical consequences of HE, such as neurocognitive testing or inability to perform activities of daily living, respectively. Indeed, the abnormalities we are detecting here may also be seen in other forms of dementia, such as uremic encephalopathy of end-stage kidney disease or Alzheimer’s disease. Future prospective studies should include standardized cognitive assessment that are most suggestive of HE to, among other aims, determine a threshold at which NLP can identify alterations in cognition. A second limitation is that pre-transplant and post-transplant messages were analyzed within a single specific time period rather than longitudinally. While we chose to analyze for this pilot study the period before and after transplant, since this interval is likely to show the biggest change in cognition, additional studies should incorporate NLP technology over the course of months to establish potential changes in language over time as patients progress to, and subsequently recover following, transplant. Studies could likewise correlate variations in language as people are receiving therapy for HE to determine if treatment influences language as detected using NLP.

Additional limitations concern the patients that were included in analysis. For example, it is possible that patients with the most severe language abnormalities were excluded from results as they needed a proxy to write to their provider. While this could indicate language differences among groups were underestimated, it also highlights the importance of confirming that the message author is the patient of interest, should NLP technology be incorporated into an automatic clinical decision support system in the future. Similarly, only a subset of patients transplanted during the study interval (81/234 or 35%) met the inclusion criteria of having at least one message in the pre-transplant and post-transplant periods and alcoholic cirrhosis was underrepresented in the study population (13.6%) as compared to the the group of people that were excluded (n = 151) due to lack of EMR messages (34.4%). These observations suggest the possibility that findings from our study might not be representative of the general population. It is worth noting that the matched analysis of our study, in which cases are both compared before and after transplant as well as to controls with similar diseases, strengthens the internal validity of the findings even as one should be cautious in extending the findings to other groups. At the same time, it is now understood that use of the internet and smartphones has reached “near saturation” such that individuals from all socioeconomic groups at least have access to platforms such as EPIC MyChart, thereby allowing for the possibility that our findings could be generalized to other groups.⁴¹

An important next step would be to use NLP tools to prospectively analyze patients’ language in comparison to a standardized neuropsychological testing measure or other established clinical or diagnostic indicators of neurocognitive impairment. Subsequent findings may allow for the creation of a clinical decision support system built into the EMR that automatically analyzes patient-to-provider messages, alerting providers when further evaluation of a patient’s cognitive status is warranted. The hope would be that this could change the care delivered to the patient, and potentially even justify advocacy for higher placement on the transplant waitlist.

As NLP use in medicine grows, careful consideration should be given to ensuring clinical effectiveness, seamless integration into care processes, and high-value application of the technology. We believe patient-generated, unstructured free text in the EMR is a largely untapped cache of data for which NLP analysis is uniquely suited. While NLP has shown promise in identifying cognitive impairment in patients with neurologic and psychiatric conditions through analysis of transcribed speech, the application of the technology to patient-generated EMR messages for the purpose of detecting language abnormalities is novel, particularly in evaluating ESLD-related cognitive decline.

Methods

Identification of transplanted patients

We identified 469 adults (>18 years) with ESLD who received a liver transplant at the Johns Hopkins Hospital (JHH) from April 1, 2013, when patient-generated electronic messages were incorporated into the EMR, to January 31, 2018. Participants were eligible if they had: (1) at least one patient-generated message in the “pre-transplant” period, defined as 6 months prior to the date of admission for transplant, (2) one patient-generated message in the “post-transplant” period, defined as between 30 days and 6 months after discharge, and (3) absence of an International Classification of Diseases, 10th revision (ICD-10) diagnosis code indicating a neurologic/psychiatric comorbidity or developmental disability (e.g. dementia, schizophrenia, or autism spectrum disorder; Supplemental Table 3). Among 469 transplanted patients, 205 were first excluded because they had an ICD-10 diagnosis code indicating neurologic/psychiatric comorbidity. Among the remaining 234 patients, 153 were excluded because they did not have at least one MyChart message in both the pre-transplant and post-transplant period. Eighty-one individuals with ESLD satisfied inclusion/exclusion criteria. The Institutional Review Board for each hospital in the Johns Hopkins Healthcare System approved this study. A waiver of informed consent was obtained by the Institutional Review Boards. The data are not publicly available given that they are not de-identifiable and could compromise patient privacy.

Identification of healthy controls

Transplanted patients were matched 1:1 to “healthy” controls who had at least one patient-generated message based on the following criteria: age at transplant (±8 years), gender, and race/ethnicity, and diagnosis similar to the indication for transplant. For example, a patient who was transplanted for “alcoholic cirrhosis” was matched to a control with “alcoholic liver disease” and a patient transplanted for “hepatocellular carcinoma” was matched to a control with “benign neoplasm of the liver.” In instances in which no controls matched all the necessary characteristics, controls with the diagnosis of “abnormal liver function tests” were selected (Supplementary Table 3). All charts were manually reviewed to make sure potential controls had no evidence of cirrhosis as indicated by clinical notes, laboratory, imaging, or biopsy reports. Potential controls who had been referred for liver transplant evaluation or who had evidence of severe medical comorbidities (e.g., cancer, congestive heart failure), frequent hospitalizations, neurologic/psychiatric comorbidities, or developmental disability were excluded.

Data extraction

The Johns Hopkins Healthcare System uses EPIC (Verona, WI), an EMR that allows for electronic messages to be sent through a portal called MyChart. The MyChart interface is similar to email, and patients access MyChart through a web-based portal on their computers, tablets, or smartphones. In addition to messages, demographic and clinical information was extracted including age, gender, race/ethnicity, indication for transplant, ICD-10 diagnosis codes, and allocation MELD score at transplant (i.e., score used for allocation, either the calculated score or the score with exception points, whichever is greater). A manual chart review was performed when a patient’s transplant indication was inconclusive based on ICD-10 codes. System-generated and redundant (i.e. copied) content was removed from the messages, and the text was segmented into individual sentences based on reasonable judgment of a native English-speaking reviewer (LD). Greetings, endings, and contact or identifying information, such as phone number, address, or date of birth was annotated and excluded from NLP results. Messages written by proxies, such as a spouse, were identified by the reviewer (LD) and removed from analysis.

NLP-based measures

Nineteen NLP measures across five domains were analyzed using Python NLP Libraries, including Natural Language Toolkit and spaCy (Explosion AI, Berlin, Germany).

Lexical domain

Lexical measures encompass the property of words and the vocabulary of language, and include word length (i.e. letters per word), percent of 6+ letter words, percent of numeral words, and percent of capitalized words.²³

Lexico-syntactic domain

Lexico-syntactic measures refer to the property of words in the context of a sentence’s grammatical structure (mainly parts of speech) and include percent of function words (i.e. conjunctions and prepositions such as “and,” “but,” “by,” “with”), question words, verbs, adjectives, nouns, and pronouns.²⁴

Syntactic domain

Syntactic measures characterize the grammatical structure of sentences and include sentence length (i.e. words per sentence), subject–verb–object ratio (i.e. number of subject–verb–object triples per sentence), noun phrase ratio (i.e. the sum of words in phrases functioning as the sentence’s subject, object, or prepositional object divided by the total number of these phrases), and Flesch–Kincaid grade level (i.e. the readability score of text expressed as a U.S. grade level based on calculations involving total words, sentences, and syllables).^25,26

Lexico-semantic domain

Lexico-semantic measures target the semantic use, or meaning, of words and lexical richness (i.e. the ratio of total words to unique words), such as percent of unique words (i.e. type–token ratio).²⁷ Brunét’s Index (W), also in the lexico-semantic domain, generates a measure of lexical richness from calculations involving the total number of words (N) and the total vocabulary used (V), such that W = N^v−0.165, with a lower value denoting richer text.^28,29 Honore’s statistic (R) generates a lexical richness measure from calculations involving the total number of words, total vocabulary, and words used only once (V₁), such that R = 100 log N/(1−V₁/V), with a higher value denoting richer text.^30,31

Sentiment domain

Sentiment measures include polarity and subjectivity.^32,33 Polarity quantifies the positive, negative, or neutral emotional content expressed in text while subjectivity quantifies the author’s personal feelings, opinion, or beliefs (polarity scale −1 to +1, subjectivity scale 0 to +1).

Statistical analysis

NLP measures were calculated for each message. If patients sent multiple messages pre-transplant or post-transplant, then the average of the NLP measures was calculated to represent their pre-transplant or post-transplant results, respectively. Paired Wilcoxon signed-rank tests were used to compare the NLP measures in messages written by patients with ESLD pre-transplant versus post-transplant, pre-transplant versus “healthy” controls, and post-transplant versus healthy controls (n = 81). Similar analyses were performed for the subgroup of ESLD patients with MELD ≥ 30 (n = 31). All analyses were assessed for statistical significance at the α = 0.05 confidence level and were not adjusted for multiple comparisons. All analyses were performed using Stata/SE 15 (StataCorp).

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The data are not publicly available since they contain identifiable information that could compromise patient privacy.

References

Vilstrup, H. et al. Hepatic encephalopathy in chronic liver disease: 2014 practice guidelines by the American Association for the Study of Liver Diseases and the European Association for the Study of the Liver. Hepatology 60, 715–735 (2014).
Article Google Scholar
Weissenborn, K. et al. Attention, memory, and cognitive function in hepatic encephalopathy. Metab. Brain Dis. 20, 359–367 (2005).
Article Google Scholar
Bajaj, J. S. Minimal hepatic encephalopathy matters in daily life. World J. Gastroenterol. 14, 3609–3615 (2008).
Article Google Scholar
Rakoski, M. O. et al. Burden of cirrhosis on older Americans and their families: analysis of the health and retirement study. Hepatology 55, 184–191 (2012).
Article Google Scholar
Kandiah, P. A. & Kumar, G. Hepatic encephalopathy—the old and the new. Crit. Care Clin. 32, 311–329 (2016).
Article Google Scholar
Jalan, R. et al. Role of predisposition, injury, response and organ failure in the prognosis of patients with acute-on-chronic liver failure: a prospective cohort study. Crit. Care 16, R227 (2012).
Article Google Scholar
Kaplan, P. W. & Rossetti, A. O. EEG patterns and imaging correlations in encephalopathy: encephalopathy part II. J. Clin. Neurophysiol. 28, 233–251 (2011).
Article Google Scholar
Amodio, P. et al. Prevalence and prognostic value of quantified electroencephalogram (EEG) alterations in cirrhotic patients. J. Hepatol. 35, 37–45 (2001).
Article CAS Google Scholar
Kato, A. et al. Regional differences in cerebral glucose metabolism in cirrhotic patients with subclinical hepatic encephalopathy using positron emission tomography. Hepatol. Res. 17, 237–245 (2000).
Article CAS Google Scholar
Weissenborn, K., Ennen, J. C., Schomerus, H., Rückert, N. & Hecker, H. Neuropsychological characterization of hepatic encephalopathy. J. Hepatol. 34, 768–773 (2001).
Article CAS Google Scholar
Krieger, S. et al. Neuropsychiatric profile and hyperintense globus pallidus on T1-weighted magnetic resonance images in liver cirrhosis. Gastroenterology 111, 147–155 (1996).
Article CAS Google Scholar
Holecek, M. Ammonia and amino acid profiles in liver cirrhosis: effects of variables leading to hepatic encephalopathy. Nutrition 31, 14–20 (2015).
Article CAS Google Scholar
Gundling, F. et al. How to diagnose hepatic encephalopathy in the emergency department. Ann. Hepatol. 12, 108–114 (2013).
Article Google Scholar
Ong, J. P. et al. Correlation between ammonia levels and the severity of hepatic encephalopathy. Am. J. Med. 114, 188–193 (2003).
Article CAS Google Scholar
Beltrami, D. et al. Speech analysis by Natural Language Processing techniques: a possible tool for very early detection of cognitive decline? Front. Aging Neurosci. 10, 369 (2018).
Article Google Scholar
Mattarozzi, K. et al. Minimal hepatic encephalopathy: longitudinal effects of liver transplantation. Arch. Neurol. 61, 242–247 (2004).
Article Google Scholar
Mooney, S. et al. Utility of the Repeatable Battery for the Assessment of Neuropsychological Status (RBANS) in patients with end-stage liver disease awaiting liver transplant. Arch. Clin. Neuropsychol. 22, 175–186 (2007).
Article Google Scholar
Hirschberg, J. & Manning, C. D. Advances in natural language processing. Science 349, 261–266 (2015).
Article CAS Google Scholar
Raskin, V., Hempelmann, C. F. & Triezenberg, K. E. Semantic forensics: an application of ontological semantics to information assurance. In Text Meaning ’04 Proceedings. 105–112 (Purdue University, 2004). https://www.aclweb.org/anthology/W04-0914/.
Taggart, M. et al. Comparison of 2 natural language processing methods for identification of bleeding among critically ill patients. JAMA Netw. Open 1, 1–11 (2018).
Article Google Scholar
Chang, E. K. et al. Defining a patient population with cirrhosis: an automated algorithm with natural language processing. J. Clin. Gastroenterol. 50, 889–894 (2016).
Article Google Scholar
Marafino, B. J. et al. Validation of prediction models for critical care outcomes using Natural Language Processing of Electronic Health Record data. JAMA Netw. Open 1 (2018).
Article Google Scholar
Baddeley, A. D., Thomson, N. & Buchanan, M. Word length and the structure of short-term memory. J. Verbal Learn. Verbal Behav. 14, 575–589 (1975).
Article Google Scholar
Thompson, C. K., Ballard, K. J., Tait, M. E., Weintraub, S. & Mesulam, M. Patterns of language decline in non-fluent primary progressive aphasia. Aphasiology 11, 297–321 (1997).
Article Google Scholar
Bird, H., Lambon Ralph, M. A., Patterson, K. & Hodges, J. R. The rise and fall of frequency and imageability: noun and verb production in semantic dementia. Brain Lang. 73, 17–49 (2000).
Article CAS Google Scholar
Kincaid, J. P., Fishburne, R. P. Jr., Rogers, R. L. & Chissom, B. S. Derivation of new readability formulas (Automated Readability Index, Fog Count, and Flesch Reading Ease Formula for Navy enlisted personnel. Research Branch Report 8-75 (Naval Technical Training, Millington, TN; U.S. Naval Air Station, Memphis, TN, 1975).
Blanken, G., Dittman, J., Christian Haas, J. & Wallesch, C. W. Spontaneous speech in senile dementia and aphasia: implications for a neurolinguistic model of language production. J. Cogn. 27, 247–274 (1987).
Article CAS Google Scholar
Brunét, E. Le Vocabulaire de Jean Giraudoux: Structure et Evolution. (Slatkine, Genéve, 1978).
Google Scholar
Asp, E. D. & Villiers, De, J. When Language Breaks Down: Analysing Discourse in Clinical Contexts(Cambridge University Press: Cambridge, 2010) p. 97.
Book Google Scholar
Honoré, A. Some simple measures of richness of vocabulary. Assoc. Lit. Linguist. Comput. Bull. 7, 172–177 (1979).
Google Scholar
Bucks, R. S., Singh, S., Cuerden, J. M. & Wilcock, G. K. Analysis of spontaneous conversational speech in dementia of Alzheimer’s type: evaluation of an objective technique for analyzing lexical performance. Aphasiology 14, 1–35 (2000).
Article Google Scholar
Pang, B. & Lee, L. Opinion mining and sentiment analysis. Found. Trends Inf. Retr. 2, 1–135 (2008).
Article Google Scholar
Gilbert, C.H.E. Vader: a parsimonious rule-based model for sentiment analysis of social media text. In Proc. Eighth International AAAI Conference on Weblogs and Social Media (2014).
Patel, V. & Johnson, C. Trends in Individuals’ Access, Viewing and Use of Online Medical Records and Other Technology for Health Needs: 2017–2018, Vol. 13 (2019). https://www.ruralcenter.org/resource-library/trends-in-individuals%26%23039%3B-access-viewing-and-use-of-online-medical-records-and.
Adekanle, O., Sumnon, T. A., Komolafe, M. & Ndububa, D. A. Cognitive functions in patients with liver cirrhosis: assessment using community screening interview for dementia. Ann. Afr. Med. 11, 222–229 (2012).
Article Google Scholar
De Paula, E. M., Porta, G., Tannuria, A. C. A., Tannuri, U. & Befi-Lopes, D. M. Language assessment of children with severe liver disease in a public service in Brazil. Clinics 72, 351–357 (2017).
Article Google Scholar
De Boer, J. N. et al. Clinical use of semantic space models in psychiatry and neurology: a systematic review and meta-analysis. Neurosci. Biobehav. Rev. 93, 85–92 (2018).
Article Google Scholar
Elvevag, B., Foltz, P. W., Weinberger, D. R. & Goldberg, T. E. Quantifying incoherence in speech: an automated methodology and novel application to schizophrenia. Schizophr. Res. 93, 304–316 (2007).
Article Google Scholar
Kreimeyer, K. et al. Natural language processing systems for capturing and standardizing unstructured clinical information: a systematic review. J. Biomed. Inf. 73, 14–29 (2017).
Article Google Scholar
Amazon Web Services. Amazon Comprehend—Natural Language Processing (NLP) and Machine Learning (ML). aws.amazon.com/comprehend/ (2019).
Hitlin, P. Use of Internet, Social Media, Digital Devices Plateau in US (Pew Research Center, 2018).

Download references

Acknowledgements

D.B.M. is supported by grant number 5K08HS023876-02 from the Agency for Healthcare Research and Quality. A.B.M. is supported by grant number 5K01DK10167704 from the National Institute of Diabetes, Digestive, and Kidney Disease (NIDDK); D.L.S. is supported by grant number K24DK101828 from the NIDDK. M.A.M.-D. is supported by R01DK114074 from the NIDDK. P.-H.C. is supported by 5KL2TR001077 from the National Center for Advancing Translational Sciences. The study’s design, collection, analysis, and interpretation were independent of these funding sources.

Author information

Authors and Affiliations

Johns Hopkins University School of Medicine, Baltimore, MD, USA
Lindsay K. Dickerson
Institute for Clinical and Translational Research, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Masoud Rouhizadeh
Department of Pediatrics, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Yelena Korotkaya, Anthony L. Guerrerio & Douglas B. Mogul
Department of Epidemiology, Johns Hopkins University Bloomberg School of Public Health, Baltimore, MD, USA
Mary Grace Bowring, Allan B. Massie, Mara A. McAdams-Demarco & Dorry L. Segev
Department of Surgery, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Allan B. Massie, Dorry L. Segev & Benjamin N. Philosophe
Kennedy Krieger Institute, Baltimore, MD, USA
Alicia Cannon
Department of Internal Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Po-Hung Chen

Authors

Lindsay K. Dickerson
View author publications
You can also search for this author in PubMed Google Scholar
Masoud Rouhizadeh
View author publications
You can also search for this author in PubMed Google Scholar
Yelena Korotkaya
View author publications
You can also search for this author in PubMed Google Scholar
Mary Grace Bowring
View author publications
You can also search for this author in PubMed Google Scholar
Allan B. Massie
View author publications
You can also search for this author in PubMed Google Scholar
Mara A. McAdams-Demarco
View author publications
You can also search for this author in PubMed Google Scholar
Dorry L. Segev
View author publications
You can also search for this author in PubMed Google Scholar
Alicia Cannon
View author publications
You can also search for this author in PubMed Google Scholar
Anthony L. Guerrerio
View author publications
You can also search for this author in PubMed Google Scholar
Po-Hung Chen
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin N. Philosophe
View author publications
You can also search for this author in PubMed Google Scholar
Douglas B. Mogul
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conception/design of the study—L.K.D., M.R., M.G.B., D.B.M. Acquisition/analysis/interpretation of data—all drafting/revising manuscript—all final approval of submitted version—all.

Corresponding author

Correspondence to Douglas B. Mogul.

Ethics declarations

Competing interests

D.L.S. receives speaking and advisory honoraria from Novartis, Sanofi, and CSL Behring. The remaining authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

supplementary tables

Reporting Summary Checklist

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Dickerson, L.K., Rouhizadeh, M., Korotkaya, Y. et al. Language impairment in adults with end-stage liver disease: application of natural language processing towards patient-generated health records. npj Digit. Med. 2, 106 (2019). https://doi.org/10.1038/s41746-019-0179-9

Download citation

Received: 10 May 2019
Accepted: 23 September 2019
Published: 04 November 2019
DOI: https://doi.org/10.1038/s41746-019-0179-9

This article is cited by

An artificial neural network approach for the language learning model
- Zulqurnain Sabir
- Salem Ben Said
- Qasem Al-Mdallal
Scientific Reports (2023)

Subjects

Abstract

Similar content being viewed by others

Large language models to identify social determinants of health in electronic health records

Case-control study of neuropsychiatric symptoms in electronic health records following COVID-19 hospitalization in 2 academic health systems

Natural language processing for mental health interventions: a systematic review and research framework

Introduction

Results

Study population

Lexical domain

Word length

6+ letter words

Numeral words, capitalized words

Lexico-syntactic domain

Pronouns

Function words, question words, verbs, adjectives, nouns

Syntactic domain

Sentence length

Noun phrase ratio

Subject–verb–object ratio, Flesch–Kincaid grade readability score

Lexico-semantic domain

Unique words, Brunét’s Index, Honoré’s statistic

Sentiment domain

Polarity, subjectivity

Discussion

Methods

Identification of transplanted patients

Identification of healthy controls

Data extraction

NLP-based measures

Lexical domain

Lexico-syntactic domain

Syntactic domain

Lexico-semantic domain

Sentiment domain

Statistical analysis

Reporting summary

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

supplementary tables

Reporting Summary Checklist

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

An artificial neural network approach for the language learning model

Search

Quick links