GastrointestinalElectronic medical record: research tool for pancreatic cancer?
Introduction
Electronic medical records are becoming progressively more important for both clinical care and research. In the current era, data warehouses and repositories have been established to serve the dual function of patient care and investigation. In theory, medical data warehouses may facilitate clinical research, including assessing and improving quality and outcomes.
Currently, outcomes and health services research can be based on a number of different data sources, including individual, department, or institutional databases; patient registries; administrative databases; and, more recently, data warehouses. All these have their own respective advantages and disadvantages. Large national databases, including the National Surgical Quality Improvement Program and the Nationwide Inpatient Sample, share some common ground with data warehouses as they are in part based on International Classification of Diseases, 9th Revision, Clinical Modification (ICD-9-CM) coding. Recently, at our institution, a newly established health care information system (HIS) data warehouse was established and made available to pilot investigators [1], [2].
Large computerized health care system databases have been touted as the research tools of the future [1], [3], [4]. However, manually maintained departmental databases still exist. Our research group, Surgical Outcomes Analysis & Research, has previously analyzed different types of data resources [5], [6]. Manasanch et al. [6], [7] demonstrated that a significant number of pancreatic neuroendocrine tumors are missed via tumor registry's use of ICD-9-CM diagnosis codes [6], [7]. Other previous studies have analyzed the effectiveness of data collection systems, such as tumor registries with varying results [8], [9], [10], [11]. The aim of the present study was to determine whether clinical research could be facilitated using a novel data warehouse, using pancreatic cancer as a model.
Section snippets
HIS-linked database
Encrypted records from a newly developed HIS-linked institutional clinical data warehouse, capturing inpatient and outpatient clinical and billing information from a pool of 2.05 million patients evaluated at an academic medical institution and its affiliates since 1995, were queried. This database, still currently under development, allows investigators to use a web-based query tool to determine the aggregate total number of patients at network hospitals who meet a given set of inclusion and
Results
From an initial HIS-linked database of 2.05 million patients and 121 million data points, 1198 were identified using our search query for pancreatic neoplasm. This number was subsequently winnowed to 1107 patients on limiting to the years 1999–2009. Along the same period, 444 patients were identified in the prospectively maintained surgical database. This was subsequently narrowed to 394 patients with pancreatic neoplasm over the years 1999–2009 (Fig. 1).
Of the 1107 HIS-linked patients, 254
Discussion
Analysis of a promising new institutional data warehouse, using pancreatic neoplasm as a prototype query, has demonstrated that 45.0% of patients from the HIS-linked database presumed to have pancreatic malignancy and not to have any identifiable pancreatic pathology. Additionally, 18.7% of patients resulting from that query were found to have non-neoplastic pancreatic pathology on meticulous manual review. Furthermore, 35.5% of all patients with pathologic documentation of pancreatic neoplasm
Acknowledgment
This work was supported by American Cancer Society (MRSG-10-003), a Howard Hughes Early Career Award (NIH 1 UL1RR031982), and Pancreatic Cancer Alliance.
References (24)
- et al.
Pancreatic cancer and comparison of a hospital-based tumor registry with a National Cancer Data Base
Am J Surg
(1997) - et al.
Potential misclassification of patients with psoriasis in electronic databases
J Am Acad Dermatol
(2008) - et al.
International classification of diseases codes showed modest sensitivity for detecting community-acquired pneumonia
J Clin Epidemiol
(2007) - et al.
The completeness and accuracy of health authority and cancer registry records according to a study of ovarian neoplasms
Public Health
(1991) - et al.
A comparison of administrative versus clinical data: coronary artery bypass surgery as an example. Ischemic Heart Disease Patient Outcomes Research Team
J Clin Epidemiol
(1994) - et al.
Integration of clinical and genetic data in the i2b2 architecture
AMIA Annu Symp Proc
(2006) - Massachusetts integrated clinical academic research database. [cited 8/8/11]; Available from:...
- et al.
Using the i2b2 hive for clinical discovery: an example
AMIA Annu Symp Proc
(2007) - et al.
Serving the enterprise and beyond with informatics for integrating biology and the bedside (i2b2)
J Am Med Inform Assoc
(2010) - et al.
Pancreatic neuroendocrine tumors: the impact of surgical resection on survival
Cancer
(2009)
Tumor registry versus physician medical record review: a direct comparison of patients with pancreatic neuroendocrine tumors
J Oncol Pract
The ninth revision of the International Classification of Diseases: nosology and the specialist in community medicine
Community Med
Cited by (6)
Diagnostic accuracy of administrative database for bile duct cancer by ICD-10 code in a tertiary institute in Korea
2020, Hepatobiliary and Pancreatic Diseases InternationalCitation Excerpt :Without the V code system, confirmation of cancer patients and collection of study subjects would be difficult. It needs to be carefully performed as shown in previous studies [10–26]. This NHIS database which also collects V code is very useful.
Building a lung and ovarian cancer data warehouse
2020, Healthcare Informatics ResearchAccuracy of an administrative database for pancreatic cancer by international classification of disease 10th codes: A retrospective large-cohort study
2019, World Journal of GastroenterologyLeveraging Graph Models to Design Acute Kidney Injury Disease Research Data Warehouse
2019, 2019 6th International Conference on Social Networks Analysis, Management and Security, SNAMS 2019Development of a large urban longitudinal HIV clinical cohort using a web-based platform to merge electronically and manually abstracted data from disparate medical record systems: Technical challenges and innovative solutions
2016, Journal of the American Medical Informatics Association