Artificial intelligence in colposcopic examination: A promising tool to assist junior colposcopists

Wu, Aiyuan; Xue, Peng; Abulizi, Guzhalinuer; Tuerxun, Dilinuer; Rezhake, Remila; Qiao, Youlin

doi:10.3389/fmed.2023.1060451

ORIGINAL RESEARCH article

Front. Med., 15 March 2023
Sec. Obstetrics and Gynecology
Volume 10 - 2023 | https://doi.org/10.3389/fmed.2023.1060451

Artificial intelligence in colposcopic examination: A promising tool to assist junior colposcopists

Aiyuan Wu¹

Peng Xue²

Guzhalinuer Abulizi¹

Dilinuer Tuerxun¹

Remila Rezhake¹^*^†

Youlin Qiao^1,2^*^†

¹The Affiliated Cancer Hospital of Xinjiang Medical University, Urumqi, China
²School of Population Medicine and Public Health, Peking Union Medical College, Chinese Academy of Medical Sciences, Beijing, China

Introduction: Well-trained colposcopists are in huge shortage worldwide, especially in low-resource areas. Here, we aimed to evaluate the Colposcopic Artificial Intelligence Auxiliary Diagnostic System (CAIADS) to detect abnormalities based on digital colposcopy images, especially focusing on its role in assisting junior colposcopist to correctly identify the lesion areas where biopsy should be performed.

Materials and methods: This is a hospital-based retrospective study, which recruited the women who visited colposcopy clinics between September 2021 to January 2022. A total of 366 of 1,146 women with complete medical information recorded by a senior colposcopist and valid histology results were included. Anonymized colposcopy images were reviewed by CAIADS and a junior colposcopist separately, and the junior colposcopist reviewed the colposcopy images with CAIADS results (named CAIADS-Junior). The diagnostic accuracy and biopsy efficiency of CAIADS and CAIADS-Junior were assessed in detecting cervical intraepithelial neoplasia grade 2 or worse (CIN2+), CIN3+, and cancer in comparison with the senior and junior colposcipists. The factors influencing the accuracy of CAIADS were explored.

Results: For CIN2 + and CIN3 + detection, CAIADS showed a sensitivity at ~80%, which was not significantly lower than the sensitivity achieved by the senior colposcopist (for CIN2 +: 80.6 vs. 91.3%, p = 0.061 and for CIN3 +: 80.0 vs. 90.0%, p = 0.189). The sensitivity of the junior colposcopist was increased significantly with the assistance of CAIADS (for CIN2 +: 95.1 vs. 79.6%, p = 0.002 and for CIN3 +: 97.1 vs. 85.7%, p = 0.039) and was comparable to those of the senior colposcopists (for CIN2 +: 95.1 vs. 91.3%, p = 0.388 and for CIN3 +: 97.1 vs. 90.0%, p = 0.125). In detecting cervical cancer, CAIADS achieved the highest sensitivity at 100%. For all endpoints, CAIADS showed the highest specificity (55–64%) and positive predictive values compared to both senior and junior colposcopists. When CIN grades became higher, the average biopsy numbers decreased for the subspecialists and CAIADS required a minimum number of biopsies to detect per case (2.2–2.6 cut-points). Meanwhile, the biopsy sensitivity of the junior colposcopist was the lowest, but the CAIADS-assisted junior colposcopist achieved a higher biopsy sensitivity.

Conclusion: Colposcopic Artificial Intelligence Auxiliary Diagnostic System could assist junior colposcopists to improve diagnostic accuracy and biopsy efficiency, which might be a promising solution to improve the quality of cervical cancer screening in low-resource settings.

1. Introduction

Cervical cancer remains the fourth most common malignant cancer among women, with an estimated 600,000 new cases and 340,000 deaths in 2020 (1). China has a large population and contributes to nearly 18% (106,000) of new cervical cancer cases and 14% (48,000) of deaths (2), and the morbidity and mortality of cervical cancer tended to increase from 2000 to 2016 in China (3). In 2018, the World Health Organization (WHO) called for global action to eliminate cervical cancer (4), while there is a considerable gap between the WHO goals and the real situation regarding cervical cancer prevention and control in China. Although different human papillomavirus (HPV) vaccines have been approved since 2016 in China, screening is still an indispensable prevention strategy in this post-vaccination era. HPV test has high sensitivity, reproducibility, long-term (at least 5years) reassurance after a negative HPV result, and has been proved as feasible on self-collected samples (5–7). Thus, HPV testing has been widely used in primary cervical cancer screening in many countries, and recommended as the main screening method in the latest WHO guidelines (8).

The application of such a highly sensitive screening method, if not appropriately triaged by another test, will inevitably lead to a much higher colposcopy referral rate. The colposcopic examination is the crucial step linking the primary screening and the histological diagnosis that determines the clinical decision about the optimal management of abnormal lesions (9). Colposcopy plays an irreplaceable role in the precise localization of the biopsy sites and in the early diagnosis of precancerous lesions to reduce the incidence of cervical cancer (10, 11). The accuracy of colposcopy is highly operator-dependent, resulting in low reproducibility and varied diagnostic performance between different resource settings (12). Many low- and middle-income countries are facing the challenges of a shortage of experienced colposcopists, regular colposcopy training courses, a uniform diagnostic standard and strict quality control process, making colposcopy a bottleneck problem that restricts the benefits of cervical cancer screening program (13).

In recent years, artificial intelligence (AI) has been rapidly developed and applied in different fields (14–16). In healthcare, AI has shown promising application value in enhancing diagnosis and personalizing treatment (17–20). There is an increasing interest in the use of deep learning-based AI technologies for the automatic assessment of medical images, which contributes to improving diagnostic accuracy and objectivity and reduces the workload of healthcare workers (21). Such advances also offer the opportunity to tackle the aforementioned challenges in colposcopic diagnosis in cervical cancer screening (22). Xue et al. developed a Colposcopic Artificial Intelligence Auxiliary Diagnostic System (CAIADS) that was trained, tuned, and validated using a large number of colposcopic images and clinical information from 19,435 patients, revealing its potential in improving the diagnostic quality of colposcopy and biopsy in the detection of cervical precancer/cancer (23). In 2022, Zhao et al. (24) concluded that the CAIADS had a higher sensitivity and similar specificity compared with colposcopists. However, the usefulness of the CAIADS in assisting less-experienced colposcopists in clinical practice is unclear.

In this study, we used hospital-based data to further evaluate the diagnostic performance of the CAIADS and its role in assisting junior colposcopists to identify the lesion areas and guide targeted biopsies.

2. Materials and methods

2.1. Study population

This was a hospital-based retrospective study. A total of 1,146 women visited the colposcopy clinics at the Affiliated Cancer Hospital of Xinjiang Medical University in Xinjiang, China, due to abnormal HPV or cytological results or other gynecological symptoms between September 2021 and January 2022. The study cohort comprised women who had standard colposcopic images consecutively taken at 0, 30, 60, 90, and 120 s during the colposcopic examination and had a valid histologic diagnosis. The exclusion criteria were radiotherapy or chemotherapy, lack of definitive pathology results, invalid colposcopic images, unknown HPV status, or unknown cytological information (Figure 1). The digital records of patients, including HPV and cytological information, colposcopic images, type of transformation zone, colposcopic diagnosis by a senior colposcopist, biopsy information (number and site), and histopathological diagnosis were collected from the hospital registry system. General information (age, smoking status, reproductive history, and HPV vaccination status) was collected from the electronic outpatient records. The study was approved by the Ethics Committee of the Affiliated Cancer Hospital of Xinjiang Medical University (approval number: K-2021055). The need for informed consent was waived because the study used anonymized data that were collected retrospectively.

FIGURE 1

Figure 1. Study flowchart. Non-imaging information comprised the clinical characteristics [human papillomavirus (HPV) status, cytological findings, colposcopic impression, and biopsy results] and demographic characteristics (age, educational level, reproductive history, and menopausal stage) obtained from the medical records. CIN, cervical intraepithelial neoplasia; HPV, human papillomavirus; CIN1, CIN grade 1; CIN2, CIN grade 2; CIN3, CIN grade 3.

2.2. Human papillomavirus testing and cytology

Human papillomavirus testing was performed using the Hybrid Capture 2 assay (Qiagen, Hilden, Germany). HPV was genotyped using GenoArray Diagnostic Kit (HBGA-21PKG, Hybribio, China), which can identify the 21 HPV subtypes, comprising 14 high-risk HPV types (HPV 16, 18, 31, 33, 35, 39, 45, 51, 52, 56, 58, 59, 66, and 68) and seven low-risk HPV types (HPV 6, 11, 42, 43, 44, 53, and 81). HPV results were classified as negative, HPV 16/18-positive, or positive to other high-risk subtypes.

Experienced cytologists from the Affiliated Cancer Hospital of Xinjiang Medical University performed liquid-based cytology (SurePath, BD Oncolarity, United States) and interpreted the results using the Bethesda 2001 classification system (25). Cytological results were classified as negative intraepithelial lesions or malignancies (NILM), atypical squamous cells of undetermined significance (ASC-US), atypical squamous cells which cannot exclude high-grade squamous intraepithelial lesion (ASC-H), low-grade squamous intraepithelial lesions (LSIL), and high-grade squamous intraepithelial lesions (HSIL) or worse (including squamous cell carcinoma, adenocarcinoma in situ, and glandular abnormalities).

2.3. Colposcopic procedure and histological confirmation

A senior colposcopist with over 20 years of specialized experience in the colposcopy clinic used a high-resolution electronic colposcope (EDAN, China) to perform the colposcopic examination in accordance with standard clinical guidelines (26). In brief, 5% acetic acid was applied to the cervix, and the visibility of the squamocolumnar junction, presence of aceto-whitening, and colposcopic lesions were documented for each woman. The final colposcopic diagnosis was recorded as benign/normal, low-grade lesion, or high-grade lesion. The colposcopic images were saved in JPEG format (640 × 480 pixels). For each woman, the colposcopic images consisted of five sequential images, namely a pre-acid image (at 0 s) and four post-acid images with an approximate time interval of 30 s (i.e., at 60, 90, 120, and 150 s) (23). Direct biopsy was performed when the colposcopic impression was satisfactory and suspicious lesions were seen; if the colposcopy was unsatisfactory or the result was HPV 16/18-positive or cytology showed high-grade abnormalities, four random biopsies were taken at the 3, 6, 9, and 12 o’clock positions.

Senior pathologists in the Affiliated Cancer Hospital of Xinjiang Medical University performed the histologic diagnosis using hematoxylin and eosin-stained slides. When the lesions were equivocal, p16 and Ki67 immunohistochemical staining of the tissue specimens was performed and the final diagnosis was made after a conjunctive analysis of the slides stained with hematoxylin and eosin and p16/Ki67. All histopathological findings were categorized by the cervical intraepithelial neoplasia (CIN) classification system as benign, CIN grades 1, 2, and 3, and cancer, with the worst finding used as the final diagnosis.

In addition to the examinations described earlier, a junior colposcopist with 1 year of experience working in the gynecological department reviewed all colposcopic images. The junior colposcopist was aware of the HPV status and cytological findings but was blinded to the colposcopic diagnosis by the senior colposcopist and the histological diagnosis. The junior colposcopist categorized the colposcopic findings using the 2014 WHO Classification of Female Reproductive System Neoplasms (27) as normal/benign, LSIL, HSIL, and cancer.

2.4. Diagnosis by the CAIADS

The CAIADS that was developed and initially validated by Xue et al. (23) was used to diagnose the cervical lesions. In brief, both the colposcopic images and the non-imaging information (cytology and HPV status) were inputted into the CAIADS to enable it to make a diagnostic judgment. The CAIADS algorithm mapped the input features (colposcopic images and non-imaging information) to the corresponding two target tasks (grading of the colposcopic impressions and guidance of biopsies) based on four deep learning networks, namely cervix detection, feature encoding, graph convolutional network-based feature fusion, and lesion area segmentation networks (23, 28).

The findings of the CAIADS were categorized into three groups: benign, LSIL, and HSIL or worse (HSIL+), and the biopsy number and specific sites were indicated by the system with blue circles (Supplementary Figure 1). The CAIADS and the junior colposcopist received the same anonymized colposcopic images and non-imaging data (cytology and HPV status) to make an independent diagnosis while blinded to the senior colposcopist’s findings and the histological results. To evaluate the role of the CAIADS in assisting the junior colposcopist, the order of the colposcopic records was changed and the junior colposcopist performed a second review with the knowledge of the CAIADS results; these findings were defined as the CAIADS-assisted junior colposcopist (abbreviated as CAIADS-Junior in the subsequent text). The junior colposcopist also indicated the biopsy sites and number of biopsies on the original colposcopic images with and without the knowledge of the CAIADS.

2.5. Statistical analysis

The demographic and clinical characteristics were summarized using descriptive statistics. Taking the histological diagnosis as the gold standard, the diagnostic performances of the different subspecialists (CAIADS, senior colposcopist, junior colposcopist, and CAIADS-Junior) were evaluated separately for the different histology endpoints (CIN2 +, CIN3 +, and cancer). The Wilson score approach was used to calculate the sensitivity, specificity, positive predictive value, and negative predictive value with 95% confidence intervals (95% CIs). The sensitivity and specificity of the subspecialists were compared using McNemar’s test. The areas under the curves (AUCs) were compared using the DeLong test (29). To evaluate the biopsy efficacy, the number of captured biopsies required per case (BNR) was calculated for each histology endpoint and the biopsy sensitivity was calculated (number of biopsies indicated by the subspecialists/the total number of diagnosed biopsies for specific endpoints). Binary logistic regression was used to estimate the odds ratios and 95% CIs to assess their impact on the CAIADS regarding accurate diagnosis and underdiagnosis. Age, ethnicity, BMI, educational level, parity, stage of menopause, cytological result, HPV status, and biopsy type were analyzed as the demographic and clinical characteristics potentially influencing the diagnostic accuracy and underdiagnosis of CAIADS. The accurate diagnosis was defined as the conditions in which the CAIADS indicated an abnormality (LSIL +) and histology confirmed CIN2 + or when the CAIADS judged a lesion as normal without the need for biopsy and histology confirmed the lesion as < CIN2; all other conditions were regarded as an inaccurate diagnosis. Among women diagnosed as normal by the CAIADS, histological confirmation of CIN2 + was defined as underdiagnosis, while a histological confirmation of normal was defined as no underdiagnosis.

Statistical significance was defined as p < 0.05 (two-sided). All analyses were performed using IBM SPSS version 28 (IBM, New York, NY, United States) and MedCalc Statistical Version 20 (MedCalc Software Ltd., Ostend, Belgium).

3. Results

Figure 1 shows the flowchart of the selection of the study population. The medical records of 1,146 women with 7,646 colposcopic images were reviewed. Among them, 660 women with complete colposcopic images (five images per woman with an approximately 30-s interval between images) were identified, resulting in a total of 3,300 colposcopic images. Two-hundred-and-ninety-four women were excluded due to incomplete clinical information. A total of 366 women with a median age of 44 years (range 22–85 years, interquartile range 36–52 years) with 1,830 colposcopic images were included in the final analysis (Figure 1). The detailed demographic information of the cohort is presented in Table 1.

TABLE 1

Table 1. Demographic characteristics of the study population.

3.1. Clinical characteristics and colposcopic findings determined by the senior colposcopist

As shown in Table 2, 145 (39.6%) women had cytological results that showed atypical squamous cells of undetermined significance or worse, and 308 women (84.2%) tested positive for HPV, of which 164 (44.8%) were HPV 16/18-positive. The histology results were: 178 normal lesions (48.6%), 85 CIN1 cases (23.2%), 33 CIN2 cases (9.0%), 46 CIN3 cases (12.6%), and 24 cancer cases (6.6%). Most women (87.7%) were assessed as having an adequate cervical impression by the senior colposcopist. The senior colposcopist classified the colposcopic findings as benign (n = 132, 36.1%), low-grade lesions (n = 139, 38.0%), and high-grade lesions or worse (n = 95, 26.0%).

TABLE 2

Table 2. Human papillomavirus (HPV), cytological, and colposcopic findings by histological diagnosis.

3.2. Colposcopic findings of the CAIADS and the junior colposcopist

The CAIADS indicated 131 LSIL cases (35.8%) and 48 HISL + cases (13.1%), whereas the junior colposcopist indicated 140 LSIL cases (38.3%) and 77 HISL + cases (21.0%). When assisted by the CAIADS, the detection rate of the junior colposcopist increased to 40.2% (n = 147) for LSIL and 23.8% (n = 87) for HSIL +.

3.3. Diagnostic performance of the CAIADS compared with different colposcopists

Table 3 and Supplementary Figure 2 show the diagnostic performance of the CAIADS in comparison with the junior and senior colposcopists, and its value in assisting the junior colposcopist. Concerning CIN2 + and CIN3 + detections, the CAIADS showed a sensitivity of approximately 80%, which was not significantly lower than the sensitivity of the senior colposcopist (for CIN2 +: 80.6, 95% CI: 71.9–87.1% vs. 91.3, 95% CI: 84.2–95.3%, p = 0.061; for CIN3 +: 80.0, 95% CI: 69.2–87.7 vs. 90.0, 95% CI: 80.7–95.1%, p = 0.189). The sensitivity of the junior colposcopist was significantly increased with the assistance of the CAIADS (for CIN2 +: 95.1, 95% CI: 89.1–97.9% vs. 79.6, 95% CI: 70.8–86.3%, p = 0.002; for CIN3 +: 97.1, 95% CI: 90.2–99.2% vs. 85.7, 95% CI: 75.7–92.1%, p = 0.039). The sensitivity of the CAIADS-Junior was slightly higher than that of the senior colposcopist (for CIN2 +: 95.1, 95% CI: 89.1–97.9% vs. 91.3, 95% CI: 84.2–95.3%, p = 0.388; for CIN3+: 97.1, 95% CI: 90.2–99.2% vs. 90.0, 95% CI: 80.7–95.1%, p = 0.125) and significantly higher than that of the CAIADS (for CIN2+: 95.1, 95% CI: 89.1–97.9% vs. 80.6, 95% CI: 71.9–87.1%, p = 0.003; for CIN3+: 97.1, 95% CI: 90.2–99.2% vs. 80.0, 95% CI: 69.2–87.7%, p = 0.004). In detecting cervical cancer, the CAIADS achieved the highest sensitivity at 100% and assisted the junior colposcopist to improve the sensitivity from 87.5 to 95.8%, although this difference was not statistically significant (p = 0.625).

TABLE 3

Table 3. Diagnostic performance of the subspecialists for different clinical endpoints.

For all endpoints, the CAIADS showed the highest specificity (55–64%) and the highest positive predictive values compared with the senior and junior colposcopists. Furthermore, the CAIADS had the highest overall accuracy for all endpoints. As shown in Figure 2 and Supplementary Figure 2E, there were significant differences between the AUC of the CAIADS and the junior colposcopist in detecting CIN2 + and cancer, although this difference was not significant for detecting CIN3 +. The AUC of the CAIADS was significantly higher than that of the senior colposcopist in detecting cancer (0.773 vs. 0.648; p < 0.001).

FIGURE 2

Figure 2. ROCs and AUCs of the subspecialists for the different disease endpoints. (A) CIN2+, (B) CIN3+, and (C) cancer. CAIADS, colposcopic artificial intelligence auxiliary diagnostic system; CAIADS-Junior, CAIADS-assisted junior colposcopist; ROC curve, receiver operator characteristic curve; AUC, area under the curve; CIN, cervical intraepithelial neoplasia; CIN2+, CIN grade 2 or worse; CIN3+, CIN grade 3 or worse.

3.4. Biopsy efficacy and sensitivity of the CAIADS and CAIADS-junior

A total of 1,415 biopsies were taken from the 366 women by the senior colposcopist. To reflect the biopsy efficacy of the subspecialists, the BNRs (Figure 3A) and biopsy sensitivity (Figure 3B) were calculated for different histological lesions. As the lesion grade became more severe, the average number of biopsies required per case decreased for the subspecialists, among which the CAIADS required the lowest number of biopsies per case (2.6 for CIN2 +, 2.4 for CIN3 +, and 2.2 for cancer). The junior colposcopist demonstrated very similar BNRs (3–4) with or without the assistance of the CAIADS. The CAIADS had the highest biopsy sensitivity for CIN3 (56.5, 95% CI: 45.8–66.7%) and for cancer (63.5, 95% CI: 49.8–75.7%). The junior colposcopist showed the lowest biopsy sensitivity for all endpoints (for CIN2: 29.1, 95% CI: 21.7–37.5%; for CIN3: 34.7, 95% CI: 27.8–42.1%; for cancer: 35.4, 95% CI: 24.5–47.5%); these sensitivities were increased with the assistance of the CAIADS (for CIN2: 33.8, 95% CI: 26.1–42.3%; for CIN3: 40.0, 95% CI: 33.1–47.2%; for cancer: 49.1, 95% CI: 36.4–62.0%).

FIGURE 3

Figure 3. Biopsy efficacy and sensitivity of the subspecialists. (A) Number of biopsy sites required to detect each high-grade cervical lesion by the subspecialists. (B) Biopsy sensitivity of the subspecialists for each cervical lesion. CAIADS, colposcopic artificial intelligence auxiliary diagnostic system; CAIADS-Junior, CAIADS-assisted junior colposcopist; BNR, number of biopsies needed to detect each case for different endpoints (CIN2+/CIN3+/cancer); CIN, cervical intraepithelial neoplasia; CIN2+, CIN2 or worse; CIN3+, CIN3 or worse.

3.5. Factors influencing the accuracy of the CAIADS judgement

Table 4 and Supplementary Table 1 show the results of the uni- and multi-variate logistic regression analyses to assess the factors influencing the diagnostic accuracy and underdiagnosis of CAIADS. Multivariate logistic regression showed that parity (> 1) was the only demographic factor that decreased the chance of underdiagnosis by the CAIADS (OR: 0.21, 95% CI: 0.52–0.84; Table 4). No clinical factor was associated with the accuracy of the CAIADS (Supplementary Table 1).

TABLE 4

Table 4. Logistic regression analysis of the demographic and clinical characteristics affecting CAIADS-related underdiagnosis in detecting cervical diseases.

4. Discussion

Colposcopy is the cornerstone of the cervical cancer screening program and is used in combination with pathology to determine the best management strategy. However, the accuracy of colposcopy is a worldwide concern due to its subjective nature as it is highly operator-dependent; this issue is compounded in low- to middle-income countries with a limited number of well-trained colposcopists. The inaccuracy of colposcopy is reflected by the large variation in the consistency rate between colposcopic findings and pathology, ranging from 37 to 66% (30–34). With the worldwide trend of using HPV testing as the primary screening method, which inevitably leads to a significant increase in colposcopy referrals, there is an increasing demand for high-quality colposcopic examination to precisely identify the cervical lesions and locate the biopsy sites to obtain the final pathological diagnosis. If the accuracy of colposcopy-directed biopsy cannot be guaranteed, the efficacy of the screening program will be limited.

The great advances in AI technology have brought the opportunity to improve medical practice in recent years. AI-based or deep learning-based colposcopic methods have shown promise in several studies (35–38). In these studies, AI-based colposcopy or deep learning-based colposcopy systems were trained and validated using more than 10,000 colposcopic images, and the performances of these systems were compared with colposcopists with different levels of experience. In diagnosing histologically confirmed HSIL + cases, the reported sensitivities of AI-colposcopy, colposcopists, and AI-assisted colposcopists are 74.1–82.8%, 19.5–100%, and 66.7–84.5%, respectively. Overall, the diagnostic performance of colposcopists varies greatly, whereas the sensitivity of AI colposcopy tends to be stable between studies. In our study, the CAIADS and CAIADS-Junior findings had a sensitivity of more than 80% for high-grade lesions. These findings further reflect the fact that as an objective tool that is trained, set up, and validated using thousands of images, AI colposcopy has great potential to ensure the quality of colposcopic examination, which is of particular importance in areas that lack well-trained colposcopists.

The major aim of the colposcopic examination is to precisely obtain biopsies to confirm a histological diagnosis of HSIL or cervical cancer. Most studies have only explored the diagnostic performance of AI-colposcopy (35, 36, 39, 40), while there is a lack of evidence regarding the role of AI in the last critical step (guiding biopsy), which makes AI colposcopy less practical in the areas lacking well-trained colposcopists. The CAIADS used in our study showed its advantages in colposcopic diagnosis, demonstrated its superiority in a colposcopy-targeted directed biopsy, and revealed its potential in assisting junior colposcopists to improve their targeted biopsy performance, achieving a higher efficacy and biopsy sensitivity than that of the junior colposcopist alone.

The accuracy of colposcopic diagnosis and targeted biopsy might be influenced by various factors, such as age, menopause status, cytological abnormalities, HPV infection status, and type of transformation zone (41). We performed univariate and multivariate logistic regression analyses to identify the factors associated with the accuracy of the CAIADS and CAIADS-related underdiagnosis. Our study revealed that the number of parities was negatively correlated with underdiagnosis of the CAIADS, which is consistent with previous evidence that deliveries significantly maintain the transformation zone on the exocervix (42), making it easier for the CAIADS to identify the lesion areas. Overall, the role of the CAIADS is to assist colposcopists rather than supersede colposcopists in clinical practice and decision-making.

External validation of the CAIADS has provided powerful evidence for its accuracy in the colposcopic examination (43). The present study used an independent real-world dataset (neither training nor an adjustment dataset) to evaluate the feasibility and effectiveness of the CAIADS, providing evidence for its clinical application in colposcopy clinics. The CAIADS was first applied in Xinjiang and was applied to ethnically diverse populations, affirming its geographical and ethnic generalization abilities. External validation of the CAIADS identified man–machine cooperation rather than man–machine confrontation. Previous studies have shown that humans and AI achieve similar outcomes and have suggested that humans will be replaced by AI (23, 44, 45). However, the present study revealed that the AI-assisted colposcopist achieved the best results, which is more in line with ethical, moral, and legal requirements than the use of AI alone.

The implementation of the CAIADS still has the following problems in less-developed areas (13, 46). First, the quality of available cervical information (screening data, colposcopy images, etc.) may affect the colposcopic interpretation, and descriptive terms are not standardized in colposcopy practice due to the use of different types of colposcopic equipment, including cervical labeling, annotation, classification, and quality supervision (26, 31, 47). Thus, we aim to apply the CAIADS in various scenarios. Second, a wide area network may be difficult to achieve in less-developed areas due to the requirement for high-definition images and large running memory. Therefore, we aim to develop a software version of the CAIADS that is feasible using a local service network. Finally, colposcopists in low-resource areas may have incorrect notions about AI. For colposcopists to effectively use the CAIAD, it is important to understand that AI is a tool that assists the physician and does not take the place of a physician in making decisions.

The main strengths of this study are that we externally validated AI-based colposcopy (using the CAIADS) in diagnosing cervical lesions and targeting biopsy sites based on a hospital-based retrospective study in Xinjiang, China, proving important evidence on the performance and feasibility of CAIADS in resource-limited areas. While, the major limitation is that, as a retrospective study, the CAIADS and the junior colposcopist made decisions by reviewing high-resolution colposcopic images. Therefore, some potentially malignant cases that were detected by either the CAIADS or the junior colposcopist might have been missed and thus not biopsied by the senior colposcopist. However, the senior colposcopist who performed the colposcopic examinations had more than 20 years of working experience in a colposcopy clinic, which may have reduced the risk of missed cases. Furthermore, only one junior colposcopist with 1 year of experience reviewed the colposcopic images, inevitably leading to observer bias. Given that the current study is one of the very few studies evaluating the role of AI-colposcopy in assisting a junior colposcopist in diagnosing and guiding the biopsy during cervical cancer screening, which might be the most practical way to use AI in the screening setting, the promising findings provide the necessary evidence for future population-based, multicenter studies to further evaluate the use of AI in real-world settings.

5. Conclusion

The CAIADS may enhance the diagnostic and biopsy accuracy of junior colposcopists. Therefore, the CAIADS might be a promising solution to improve the colposcopy practice in low-resource areas with limited numbers of well-trained colposcopists.

Data availability statement

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

Ethics statement

The studies involving human participants were reviewed and approved by the Research Ethics Committee of Affiliated Tumor Hospital of Xinjiang Medical University. Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements.

Author contributions

RR and YQ designed the study. AW, PX, GA, and DT were involved in the administration of fieldwork, data collection, and assembly. AW, PX, and RR participated in manuscript writing, data analysis, and interpretation. YQ and GA provided constructive comments and revisions to the manuscript. All authors contributed to the article and approved the submitted version.

Funding

This study was supported by the State Key Laboratory of Pathogenesis, Prevention, and Treatment of High Incidence Diseases in Central Asia Fund (SKL-HIDCA-2020-GJ2), the Xinjiang Uygur Autonomous Region Postgraduate Research Innovation Project (XJ2022G162), the Postdoctoral Fund of Affiliated Cancer Hospital of Xinjiang Medical University, and Chinese Academy of Medical Sciences Innovation Fund for Medical Sciences (2021-I2M-1-004).

Acknowledgments

We acknowledge the great support of the staff from colposcopy clinics and the gynecologic department at the Affiliated Cancer Hospital of Xinjiang Medical University.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmed.2023.1060451/full#supplementary-material

References

1. Sung, H, Ferlay, J, Siegel, RL, Laversanne, M, Soerjomataram, I, Jemal, A, et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. (2021) 71:209–49. doi: 10.3322/caac.21660

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Arbyn, M, Weiderpass, E, Bruni, L, de Sanjose, S, Saraiya, M, Ferlay, J, et al. Estimates of incidence and mortality of cervical cancer in 2018: a worldwide analysis. Lancet Glob Health. (2020) 8:e191–203. doi: 10.1016/S2214-109X(19)30482-6

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Zheng, R, Zhang, S, Zeng, H, Wang, S, Sun, K, Chen, R, et al. Cancer incidence and mortality in China, 2016. J Natl Cancer Cent. (2022) 2:1–9. doi: 10.1016/j.jncc.2022.02.002

CrossRef Full Text | Google Scholar

4. Canfell, K. Towards the global elimination of cervical cancer. Papillomavirus Res. (2019) 8:100170. doi: 10.1016/j.pvr.2019.100170

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Zhao, F, and Qiao, Y. Cervical cancer prevention in China: a key to cancer control. Lancet. (2019) 393:969–70. doi: 10.1016/S0140-6736(18)32849-6

CrossRef Full Text | Google Scholar

6. Arbyn, M, Smith, SB, Temin, S, Sultana, F, and Castle, P. Collaboration on self-sampling and HPV testing. Detecting cervical precancer and reaching underscreened women by using HPV testing on self samples: updated meta-analyses. BMJ. (2018) 363:k4823. doi: 10.1136/bmj.k4823

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Arbyn, M, Ronco, G, Anttila, A, Meijer, CJ, Poljak, M, Ogilvie, G, et al. Evidence regarding human papillomavirus testing in secondary prevention of cervical cancer. Vaccine. (2012) 30:F88–99. doi: 10.1016/j.vaccine.2012.06.095

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Hu, SY, Zhao, XL, Zhang, Y, Qiao, YL, and Zhao, FH. Interpretation of "WHO guideline for screening and treatment of cervical pre-cancer lesions for cervical cancer prevention, second edition". Zhonghua Yi Xue Za Zhi. (2021) 101:2653–7. doi: 10.3760/cma.j.cn112137-20210719-01609

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Green, LI, Mathews, CS, Waller, J, Kitchener, H, and Rebolj, M. Attendance at early recall and colposcopy in routine cervical screening with human papillomavirus testing. Int J Cancer. (2021) 148:1850–7. doi: 10.1002/ijc.33348

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Brown, BH, and Tidy, JA. The diagnostic accuracy of colposcopy—a review of research methodology and impact on the outcomes of quality assurance. Eur J Obstet Gynecol Reprod Biol. (2019) 240:182–6. doi: 10.1016/j.ejogrb.2019.07.003

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Schiffman, M, and Wentzensen, N. Issues in optimising and standardising the accuracy and utility of the colposcopic examination in the HPV era. Ecancermedicalscience. (2015) 9:530. doi: 10.3332/ecancer.2015.530

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Hariprasad, R, Mittal, S, and Basu, P. Role of colposcopy in the management of women with abnormal cytology. Cytojournal. (2022) 19:40. doi: 10.25259/CMAS_03_15_2021

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Xue, P, Ng, MTA, and Qiao, Y. The challenges of colposcopy for cervical cancer screening in LMICs and solutions by artificial intelligence. BMC Med. (2020) 18:169. doi: 10.1186/s12916-020-01613-x

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Yang, S, Liu, K, Gai, J, and He, X. Transformation to industrial artificial intelligence and workers' mental health: evidence from China. Front Public Health. (2022) 10:881827. doi: 10.3389/fpubh.2022.881827

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Guo, C, and Li, H. Application of 5G network combined with AI robots in personalized nursing in China: a literature review. Front Public Health. (2022) 10:948303. doi: 10.3389/fpubh.2022.948303

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Tran, AQ, Nguyen, LH, Nguyen, HSA, Nguyen, CT, Vu, LG, Zhang, M, et al. Determinants of intention to use artificial intelligence-based diagnosis support system among prospective physicians. Front Public Health. (2021) 9:755644. doi: 10.3389/fpubh.2021.755644

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Han, R, Cheng, G, Zhang, B, Yang, J, Yuan, M, Yang, D, et al. Validating automated eye disease screening AI algorithm in community and in-hospital scenarios. Front Public Health. (2022) 10:944967. doi: 10.3389/fpubh.2022.944967

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Akazawa, M, and Hashimoto, K. Artificial intelligence in gynecologic cancers: current status and future challenges–a systematic review. Artif Intell Med. (2021) 120:102164. doi: 10.1016/j.artmed.2021.102164

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Ben-Israel, D, Jacobs, WB, Casha, S, Lang, S, Ryu, WHA, de Lotbiniere-Bassett, M, et al. The impact of machine learning on patient care: a systematic review. Artif Intell Med. (2020) 103:101785. doi: 10.1016/j.artmed.2019.101785

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Bhinder, B, Gilvary, C, Madhukar, NS, and Elemento, O. Artificial intelligence in cancer research and precision medicine. Cancer Discov. (2021) 11:900–15. doi: 10.1158/2159-8290.CD-21-0090

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Jiang, Y, Yang, M, Wang, S, Li, X, and Sun, Y. Emerging role of deep learning-based artificial intelligence in tumor pathology. Cancer Commun (Lond). (2020) 40:154–66. doi: 10.1002/cac2.12012

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Ito, Y, Miyoshi, A, Ueda, Y, Tanaka, Y, Nakae, R, Morimoto, A, et al. An artificial intelligence-assisted diagnostic system improves the accuracy of image diagnosis of uterine cervical lesions. Mol Clin Oncol. (2022) 16:27. doi: 10.3892/mco.2021.2460

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Xue, P, Tang, C, Li, Q, Li, Y, Shen, Y, Zhao, Y, et al. Development and validation of an artificial intelligence system for grading colposcopic impressions and guiding biopsies. BMC Med. (2020) 18:406. doi: 10.1186/s12916-020-01860-y

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Zhao, Y, Li, Y, Xing, L, Lei, H, Chen, D, Tang, C, et al. The performance of artificial intelligence in cervical colposcopy: a retrospective data analysis. J Oncol. (2022) 2022:4370851. doi: 10.1155/2022/4370851

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Solomon, D, Davey, D, Kurman, R, Moriarty, A, O'Connor, D, Prey, M, et al. The 2001 Bethesda system: terminology for reporting results of cervical cytology. JAMA. (2002) 287:2114–9. doi: 10.1001/jama.287.16.2114

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Chen, F, You, Z, Sui, L, Li, S, Li, J, Liu, A, et al. Chinese expert consensus on colposcopy application. Chin. J Obstet Gynecol. (2020) 55:443–9. doi: 10.3760/cma.j.cn112141-20200320-00240

CrossRef Full Text | Google Scholar

27. Kurman, RJ, Carcangiu, ML, Herrington, CS, and Young, RH. WHO classification of tumours of female reproductive organs, WHO classification of tumours, 4th edition. Int Agency Res Cancer. (2014) 6

Google Scholar

28. Li, Y, Liu, ZH, Xue, P, Chen, J, Ma, K, Qian, T, et al. GRAND: a large-scale dataset and benchmark for cervical intraepithelial neoplasia grading with fine-grained lesion description. Med Image Anal. (2021) 70:102006. doi: 10.1016/j.media.2021.102006

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Demler, OV, Pencina, MJ, and D'Agostino, RB Sr. Misuse of DeLong test to compare AUCs for nested models. Stat Med. (2012) 31:2577–87. doi: 10.1002/sim.5328

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Li, J, Wang, W, Yang, P, Chen, J, Dai, Q, Hua, P, et al. Analysis of the agreement between colposcopic impression and histopathological diagnosis of cervical biopsy in a single tertiary center of Chengdu. Arch Gynecol Obstet. (2021) 304:1033–41. doi: 10.1007/s00404-021-06012-y

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Fan, A, Wang, C, Zhang, L, Yan, Y, Han, C, and Xue, F. Diagnostic value of the 2011 International Federation for Cervical Pathology and Colposcopy Terminology in predicting cervical lesions. Onco Targets Ther. (2018) 9:9166–76. doi: 10.18632/oncotarget.24074

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Tatiyachonwiphut, M, Jaishuen, A, Sangkarat, S, Laiwejpithaya, S, Wongtiraporn, W, Inthasorn, P, et al. Agreement between colposcopic diagnosis and cervical pathology: Siriraj hospital experience. Asian Pac J Cancer Prev. (2014) 15:423–6. doi: 10.7314/apjcp.2014.15.1.423

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Benedet, JL, Matisic, JP, and Bertrand, MA. An analysis of 84244 patients from the British Columbia cytology-colposcopy program. Gynecol Oncol. (2004) 92:127–34. doi: 10.1016/j.ygyno.2003.10.001

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Massad, LS, and Collins, YC. Strength of correlations between colposcopic impression and biopsy histology. Gynecol Oncol. (2003) 89:424–8. doi: 10.1016/s0090-8258(03)00082-9

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Zimmer-Stelmach, A, Zak, J, Pawlosek, A, Rosner-Tenerowicz, A, Budny-Winska, J, Pomorski, M, et al. The application of artificial intelligence-assisted colposcopy in a tertiary care hospital within a cervical pathology diagnostic unit. Diagnostics (Basel). (2022) 12:106. doi: 10.3390/diagnostics12010106

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Kim, S, Lee, H, Lee, S, Song, JY, Lee, JK, and Lee, NW. Role of artificial intelligence interpretation of colposcopic images in cervical cancer screening. Healthcare (Basel). (2022) 10:468. doi: 10.3390/healthcare10030468

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Liu, L, Wang, Y, Liu, X, Han, S, Jia, L, Meng, L, et al. Computer-aided diagnostic system based on deep learning for classifying colposcopy images. Ann Transl Med. (2021) 9:1045. doi: 10.21037/atm-21-885

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Yuan, C, Yao, Y, Cheng, B, Cheng, Y, Li, Y, Li, Y, et al. The application of deep learning based diagnostic system to cervical squamous intraepithelial lesions recognition in colposcopy images. Sci Rep. (2020) 10:11639. doi: 10.1038/s41598-020-68252-3

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Valasoulis, G, Pouliakis, A, Michail, G, Daponte, AI, Galazios, G, Panayiotides, IG, et al. The influence of sexual behavior and demographic characteristics in the expression of HPV-related biomarkers in a colposcopy population of reproductive age Greek women. Biology (Basel). (2021) 10:713. doi: 10.3390/biology10080713

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Miyagi, Y, Takehara, K, Nagayasu, Y, and Miyake, T. Application of deep learning to the classification of uterine cervical squamous epithelial lesion from colposcopy images combined with HPV types. Oncol Lett. (2020) 19:1602–10. doi: 10.3892/ol.2019.11214

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Fan, A, Zhang, L, Wang, C, Wang, Y, Han, C, and Xue, F. Analysis of clinical factors correlated with the accuracy of colposcopically directed biopsy. Arch Gynecol Obstet. (2017) 296:965–72. doi: 10.1007/s00404-017-4500-z

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Autier, P, Coibion, M, Huet, F, and Grivegnee, AR. Transformation zone location and intraepithelial neoplasia of the cervix uteri. Br J Cancer. (1996) 74:488–90. doi: 10.1038/bjc.1996.388

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Xue, P, Wang, J, Qin, D, Yan, H, Qu, Y, Seery, S, et al. Deep learning in image-based breast and cervical cancer detection: a systematic review and meta-analysis. NPJ Digit Med. (2022) 5:19. doi: 10.1038/s41746-022-00559-z

PubMed Abstract | CrossRef Full Text | Google Scholar

44. Asiedu, MN, Simhal, A, Chaudhary, U, Mueller, JL, Lam, CT, Schmitt, JW, et al. Development of algorithms for automated detection of cervical pre-cancers with a low-cost, point-of-care, pocket colposcope. IEEE Trans Biomed Eng. (2019) 66:2306–18. doi: 10.1109/TBME.2018.2887208

PubMed Abstract | CrossRef Full Text | Google Scholar

45. Hu, L, Bell, D, Antani, S, Xue, Z, Yu, K, Horning, MP, et al. An observational study of deep learning and automated evaluation of cervical images for cancer screening. J Natl Cancer Inst. (2019) 111:923–32. doi: 10.1093/jnci/djy225

PubMed Abstract | CrossRef Full Text | Google Scholar

46. Chen, M, Zhang, B, Cai, Z, Seery, S, Gonzalez, MJ, Ali, NM, et al. Acceptance of clinical artificial intelligence among physicians and medical students: a systematic review with cross-sectional survey. Front Med (Lausanne). (2022) 9:990604. doi: 10.3389/fmed.2022.990604

PubMed Abstract | CrossRef Full Text | Google Scholar

47. Khan, MJ, Werner, CL, Darragh, TM, Guido, RS, Mathews, C, Moscicki, AB, et al. ASCCP colposcopy standards: role of colposcopy, benefits, potential harms, and terminology for colposcopic practice. J Low Genit Tract Dis. (2017) 21:223–9. doi: 10.1097/LGT.0000000000000338

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: artificial intelligence, cervical cancer, colposcopy, diagnostic accuracy, biopsy

Citation: Wu A, Xue P, Abulizi G, Tuerxun D, Rezhake R and Qiao Y (2023) Artificial intelligence in colposcopic examination: A promising tool to assist junior colposcopists. Front. Med. 10:1060451. doi: 10.3389/fmed.2023.1060451

Received: 14 November 2022; Accepted: 08 February 2023;
Published: 15 March 2023.

Edited by:

Li Dong, Shanxi University, China

Reviewed by:

Pei Yu, Monash University, Australia
Shanshan Du, Fujian Medical University, China

Copyright © 2023 Wu, Xue, Abulizi, Tuerxun, Rezhake and Qiao. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Remila Rezhake, remila@xjmu.edu.cn; Youlin Qiao, qiaoy@cicams.ac.cn

^†These authors have contributed equally to this work and share last authorship

ORIGINAL RESEARCH article

Artificial intelligence in colposcopic examination: A promising tool to assist junior colposcopists

1. Introduction

2. Materials and methods

2.1. Study population

2.2. Human papillomavirus testing and cytology

2.3. Colposcopic procedure and histological confirmation

2.4. Diagnosis by the CAIADS

2.5. Statistical analysis

3. Results

3.1. Clinical characteristics and colposcopic findings determined by the senior colposcopist

3.2. Colposcopic findings of the CAIADS and the junior colposcopist

3.3. Diagnostic performance of the CAIADS compared with different colposcopists

3.4. Biopsy efficacy and sensitivity of the CAIADS and CAIADS-junior

3.5. Factors influencing the accuracy of the CAIADS judgement

4. Discussion

5. Conclusion

Data availability statement

Ethics statement

Author contributions

Funding

Acknowledgments

Conflict of interest

Publisher’s note

Supplementary material

References

People also looked at