Evaluation of Digital Drawing Tests and Paper-and-Pencil Drawing Tests for the Screening of Mild Cognitive Impairment and Dementia: A Systematic Review and Meta-analysis of Diagnostic Studies

Chan, Joyce Y. C.; Bat, Baker K. K.; Wong, Adrian; Chan, Tak Kit; Huo, Zhaohua; Yip, Benjamin H. K.; Kowk, Timothy C. Y.; Tsoi, Kelvin K. F.

doi:10.1007/s11065-021-09523-2

Evaluation of Digital Drawing Tests and Paper-and-Pencil Drawing Tests for the Screening of Mild Cognitive Impairment and Dementia: A Systematic Review and Meta-analysis of Diagnostic Studies

Review
Open access
Published: 16 October 2021

Volume 32, pages 566–576, (2022)
Cite this article

Download PDF

You have full access to this open access article

Neuropsychology Review Aims and scope Submit manuscript

Evaluation of Digital Drawing Tests and Paper-and-Pencil Drawing Tests for the Screening of Mild Cognitive Impairment and Dementia: A Systematic Review and Meta-analysis of Diagnostic Studies

Download PDF

Joyce Y. C. Chan¹,
Baker K. K. Bat²,
Adrian Wong¹,
Tak Kit Chan³,
Zhaohua Huo³,
Benjamin H. K. Yip³,
Timothy C. Y. Kowk¹ &
…
Kelvin K. F. Tsoi^2,3

5255 Accesses
14 Citations
1 Altmetric
Explore all metrics

Abstract

Digital drawing tests have been proposed for cognitive screening over the past decade. However, the diagnostic performance is still to clarify. The objective of this study was to evaluate the diagnostic performance among different types of digital and paper-and-pencil drawing tests in the screening of mild cognitive impairment (MCI) and dementia. Diagnostic studies evaluating digital or paper-and-pencil drawing tests for the screening of MCI or dementia were identified from OVID databases, included Embase, MEDLINE, CINAHL, and PsycINFO. Studies evaluated any type of drawing tests for the screening of MCI or dementia and compared with healthy controls. This study was performed according to PRISMA and the guidelines proposed by the Cochrane Diagnostic Test Accuracy Working Group. A bivariate random-effects model was used to compare the diagnostic performance of these drawing tests and presented with a summary receiver-operating characteristic curve. The primary outcome was the diagnostic performance of clock drawing test (CDT). Other types of drawing tests were the secondary outcomes. A total of 90 studies with 22,567 participants were included. In the screening of MCI, the pooled sensitivity and specificity of the digital CDT was 0.86 (95% CI = 0.75 to 0.92) and 0.92 (95% CI = 0.69 to 0.98), respectively. For the paper-and-pencil CDT, the pooled sensitivity and specificity of brief scoring method was 0.63 (95% CI = 0.49 to 0.75) and 0.77 (95% CI = 0.68 to 0.84), and detailed scoring method was 0.63 (95% CI = 0.56 to 0.71) and 0.72 (95% CI = 0.65 to 0.78). In the screening of dementia, the pooled sensitivity and specificity of the digital CDT was 0.83 (95% CI = 0.72 to 0.90) and 0.87 (95% CI = 0.79 to 0.92). The performances of the digital and paper-and-pencil pentagon drawing tests were comparable in the screening of dementia. The digital CDT demonstrated better diagnostic performance than paper-and-pencil CDT for MCI. Other types of digital drawing tests showed comparable performance with paper-and-pencil formats. Therefore, digital drawing tests can be used as an alternative tool for the screening of MCI and dementia.

Diagnosis of Early Alzheimer’s Disease: Clinical Practice in 2021

Article Open access 09 June 2021

Alzheimer’s Disease: Epidemiology and Clinical Progression

Article Open access 14 March 2022

Considerations for using the Wisconsin Card Sorting Test to assess cognitive flexibility

Article 22 March 2021

Background

There is a growing concern globally for mild cognitive impairment (MCI) and dementia due to the increasing aging population. Dementia is a syndrome in which there is progressive deterioration of cognitive function such as memory, visuospatial abilities, executive functions, and thinking (World Health Organization, 2019). MCI is recognized as the intermediate stage between normal aging and dementia (Petersen et al., 1999; Winblad et al., 2004). Early detection of MCI and dementia by cognitive screening tests can help patients and their family members to receive timely proper dementia-related care and support from health care professionals (Prince et al., 2011)

Drawing tests are quick, easy to use, cognitive screening tests that are commonly used for the screening of MCI and dementia. Drawing tests can be applied easily and can assess different neuropsychological functions, such as visuospatial ability and executive functions (Aprahamian et al., 2009; Ehreke et al., 2010). Deterioration of visuospatial abilities and executive functions are common cognitive symptoms in patients with MCI and dementia (Pal et al., 2016; Traykov et al., 2007). There are different types of drawing tests such as the clock drawing test (CDT) (Shulman et al., 1986; Sunderland et al., 1989), pentagon drawing test (Cormack et al., 2004), and cube drawing test (Ota et al., 2015). These tests can be either used alone or combined in a multi-domain cognitive test, such as Montreal Cognitive Assessment (MoCA) (Nasreddine et al., 2005). The CDT is the most extensively used drawing test (Aprahamian et al., 2009; Ehreke et al., 2010). In the early development of the CDT, it was mainly used to screen visuospatial and visual-constructional disorders associated with lesions in the parietal region of the brain, such as post-stroke dementia (Aprahamian et al., 2009). The usage of CDT is widened nowadays, and the CDT is widely used for the screening of MCI and dementia (Tsoi et al., 2015). There are different scoring methods of the CDT such as the 6-point scoring method (Shulman et al., 1986) and the 10-point scoring method (Sunderland et al., 1989). Studies indicate that the CDT is a good test for the screening of patients with dementia (Aprahamian et al., 2009; Park et al., 2018), but that MCI patient performance is fair and may overlook degenerative processes (Breton et al., 2019; Ehreke et al., 2010; Pinto & Peters, 2009; Tsoi et al., 2017).

Due to the weakness of paper-and-pencil drawing tests and the advancement of technology, digital drawing tests have evolved over the past decade. Digital drawing tests can record and assess drawing characteristics such as total time spent, contour, and drawing methods, which can be considered when discriminating between MCI and dementia (Heymann et al., 2018). Studies showed that on-air movements can enhance the sensitivity of identifying patients with MCI (Garre-Olmo et al., 2017; Müller et al., 2017). The pressure applied when drawing can be another indicator to discriminate elders with MCI and healthy aging (Faundez-Zanuy et al., 2013).

A meta-analysis found that the diagnostic performance of digital cognitive tests and paper-and-pencil cognitive tests are comparable (Chan et al., 2018). However, previous studies seldom compare the diagnostic performance of digital drawing tests and paper-and-pencil drawing tests specifically. Therefore, the objective of this study was to evaluate the diagnostic performance of different types of digital drawing tests and paper-in-pencil drawing tests for the screening of MCI and dementia.

Methods

The current study was performed according to the standard guidelines for the systematic review of diagnostic studies, including the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) (Moher et al., 2009) and the guidelines proposed by the Cochrane Diagnostic Test Accuracy Working Group (Leeflang et al., 2008; Macaskill et al., 2010). This study is registered as CRD42020166750 in PROSPERO.

Search Strategy

Literature searches were performed in OVID databases, included Embase, MEDLINE, CINAHL, and PsycINFO. Keywords included “cognitive impairment”, “MCI”, “dementia”, “draw test”, “digital draw”, “Clock drawing test”, “Pentagon test”, “Cube draw”, “Tree draw”, “House draw’ and “Rey-Osterrieth”, “ROCF”, “Spiral” and “infinity loops” (Supplementary Table 1). The search duration was from the earliest available dates in each database to the 31st of March 2020. Diagnostic studies comparing the accuracy of the drawing tests for MCI and dementia were identified from the title and abstract preview of all search records. Literature searches were also extended to the Digital Dissertation Consortium database and WorldCat for identification of unpublished theses or grey literature. Manual searches were extended to the bibliographies of the review articles and studies that were included in this meta-analysis. No language restriction was adopted.

Inclusion and Exclusion Criteria

Studies were included if they met the following inclusion criteria:

1.
The study used any type of drawing tests for the screening of MCI or dementia;
2.
The study recruited participants with MCI or dementia in any clinical or community settings and compared them with cognitively healthy controls;
3.
Participants with MCI or dementia were confirmed with standardized diagnostic criteria, including the Petersen criterion (Petersen et al., 1999), the report of International Working Group on Mild Cognitive Impairment (Winblad et al., 2004), the recommendations of the National Institute on Aging and the Alzheimer’s Association (Albert et al., 2011), the National Institute of Neurological and Communicative Disorders and Stroke and the Alzheimer’s Disease and Related Disorders Association (NINCDS-ADRDA) (McKhann et al., 1984), any version of the Diagnostic and Statistical Manual of Mental Disorder (DSM) (American Psychiatric Association, 2013), consensus by qualified clinicians using the Clinical Dementia Rating (CDR) (Morris, 1993) or standardized neuropsychological tests. Studies reported cases of very mild dementia, questionable dementia or cognitive impairments with no dementia (CIND), were further studied to confirm whether they included participants with MCI;
4.
The diagnostic performance of the drawing tests was summarized in terms of sensitivity and specificity, or data that could be used to derive those values were provided.

Studies were excluded if a study only evaluated different types of MCI or dementia, such as to identify patients with Parkinson’s disease dementia from Alzheimer’s disease.

Data Extraction

Two investigators (TKC, BKK) independently evaluated the relevance of search results and extracted the data into an Excel spreadsheet. The spreadsheet was used to record the demographic details of included articles, such as the year of publication, the study location, the number of MCI or dementia participants and controls, the mean age of participants, the percentage of male participants, and the diagnostic criteria and cutoff values used to define patients with MCI or dementia. We also recorded the sensitivity and specificity, or true-positive, false-positive, true-negative, and false-negative values, of each drawing test for result analysis. When a study presented different cutoff values to show the performance of a test, only the result from a recommended cutoff value in the article was chosen. When a study recommended more than one cutoff value, the cutoff value presented in the abstract of the article was selected. When there were discrepancies in the study eligibility or data extraction, the third investigator (JYC) would make the definitive decision. The Cohen’s kappa was used to evaluate the inter-rater variability.

Type of Drawing Tests

The digital drawing tests included the digital CDT, digital pentagon drawing test, digital Rey-Osterrieth complex figure (ROCF), digital tree drawing test, digital house drawing test, and digital spiral test. The paper-and-pencil drawing tests included the CDT, pentagon drawing test, cube drawing test, and ROCF. We further categorized paper-and-pencil CDT into the brief scoring method (i.e. ≤ 9 score), and the detailed scoring method (i.e. > 9 score).

Risk of Bias and Reporting Quality

The Quality Assessment of Diagnostic Accuracy Studies 2 (QUADAS-2) instrument (Whiting et al., 2011) was used to evaluate the potential risks of bias (ROB). The assessment areas included, 1. selection of patient, 2. execution of the screening tests, 3. execution of the reference standard, and 4. presentation on the patient flow and timing to have the reference standard and index tests. The methodology section of the STARD statement (Standards for Reporting of Diagnostic Accuracy) (Bossuyt et al., 2003) was used to evaluate the study quality. An 8-point scale was designed to evaluate the study quality, which included: 1. a clear definition on study population, 2. adequate details of recruitment of participants, 3. description of sampling of participant selection, 4. description of data collection plan, 5. description of reference standard and its rationale, 6. specifications of the drawing tests, 7. rationales for cutoff values, and 8. methods of calculation of diagnostic performance.

Outcomes

The primary outcome of this study was the diagnostic performance of the CDT for the screening of MCI and dementia. The secondary outcome was the diagnostic performance of other types of drawing tests.

Data Synthesis and Statistical Analysis

A bivariate random-effects model was used to combine the overall sensitivity and specificity of each drawing test (Reitsma et al., 2005). Forest plots were used as the graphical presentation for the pooled sensitivity and specificity. A diagnostic odds ratio was used as a single indicator of the test performance across different thresholds of cutoff values (Glas et al., 2003). A hierarchical summary receiver-operating characteristic (HSROC) curve was generated to present the summary estimates of sensitivity and specificity along with corresponding 95% confidence interval (95% CI) and prediction region (Rutter & Gatsonis, 2001). The area under the HSROC curve (AUC) was calculated. The approach of DerSimonian and Laird was applied when Hessian matrix was unstable (DerSimonian & Laird, 1986). Statistical heterogeneity among the trials was assessed by I². The diagnostic performances of digital and paper-and-pencil drawing tests were compared by meta-regression models, with P < 0.05 indicates a statistically significant difference. Publication bias was conducted by a regression of diagnostic log odds ratio against 1/sqrt (effective sample size), weighting by effective sample size, with P < 0.10 for the slope coefficient indicating significant asymmetry (Deeks et al., 2005). Sensitivity analysis was conducted according to the scoring methods of paper-and-pencil clock drawing tests. The statistical analyses were performed with the Midas procedures in STATA, version 11 (StataCorp) and meta-disc version 1.4.

Results

Literature Search and Study Selection

A total of 7,180 abstracts were identified in OVID databases, 37 papers were identified from the bibliography, and 363 abstracts were identified from WorldCat. After excluding the irrelevant or duplication of titles, 502 articles were further evaluated. A total of 412 articles were excluded (Cohen’s Kappa statistics at 85% between the investigators) due to the following reasons: 36 studies were systematic review or meta-analysis, 335 studies were not studied the diagnostic performance of a drawing test, 24 abstracts or presentation posters were lack of diagnostic result, 5 studies were not recruited patients with MCI or dementia, and 12 studies were not recruited participants with normal cognition as a control group. As a result, a total of 90 studies were eligible for this systematic review and meta-analysis (Fig. 1).

Study Characteristics

A total of 90 studies with 2,810 participants with MCI, 7,751 participants with dementia, and 12,006 controls were included in this systematic review and meta-analysis. The mean age of the participants ranged from 58 to 85 (Supplementary Table 2). The percentage of male participants ranged from 4 to 99%. Seventy-six studies recruited participants with MCI or dementia in an out-patient clinic or the community, whereas all other studies recruited participants in hospital or an old age home. Six studies used digital CDT, while other digital drawing tests included pentagon drawing test (n = 2), tree drawing test (n = 2), ROCF (n = 1), house drawing test (n = 1), and spiral drawing test (n = 1). Paper-and-pencil CDT was the most commonly used drawing test, as 45 studies used the detailed scoring method (i.e. > 9 score), and 35 studies used the brief scoring method (i.e. ≤ 9 score). Other paper-and-pencil drawing tests included the pentagon drawing test (n = 5), cube drawing test (n = 3) and ROCF (n = 1). In the assessment of ROB, nine studies were assessed as high risk of bias, including selection of patients (n = 6), execution of the index test (n = 1), execution of the reference standard (n = 3) and flow and timing (n = 4) (Supplementary Table 3).

Performance of Digital and Paper-and-pencil CDT in the Screening of MCI

Four studies used the digital CDT for the screening of MCI, and a total of 179 participants with MCI and 467 controls were included. The sensitivities ranged from 0.67 to 0.94 and specificities ranged from 0.79 to 0.94 across individual studies. The heterogeneity with I² statistic for sensitivity and specificity were 0.81 and 0.92, respectively. The pooled sensitivity and specificity of four studies using digital CDT were 0.86 (95% CI = 0.75 to 0.92) and 0.92 (95% CI = 0.69 to 0.98), respectively (Table 1a, Fig. 2, Supplementary Table 4). The pooled AUC was 87% (95% CI = 84% to 90%) (Supplementary Fig. 1). A non-significant P-value (0.42) for the slope coefficient suggested symmetry in the data and a low likelihood of publication bias (Supplementary Fig. 2). Nine studies used brief scoring method of paper-and-pencil CDT. The heterogeneity with I² statistic for sensitivity and specificity were 0.92 and 0.93, respectively. The pooled sensitivity and specificity were 0.63 (95% CI = 0.49 to 0.75) and 0.77 (95% CI = 0.68 to 0.84), respectively. The pooled AUC was 77% (95% CI = 74% to 81%). Twenty-one studies used detailed scoring method of paper-and-pencil CDT. The heterogeneity with I² statistic for sensitivity and specificity were 0.87 and 0.86, respectively. The pooled sensitivity and specificity were 0.63 (95% CI = 0.56 to 0.71) and 0.72 (95% CI = 0.65 to 0.78), respectively. The pooled AUC was 74% (95% CI = 69% to 77%). The diagnostic performance of digital CDT was significantly better than the brief scoring methods (P = 0.02) and detailed scoring methods (P < 0.001) of paper-and-pencil CDT in the meta-regression models.

Table 1 Sensitivity and Specificity of Digital and Paper-and-Pencil Clock Drawing Tests

Full size table

Performance of Digital and Paper-and-pencil CDT in the Screening of Dementia

Six studies used the digital CDT for the screening of dementia, and a total of 505 participants with dementia and 915 controls were included. The sensitivities ranged from 0.63 to 0.94 and specificities ranged from 0.77 to 1.00 across individual studies. The heterogeneity with I² statistic for sensitivity and specificity were 0.87 and 0.78, respectively. The pooled sensitivity and specificity with bivariate random-effects model were 0.83 (95% CI = 0.72 to 0.90) and 0.87 (95% CI = 0.79 to 0.92) (Table 1b, Fig. 3). The pooled AUC was 92% (95% CI = 89% to 94%) (Supplementary Fig. 3). A non-significant P-value (0.93) for the slope coefficient suggested symmetry in the data and a low likelihood of publication bias. Thirty studies used brief scoring method of the CDT. The heterogeneity with I² statistic for sensitivity and specificity were 0.86 and 0.83, respectively. The pooled sensitivity and specificity were 0.83 (95% CI = 0.77 to 0.87) and 0.80 (95% CI = 0.74 to 0.85), respectively. The pooled AUC was 88% (95% CI = 85% to 91%). Thirty-five studies used detailed scoring method of the CDT. The heterogeneity with I² statistic for sensitivity and specificity were 0.82 and 0.93, respectively. The pooled sensitivity and specificity were 0.80 (95% CI = 0.76 to 0.83) and 0.81 (95% CI = 0.75 to 0.86), respectively. The pooled AUC was 87% (95% CI = 84% to 90%). No significant difference was found between the diagnostic performance of digital CDT and brief scoring methods (P = 0.33) and detailed scoring methods (P = 0.35) of paper-and-pencil CDT in the meta-regression model.

Performance of Other Types of Digital and Paper-and-pencil Drawing Tests

In the screening of MCI, one study used digital ROCF (sensitivity: 0.76, specificity: 0.86, AUC: 85%) and one study used paper-and-pencil ROCF (sensitivity: 0.59, specificity: 0.96, AUC: 77%). In the screening of dementia, two studies used digital pentagon drawing test. The pooled sensitivity and specificity were 0.79 (95% CI = 0.74 to 0.85) and 0.74 (95% CI = 0.68 to 0.78), respectively (Table 2b). Four studies used paper-and-pencil pentagon drawing test. The pooled sensitivity and specificity were 0.85 (95% CI = 0.70 to 0.94) and 73% (95% CI = 0.52 to 0.87), respectively.

Table 2 Sensitivity and Specificity of Other types of Digital and Paper-and-Pencil Drawing Tests

Full size table

Sensitivity Analyses

Sensitivity analysis showed that the performance of different brief scoring method and detailed scoring method of paper-and-pencil CDT were comparable (Supplementary Table 5).

Discussion

This systematic review and meta-analysis included 90 studies and compared different types of digital drawing tests and paper-and-pencil drawing tests. The CDT is the most commonly used drawing test. In the screening of MCI, the digital CDT demonstrated better diagnostic performance than the paper-and-pencil CDT. Comparable performance was shown between the digital and paper-and-pencil CDT in the screeing of dementia. The diagnostic performance of other types of digital drawing tests and their paper-and-pencil formats was also comparable. Therefore, digital drawing tests can used as an alternative tool for the screening of MCI and dementia.

There are similarities between the digital CDT and paper-and-pencil CDT. Both methods require participants to draw the clock face as well as the hands of the clock that point to a specific time. The digital CDT uses a digital pen to draw on a tablet instead of drawing on a paper. Previous meta-analyses showed the diagnostic performance of paper-and-pencil CDT is fair in the screening of MCI, no matter the complexity of the scoring system (Ehreke et al., 2010; Pinto & Peters, 2009; Tsoi et al., 2017). This study showed similar results, however, the digital CDT showed better diagnostic performance than paper-and-pencil CDT in the screening of MCI. It may be due to the fact that deterioration of cognitive abilities such as executive function and visuospatial abilities found in patients with MCI are not yet clearly reflected in the final product of the paper drawing. However, decline in cognitive functions may be reflected in the drawing process captured in digital drawing tests (Müller et al., 2017; Garre-Olmo et al., 2017). Among different drawing characteristics, drawing time, pressure acceleration, and velocity are shown to be the behaviour markers for the discrimination of MCI from healthy aging (Garre-Olmo et al., 2017). Müller et al. (2019) further found that drawing time and velocity, such as time-in-air, total time strokes per minute are more sensitive maker than drawing pressure. Müller et al. (2017) suggested that time-in-air was a more sensitive marker than other time factors such as time-on-surface and total time (Muller et al., 2017). Additionally, the digital systems can automatically divide the drawing surface into different segments and sub-regions, and then analyze the strokes and angular differences of the drawing in the calculation of the final score (Davis et al., 2014; Shigemoria, et al., 2015). This combination of visual features and behavioural data can contribute to the identification of patients and enhance the accuracy of the digital CDT (Muller et al., 2019). Therefore, the use of digital CDT can improve the sensitivity and specificity in the screening of MCI. Past work suggests that machine-learning methods can enhance the ability to produce accurate predictive models of the drawing tests to classify MCI and dementia when the models trained on a large amount of data (Davis et al., 2014, Souillard-Mandar et al., 2016; Muller et al., 2017, 2019). Besides digital CDT, some other digital drawing tests have been suggested in the literature. The digital pentagon drawing test (Garre-Olmo et al., 2017; Tsoi et al., 2018) and digital ROCF (Cheah et al., 2019; Kokubo et al., 2018) are adapted from paper-and-pencil versions. The digital tree drawing test and digital house drawing test are adopted new approaches (Robens et al., 2019; Garre-Olmo et al., 2017). The digital tree drawing test and digital house drawing test are free-hand drawing tests which do not require the participants to draw any specific features. Thus, only drawing behaviour are used in the classification of disease.

A comprehensive evaluation of different types of drawing tests for the screening of MCI and dementia is the strength of this study. However, this study has some limitations. First, the comparisons in this study are not head-to-head comparisons, and patients’ engagement and performance on the drawing test is a confounding factor to result interpretation. There is a study of head-to-head comparison between a digital drawing test and a paper-and-pencil drawing test, which showed that digital CDT had a higher diagnostic accuracy than paper-and-pencil CDT (Müller et al., 2017). Second, the number of studies to compare diagnostic performance of drawing tests are limited. The benefits of digital drawing tests may be stronger if we can include more studies in this meta-analysis.

Conclusions

The current study revealed that digital CDT can enhance the identification of deficits in the screening of MCI. Digital and paper-and-pencil CDT have a comparable performance in the screening of dementia. Other types of drawing tests in digital formats showed comparable to paper-in-pencil formats. Therefore, digital drawing tests can be a potential tool to use as an alternative for the screening of MCI and dementia.

References

Albert, M. S., DeKosky, S. T., Dickson, D., Dubois, B., Feldman, H. H., Fox, N. C., & Phelps, C. H. (2011). The diagnosis of mild cognitive impairment due to Alzheimer’s disease: Recommendations from the national institute on Aging-Alzheimer’s association workgroups on diagnostic guidelines for Alzheimer’s disease. Alzheimer’s & Dementia, 7, 270–279.
Article Google Scholar
American Psychiatric Association. (2013). Diagnostic and statistical manual of mental disorders, 4th ed, revised. (DSM-IV-TR). Washington, DC: American Psychiatric Association.
Aprahamian, I., Martinelli, J. E., Neri, A. L., & Yassuda, M. S. (2009). The Clock Drawing Test A review of its accuracy in screening for dementia. Dement Neuropsychol, 3(2), 74–81.
Article Google Scholar
Bossuyt, P. M., Reitsma, J. B., Bruns, D. E., Gatsonis, C. A., Glasziou, P. P., Irwig, L. M., & Standards for Reporting of Diagnostic Accuracy. (2003). The STARD statement for reporting studies of diagnostic accuracy: Explanation and elaboration. Annals of Internal Medicine, 138, W1-12.
Article Google Scholar
Breton, A., Casey, D., & Arnaoutoglou, N. A. (2019). Cognitive tests for the detection of mild cognitive impairment (MCI), the prodromal stage of dementia: Meta-analysis of diagnostic accuracy studies. International Journal of Geriatric Psychiatry, 34(2), 233–242.
Article Google Scholar
Chan, J. Y. C., Kwong, J. S. W., Wong, A., Kwok, T. C. Y., & Tsoi, K. K. F. (2018). Comparison of Computerized and Paper-and-Pencil Memory Tests in Detection of Mild Cognitive Impairment and Dementia: A Systematic Review and Meta-analysis of Diagnostic Studies. Journal of the American Medical Directors Association, 19(9), 748-756.e5.
Article Google Scholar
Cheah, W. T., Chang, W. D., Hwang, J. J., Hong, S. Y., Fu, L. C., & Chang, Y. L. (2019). A Screening System for Mild Cognitive Impairment Based on Neuropsychological Drawing Test and Neural Network. 2019 IEEE International Conference on Systems, Man and Cybernetics (SMC), Bari, Italy.
Cormack, F., Aarsland, D., Ballard, C., & Tovée, M. J. (2004). Pentagon drawing and neuropsychological performance in Dementia with Lewy Bodies, Alzheimer’s disease, Parkinson’s disease and Parkinson’s disease with dementia. International Journal of Geriatric Psychiatry, 19(4), 371–377.
Article Google Scholar
Davis, R., Libon, D. J., Au, R., Pitman, D., & Penney, D. L. (2014). Think: Inferring Cognitive Status from Subtle Behaviors. Proceedings of the AAAI Conference on Artificial Intelligence, 2014, 2898–2905.
Article Google Scholar
Deeks, J. J., Macaskill, P., & Irwig, L. (2005). The performance of tests of publication bias and other sample size effects in systematic reviews of diagnostic test accuracy was assessed. Journal of Clinical Epidemiology, 58(9), 882–893.
Article Google Scholar
DerSimonian, R., & Laird, N. (1986). Meta-analysis in clinical trials. Controlled Clinical Trials, 7, 177–188.
Article CAS Google Scholar
Ehreke, L., Luppa, M., Konig, H. H., & Riedel-Heller, S. G. (2010). Is the Clock Drawing Test a screening tool for the diagnosis of mild cognitive impairment? A Systematic Review. International Psychogeriatrics, 22(1), 56–63.
Article Google Scholar
Faundez-Zanuy, M., Sesa-Nogueras, E., Roure-Alcobé, J., Garré-Olmo, J., Lopez-de-Ipiña, K., & Solé-Casals, J. (2013). Online Drawings for Dementia Diagnose: In-Air and Pressure Information Analysis. In: Roa Romero L. (eds) XIII Mediterranean Conference on Medical and Biological Engineering and Computing. IFMBE Proceedings, vol 41. Springer, Cham.
Garre-Olmno, J., Faundez-Zanuy, M., Lopez-de-Ipina, K., Calvo-Perxas, L., Turro-Garriga1, O. (2017). Kinematic and pressure features of handwriting and drawing: preliminary results between patients with mild cognitive impairment. Alzheimer Disease and Healthy Controls Current Alzheimer Research, 14(9), 960 968.
Glas, A. S., Lijmer, J. G., Prins, M. H., Bonsel, G. J., & Bossuyt, P. M. M. (2003). The diagnostic odds ratio: A single indicator of test performance. Journal of Clinical Epidemiology, 56(11), 1129–1135.
Article Google Scholar
Heymann, P., Gienger, R., Hett, A., Müller, S., Laske, C., Robens, S., & Elbing, U. (2018). Early Detection of Alzheimer’s Disease Based on the Patient’s Creative Drawing Process: First Results With a Novel Neuropsychological Testing Method. Journal of Alzheimer’s Disease, 63(2), 675–687.
Article CAS Google Scholar
Kokubo, N., Yoko, Y., Saitoh, Y., Murata, M., Maruo, K., Takebayashi, Y., & Horikoshi, M. (2018). A new device-aided cognitive function test, User eXperience-Trail Making Test (UX-TMT), sensitively detects neuropsychological performance in patients with dementia and Parkinson’s disease. BMC Psychiatry, 5;18(1):220.
Leeflang, M. M., Deeks, J. J., Gatsonis, C., Bossuyt, P. M. M., & Cochrane Diagnostic Test Accuracy Working Group. (2008). Systematic reviews of diagnostic test accuracy. Annals of Internal Medicine, 149, 889–897.
Article Google Scholar
Macaskill, P., Gatsonis, C., Deeks, J., Harbord, R., & Takwoingi, Y. (2010). Chapter 10: analysing and presenting results. In: Deeks, J.J., Bossuyt, P.M. & Gatsonis, C. (editors). Cochrane handbook for systematic reviews of diagnostic test accuracy version 1.0. the cochrane collaboration. Available from: http://srdta.cochrane.org/
McKhann, G., Drachman, D., Folstein, M., Katzman, R., Price, D., & Stadlan, E. M. (1984). Clinical diagnosis of Alzheimer’s disease: Report of the NINCDS-ADRDA work group under the auspices of department of health and human services task force on alzheimer’s disease. Neurology, 34(7), 939–944.
Article CAS Google Scholar
Moher, D., Liberati, A., Tetzlaff, J., Altman, D. G., & PRISMA Group. (2009). Preferred reporting items for systematic reviews and meta-analyses: The PRISMA statement. Annals of Internal Medicine, 151, 264–269.
Article Google Scholar
Morris, J. C. (1993). The clinical dementia rating (CDR): Current version and scoring rules. Neurology, 43(11), 2412–2414.
Article CAS Google Scholar
Müller, S., Herde, L., Preische, O., Zeller, A., Heymann, P., Robens, S., & Laske, C. (2019). Diagnostic value of digital clock drawing test in comparison with CERAD neuropsychological battery total score for discrimination of patients in the early course of Alzheimer’s disease from healthy individuals. Science and Reports, 9(1), 3543.
Article Google Scholar
Müller, S., Preische, O., Heymann, P., Elbing, U., & Laske, C. (2017). Increased diagnostic accuracy of digital vs. conventional clock drawing test for discrimination of patients in the early course of alzheimer’s disease from cognitively healthy individuals. Frontiers in Aging Neuroscience, 11(9), 101.
Nasreddine, Z. S., Phillips, N. A., Bédirian, V., Charbonneau, S., Whitehead, V., Collin, I., & Chertkow, H. (2005). The Montreal Cognitive Assessment, MoCA: A brief screening tool for mild cognitive impairment. Journal of the American Geriatrics Society, 53(4), 695–699.
Article Google Scholar
Ota, K., Murayama, N., Kasanuki, K., Kondo, D., Fujishiro, H., Arai, H., & Iseki, E. (2015). Visuoperceptual assessments for differentiating dementia with lewy bodies and alzheimer’s disease: illusory contours and other neuropsychological examinations. Archives of Clinical Neuropsychology, 30(3), 256–263.
Article Google Scholar
Park, J. K., Jeong, E. H., & Seomun, G. A. (2018). The clock drawing test: A systematic review and meta-analysis of diagnostic accuracy. Journal of Advanced Nursing Actions, 74(12), 2742–2754.
Article Google Scholar
Pal, A., Biswas, A., Pandit, A., Roy, A., Guin, D., Gangopadhyay, G., & Senapati, A. K. (2016). Study of visuospatial skill in patients with dementia. Annals of Indian Academy of Neurology, 19, 83–88.
Article Google Scholar
Petersen, R. C., Smith, G. E., Waring, S. C., Ivnik, R. J., Tangalos, E. G., & Kokmen, E. (1999). Mild cognitive impairment: Clinical characterization and outcome. Archives of Neurology, 56, 303–308.
Article CAS Google Scholar
Pinto, E., & Peters, R. (2009). Literature review of the Clock Drawing Test as a tool for cognitive screening. Dementia and Geriatric Cognitive Disorders, 27, 201–213.
Article Google Scholar
Prince, M., Bryce, R., & Ferri. C. (2011) The benefits of early diagnosis and intervention. Alzheimer’s Disease International World Alzheimer Report 2011, available at https://www.alz.co.uk/research/world-report-2011 assessed at 18 April 2020.
Reitsma, J. B., Glas, A. S., Rutjes, A. W., Scholten, R. J. P. P., Bossuyt, P. M., & Zwinderman, A. H. (2005). Bivariate analysis of sensitivity and specificity produces informative summary measures in diagnostic reviews. Journal of Clinical Epidemiology, 58, 982–990.
Article Google Scholar
Robens, S., Heymann, P., Gienger, R., Hett, A., Müller, A., Laske, C., & Elbing, U. (2019). The Digital Tree Drawing Test for Screening of Early Dementia: An Explorative Study Comparing Healthy Controls, Patients with Mild Cognitive Impairment, and Patients with Early Dementia of the Alzheimer Type. Journal of Alzheimer’s Disease, 68(4), 1561–1574.
Article Google Scholar
Rutter, C. M., & Gatsonis, C. A. (2001). A hierarchical regression approach to meta-analysis of diagnostic test accuracy evaluations. Statistics in Medicine, 20, 2865–2884.
Article CAS Google Scholar
Shigemoria, T., Harbi, Z., Kawanaka, H., Hicks, Y., Setchi, R., Takase, H., & Tsuruoka, S. (2015). Feature Extraction Method for Clock Drawing Test. Procedia Computer Science, 60, 1707–1714.
Article Google Scholar
Shulman, K. I., Shedletsky, R., & Silver, I. L. (1986). The challenge of time: Clock-drawing and cognitive function in the elderly. International Journal of Geriatric Psychiatry, 1, 135–140.
Article Google Scholar
Souillard-Mandar, W., Davis, R., Rudin, C., Au, R., Libon, D. J., Swenson, R., & Penney, D. L. (2016). Learning Classification Models of Cognitive Conditions from Subtle Behaviors in the Digital Clock Drawing Test. Mach Learn Actions, 102(3), 393–441.
Article Google Scholar
Sunderland, T., Hill J. L., Mellow, A. M., Lawlor, B. A., Gundersheimer, J., Newhouse, P.A., & Grafman, J, H. (1989). Clock drawing in alzheimer’s disease. A novel measure of dementia severity. Journal of the American Geriatrics Society, 37(8), 725–729.
Traykov, L., Raoux, N., Latour, F., Gallo, L., Hanon, O., Baudic, S., et al. (2007). Executive Functions Deficit in Mild Cognitive Impairment. Cognitive and Behavioral Neurology, 20(4), 219–224.
Article Google Scholar
Tsoi, K. K. F., Chan, J. Y. C., Hirai, H. W., Wong, S. Y. S., & Kwok, T. C. Y. (2015). Cognitive tests to detect dementia. A systematic review and meta-analysis. JAMA Internal Medicine, 175, 1450–1458.
Article Google Scholar
Tsoi, K. K. F., Chan, J. Y. C., Hirai, H. W., Wong, A., Mok, V. C. T., Lam, L. C. W., & Wong, S. Y. S. (2017). Recall Tests Are Effective to Detect Mild Cognitive Impairment: A Systematic Review and Meta-analysis of 108 Diagnostic Studies. Journal of the American Medical Directors Association, 18(9), 807.e17-807.e29.
Article Google Scholar
Tsoi, K. K. F., Lam, M. W. Y., Chu, C., Wong, M. P. F., & Meng, H. (2018). Machine Learning on Digital Drawing Date For Preliminary Dementia Screening. Alzheimer’s and Dementia, 14(7), 196. https://doi.org/10.1016/j.jalz.2018.06.2039
Article Google Scholar
Whiting, P. F., Rutjes, A. W. S., Westwood, M. E., Mallett, S., Deeks, J. J., Reitsma, J. B., & QUADAS-2 Group. (2011). QUADAS-2: A revised tool for the quality assessment of diagnostic accuracy studies. Annals of Internal Medicine, 155, 529–536.
Article Google Scholar
Winblad, B., Palmer, K., Kivipelto, M., Jelic, V., Fratiglioni, L., Wahlund, L. O., & Petersen, R. C. (2004). Mild cognitive impairment - Beyond controversies, towards a consensus: Report of the international working group on mild cognitive impairment. Journal of Internal Medicine, 256, 240–246.
Article CAS Google Scholar
World Health Organization. (2019). Dementia. Retrieved on January 12 2021. Retrieval from: https://www.who.int/news-room/fact-sheets/detail/dementia

Download references

Acknowledgements

We thank the INS Global Engagement Committee Research Editing and Consulting Program (RECP) and Dr. Sarah L. Garcia-Beaumier, consultant of the RECP, for help in manuscript editing.

Funding

This is a self-initiated study. No funding was received.

Author information

Authors and Affiliations

Department of Medicine and Therapeutics, The Chinese University of Hong Kong, Hong Kong, China
Joyce Y. C. Chan, Adrian Wong & Timothy C. Y. Kowk
Stanley Ho Big Data Decision Analytics Research Centre, The Chinese University of Hong Kong, Hong Kong, China
Baker K. K. Bat & Kelvin K. F. Tsoi
JC School of Public Health and Primary Care, The Chinese University of Hong Kong, Hong Kong, China
Tak Kit Chan, Zhaohua Huo, Benjamin H. K. Yip & Kelvin K. F. Tsoi

Authors

Joyce Y. C. Chan
View author publications
You can also search for this author in PubMed Google Scholar
Baker K. K. Bat
View author publications
You can also search for this author in PubMed Google Scholar
Adrian Wong
View author publications
You can also search for this author in PubMed Google Scholar
Tak Kit Chan
View author publications
You can also search for this author in PubMed Google Scholar
Zhaohua Huo
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin H. K. Yip
View author publications
You can also search for this author in PubMed Google Scholar
Timothy C. Y. Kowk
View author publications
You can also search for this author in PubMed Google Scholar
Kelvin K. F. Tsoi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kelvin K. F. Tsoi.

Ethics declarations

Ethical Approval

This study was a systematic review and meta-analysis and ethical approval was not required.

Conflict of Interests

None.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 338 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Chan, J.Y.C., Bat, B.K.K., Wong, A. et al. Evaluation of Digital Drawing Tests and Paper-and-Pencil Drawing Tests for the Screening of Mild Cognitive Impairment and Dementia: A Systematic Review and Meta-analysis of Diagnostic Studies. Neuropsychol Rev 32, 566–576 (2022). https://doi.org/10.1007/s11065-021-09523-2

Download citation

Received: 19 March 2021
Accepted: 09 August 2021
Published: 16 October 2021
Issue Date: September 2022
DOI: https://doi.org/10.1007/s11065-021-09523-2

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Evaluation of Digital Drawing Tests and Paper-and-Pencil Drawing Tests for the Screening of Mild Cognitive Impairment and Dementia: A Systematic Review and Meta-analysis of Diagnostic Studies

Abstract

Similar content being viewed by others

Diagnosis of Early Alzheimer’s Disease: Clinical Practice in 2021

Alzheimer’s Disease: Epidemiology and Clinical Progression

Considerations for using the Wisconsin Card Sorting Test to assess cognitive flexibility

Background

Methods

Search Strategy

Inclusion and Exclusion Criteria

Data Extraction

Type of Drawing Tests

Risk of Bias and Reporting Quality

Outcomes

Data Synthesis and Statistical Analysis

Results

Literature Search and Study Selection

Study Characteristics

Performance of Digital and Paper-and-pencil CDT in the Screening of MCI

Performance of Digital and Paper-and-pencil CDT in the Screening of Dementia

Performance of Other Types of Digital and Paper-and-pencil Drawing Tests

Sensitivity Analyses

Discussion

Conclusions

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Ethical Approval

Conflict of Interests

Additional information

Publisher's Note

Supplementary Information

Supplementary file1 (DOCX 338 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation