A standardized workflow for long-term longitudinal actigraphy data processing using one year of continuous actigraphy from the CAN-BIND Wellness Monitoring Study

Slyepchenko, Anastasiya; Uher, Rudolf; Ho, Keith; Hassel, Stefanie; Matthews, Craig; Lukus, Patricia K.; Daros, Alexander R.; Minarik, Anna; Placenza, Franca; Li, Qingqin S.; Rotzinger, Susan; Parikh, Sagar V.; Foster, Jane A.; Turecki, Gustavo; Müller, Daniel J.; Taylor, Valerie H.; Quilty, Lena C.; Milev, Roumen; Soares, Claudio N.; Kennedy, Sidney H.; Lam, Raymond W.; Frey, Benicio N.

doi:10.1038/s41598-023-42138-6

Download PDF

Article
Open access
Published: 15 September 2023

A standardized workflow for long-term longitudinal actigraphy data processing using one year of continuous actigraphy from the CAN-BIND Wellness Monitoring Study

Anastasiya Slyepchenko¹,
Rudolf Uher²,
Keith Ho³,
Stefanie Hassel⁴,
Craig Matthews¹,
Patricia K. Lukus⁵,
Alexander R. Daros⁶,
Anna Minarik²,
Franca Placenza⁷,
Qingqin S. Li⁸,
Susan Rotzinger³,
Sagar V. Parikh⁹,
Jane A. Foster^1,10,
Gustavo Turecki¹¹,
Daniel J. Müller⁶,
Valerie H. Taylor⁴,
Lena C. Quilty^6,13,
Roumen Milev¹²,
Claudio N. Soares¹²,
Sidney H. Kennedy^3,13,
Raymond W. Lam¹⁴ &
…
Benicio N. Frey^1,5

Scientific Reports volume 13, Article number: 15300 (2023) Cite this article

1446 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

Monitoring sleep and activity through wearable devices such as wrist-worn actigraphs has the potential for long-term measurement in the individual’s own environment. Long periods of data collection require a complex approach, including standardized pre-processing and data trimming, and robust algorithms to address non-wear and missing data. In this study, we used a data-driven approach to quality control, pre-processing and analysis of longitudinal actigraphy data collected over the course of 1 year in a sample of 95 participants. We implemented a data processing pipeline using open-source packages for longitudinal data thereby providing a framework for treating missing data patterns, non-wear scoring, sleep/wake scoring, and conducted a sensitivity analysis to demonstrate the impact of non-wear and missing data on the relationship between sleep variables and depressive symptoms. Compliance with actigraph wear decreased over time, with missing data proportion increasing from a mean of 4.8% in the first week to 23.6% at the end of the 12 months of data collection. Sensitivity analyses demonstrated the importance of defining a pre-processing threshold, as it substantially impacts the predictive value of variables on sleep-related outcomes. We developed a novel non-wear algorithm which outperformed several other algorithms and a capacitive wear sensor in quality control. These findings provide essential insight informing study design in digital health research.

Evaluating reliability in wearable devices for sleep staging

Article Open access 18 March 2024

Real-world longitudinal data collected from the SleepHealth mobile app study

Article Open access 27 November 2020

Wearable-based accelerometer activity profile as digital biomarker of inflammation, biological age, and mortality using hierarchical clustering analysis in NHANES 2011–2014

Article Open access 08 June 2023

Introduction

Activity and sleep monitoring through ambulatory devices has become ubiquitous through use of commercial devices such as smartphones and smartwatches. Actigraphy, defined as activity and sleep monitoring through a research or medical-grade device worn on the body, has been in use for decades. It has been implemented in monitoring various populations, including individuals with sleep or biological rhythm disorders¹, dementia², and depression³, among others. Sleep and activity are the key monitoring targets for many disorders. For instance, sleep and activity are linked to quality of life⁴, mental, and physical health outcomes⁵. Activity and sleep disturbances have also been linked with increased risk of hypertension, diabetes mellitus, cardiovascular disease, coronary heart disease, obesity and mortality^6,7,8.

Actigraphs are devices typically worn on the wrist, chest, or hip, which use motion sensing accelerometers to measure activity on one or three axes. One advantage of actigraphs is the potential for prolonged monitoring within the individual’s natural environment, which requires minimal effort on behalf of the device wearer and interaction with the device as compared to methods such as take-home questionnaires and ecological momentary assessment^1,9. To date, the majority of studies have focused on periods of continuous data collection of 2 weeks or less¹⁰. Longer periods of data collection may be more informative, however, they have received less attention, likely because they require a more complex approach.

Detection of early signs of clinical changes is an important application of actigraphy. For instance, changes in sleep may be among the initial symptoms preceding the onset of a major depressive episode¹¹. Actigraphy is therefore a promising tool to monitor early warning signs of depressive relapse. Actigraphy can be used to evaluate sleep parameters (e.g., total sleep time, sleep maintenance efficiency, wake after sleep onset), sleep timing (e.g., sleep onset time, time out of bed, mid sleep point), physical activity parameters (e.g., total activity counts, physical activity energy expenditure), circadian activity rhythms (e.g., cosinor analysis, which yields information about timing and intensity of activity), and other parameters. However, methods of actigraphy data collection and analysis, including collection parameters, devices used, data pre-processing, and variable extraction have not been standardized¹.

Accurately and efficiently differentiating periods of wear from non-wear in actigraph data is a major challenge in actigraphy research. Ideally, participants should record off-wrist time in a dedicated log maintained throughout the duration of the study. However, this may be challenging in clinical populations, especially if participants suffer from difficulties with memory or attention, life stress, or other challenges that impair their ability to accurately record off-wrist time. As a consequence, automatic methods of detecting wear and non-wear periods have been developed. For instance, the ActiGraph GT9X Link is equipped with a capacitive sensor, which indicates whether the participant is wearing the device, based on the proximity of the device to skin, however, this wear sensor has technical issues, with non-wear being noted during times of apparent wear of the actigraph, as recorded by participants¹². Consequently, the wear sensor substantially underestimates wear time compared to participant diaries, with a sensitivity of 93% but a specificity of 49%¹³. Additionally, there are several non-wear detection algorithms, though some of these were not developed to account for non-wear episodes during the night, or during sleep periods, and the majority of these algorithms were developed using data from actigraphs worn at the hip^14,15,16. Importantly, the choice of pre-processing approaches, such as non-wear detection, sleep detection, and rules such as thresholds for what constitutes a valid number of days for actigraphy analysis can significantly impact outcomes in actigraphy studies^12,17 Periods of non-wear may also be associated with outcomes of interest in mental health research, further supporting the importance of their accurate detection as part of studies of actigraphy in clinical populations.

The aim of this paper is to report on a standardized pipeline for quality control, pre-processing, and analysis of actigraphy data collected over an extended period of time, developed with the use of open-source packages.

Methods

Data collection

Study design

These actigraphy data were collected as part of the Wellness Monitoring for Major Depressive Disorder (Wellness Monitoring Study), a longitudinal observational study conducted by the Canadian Biomarker Integration Network in Depression (CAN-BIND), which aimed to identify predictive biomarkers of relapse of major depressive disorder (MDD) (ClinicalTrials.gov Identifier: NC02934334). The Wellness Monitoring Study used ambulatory monitoring to establish which variables can act as “warning signals” prior to a relapse of MDD. Several symptom domains were evaluated, including mood and anxiety symptoms, sleep, activity, biological rhythms, anhedonia, pain, quality of life, treatment compliance-related variables, speech characteristics and voice characteristics. The domains were assessed through different methods, including self-report questionnaires, clinician-rated assessments, audio recording of voice, and objective monitoring of activity, sleep and biological rhythms with actigraphy.

Participants were enrolled into the study if they had a diagnosis of MDD, responded to treatment for their most recent major depressive episode, and had a current MADRS score < 14 at baseline and screening visits, resulting in a total of 101 participants who completed a baseline visit. Following written informed consent, participants received a study-specific smartphone (LogPad®, ERT, Clario [formerly, PHT]) and wrist-worn actigraph, which were used for the duration of the study. Further information about the study sample is provided in the Supplementary Materials, including supplementary Figure 1 which describes participants in the Wellness Monitoring study.

Participants completed a screening visit, a baseline visit within 2 weeks of screening, and a minimum one-year observational phase (early withdrawal allowed). Most participants completed screening and baseline visits on the same day. During the observational phase of the study, participants completed in-person assessments every 8 weeks in addition to continuous ambulatory monitoring. Participants enrolled on a rolling basis and had variable lengths of follow up periods with target durations of at least 1 year since last patient enrolled.

At baseline, and subsequent 8-weekly follow-up visits, participants were assessed through an on-site electronic data collection device (the SitePad®) which recorded measures of depressive symptom severity, healthcare service use, and symptom severity. Additionally, participants completed self-report questionnaires through the Brain-CODE REDCap interface and provided blood samples, as well as a series of weekly self-reports, and biweekly speech and voice characteristics through the LogPad® device. Further information about the study inclusion/exclusion criteria, treatment and relapse is provided in the supplementary material. All procedures contributing to this work comply with the ethical standards of the relevant national and institutional committees on human experimentation and with the Helsinki Declaration of 1975, as revised in 2008. Study procedures were approved by local research ethics boards and all participants provided informed consent before study entry.

Data acquisition: raw actigraphy data

The Actigraph GT9X-BT Link® (ActiGraph, Penascola, Florida, USA) device was used to collect sleep, activity and biological rhythms parameters through the observational phase of the study. Study coordinators uploaded the data to the CentrePoint Study Admin System (http://www.actigraphcorp.com/product-category/study-admin/) and monitored adherence during in-person visits. CentrePoint is a cloud-based technology platform developed by Actigraph, which preserved data integrity, as well as network security, availability, and standards compliance. The GT9X Link contains a capacitive touch wear sensor¹⁸.

Participants were instructed to wear the GT9X Link® device 24 h per day for the entire duration of the study, and received a charging dock and USB cable to charge the device from home. Data were collected at 30 Hz on the non-dominant wrist. At each in-person visit, data were extracted to the CentrePoint system by study coordinators. Data from the CentrePoint system were transferred to OBI’s Brain-CODE platform at the completion of the study. Data were first extracted as raw .gt3x files, at intervals corresponding to occasions on which data were uploaded. Data were additionally aggregated into minute-by-minute epochs, as one .csv file for each participant, and were initially sleep scored using the Cole-Kripke algorithm¹⁹ (Fig. 1: Raw Actigraphy Data).

Raw actigraphy data provided information about the direction and orientation of the actigraph, while count data only provided information about the amount of movement. Count data aggregated by epoch are traditionally used as the basis of calculating sleep^[20]and energy expenditure parameters²⁰, as well as non-wear, while more recent actigraphy processing methods use raw data^16,21.

Data processing and analysis

Summary

Figure 1 shows a summary of the automated data pre-processing pipeline, as executed in R Statistical Software (v 4.0). As part of this pre-processing pipeline, we assessed data missingness and scored sleep and wake for minute-by-minute epochs using the Cole-Kripke¹⁹ and Tudor-Locke²² algorithms. Next, we tested the accuracy of four methods of non-wear detection: (1) the built-in wear sensor available in this actigraph model; scored the minute-by-minute epoch data using the (2) Choi¹⁴ and (3) Troiano¹⁵ algorithms; and (4) used the raw 30 Hz actigraphy data for scoring using the van Hees algorithm²³. From these four methods, we created a new non-wear scoring algorithm (the majority algorithm), and conducted visual quality control of this majority algorithm (See “Non-wear detection” section below). Next, we combined the sleep intervals with non-wear intervals, and conducted sensitivity analyses to assess the influence of valid day selection and percentage of overlap between non-wear and sleep on the relationship between sleep variables and the main outcome measure of this study – the Montgomery-Åsberg Depression Rating Scale (MADRS)²⁴, which was collected at each in-person visit.

Data trimming

An important step in data pre-processing is to trim the data including only data that will be used for analysis. For instance, in case of withdrawal from the study, participants may have worn the actigraph (or the actigraph may have collected data) until it is returned to the lab, at a later date than the official withdrawal date from the study. Additionally, researchers may only be interested in analyzing a specific portion of the collected data, in which case data trimming is also necessary. In the Wellness Monitoring Study, data were trimmed to 1 year of collection, and data that extended following the participant’s enrollment or collected due to configuration error prior to enrollment in the study were trimmed based on study enrolment dates. Duplicate rows were removed (Fig. 1: Data Trimming).

It is important to ensure that data for all dates were accounted for, including periods of missing data, if such paradata were to be recorded or reported. Paradata refers to administrative data that were obtained during the process of collection, management and treatment of actigraphy data²⁵. If a participant was asked to wear multiple actigraph devices throughout the duration of the study, the periods of overlap must be correctly accounted for, and the correct data interval should be used. We maintained accurate paradata of the rows that were removed, and the number of missing minutes per day, per participant, which will be stored and made available with the pre-processed data.

Sleep scoring

Minute-by-minute epoch data were scored for sleep and wake using the Cole-Kripke and Tudor-Locke algorithms deployed in the actigraph.sleepr package (https://github.com/dipetkov/actigraph.sleepr), which is an open-source implementation of the ActiLife software’s sleep and non-wear detection algorithms (Fig. 1: Sleep/Wake Scoring: Cole-Kripke and Tudor Locke Algorithms). From this analysis, epoch-based scoring of minute epochs and sleep intervals were obtained. Sleep intervals were characterized by the following variables: sleep maintenance efficiency (SE, %), sleep duration (mins), activity counts, non-zero epochs, total sleep time (TST, mins), wake after sleep onset (WASO, mins), number of awakenings, movement index, fragmentation index, sleep fragmentation index, sleep onset time (HH:MM:SS), time out of bed (HH:MM:SS), number of one minute sleep intervals, mean mid sleep time ([time out of bed – sleep onset time]/2), average awakening (mins). Fragmentation index is calculated as a percentage of sleep periods that last 1 min compared to number of periods of sleep during the sleep period. Movement index consists of the percentage of epochs during the sleep period where y-axis counts were larger than zero. Sleep fragmentation index is the sum of the movement index and fragmentation index²⁶.

Non-wear scoring

In the Wellness Monitoring Study, we used the wear sensor embedded in the Actigraph GT9X Link, in addition to the Troiano, Choi and van Hees algorithms to detect non-wear. The Troiano and Choi algorithms were chosen due to their wide use, ease of implementation, and availability through the ActiLife software. The van Hees algorithm was chosen due to its superior performance in Syed and colleagues’ study²⁷, and ease of implementation. The Troiano and Choi algorithms use epoch-aggregated count data^14,15. The Troiano algorithm defines non-wear intervals as 60 or more consecutive minute epochs with no activity, allowing for 1 or 2 min of counts of 0 to 100¹⁵. Since this algorithm is prone to classifying sedentary activity as non-wear time, Choi and colleagues proposed a modified algorithm where non-wear was classified as intervals of at least 90 min with consecutive minute epochs of no activity. Intervals of 1 or 2 min with non-zero counts would not change this classification, if there was no activity 30 min before or after that interval¹⁴. Newer approaches such as the van Hees algorithm use raw data¹⁶. Van Hees’ algorithm is based on raw data, where a period is deemed to be non-wear when the standard deviation of movement is lower than 3.0mG (1mG = 0.00981 m/s²) or the value range is lower than 50 mg for at least 2 of 3 axes for a given 30-min period^16,23. These approaches are useful to detect longer periods of non-wear, however, shorter periods of non-wear (e.g., taking the actigraph off for showers), will not be detected.

The capacitive sensor on the Actigraph GT9X Link provided epoch-aggregated non-wear detection at the minute level. The capacitive sensor consists of a metallic plate. Based on the concept of capacitive coupling, the sensor charges more quickly when it is in closer proximity to our bodies. The sensor therefore measures the amount of time that the capacitor uses to charge, and therefore allows estimation of non-wear²⁸. Troiano¹⁵ and Choi¹⁴ algorithms were used to score the activity (motion) data from csv files containing minute-by-minute data using the actigraph.sleepr package (Fig. 1: Non-wear Scoring: Choi and Troiano Algorithms). Additionally, non-wear scoring was performed on the raw data gt3x files using the van Hees algorithm through the GGIR package²³. While using this package, we specified a 5 s window for calculating acceleration and angle, 900 s for the epoch length to calculate non-wear and signal clipping, and 3600 s for the window of wear detection (Fig. 1: Non-wear Scoring: Van Hees Algorithm). Agreement between algorithms during each epoch was evaluated through minute-by-minute overlap of non-wear detected by the different algorithms and the wear sensor. Additional information about data processing is provided in the Supplement.

Development of a novel non-wear algorithm: the majority algorithm

A novel non-wear algorithm, the majority algorithm, was developed by calculating the percentage of overlap between the wear sensor, Troiano, Choi and Van Hees algorithms in each minute epoch (Fig. 1: Non-wear Scoring: Development of the Majority Algorithm). If 3 or 4 of the 4 methods of detection indicated that a minute epoch should be classified as non-wear, this minute epoch was classified as non-wear. As the Choi algorithm is an updated version of the Troiano algorithm, we compared the performance of a 4-method version of the majority algorithm (which combined the wear sensor, Troiano, Choi and van Hees algorithms) to a 3-method version of the majority algorithm (which only used the wear sensor, Choi and van Hees algorithms). For the 3-method version, if 2 or 3 of the 3 methods of detection indicated that a minute epoch should be classified as non-wear, this minute epoch was classified as non-wear. To validate the use of this algorithm, we performed visual quality control to evaluate performance of the majority algorithm in a subset of participants. We selected a majority of these participants based on their relapse status, as this was the major outcome in the Wellness Monitoring Study (see Supplementary Material). Each participant file was reviewed day-by-day, where false non-wear detection was identified by one or two trained independent scorers (see Supplementary Material for further details). Accuracy, positive predictive value, sensitivity and specificity statistics were calculated for epoch-level data for each of the 5 algorithms (Choi, Troiano, van Hees, majority (4), and majority (3)) and the wear sensor, as compared to visual quality control at the day level. As 6 of the participant data files were scored by 2 scorers, we averaged the results of the accuracy, positive predictive value, sensitivity and specificity statistics for these participants for the outputs of the algorithms compared to visual quality control. To test the difference in performance of the algorithms, we fitted mixed linear models, with day-level performance statistics as dependent variables and algorithm*day as the independent variables using the lme4 package. We compared the performance of the different algorithms using estimated marginal means of the models, with a Tukey correction for multiple comparisons using the emmeans package. Inter-rater reliability (Cohen’s kappa) was calculated.

Addressing data missingness

Some analytic procedures require complete data. Data missingness can be classified as missing completely at random (MCAR), meaning that missing data are missing independently of observed or missing data. This type of missingness does not cause bias, despite increasing standard error. Missing at random (MAR) data occur when the mechanism of missingness is a partial result of the observed data, and if the mechanism of the missing data is a result of the missing data, this indicates the data are not missing at random (NMAR)²⁹.

It is plausible that participants’ non-wear may correspond with periods of relapse of depression, which is the key outcome measured in the Wellness Monitoring Study, indicating that these data are likely not MAR or MCAR. Additionally, summary statistics regarding non-wear can be used in modeling outcomes during the analysis stage. Therefore, we intend to use missing data as part of our modelling approach, where variables describing non-wear and missingness will be included in predictive models for mental health outcomes.

At the epoch level, we used the average day imputation method, where missing data are imputed by an average of the values collected during the same time period that has missing data (for instance, if data are missing from 7:00 to 7:15, this algorithm will create an average for that missing interval based on the data that were collected)³⁰. To perform this average day imputation, we used a window of 7 days (i.e., 3 days prior to and 3 days following the day with missing data). We did not impute full days of data – only days with partial missing data were imputed. In this study, data could have been missing as a result of non-wear (based on the majority (3) algorithm) or as a result of data not being collected for the period (Fig. 1: Addressing Data Missingness).

Spearman correlations were applied to assess the relationship between depressive symptoms according to the MADRS and data missingness or non-wear patterns. As the data for sleep and depressive symptoms were assessed at different frequencies, we aggregated these data by creating an average of each sleep variable.

Sensitivity analyses

Many studies in actigraphy literature use filtering approaches, where days are only considered valid if the actigraph is worn over a certain number of hours for each day³¹. This threshold has not been standardized, though the most commonly used threshold is 10 h or more of available data in a day³¹, for the day to be considered valid. A sensitivity analysis was conducted to test influence of non-wear on the relationship between sleep and MADRS scores, the main symptom outcome measure in this study. This sensitivity analysis consisted of two components: (1) number of valid hours of data per day for the day to be considered valid and (2) overlap of the sleep interval with non-wear, and how these components influenced the relationship between sleep variables and depressive symptoms (Fig. 1: Sensitivity Analyses).

First, this sensitivity analysis used hourly thresholds starting from > 6 to 24 valid hours per day of analysis for the relevant sleep interval to be included in the analysis, as well as all collected data. The second component of the analysis selected several thresholds for excluding intervals of sleep based on overlap with non-wear. Overlap of sleep with non-wear intervals was calculated for each sleep interval, first by generating the number of non-wear minutes in each sleep interval, and subsequently calculating percentage of non-wear minutes per duration of the sleep interval. Thresholds were tested in 10% intervals, ranging from < 10% overlap to up to 100% overlap. Sleep intervals exceeding a given threshold (e.g. > 80% overlap) were excluded from analysis for each iteration of this analysis. Since MADRS scores were obtained every 8 weeks for the duration of the study, and at each relapse verification visit, we averaged sleep values across each 8-week epoch. For each combination of thresholds, we conducted mixed linear modeling with the following variables, following standardization, as fixed-effects variables used to model of MADRS score: sleep variables (SE, duration, activity counts, non-zero epochs, TST, number of awakenings, movement index, fragmentation index, sleep onset time, out of bed time, number of one minute sleep intervals, average awakenings), time since study enrolment and number of missing or non-wear minutes, and participant ID as a random intercept . We evaluated 190 combinations of overlap threshold and valid day selection, and chose the threshold combination with the lowest marginal R² ³².

Statistical software

All analyses were implemented in R statistical software (v. 4.0).

Results

Collected and missing data

Summary statistics outlining collected data and missingness in the Wellness Monitoring Study are outlined in Table 1, describing missingness due to a lack of data collection at the minute epoch level. Overall, participants were observed for a total of 31,175 days, amounting to 44,891,400 rows (minute epochs) of data. Overall, 36,600,320 rows of data were collected (25,416.89 days), with 18.47% or 8,291,080 rows of data missing across the period of data collection (5,757.69 days). If aggregated at the participant level, each participant had between 0.11 to 100% of data missing. A total of 95 participants had available actigraphy data, and completed 8 weeks of data collection. By 26 weeks of data collection, 84 participants (88.4%) continued data collection, and 73 participants (76.8%) remained by the 52^nd week of data collection.

Table 1 Data missingness in the wellness monitoring study.

Full size table

Non-wear detection

Summary of non-wear according to different algorithms

Table 2 displays non-wear statistics obtained from the non-wear detection methods throughout the study. At the day level, according to the 3 non-wear algorithms, there was a mean of 12.55 to 16.74% of data missing overall throughout the study, whereas the wear sensor detected 16.29% of non-wear throughout the study. At the participant level, where mean statistics were aggregated per participant, each participant had 12.43 to 16.62% of non-wear. Figure 2 shows the distribution of non-wear per day as detected by the different methods.

Table 2 Summary of non-wear in wellness study according to different methods of detection.

Full size table

Overlap of non-wear detection methods

Next, we assessed overlap of non-wear detected by the different algorithms and the wear sensor, finding a high proportion of overlap between all non-wear algorithms (91.55 ± 14.96%) at the day level across all participants. However, overlap with the wear sensor was lower, with a total of 79.32 ± 27.71% overlap of all methods of wear detection (Table 3). Additionally, this overlap of non-wear detection methods did not substantially change over time, as indicated by Figure S3c.

Table 3 Mean and standard deviation daily percent overlap of non-wear in wellness study according to different methods of detection – (Day level) (n = 31,175).

Full size table

Development of a novel non-wear algorithm: the majority algorithm

Table 4 shows performance of the 3-method and 4-method non-wear majority algorithms compared to the other methods of non-wear detection.

Detailed visual quality control was conducted to test the performance of the 4-method majority algorithm on data from 19 participants (20% of the total sample), for a total of 4,600 days, or 6,624,026 rows. Figure S2a shows an example of the visualization used to conduct quality control for the majority algorithm. Most selected participants (n = 15) were chosen from based on their status as relapsers at some point during the study, and additional participants were selected from the non-relapser group to strengthen the validity of this evaluation (n = 4). Inter-rater reliability measured through Cohen’s kappa was κ = 0.94, indicating near perfect inter-rater reliability³³, calculated from 1,991 days of data obtained from 6 participants assessed by 2 raters.

Table 4 Performance of non-wear detection methods in visual quality control.

Full size table

A visualization of the comparative performance of the wear sensor, Choi, Troiano, van Hees and majority (3- and 4- method) algorithms can be found in Supplementary Figure S2b, and results from models comparing algorithm performance statistics can be found in Supplementary Tables S1 and S2. Between the wear sensor, Choi algorithm, Troiano algorithm, van Hees algorithm and majority algorithms (3-method and 4-method versions), the majority algorithms had the best overall performance. The majority algorithms had significantly better accuracy than the wear sensor, Choi and Troiano algorithms (3-method: 0.9887; 4-method: 0.9884; wear sensor: 0.8839; Choi algorithm: 0.9816; Troiano algorithm: 0.9609). The van Hees (0.9866) and majority algorithms had similar accuracy, though the van Hees algorithm’s accuracy did not significantly differ from the Choi algorithm. The majority and van Hees algorithms had performed significantly better than all other methods in specificity (4-method: 0.9982; 3-method:0.9972; van Hees algorithm: 0.9967; Choi algorithm: 0.9885; Troiano algorithm: 0.9632; wear sensor: 0.9154) and PPV (4-method: 0.9665; 3-method: 0.9641; van Hees algorithm: 0.9515; Choi algorithm: 0.9101; Troiano algorithm: 0.6723; wear sensor: 0.6197). Finally, the Troiano algorithm significantly outperformed all other algorithms in terms of sensitivity, followed by the Choi and majority algorithms (Troiano algorithm: 0.9823; Choi algorithm: 0.9617; 4-method: 0.9608; 3-method: 0.9592; wear sensor: 0.9444; van Hees algorithm: 0.9289). The wear sensor had the poorest performance in non-wear detection. Notably, these statistics only capture visually noted intervals of non-wear, which were typically over the length of an hour. Since the 3- and 4-method majority algorithms had comparable performance, which exceeded the single algorithms in accuracy, we used the 3-method majority algorithm in the remainder of our analyses.

In line with previous investigations³⁴, non-wear increased with time since baseline, and variability in non-wear increased with time since baseline, as data from fewer participants were available (see Fig. 3, mean of 4.8% in the first week to 23.6% at the end of 12 months of data collection).

Managing data missingness

Addressing data missingness in the Wellness Monitoring Study

When we combined non-wear scoring with sleep interval scoring, there were, expectedly, periods of overlap between these intervals. We used Spearman correlations to see whether there was a relationship between the main clinical outcome (depressive symptoms according to the MADRS) and data missingness or non-wear patterns. Depressive symptoms according to the MADRS did not correlate with data missingness (rho = − 0.04), nor with non-wear patterns according to any of the methods of non-wear detection (rho = − 0.03 to 0.02) (Figure S3).

Sensitivity analyses

Next, we conducted a sensitivity analysis of the influence of the overlap of sleep intervals with non-wear intervals, and influence of valid day criteria. We tested 200 thresholds for excluding sleep intervals which overlapped with non-wear, and their combination with thresholds of number of hours of data per day for the day to be considered valid, and assessed whether these thresholds impacted the relationship of individual sleep metrics with depressive symptoms. Overlap thresholds were tested in 10% increments, ranging between < 10% overlap and 100% overlap. Valid day thresholds were tested in hourly increments ranging from all collected data, > 6 valid hours to 24 valid hours. See Supplementary Figure S5 for an illustration of this thresholding approach (Table 5).

Altogether, there were 30,093 sleep intervals available for evaluation, and a maximum of 12,438 sleep intervals were excluded through the non-wear percentage threshold approach. There were 515 instances of MADRS observations across the study for 94 participants with valid actigraphy data. The threshold combination of > 20 valid hours and up to 30% overlap between sleep and non-wear intervals, was chosen based on the highest marginal R² value for mixed linear models (See Table 5). This yielded 22,853 total sleep intervals.

Table 5 Sensitivity analysis results: combinations of non-wear thresholds based on 24-h non-wear and % overlap between sleep intervals with non-wear.

Full size table

Discussion

In this study, we present a data-driven pre-processing pipeline for a long-term actigraphy study using the example of the Wellness Monitoring Study which lasted over the course of 12 months of continuous data collection. This study provides a guideline for future digital health research using large, longitudinal actigraphy datasets. Importantly, a novel algorithm for non-wear detection, the majority algorithm was developed, which involved an extensive visual quality control procedure. The majority algorithm significantly outperformed the use of single common non-wear detection methods in terms of accuracy, specificity and positive predictive value, including the GT9X Link wear sensor, the Choi, and Troiano algorithms, and outperformed the van Hees algorithm in sensitivity. A key advantage of the majority non-wear algorithm is that it is relatively easy to implement and will be useful for other models of ActiGraph devices, which also use the capacitive sensor for detecting skin conductance and non-wear. Moreover, this algorithm was developed using open-source packages that are widely available to the public. We found that the wear sensor had the worst performance compared to the algorithms that were calculated, though it was likely able to detect short periods of non-wear that the visual quality control procedure was likely unable to detect, as the visual quality control procedure was not able to verify short non-wear periods. Additionally, the non-wear algorithms were only able to capture intervals of non-wear that were typically over the length of an hour. Our findings of inconsistency in wear sensor performance are similar to both Pulakka and colleagues’ and Arguello and colleagues’, who also witnessed off-wrist time shown by the wear sensor during apparent wear time, and poor sensitivity of the wear sensor^12,13.

As expected, compliance with actigraph wear decreased progressively over the course of the year, from a mean of 4.8% at the beginning of the study, to a mean of 23.6% by the end of the year-long study. To date, the majority of studies using actigraphy have used significantly shorter periods of data collection¹⁰, with some studies reporting wear compliance through periods of 16 weeks to 1 year^34,35,36. In a 16-week longitudinal actigraphy study, Thurman and colleagues found 95.1% compliance with actigraphy measurements, with no changes over time³⁴. In contrast, in a 6-month longitudinal study of pain in patients with sickle cell disease, of the possible 6 months of data collection, participants completed a median of only 85 days of actigraphy data, with a range of 7 to 179 days of data collected, as a result of compliance and technical issues³⁵. A feasibility actigraphy study of 8 participants followed for a total of 150 weeks with the aim of predicting relapses in bipolar disorder had a total of 30% of data missing³⁶. This suggests that there is a range of compliance in studies with actigraphy devices, where longer study duration is associated with lower compliance. We interpret the approximately 70% completeness in actigraphy data obtained in 95 participants with major depression over a 12-month study as positive.

An important strength of our methods study is the amount of data available to us through the longitudinal, naturalistic design of the Wellness Monitoring Study. This unique, longitudinal dataset showed that non-wear increases over the course of a year, though a substantial proportion (n = 59) of participants continued to wear their actigraph until the end of the year mark. Moreover, we were able to address challenges of data pre-processing consistency, by providing a pre-processing pipeline for data extraction, trimming, sleep and non-wear scoring, combining sleep and non-wear intervals, and non-wear threshold selection. In a real-world application, where actigraphs are used to detect, for instance, early signs of relapse, or subtle changes in physical activity, longitudinal data spanning a substantial period of participants’ lives may be used as an early signal.

We found that a threshold of 20 or more valid hours per day combined with 30% or less overlap of sleep intervals with non-wear yielded the best performance of sleep variables as an explanatory variable for depressive symptoms. The findings of our sensitivity analyses support the importance of selecting an appropriate valid day and/or percentage overlap of sleep interval with non-wear criteria in order to obtain stable estimates of the influence of sleep variables on depressive symptoms. This finding is in line with previous studies^12,17, which indicated that pre-processing choices, such as selecting valid day filtering rules impact the influence of physical activity on outcomes. We suggest that future studies control for non-wear based on similar considerations, accounting for the influence of these non-wear thresholds on outcomes.

Limitations

One limitation of this study is the lack of ability of the ActiGraph GT9X Link to adequately detect sleep onset latency without use of a sleep diary. This type of actigraph provides an output of “0” for each of the instances of this value if a sleep diary is not used. This likely means that our estimates of sleep maintenance efficiency were possibly overestimated. Notably, the participants in our study were diagnosed with MDD, and may not reflect the patterns of activity in the general population, and may have a different propensity to remove the actigraph (for instance, during relapse) compared to the general population.

The majority algorithm should be further validated in an independent dataset which is able to provide the actual accurate periods of non-wear, as opposed to visual quality control through a sleep diary or some other measure. Having a sleep diary would allow us to verify the periods of sleep accurately as well, however, in a dataset of this size, with over 31,000 days of data collected, comparing actigraphy data with data from several thousand of sleep diaries would be a significant challenge.

Future directions

Recently, Syed and colleagues trained a deep convolutional neural network algorithm to detect non-wear from raw data by attempting to identify the instance of the hip-worn actigraph being removed and replaced, providing a more precise non-wear algorithm, which performed with high positive predictive value, sensitivity and F1 scores (all above 0.99). One drawback to this algorithm is the need to resample to a frequency of 100 Hz, indicating that data points that do not exist must be interpolated and the effects of resampling on the integrity of the data have not been explored³⁷. Additionally, future studies should investigate the influence of actigraph non-wear time with clinical characteristics of MDD, including relapse, mood symptom worsening, behavioural inhibition, and psychosocial functioning.

Conclusions

This study provides a standardized pre-processing pipeline for a longitudinal actigraphy study, in which data were collected continuously in 95 participants for one year. A novel non-wear algorithm was proposed which outperformed several single algorithms and a capacitive wear sensor in an intensive quality control procedure. Compliance with actigraph wear decreased over time, and sensitivity analyses demonstrated the importance of selecting pre-processing thresholds, as they substantially impacted the predictive value of variables on our main clinical outcome.

Data availability

CAN-BIND and the CAN-BIND Wellness Monitoring study are open science. Data will be released through Ontario Brain Institute’s Brain—CODE platform, which provides the ability to capture and manage data, and enables researchers to share their data, maximizing data discovery (https://www.braincode.ca).

References

Smith, M. T. et al. Use of actigraphy for the evaluation of sleep disorders and circadian rhythm sleep-wake disorders: An American academy of sleep medicine systematic review, meta-analysis, and GRADE assessment. J. Clin. Sleep Med. 14, 1209–1230. https://doi.org/10.5664/jcsm.7228 (2018).
Article PubMed PubMed Central Google Scholar
Martin, J. L. & Hakim, A. D. Wrist actigraphy. Chest 139, 1514–1527. https://doi.org/10.1378/chest.10-1872 (2011).
Article PubMed PubMed Central Google Scholar
Minaeva, O. et al. Level and timing of physical activity during normal daily life in depressed and non-depressed individuals. Transl. Psychiatry 10, 1–11. https://doi.org/10.1038/s41398-020-00952-w (2020).
Article Google Scholar
Slyepchenko, A. et al. Association of functioning and quality of life with objective and subjective measures of sleep and biological rhythms in major depressive and bipolar disorder. Aust. N. Z. J. Psychiatry 53, 683–696. https://doi.org/10.1177/0004867419829228 (2019).
Article PubMed Google Scholar
Baglioni, C. et al. Sleep and mental disorders: A meta-analysis of polysomnographic research. Psychol. Bull. 142, 969–990. https://doi.org/10.1037/bul0000053 (2016).
Article PubMed PubMed Central Google Scholar
Gangwisch, J. E. et al. Short sleep duration as a risk factor for hypertension. Hypertension 47, 833–839. https://doi.org/10.1161/01.HYP.0000217362.34748.e0 (2006).
Article CAS PubMed Google Scholar
Itani, O., Jike, M., Watanabe, N. & Kaneita, Y. Short sleep duration and health outcomes: a systematic review, meta-analysis, and meta-regression. Sleep Med. 32, 246–256. https://doi.org/10.1016/j.sleep.2016.08.006 (2017).
Article PubMed Google Scholar
Pescatello, L. S. et al. Physical activity to prevent and treat hypertension: A systematic review. Med. Sci. Sports Exerc. 51, 1314–1323. https://doi.org/10.1249/mss.0000000000001943 (2019).
Article PubMed Google Scholar
de Vries, L. P., Baselmans, B. M. L. & Bartels, M. Smartphone-based ecological momentary assessment of well-being: A systematic review and recommendations for future studies. J. Happiness Stud. 22, 2361–2408. https://doi.org/10.1007/s10902-020-00324-7 (2021).
Article PubMed Google Scholar
Tazawa, Y. et al. Actigraphy for evaluation of mood disorders: A systematic review and meta-analysis. J. Affect. Disord. 253, 257–269. https://doi.org/10.1016/j.jad.2019.04.087 (2019).
Article PubMed Google Scholar
Benasi, G., Fava, G. A. & Guidi, J. Prodromal symptoms in depression: A systematic review. Psychother Psychosom 90, 365–372. https://doi.org/10.1159/000517953 (2021).
Article PubMed Google Scholar
Pulakka, A. et al. Classification and processing of 24-hour wrist accelerometer data. J. Meas. Phys. Behav. 1, 51–59. https://doi.org/10.1123/jmpb.2017-0008 (2018).
Article Google Scholar
Arguello, D. et al. Validity of proximity sensor-based wear-time detection using the ActiGraph GT9X. J. Sports Sci. 36, 1502–1507. https://doi.org/10.1080/02640414.2017.1398891 (2018).
Article PubMed Google Scholar
Choi, L., Liu, Z., Matthews, C. E. & Buchowski, M. S. Validation of accelerometer wear and nonwear time classification algorithm. Med. Sci. Sports Exerc. 43, 357. https://doi.org/10.1249/MSS.0b013e3181ed61a3 (2011).
Article PubMed PubMed Central Google Scholar
Troiano, R. P. et al. Physical activity in the United States measured by accelerometer. Med. Sci. Sports Exerc. 40, 181. https://doi.org/10.1249/mss.0b013e31815a51b3 (2008).
Article PubMed Google Scholar
van Hees, V. T. et al. Estimation of daily energy expenditure in pregnant and non-pregnant women using a wrist-worn tri-axial accelerometer. PLoS ONE 6, e22922. https://doi.org/10.1371/journal.pone.0022922 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Lee, P. H. A sensitivity analysis on the variability in accelerometer data processing for monitoring physical activity. Gait. Posture 41, 516–521. https://doi.org/10.1016/j.gaitpost.2014.12.008 (2015).
Article PubMed Google Scholar
ActiGraph Corporation. ActiGraph GT9X Link. https://s3.amazonaws.com/actigraphcorp.com/wp-content/uploads/2018/03/06174921/ActiGraph_Link_MarketingSheet_12302016_FINAL_WEB.pdf.
Cole, R. J., Kripke, D. F., Gruen, W., Mullaney, D. J. & Gillin, J. C. Automatic sleep/wake identification from wrist activity. Sleep 15, 461–469. https://doi.org/10.1093/sleep/15.5.461 (1992).
Article CAS PubMed Google Scholar
Troiano, R. P. Translating accelerometer counts into energy expenditure: advancing the quest. J. Appl. Physiol. 1985(100), 1107–1108. https://doi.org/10.1152/japplphysiol.01577.2005 (2006).
Article Google Scholar
van Hees, V. T. et al. Estimating sleep parameters using an accelerometer without sleep diary. Sci. Rep. 8, 12975. https://doi.org/10.1038/s41598-018-31266-z (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Tudor-Locke, C., Barreira, T. V., Schuna, J. M. Jr., Mire, E. F. & Katzmarzyk, P. T. Fully automated waist-worn accelerometer algorithm for detecting children’s sleep-period time separate from 24-h physical activity or sedentary behaviors. Appl. Physiol. Nutr. Metab. 39, 53–57. https://doi.org/10.1139/apnm-2013-0173 (2014).
Article PubMed Google Scholar
Migueles, J. H., Rowlands, A. V., Huber, F., Sabia, S. & van Hees, V. T. GGIR: a research community–driven open source R package for generating physical activity and sleep outcomes from multi-day raw accelerometer data. J. Meas. Phys. Behav. 2, 188–196. https://doi.org/10.1123/jmpb.2018-0063 (2019).
Article Google Scholar
Montgomery, S. A. & Asberg, M. A new depression scale designed to be sensitive to change. Br. J. Psychiatry 134, 382–389. https://doi.org/10.1192/bjp.134.4.382 (1979).
Article CAS PubMed Google Scholar
Tudor-Locke, C. et al. A model for presenting accelerometer paradata in large studies: ISCOLE. Int. J. Behav. Nutr. Phys. Act. 12, 52. https://doi.org/10.1186/s12966-015-0213-5 (2015).
Article PubMed PubMed Central Google Scholar
Actigraph Corporation. What is Sleep Fragmentation and how is it calculated?, https://actigraphcorp.my.site.com/support/s/article/What-is-Sleep-Fragmentation-and-how-is-it-calculated.
Syed, S., Morseth, B., Hopstock, L. A. & Horsch, A. Evaluating the performance of raw and epoch non-wear algorithms using multiple accelerometers and electrocardiogram recordings. Sci. Rep. 10, 5866. https://doi.org/10.1038/s41598-020-62821-2 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
ActiGraph Corporation. wGT3X-BT and GT9X Wear Sensor Details and Commonly Asked Questions, https://actigraphcorp.my.site.com/support/s/article/wGT3X-BT-and-GT9X-Wear-Sensor-Details-and-Commonly-Asked-Questions.
Newman, D. A. Missing data: Five practical guidelines. Organ. Res. Methods 17, 372–411. https://doi.org/10.1177/1094428114548590 (2014).
Article ADS Google Scholar
van Hees, V. T. et al. Separating movement and gravity components in an acceleration signal and implications for the assessment of human daily physical activity. PLoS One 8, e61691. https://doi.org/10.1371/journal.pone.0061691s (2013).
Article ADS PubMed PubMed Central Google Scholar
Cain, K. L., Sallis, J. F., Conway, T. L., Van Dyck, D. & Calhoon, L. Using accelerometers in youth physical activity studies: A review of methods. J. Phys. Act. Health 10, 437–450. https://doi.org/10.1123/jpah.10.3.437 (2013).
Article PubMed PubMed Central Google Scholar
Nakagawa, S. & Schielzeth, H. A general and simple method for obtaining R2 from generalized linear mixed-effects models. Methods Ecol. Evol. 4, 133–142. https://doi.org/10.1111/j.2041-210x.2012.00261.x (2013).
Article Google Scholar
Landis, J. R. & Koch, G. G. The measurement of observer agreement for categorical data. Biometrics 33, 159–174 (1977).
Article CAS PubMed MATH Google Scholar
Thurman, S. M. et al. Individual differences in compliance and agreement for sleep logs and wrist actigraphy: A longitudinal study of naturalistic sleep in healthy adults. PLoS One 13, e0191883. https://doi.org/10.1371/journal.pone.0191883 (2018).
Article CAS PubMed PubMed Central Google Scholar
Pittman, D. D. et al. Evaluation of longitudinal pain study in sickle cell disease (ELIPSIS) by patient-reported outcomes, actigraphy, and biomarkers. Blood 137, 2010–2020. https://doi.org/10.1182/blood.2020006020 (2021).
Article CAS PubMed PubMed Central Google Scholar
Novák, D., Albert, F. & Španiel, F. Analysis of actigraph parameters for relapse prediction in bipolar disorder: A feasibility study. Annu. Int. Conf. IEEE Eng. Med. Biol. Soc. 4972–4975, 2014. https://doi.org/10.1109/embc.2014.6944740 (2014).
Article Google Scholar
Syed, S., Morseth, B., Hopstock, L. A. & Horsch, A. A novel algorithm to detect non-wear time from raw accelerometer data using deep convolutional neural networks. Sci. Rep. 11, 8832. https://doi.org/10.1038/s41598-021-87757-z (2021).
Article ADS CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

CAN-BIND is an Integrated Discovery Program carried out in partnership with, and with financial support from, the Ontario Brain Institute, an independent nonprofit corporation funded partially by the Ontario government. The opinions, results and conclusions are those of the authors, and no endorsement by the Ontario Brain Institute is intended or should be inferred. Additional funding is provided by the Canadian Institutes of Health Research, Lundbeck, Bristol-Myers Squibb and Servier. Funding and/or in-kind support is also provided by the investigators’ universities and academic institutions.

Author information

Authors and Affiliations

Department of Psychiatry and Behavioural Neurosciences, McMaster University, 100 West 5th Street, Suite C124, Hamilton, ON, L8N 3K7, Canada
Anastasiya Slyepchenko, Craig Matthews, Jane A. Foster & Benicio N. Frey
Department of Psychiatry, Dalhousie University, Halifax, NS, Canada
Rudolf Uher & Anna Minarik
Centre for Depression and Suicide Studies, St. Michael’s Hospital, Toronto, ON, Canada
Keith Ho, Susan Rotzinger & Sidney H. Kennedy
Department of Psychiatry, Cumming School of Medicine, and Hotchkiss Brain Institute, University of Calgary, Calgary, AB, Canada
Stefanie Hassel & Valerie H. Taylor
Mood Disorders Program, St. Joseph’s Healthcare Hamilton, Hamilton, ON, Canada
Patricia K. Lukus & Benicio N. Frey
Campbell Family Mental Health Research Institute, Centre for Addiction and Mental Health, Toronto, ON, Canada
Alexander R. Daros, Daniel J. Müller & Lena C. Quilty
University Health Network, University of Toronto, Toronto, ON, Canada
Franca Placenza
Neuroscience, Janssen Research & Development, LLC, Titusville, NJ, 08560, USA
Qingqin S. Li
Department of Psychiatry, University of Michigan, Ann Arbor, USA
Sagar V. Parikh
Center for Depression Research and Clinical Care, UT Southwestern Medical Center, Dallas, TX, USA
Jane A. Foster
Douglas Institute, Department of Psychiatry, McGill University, Montreal, QC, Canada
Gustavo Turecki
Department of Psychiatry, Queen’s University and Providence Care Hospital, Kingston, ON, Canada
Roumen Milev & Claudio N. Soares
Department of Psychiatry, University of Toronto, Toronto, ON, Canada
Lena C. Quilty & Sidney H. Kennedy
Department of Psychiatry, University of British Columbia, Vancouver, BC, Canada
Raymond W. Lam

Authors

Anastasiya Slyepchenko
View author publications
You can also search for this author in PubMed Google Scholar
Rudolf Uher
View author publications
You can also search for this author in PubMed Google Scholar
Keith Ho
View author publications
You can also search for this author in PubMed Google Scholar
Stefanie Hassel
View author publications
You can also search for this author in PubMed Google Scholar
Craig Matthews
View author publications
You can also search for this author in PubMed Google Scholar
Patricia K. Lukus
View author publications
You can also search for this author in PubMed Google Scholar
Alexander R. Daros
View author publications
You can also search for this author in PubMed Google Scholar
Anna Minarik
View author publications
You can also search for this author in PubMed Google Scholar
Franca Placenza
View author publications
You can also search for this author in PubMed Google Scholar
Qingqin S. Li
View author publications
You can also search for this author in PubMed Google Scholar
Susan Rotzinger
View author publications
You can also search for this author in PubMed Google Scholar
Sagar V. Parikh
View author publications
You can also search for this author in PubMed Google Scholar
Jane A. Foster
View author publications
You can also search for this author in PubMed Google Scholar
Gustavo Turecki
View author publications
You can also search for this author in PubMed Google Scholar
Daniel J. Müller
View author publications
You can also search for this author in PubMed Google Scholar
Valerie H. Taylor
View author publications
You can also search for this author in PubMed Google Scholar
Lena C. Quilty
View author publications
You can also search for this author in PubMed Google Scholar
Roumen Milev
View author publications
You can also search for this author in PubMed Google Scholar
Claudio N. Soares
View author publications
You can also search for this author in PubMed Google Scholar
Sidney H. Kennedy
View author publications
You can also search for this author in PubMed Google Scholar
Raymond W. Lam
View author publications
You can also search for this author in PubMed Google Scholar
Benicio N. Frey
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.S. led, conceptualized and performed the analyses, prepared the first draft of the manuscript. K.H., S.H., C.M., P.K.L., A.R.D., A.M., and F.P. developed analytic approach, performed quality control procedure, wrote manuscript. R.U. conceptualized study, developed analytical approach, wrote manuscript. Q.S.L. wrote manuscript. S.R., S.V.P., J.A.F., G.T., D.J.M., V.H.T., L.C.Q., R.M., C.N.S., S.H.K., R.W.L., B.N.F. conceptualized study, wrote manuscript.

Corresponding author

Correspondence to Benicio N. Frey.

Ethics declarations

Competing interests

A.S., A.R.D., B.N.F., L.C.Q., and S.H. have no competing interests to declare. S.H.K. has received funding for Consulting or Speaking engagements from Abbvie, Boehringer-Ingelheim, Janssen, Lundbeck, Lundbeck Institute, Merck, Otsuka Pfizer, Sunovion and Servier. He has received Research Support from Abbott, Brain Canada, CIHR (Canadian Institutes of Health Research), Janssen, Lundbeck, Neurocrine, Ontario Brain Institute, Otsuka, Pfizer, SPOR (Canada's Strategy for Patient-Oriented Research). He holds stock/stock options in Field Trip Health. SR has grant funding from Ontario Brain Institute and holds a patent: Teneurin C-terminal associated peptides (TCAP) and methods and uses thereof.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Slyepchenko, A., Uher, R., Ho, K. et al. A standardized workflow for long-term longitudinal actigraphy data processing using one year of continuous actigraphy from the CAN-BIND Wellness Monitoring Study. Sci Rep 13, 15300 (2023). https://doi.org/10.1038/s41598-023-42138-6

Download citation

Received: 29 December 2022
Accepted: 05 September 2023
Published: 15 September 2023
DOI: https://doi.org/10.1038/s41598-023-42138-6

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Evaluating reliability in wearable devices for sleep staging

Real-world longitudinal data collected from the SleepHealth mobile app study

Wearable-based accelerometer activity profile as digital biomarker of inflammation, biological age, and mortality using hierarchical clustering analysis in NHANES 2011–2014

Introduction

Methods

Data collection

Study design

Data acquisition: raw actigraphy data

Data processing and analysis

Summary

Data trimming

Sleep scoring

Non-wear scoring

Development of a novel non-wear algorithm: the majority algorithm

Addressing data missingness

Sensitivity analyses

Statistical software

Results

Collected and missing data

Non-wear detection

Summary of non-wear according to different algorithms

Overlap of non-wear detection methods

Development of a novel non-wear algorithm: the majority algorithm

Managing data missingness

Addressing data missingness in the Wellness Monitoring Study

Sensitivity analyses

Discussion

Limitations

Future directions

Conclusions

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links