Development of an intrinsic health risk prediction model for camera-based monitoring of older adults living alone

Kim, Minji; Hong, Song-iee; Youm, Sekyoung

doi:10.1038/s41598-022-23663-2

Download PDF

Article
Open access
Published: 07 November 2022

Development of an intrinsic health risk prediction model for camera-based monitoring of older adults living alone

Minji Kim¹,
Song-iee Hong² &
Sekyoung Youm¹

Scientific Reports volume 12, Article number: 18855 (2022) Cite this article

1269 Accesses
2 Altmetric
Metrics details

Subjects

Abstract

The number of older adults in Korea is increasing, along with the number of depressed older patients. The causes of depression in older adults include social isolation with negligible interaction with others, irregular nutritional habits, and self-negligence, i.e., they do not engage in any activity. These factors, self-negligence, social isolation, and irregular nutritional habits, are defined as inherent health risks, and in this study, we detected them. These factors can only be derived through long-term monitoring, but the current monitoring system for older adults is severely limited as it focuses only on emergencies, such as “falls.” Therefore, in this study, the goal was to perform long-term monitoring using a camera. In order to capture the physical characteristics of the older adults, the ETRI-Activity3D data were used for training, and the skeleton-based action recognition algorithm Posec3d was used. By defining 90 frames as the time taken for one action, we built a monitoring system to enable long-term monitoring of older adult by performing multiple action detection in one video. A reliable monitoring system, with 98% accuracy, 98% precision, 99% recall, and 98% F1, was successfully established for health monitoring of older adults. This older adult monitoring technology is expected to improve the quality of medical services in a medical environment as well as the objective, activities of daily living test, which does not depend on the observer through daily life detection.

An eight-camera fall detection system using human fall pattern recognition via machine learning by a low-cost android box

Article Open access 28 January 2021

Validation of Amazon Halo Movement: a smartphone camera-based assessment of movement health

Article Open access 06 September 2022

A study on the impact of the users’ characteristics on the performance of wearable fall detection systems

Article Open access 26 November 2021

Introduction

According to Korea´s statistics on older adults in 2021, older adults comprise 16.5% of the total population in South Korea. Among them, community older adults living alone, account for 35.1%, which is expected to continue to rise in the future¹. According to the statistical data of 2019, as the number of older adults increased, the proportion of depressed patients belonging to this category became 40.4% of the total population, and this proportion is continuously increasing. Depression in older adults has a negative impact on their overall life and may lead to death depending on the degree of depression, and prolonged depression is certainly fatal to older adults living alone^2,3,18. Among the many factors that cause depression in older adults, social isolation and self-neglect are the most common causes^4,19. The recent guidelines for “social distancing”^15,16,17, in which people were advised to stay at home due to the COVID-19 pandemic, had an indirect effect on the onset of depression in older adults⁵. A life of isolation due to a decrease in intimacy with neighbors and a decrease in interpersonal relationships due to “social distancing” can lead to depression¹⁹. Further, the rate of self-neglect, which means that the older adults do not take care of themselves and are inactive, is increasing every year²⁰. Self-neglect also has a significant correlation with depression⁴. In addition, older adults show irregular nutritional intake due to various factors such as economic status and loss of appetite, and an irregular nutritional intake increases the nutritional risk factors^21,22. Although such irregular eating habits do not cause short-term health deterioration, they can worsen health in the long term. Social isolation, self-neglect, and abnormal nutritional habits can be discovered by observing daily life patterns and through the activities of daily living (ADL) assessment. ADL refers to the basic activities that an individual needs to perform in order to function independently in daily life¹⁰. A correlation between the ADL and the cognitive dysfunctions has been reported¹¹. Since ADLs are used as indicators to measure the severity of dementia²⁴ and to determine the effectiveness of dementia treatment agents¹², ADLs can provide specific information on the ability of older adults to manage their daily life independently¹³. Currently, the ADLs are either conducted directly with older adults or through observations and interviews with their family members and nurses^13,14. However, the reliability of such assessments is questionable as they may be affected by subjective factors¹².

In recent times, the fourth industrial revolution is garnering considerable attention, and the development of technology to realize it has accelerated. Consequently, numerous studies are being conducted to solve the current societal problems via the convergence of the fourth industrial revolution technology⁶. Technologies for monitoring older adults have been proposed to maintain the independence of older adults, aid in decision-making, and reduce the burden on caregivers^7,40. Further, older adult monitoring systems can help before the unexpected happens³⁹. The focus of current monitoring technology is on identifying the older adults’ emergency situations^8,9. However, a monitoring system focusing on emergencies cannot detect the general lifestyle of an older adult, such as lack of various activities and self-negligence. Several researchers have also performed non-emergency, daily life detection. Yang and Hsu monitored the rhythm of the daily life of older adults using four movement detectors and one appliance usage detector; however, they could not detect the exact action being performed⁴¹. Awais et al. performed daily living behavior monitoring through four sensors attached to the body. However, only four simple actions (sit, stand, walk, and lie down) were detectable⁴². Matsui et al. used several motion sensors, ambient sensors, and door sensors to monitor the daily life of the elderly, but only five behaviors (bathing, cooking, eating, going out, sleeping) could be detected with a low precision⁴³. Fernando et al. performed the detection of falls and daily life behaviors of the elderly using a camera. However, only a small number of daily life categories (standing, sitting, walking, falling, and grabbing) were monitored in this study⁴⁴. As can be seen from these previous studies, the current technology has limitations in accurately monitoring daily life, including uncertain movements, and thus, many categories of daily life cannot be detected.

Therefore, this study was performed to design a monitoring system that can automatically evaluate various categories of behaviors, such as social isolation, inactivity, and unhealthy behavior. This designed monitoring system can detect several behavioral categories together, unlike the current single monitoring systems, which can detect on life-threatening emergencies. Furthermore, instead of simple monitoring, the developed system can be applied to the data of older adults’ actual daily lives. Therefore, the study was performed in two stages: first, we attempted to advance the camera-based technology of monitoring older adults’ daily lives; second, we plan to applied this advanced skill to the real settings of older adults, who are institutionalized at nursing homes in Kang Hwa, South Korea. This article is focused on the first stage of our study. We expect this camera-based monitoring system to improve the quality of health care services. Furthermore, the monitoring system can facilitate an objective and quantitative ADL evaluation, which is dependent on observations and interviews, and ensure the safe management of older adults^40,45.

Methods

Materials

In this study, we used the ETRI-Activity3D dataset by filming the lives of older adults from a robot’s point of view to solve the problems of an aging society and to learn the behavior of older adults³². The data was collected in a 102 m² apartment setting reflecting a real-home environment, and it was constructed to elicit similar behaviors of older adults with maximum possible accuracy. Figure 1 shows a typical example of the video data. The actual data used from the ETRI-Activity3D dataset are shown in Table 1. The ETRI-Activity3D data were approved on April 20, 2021 after submitting an agreement on the ETRI AI Nanum website (https://nanum.etri.re.kr/share/dhkim008/robot_environment2?lang=en_KR).

Table 1 Definition of ETRI-Activity3D data.

Full size table

There were a total of 55 classes in the ETRI-Activity3D dataset. However, in accordance with the behavior classes for the intrinsic risk outlined, the categories were defined as (1) behaviors that can be used to assess the ability to perform ADLs as a basic daily routine; (2) behaviors that may be unhealthy in the long term; and (3) behaviors that can indicate the social relationships of older adults. Therefore, we defined a total of 13 classes, and these classes satisfying the respective conditions are presented in Table 2. Several behavior classes were merged and redefined into a single class. The behavior of using a vacuum cleaner and that of cleaning the floor while bending forward were combined and defined as “Cleaning the room.” Similarly, reading a book and reading a newspaper were combined and defined as “Reading.” The action of making or receiving a call and the behavior of operating a smartphone were also combined and defined as “Using a phone”.

Table 2 Definition of behavior classes used by the developed system.

Full size table

Action recognition

Action recognition technology is essential to monitor the daily lives of older adults and has been widely applied to video information search, daily life security, and CCTV surveillance²³. This technology is divided into two types of actions: action classification and action detection²⁴. Action classification implies classifying the type of action a person is performing in a video. Consequently, the video data used as the input should consist of an image of an action. In contrast, action detection detects which action is taken at a certain point in a video that was not cut, based on a specific class criterion^23,24. Since most videos collected in real life are unedited videos that include multiple actions rather than a specific action, action detection is essential for recognizing the specific target actions of a person in the actual video^24,25.

The methods of action recognition can be broadly divided into the following categories: (i) a method that uses RGB image data without changes^26,27,34, and (ii) a method that detects an action using the skeleton coordinates of a human body derived from the RGB image data^28,29. However, when RGB images are used, the action detection may be sensitive to information other than actions, such as background and color^19,21,22,34. Therefore, if action recognition is performed using the skeleton coordinates of a human body in RGB images, then it is possible to overcome the limitations by solely obtaining the motion data without any influence of the background or lighting^30,31. For this reason, Posec3d²⁸, which performs skeleton-based action recognition, was used in this study.

Posec3d can be divided into two main segments: a pose estimator and a behavior detector. In the pose estimator, a video, entered as the input data, is sliced into images at a certain rate (frame per second; fps), and the human skeleton coordinates are derived from each image. In this study, the pose estimator stage involved the estimation of human pose using the Top-Down method, which has an advantage over the Bottom-Up method in terms of accuracy³⁶. Therefore, the human body was first detected and then the human skeleton coordinates were derived. First, the object detection algorithm faster region-based convolutional network (Faster CNN)³⁷ was used to detect a person, and the pose estimation algorithm high resolution network (HRNet)³⁵ was used to extract the human skeleton coordinates of the detected person. After converting each extracted human skeleton coordinate into a two-dimensional (2D) heatmap, it was stacked according to time flow to construct a 3D heatmap. The behavior detector used the 3D ResNet-based 3D CNN as the input for the 3D heatmap to detect and identify the actions performed by a person in a video. Figure 2 shows the framework of Posec3d.

Development of the ADL monitoring system

Framework

The study framework (Fig. 3) allows older adults to record their behaviors using Posec3d, which is designed to evaluate older adults’ ADLs to detect intrinsic risks. Posec3d enables a state-of-the-art recording in the field of action recognition using human skeleton coordinates³³.

In this study, we evaluated different behaviors of older adults, such as social isolation, self-neglect, and long-term unhealthy behaviors. These behaviors elucidated in this study are: (1) basic behaviors that can be used to evaluate the personal capacity of ADLs, (2) behaviors that can be unhealthy in the long term, and (3) behaviors that can be assessed to identify the social relationships of older adults.

Data processing

All the performance evaluation processes were carried out using Python. Figure 4 shows a series of steps used for preprocessing the data. These processes created an annotation source of learning in the Posec3d algorithm.

The training and test data were divided into a ratio of 8:2 based on the number of participants and were set as the training dataset (8 males and 8 females) and test dataset (2 males and 2 females). The ETRI-Activity3D dataset was a video filmed at 25 fps and sliced to 25 fps using ffmpeg to create several images.

Annotation is a collection of information on the data to be used in the learning. In this study, the annotation contained information for each video as shown in Table 3. This information was stored in a dictionary format, and the final annotation was in the form of a list containing the annotation of each video. Here, “Frame_dir” is used to distinguish the name of the video, and “Img_shape” and “Original_shape” indicates 1080 width and 1920 height (1080, 1920). Because each video had a different length, the total number of frames also varies.

Table 3 Annotation definitions and examples used in the system.

Full size table

The keypoints were extracted using HRNet³⁵, and then a total of 17 skeleton coordinates were used as shown in Table 4. The order of each skeleton coordinate is specified in Table 4. Here, “Keypoint_score” indicates the confidence value of each keypoint. Therefore, if the confidence value was high, then the human skeleton coordinate could be detected with a high accuracy. Furthermore, “Label” represents the information on the behavior being filmed in the video and shows that each behavior can be expressed using numbers (Table 4).

Table 4 Definitions of the keypoints, used for pose extraction, and behavior class number.

Full size table

Algorithm for judging the behavior of older adults

Posec3d is an algorithm for action classification during action recognition using skeleton coordinates; that is, it is an algorithm that classifies the entire image into a single class. However, it is difficult to use in its original form, because older adults’ daily behaviors are too complex to be detected (i.e., the detection of what behavior is performed at a specific time on the basis of daily routine). Therefore, 90 frames were defined as the time to act, and the problem was mitigated by repeating Posec3d after every five frames. Using this Posec3d algorithm, action detection was performed every five frames to detect the actions of the older adults, and then the actions and respective corresponding time were coincidently recorded in the database. According to the set number of frames per second, the video was divided into image data. For example, based on 25 fps, five images correspond to 0.25 s, and 250 images corresponded to 10 s. Therefore, time information was derived, which allowed us to record the respective start times and end times of the older adult’s activities, configure a database, and derive information on what activities the older adult did at what time through the database. Using this process, the repeated daily behaviors of the older adults could be identified and implemented as the baseline data for ADLs through the stored database.

Statistical analysis and experiment

To evaluate the accuracy of the machine learning and deep learning models, we utilized performance indicators such as accuracy, precision, recall, and F1-score. Accuracy is the ratio of the number of true positive and true negative to the total number of predictions. Precision means the ratio of true positives to true positives and false positives. Recall is calculated as the ratio of true positives to true positives and false negatives. The F1-score is calculated as the harmonic mean of precision and recall. The data used in this study showed some mild imbalance, and thus, we aimed to achieve an F1-score of more than 95%. In this experiment, we employed the Ubuntu 18.04 LTS operating system for machine learning using two RTX3090. The total batch size was set to 64, the learning rate was set to 0.025–1000 epochs, and the stochastic gradient descent was used as the optimizer function. In this case, cross entropy was used as the loss function. The model calculation of such a learning process was verified using the test annotation every 10 epochs, and the final model yielded the highest accuracy of older adults’ target behaviors. Specifically, among the 100 validations out of 1000 epochs, the 940-epoch model was selected as the best performing model as depicted in Fig. 5.

Research ethics

The experimental data for verification were approved by the institutional review board of Dongguk University (DUIRB-202106-15). All the methods were carried out in accordance with the relevant guidelines and regulations. Informed consent was obtained from all the subjects and/or legal guardians that their information/images will be included in an online open-access publication and paper. Additionally, informed consent was obtained from all the subjects for participation.

Results

Evaluation of the test dataset

We obtained an excellent behavioral performance with 98% accuracy, 98% precision, 99% recall, and 98% F1-score, showing that all the indicators were accepted. In Fig. 6a, the actual label value and predicted label value are expressed using a confusion matrix. Since the number of labels for each class was not the same, as shown in Fig. 6b, we normalized the number of each class. The x- and y-axes in Fig. 6a,b represent the predicted and actual labels, respectively. Table 5 presents the performance indicators for each class.

Table 5 Performance indicators for each behavior class of the system.

Full size table

The average F1-score was 98%, and the performance of 95% or more of the F1-score was established for a reliable behavior detection. However, as evident from Table 5, all the behaviors are not well detected. The precision of “Eating” is 96%, but recall is 89%, which is inferior to the performance of the other behavior classes. Figure 6 reveals that “Eating” is falsely detected the most as “Reading,” because the action of reaching for food and the action of grabbing a book show similar arm movements. Most of the behaviors are standing or sitting positions, but class 12 “lying” is a lying position, which is different from the other behaviors. As a result, “lying” detection is excellent with 100% accuracy, precision, and recall.

Application of real data

The model was applied to actual videos to verify whether the actions shown in them were properly predicted or not. As shown in Fig. 7, the model was applied in older adults’ actual residential space to test and confirm that their actions were recorded in the database. In addition, it was applied to various existing videos to verify its workability in a real environment. The data shown in Fig. 7a,b were collected in a real environment, and permission was obtained from the person in the video to use the data. The data shown in Fig. 7c,d were published on YouTube³⁸. The upper data shown in Fig. 7a,b could be transformed as a behavioral database, because the actual shooting time was 15:00. However, the data shown in Fig. 7c,d could not be transformed, because the actual time was unknown from the news. For this reason, the database was built to display changes in behavior in seconds by quickly replaying the human videos in the news. As shown in Fig. 7, when a video is given, the position of the person in the video and the skeleton coordinates of the human are detected. When the images shown in Fig. 7a,b were applied to the proposed system, some cases of false detection were obtained. For example, “Using a phone” was falsely detected as “Using a remote.” In this case, the database was designed in such a way that this behavior was identified as falsely detected as a different behavior for a short time, compared with the three detected behaviors before and after, and removed. Therefore, the behaviors could be organized into a database as shown on the right side of Fig. 7. These results confirm that various behaviors defined as intrinsic health risks were successfully detected and demonstrate the feasibility of long-term monitoring using the proposed system.

Discussion

The focus of this study was to detect the various behaviors defined as chronic health risks in older adults with a high accuracy. Identifying such behaviors with a high accuracy will aid in detecting various daily life behaviors of an older adult in a dwelling, and long-term monitoring will become possible by storing the various behaviors in a database. Since older adults spend a lot of time at home, home is a valuable place for them and not just a place to live⁴⁶. Therefore, we detected and identified their behaviors in a house.

Unlike the study of Yang and Hsu, who detected the rhythms of daily life in older adults, but not the exact actions/behaviors, our study objective was significantly different and comprehensive as it covered a wide range of intrinsic health behaviors of older adults using the proposed monitoring system⁴¹. Awais et al. established a monitoring system for ADL evaluation, but detected only four behaviors (sit, stand, walk, and lie down), reporting an F1-score of 87.2%, which is higher than that obtained in our case⁴². In contrast, Matsui et al., who detected only five behaviors, reported an F1-score of 40.7%, which is lower than that obtained in our study⁴³. When a small number of behaviors are detected, building and utilizing a database through behavior detection of older adults are severely limited, because any unlearned behavior cannot be detected in this case. In this study, we overcame these limitations and performed detection for 13 different behaviors, achieving an average F1-score of 98%. Through proposed monitoring, it is possible to perform ADL evaluations, which can reduce the hassle of the observer. This study is a foundational research focused on the design and development of a long-term monitoring system for behavior identification and to realize risk aversion for older adults. After establishing a database following long-term monitoring using the proposed system, it is possible to derive the main behavioral patterns of the older adults in a house. Further, it is possible detect abnormal behaviors through the main behavioral patterns, and detect diseases such as dementia⁴⁷. These systems are telemedicine systems that can manage the health of older adults and improve their quality of life in a non-invasive way.

Conclusions

We used a Posec3d-based algorithm that performs deep-learning-based action recognition to develop an older adult monitoring system, which can monitor older adults living alone in a residential setting. Such a monitoring system organized and stored actions in a database depending on the start time and end time of certain actions. In particular, older adults’ behaviors that can cause social isolation, self-neglect, and health deterioration were defined as intrinsic risks in this study. Accordingly, three behavior categories were defined: (1) behaviors that can assess the ability to perform ADL in daily life, (2) behavior that can be unhealthy in the long run, and (3) behaviors that indicate the social relationship of older adults. The ETRI-Activity3D database, which includes videos of older adults, was used. Compared with other behavioral data, this dataset shows distinctive physical features of older adults, such as a curved spine different from that in young adults. In this way, we could perform specific monitoring customized for older adults.

Various older adult data could not be used because of storage limitations. To overcome the lack of diversity in the study data, we will apply this study technology in real settings with older adults, including personal houses and institutional settings like nursing homes, in the next stage of this research. In this study, the human skeleton data, which contain only motion information, were used, and thus, similar actions could not be properly detected. Therefore, in future studies, action recognition should be performed using the RGB image data and skeleton data in multimodal forms. Moreover, more accurate and comprehensive action recognition studies need to be conducted to further solidify the health management of older adults living alone.

Data availability

RGB dataset for recognizing the daily behavior of older adults in a robotic environment provided by ETRI was used. The data provided by ETRI were obtained from 100 persons (50 older adults and 50 general adults), and there were a total of 55 daily behavior classes. When observing the behavior of older adults, 55 types of behavior classes were constructed based on frequent activities, and in this study, 16 classes taken by 10 males and 10 females were used and reconstructed into 13 classes. Anyone can download the ETRI-Activity 3D data after applying for data on the ETRI Nanum website (https://nanum.etri.re.kr/share/dhkim008/robot_environment2?lang=en_KR). We do not have the permission to share these data.

References

Statistical Office. 2021 Older Adult Statistics. (2021).
Oh, H. Relationship between social capital, depression and quality of life in elderly people participating in physical activity. Korean J. Phys. Educ. 53, 535–547 (2014).
Google Scholar
Waern, M., Rubenowitz, E. & Wilhelmson, K. Predictors of suicide in the old elderly. Gerontology 49, 328–334 (2003).
Article PubMed Google Scholar
Dong, X. et al. Elder self-neglect and abuse and mortality risk in a community-dwelling population. JAMA 302, 517–526 (2009).
Article CAS PubMed PubMed Central Google Scholar
Chan, A., Malhotra, C., Malhotra, R. & Østbye, T. Living arrangements, social networks and depressive symptoms among older men and women in Singapore. Int. J. Geriatr. Psychiatry 26, 630–639 (2011).
Article PubMed Google Scholar
Choi, S., Kim, C., Kang, Y. & Youm, S. Human behavioral pattern analysis-based anomaly detection system in residential space. J. Supercomput. 77, 9248–9265 (2021).
Article Google Scholar
Camp, N. et al. Technology used to recognize activities of daily living in community-dwelling older adults. Int. J. Environ. Res. Public Health 18, 163 (2021).
Article Google Scholar
Won, J., Kim, C., Choi, S., Youm, S. & Kang, Y. S. TensorFlow Object Detection API-Based Pose Identification Procedure for Elderly Living Alone Emergency Situation Detection. 726–728 (The Korean Institute of Information Scientists and Engineers, 2018).
Kim, G. & Park, S. Activity detection from electricity consumption and communication usage data for monitoring lonely deaths. Sensors 21, 3016 (2021).
Article ADS PubMed PubMed Central Google Scholar
Vermeulen, J., Neyens, J. C., van Rossum, E., Spreeuwenberg, M. D. & de Witte, L. P. Predicting ADL disability in community-dwelling elderly people using physical frailty indicators: A systematic review. BMC Geriatr. 11, 1–11 (2011).
Article Google Scholar
Gold, D. A. An examination of instrumental activities of daily living assessment in older adults and mild cognitive impairment. J. Clin. Exp. Neuropsychol. 34, 11–34 (2012).
Article PubMed Google Scholar
Bavazzano, A. et al. Functional evaluation of Alzheimer patients during clinical trials: A review. Arch. Gerontol. Geriatr. 26, 27–32 (1998).
Article Google Scholar
Yang, Y. et al. Activities of daily living and dementia. Dement. Neurocognit. Disord. 11, 29–37 (2012).
Article Google Scholar
Jang, J. Comparison of activities of daily living differences with dementia stage. J. Korea Acad. Ind. Cooper. Soc. 18, 557–563 (2017).
Google Scholar
Morley, J. E. & Vellas, B. COVID-19 and older adult. J. Nutr. Health Aging 24, 364–365 (2020).
Article CAS PubMed PubMed Central Google Scholar
Lim, W. S. et al. COVID-19 and older people in Asia: Asian Working Group for sarcopenia calls to action. Geriatr. Gerontol. Int. 20, 547–558 (2020).
Article PubMed PubMed Central Google Scholar
Kim, J., Kim, Y. & Ha, J. Changes in daily life during the COVID-19 pandemic among south korean older adults with chronic diseases: A qualitative study. Int. J. Environ. Res. Public Health 18, 6781 (2021).
Article CAS PubMed PubMed Central Google Scholar
Plagg, B., Engl, A., Piccoliori, G. & Eisendle, K. Prolonged social isolation of the elderly during COVID-19: Between benefit and damage. Arch. Gerontol. Geriatr. 89, 104086 (2020).
Article CAS PubMed PubMed Central Google Scholar
Kim, M., Eo, Y. & Kim, S. A study of depression in the elderly by individual and community effects. Health Soc. Welfare Rev. 39, 192–221 (2019).
Article Google Scholar
Chang, S. & Kim, S. The social network typology among elderly living alone in Busan, depression, and self-neglect. Korean J. Gerontol. Soc. Welfare 72, 245–273 (2017).
Article Google Scholar
Schlenker, E. Nutrition in Aging. 2nd edn. 186–195 (WCB McGraw-Hill, 1993).
Solomons, N. W. Nutrition and aging: Potentials and problems for research in developing countries. Nutr. Rev. 50, 224–229 (1992).
Article CAS PubMed Google Scholar
Yao, G., Lei, T. & Zhong, J. A review of convolutional-neural-network-based action recognition. Pattern Recogn. Lett. 118, 14–22 (2019).
Article ADS Google Scholar
Moon, J., Kim, H. & Park, J. Trends in temporal action detection in untrimmed videos. Electron. Telecommun. Trends 35, 20–33 (2020).
Google Scholar
Wu, D., Sharma, N. & Blumenstein, M. in 2017 International Joint Conference on Neural Networks (IJCNN). 2865–2872 (IEEE, 2017).
Feichtenhofer, C., Fan, H., Malik, J. & He, K. SlowFast networks for video recognition. in Proceedings of the IEEE/CVF International Conference on Computer Vision. 6202–6211.
Carreira, J. & Zisserman, A. Quo Vadis, action recognition? A new model and the kinetics dataset. in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 6299–6308.
Duan, H. et al. Revisiting skeleton-based action recognition. arXiv Preprint arXiv:2104.13586 (2021).
Yan, S., Xiong, Y. & Lin, D. Spatial temporal graph convolutional networks ofr skeleton-based action recognition. in Thirty-Second AAAI Conference on Artificial Intelligence (2018).
Vemulapalli, R., Arrate, F. & Chellappa, R. Human action recognition by representing 3D skeleton based action recognition. in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 588–595.
Du, Y., Wang, W. & Wang, L. Hierarchical recurrent neural network for skeleton based action recognition. in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1110–1118.
Jang, J. et al. ETRI-activity 3D: A large-scale RGB-D dataset for robots to recognize daily activities of the elderly. in 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). 10990–10997 (IEEE, 2020).
Skeleton Based Action Recognition on NTU RGB+D State of the Art" Papers with Code. https://paperswithcode.com/sota/skeleton-based-action-recognition-on-ntu-rgbd. Assessed 7 Oct 2021 (2021).
Sun, Z. et al. Human action recognition from various data modalities: A review. arXiv Preprint arXiv:2012.11866 (2020).
Sun, K., Xiao, B., Liu, D. & Wang, J. Deep high-resolution representation learning for human pose estimation. in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 5693–5703.
Dang, Q., Yin, J., Wang, B. & Zheng, W. Deep learning based 2D human pose estimation: A survey. Tsinghua Sci. Technol. 24, 663–676 (2019).
Article Google Scholar
Ren, S., He, K., Girshick, R. & Sun, J. Faster r-cnn: Towards real-time object detection with region proposal networks. Adv. Neural Inf. Process. Syst. 2015, 28 (2015).
Google Scholar
JTBC News. "I can't even say a few words a day"... older adults living alone 'shade of non-face-to-face'. in Youtube. https://www.youtube.com/watch?v=6VzVSW7olWM (2021).
Suryadevara, N. K. & Mukhopadhyay, S. C. Wireless sensor network based home monitoring system for wellness determination of elderly. IEEE Sens. J. 12, 1965–1972 (2012).
Article ADS Google Scholar
Wagner, F., Basran, J. & Bello-Haas, V. D. A review of monitoring technology for use with older adults. J. Geriatr. Phys. Ther. 35, 28–34 (2012).
Article PubMed Google Scholar
Yang, C.-C. & Hsu, Y.-L. Remote monitoring and assessment of daily activities in the home environment. J. Clin. Gerontol. Geriatr. 3, 97–104 (2012).
Article CAS Google Scholar
Awais, M., Chiari, L., Ihlen, E. A., Helbostad, J. L. & Palmerini, L. Physical activity classification for elderly people in free-living conditions. IEEE J. Biomed. Health Inform. 23, 197–207 (2019).
Article PubMed Google Scholar
Matsui, T. et al. Salon: Simplified sensing system for activity of daily living in ordinary home. Sensors 20, 4895 (2020).
Article ADS PubMed Central Google Scholar
Fernando, Y. P. N. et al. Computer vision based privacy protected fall detection and behavior monitoring system for the care of the elderly. in 2021 26th IEEE International Conference on Emerging Technologies and Factory Automation (ETFA) (2021).
Suzuki, R. et al. Rhythm of daily living and detection of atypical days for elderly people living alone as determined with a monitoring system. J. Telemed. Telecare 12, 208–214 (2006).
Article PubMed Google Scholar
Leonardi, C. et al. Knocking on elders' door. in Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (2009).
Hine, C., Nilforooshan, R. & Barnaghi, P. Ethical considerations in design and implementation of home-based Smart Care for dementia. Nurs. Ethics 29, 1035–1046 (2022).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was supported by the National Research Foundation of Korea (NRF- 2020R1A2C2010471). The permission for experimental data was obtained from the person in the video to use the data, and was reviewed and approved by the institutional review board of Dongguk University (DUIRB-202106-15).

Author information

Authors and Affiliations

Department of Industrial and Systems Engineering, Dongguk University, 30 Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea
Minji Kim & Sekyoung Youm
Department of Social Welfare, Dongguk University, 30 Pildong-ro 1-gil, Jung-gu, Seoul, 04620, Republic of Korea
Song-iee Hong

Authors

Minji Kim
View author publications
You can also search for this author in PubMed Google Scholar
Song-iee Hong
View author publications
You can also search for this author in PubMed Google Scholar
Sekyoung Youm
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.K. was responsible for methodology, analysis, and writing—original draft preparation. S.Y. was responsible for methodology, review, and editing. S.H. was responsible for the conceptualization and review. All authors have read and agreed to the published version of the manuscript.

Corresponding authors

Correspondence to Song-iee Hong or Sekyoung Youm.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kim, M., Hong, Si. & Youm, S. Development of an intrinsic health risk prediction model for camera-based monitoring of older adults living alone. Sci Rep 12, 18855 (2022). https://doi.org/10.1038/s41598-022-23663-2

Download citation

Received: 10 March 2022
Accepted: 03 November 2022
Published: 07 November 2022
DOI: https://doi.org/10.1038/s41598-022-23663-2

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.