Paper
10 November 2022 Disk failure prediction via Lightgbm model
Sitao Ji, Xingquan Zuo, Hai Huang
Author Affiliations +
Proceedings Volume 12331, International Conference on Mechanisms and Robotics (ICMAR 2022); 123313G (2022) https://doi.org/10.1117/12.2652965
Event: International Conference on Mechanisms and Robotics (ICMAR 2022), 2022, Zhuhai, China
Abstract
Disk failure prediction has become a major concern with the rapid expansion of data centers and the demand for reliable storage systems. Based on extremely unbalanced positive and negative samples in the disk data set, this paper proposes a fast and accurate disk failure prediction model. Target disks are selected from the actual disk data set, and the correlation matrix is used to filter the features with high correlation coefficients, which makes the model easy to fit. Aiming at the imbalance of positive and negative samples, an under-sampling strategy is implemented through Mini Batch K-Means clustering, and the Focal Loss function is introduced into the LightGBM model to reduce the impact of the imbalance data. Compared with several traditional machine learning models, the effectiveness of the proposed prediction model is verified on the public real data set.
© (2022) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Sitao Ji, Xingquan Zuo, and Hai Huang "Disk failure prediction via Lightgbm model", Proc. SPIE 12331, International Conference on Mechanisms and Robotics (ICMAR 2022), 123313G (10 November 2022); https://doi.org/10.1117/12.2652965
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Data modeling

Statistical modeling

Failure analysis

Machine learning

Data storage

Manufacturing

Clouds

Back to Top