Taylor & Francis Group
Browse

sorry, we can't preview this file

uasa_a_2169699_sm8136.rar (669.02 kB)

A General M-estimation Theory in Semi-Supervised Framework

Download (669.02 kB)
dataset
posted on 2023-02-28, 21:40 authored by Shanshan Song, Yuanyuan Lin, Yong Zhou

We study a class of general M-estimators in the semi-supervised setting, wherein the data are typically a combination of a relatively small labeled dataset and large amounts of unlabeled data. A new estimator, which efficiently uses the useful information contained in the unlabeled data, is proposed via a projection technique. We prove consistency and asymptotic normality, and provide an inference procedure based on K-fold cross-validation. The optimal weights are derived to balance the contributions of the labeled and unlabeled data. It is shown that the proposed method, by taking advantage of the unlabeled data, produces asymptotically more efficient estimation of the target parameters than the supervised counterpart. Supportive numerical evidence is shown in simulation studies. Applications are illustrated in analysis of the homeless data in Los Angeles. Supplementary materials for this article are available online.

Funding

Lin’s work was supported by the Hong Kong Research Grants Council (grant no. 14306219 and 14306620), the National Natural Science Foundation of China (grant no. 11961028) and Direct Grants for Research, The Chinese University of Hong Kong. Zhou’s work was supported by the State Key Program of National Natural Science Foundation of China (71931004) and the National Key R&D Program of China (2021YFA1000100, 2021YFA1000101).

History