Welcome to Journal of University of Chinese Academy of Sciences,Today is

Journal of University of Chinese Academy of Sciences

Previous Articles     Next Articles

Robust subgroup analysis based on Huber loss for high-dimensional heterogeneous data

SU Jing, HAN Chao, ZHANG Weiping   

  1. Department of Statistics and Finance, School of Management, University of Science and Technology of China,Hefei 230026, China
  • Received:2025-03-25 Revised:2025-06-19

Abstract: Based on the general linear regression model, this paper considers the heterogeneity of individual intercepts and the high dimensional covariates in the model. In order to deal with the problem of data anomalies and improve the robustness of the model, we adopt Huber loss function. Meanwhile, we propose a center-based penalty to identify potential subgroups and implement covariates selection by using concave penalty. In the aspect of algorithm, we design a new hybrid algorithm based on alternating direction multiplier method (ADMM) and coordinate descent method to solve the objective function. At the theoretical level, this paper successfully constructs the asymptotic property of Oracle estimators, and rigorously proves its close relationship with the objective function, which guarantees the effectiveness of the proposed method in potential subgroup identification and variable selection. Numerical simulation and empirical data analysis fully demonstrate the robustness and effectiveness of the proposed method in subgroup identification and high-dimensional data processing.

Key words: Huber loss, subgroup analysis, high-dimensional data, Oracle properties

CLC Number: