已发表论文

相对脂肪量和物理指数作为胆结石形成的预测因素:来自机器学习和逻辑回归的见解

 

Authors Deng L, Wang S, Wan D, Zhang Q, Shen W, Liu X, Zhang Y

Received 21 November 2024

Accepted for publication 23 January 2025

Published 31 January 2025 Volume 2025:18 Pages 509—527

DOI https://doi.org/10.2147/IJGM.S507013

Checked for plagiarism Yes

Review by Single anonymous peer review

Peer reviewer comments 2

Editor who approved publication: Dr Christian Selinger

Laifu Deng,1,* Shuting Wang,1,* Daiwei Wan,1,* Qi Zhang,2 Wei Shen,1 Xiao Liu,1,* Yu Zhang1,* 

1Department of General Surgery, Wuxi Medical Center of Nanjing Medical University, Wuxi, People’s Republic of China; 2Department of Oncology, Tengzhou Central People’s Hospital, Jining Medical College, Shandong, People’s Republic of China

*These authors contributed equally to this work

Correspondence: Xiao Liu; Yu Zhang, Department of General Surgery, Wuxi Medical Center of Nanjing Medical University, Wuxi, People’s Republic of China, Tel +8613561164619 ; +8613793706777, Email ningzhouwxrmyy@163.com; yuzhangjs@yeah.net

Purpose: Gallstones (GS), a prevalent disorder of the biliary tract, markedly impair patients’ quality of life. This study aims to construct predictive models employing diverse machine learning algorithms to elucidate risk factors linked to gallstone formation.
Patients and Methods: This study integrated data from the National Health and Nutrition Examination Survey (NHANES) with a cohort of 7868 participants from Wuxi People’s Hospital and Wuxi Second People’s Hospital, including 830 individuals diagnosed with gallstones. To develop our predictive model, we employed four algorithms—Logistic Regression, Gaussian Naive Bayes (GNB), Multi-Layer Perceptron (MLP), and Support Vector Machine (SVM). The models were validated internally through k-fold cross-validation and externally using independent datasets. Furthermore, we substantiated the link between relative fat mass (RFM) and gallstone formation by employing four logistic regression models, conducting subgroup analyses, and applying restricted cubic spline (RCS) curves.
Results: The logistic regression algorithm demonstrated superior predictive capability for all risk factors associated with gallstone occurrence compared to other machine learning models. SHAP analysis identified RFM, weight-to-waist index (WWI), waist circumference (WC), waist-to-height ratio (WHtR), and body mass index (BMI) as prominent predictors of gallstone occurrence, with RFM emerging as the primary determinant. A fully adjusted multivariate logistic regression analysis revealed a robust positive association between RFM and gallstones. Subgroup analysis further indicated that subgroup factors did not alter the positive relationship between RFM and gallstone prevalence.
Conclusion: Among the four algorithmic models, logistic regression proved most effective in predicting gallstone occurrence. The model developed in this study offers clinicians a valuable tool for identifying critical prognostic factors, facilitating personalized patient monitoring and tailored management.

Keywords: gallstones, atherogenic index of plasma, national health and nutrition examination survey, cross-sectional studies, risk factor, machine learning