Research Article Details
Article ID: | A50278 |
PMID: | 35444985 |
Source: | Front Public Health |
Title: | A Machine Learning Based Framework to Identify and Classify Non-alcoholic Fatty Liver Disease in a Large-Scale Population. |
Abstract: | Non-alcoholic fatty liver disease (NAFLD) is a common serious health problem worldwide, which lacks efficient medical treatment. We aimed to develop and validate the machine learning (ML) models which could be used to the accurate screening of large number of people. This paper included 304,145 adults who have joined in the national physical examination and used their questionnaire and physical measurement parameters as model's candidate covariates. Absolute shrinkage and selection operator (LASSO) was used to feature selection from candidate covariates, then four ML algorithms were used to build the screening model for NAFLD, used a classifier with the best performance to output the importance score of the covariate in NAFLD. Among the four ML algorithms, XGBoost owned the best performance (accuracy = 0.880, precision = 0.801, recall = 0.894, F-1 = 0.882, and AUC = 0.951), and the importance ranking of covariates is accordingly BMI, age, waist circumference, gender, type 2 diabetes, gallbladder disease, smoking, hypertension, dietary status, physical activity, oil-loving and salt-loving. ML classifiers could help medical agencies achieve the early identification and classification of NAFLD, which is particularly useful for areas with poor economy, and the covariates' importance degree will be helpful to the prevention and treatment of NAFLD. |
DOI: | 10.3389/fpubh.2022.846118 |

Strategy ID | Therapy Strategy | Synonyms | Therapy Targets | Therapy Drugs |
---|
Diseases ID | DO ID | Disease Name | Definition | Class | |
---|---|---|---|---|---|
I12 | 10763 | Hypertension | An artery disease characterized by chronic elevated blood pressure in the arteries. https://en.wikipedia.org/wiki/Hypertension, https://www.ncbi.nlm.nih.gov/pubmed/24352797 | disease of anatomical entity/ cardiovascular system disease/vascular disease/ artery disease | Details |
I05 | 9352 | Type 2 diabetes mellitus | A diabetes that is characterized by chronic hyperglycaemia with disturbances of carbohydrate, fat and protein metabolism resulting from defects in insulin secretion, insulin action, or both. A diabetes mellitus that is characterized by high blood sugar, insulin resistance, and relative lack of insulin. http://en.wikipedia.org/wiki/Diabetes, http://en.wikipedia.org/wiki/Diabetes_mellitus_type_2 | disease of metabolism/inherited metabolic disorder/ carbohydrate metabolic disorder/glucose metabolism disease/diabetes/ diabetes mellitus | Details |
Drug ID | Drug Name | Type | DrugBank ID | Targets | Category | Latest Progress | |
---|---|---|---|---|---|---|---|
D328 | Serine | Chemical drug | DB00133 | SRR | Improve insulin resistance | Under clinical trials | Details |
D083 | CLA | Chemical drug | DB01211 | KCNH2; SLCO1B1; SLCO1B3 | -- | Under clinical trials | Details |
D094 | Cysteamine | Chemical drug | DB00847 | GSS stimulant | Renal drug | Under clinical trials | Details |
D095 | Cysteamine bitartrate | Chemical drug | DB00847 | -- | -- | Under clinical trials | Details |