Logo image
The determinants of health assessment in the United States: A supervised learning approach
Journal article   Open access   Peer reviewed

The determinants of health assessment in the United States: A supervised learning approach

Guillaume Coqueret
Healthcare Analytics
01/11/2022

Abstract

Supervised learning Feature importance Permutation importance Descriptive analytics Tabular networks Data Mining
In this article, we exploit a large dataset of surveys to answer a simple questions: which factors drive good (or bad) health? Using a set of 14 very diverse predictors (both socioeconomic and physiological), we perform sets of supervised learning tasks to determine which variables best explain the self-assessment of health conditions. Our predictive algorithms range from simple regressions to tabular networks and include random forests, all of which allow for some interpretability, directly or indirectly, either via feature importance or via conditional permutation influence of the trained models. Our results indicate that two indicators, in particular, emerge as potent determinants of physical well-being: income and exercise. The body mass index is the third main driver, though its role is less prominent. Importantly, for reproducibility, the dataset used in the study is in open access.
pdf
HA_Coqueret_202211686.91 kBDownloadView
Open Access CC BY V4.0
url
https://doi.org/10.1016/j.health.2022.100106View
Published (Version of record) Open

Metrics

10 Record Views

Details

Logo image