The effect of air pollution and household environmental indicators on anemia status among under-five children in low-middle income countries: exploring potential machine learning applications on demographic and health surveys. Journal Abstract - Guideline Central

The effect of air pollution and household environmental indicators on anemia status among under-five children in low-middle income countries: exploring potential machine learning applications on demographic and health surveys.

Published: 2026 Jan 19

Authors

, , , ,

Abstract

Exposure to air pollution has been associated with anemia in children, but little effort has been made in low- and middle-income countries (LMICs) in which the prevalence of anemia is persistently high. This study aimed to assess the effects of air pollution and household environmental indicators on anemia among children aged 6-59 months using machine learning algorithms. The Demographic and Health Survey (DHS) datasets from 45 LMICs were linked with the satellite-derived estimates of annual average particulate matter (PM) and nitrogen dioxide (NO) based on children's area of residence. The modified Poisson regression model was used to assess the association between exposure to air pollutants, household environmental indicators, and anemia status of children. Machine learning algorithms (MLA) such as logistic regression, Ridge, Lasso, elastic net, Artificial Neural Network, Naïve Bayes, Boosting, and Random Forest were used for predicting the anemia status. We randomly split the dataset into two (train/test), and the model performance was evaluated using sensitivity, specificity and the area under the receiver operating characteristic curve (AU-ROC). The study included 177,251 under-five children, of which 99,290 (56%) were anemic and varied across countries ranging from 16% (Armenia) to 81% (Mali). A child who lived in areas with a PM concentration above the WHO recommended guidelines has a 26% higher risk of being anemic (aPR = 1.26; 95% CI 1.22-1.30) and a child from households having clean fuel for cooking, improved water, and improved sanitation have 24%, 3%, and 13% lower risk of being anemic. The random forest MLA achieved the best classification accuracy of 68%, specificity of 54%, sensitivity of 79% and AUC of 74%. The MLA is more effective than traditional analytical approaches in predicting anemia status. The RF revealed that the age of the child, wet day of the environment, the location of the child, and PM are the most important features to predict the anemia status of a child.

Keywords: Air pollution, Anemia prediction, Environmental quality, Low-middle income countries, Machine learning, Modified poisson regression

Source

Scientific reports

Publication Type

Journal Article

Language

English

PubMed ID

41554788

MeSH terms

You rely on Guideline Central for transparency

Guideline Central and select third party use “cookies” on this website to enhance the user experience.

This technology helps us gather statistical and analytical information to optimize the relevant content for you.

The user also has the option to opt-out which may have an effect on the browsing experience.