Project
Analysis of Interrelationships between Sanitation, Child Malnutrition & Mortality in India


Home | LinkedIn | GitHub | Portfolio

This page acts as my project page for the PP434 | Automated Data Visualisation for Policymaking course, part of the MPP programme at LSE. The project is divided into 6 parts that analyse the impact of various indicators on child mortality in India and the policy response to it.

Part I: Introduction

Objectives: This project aims to analyse the relationship between child mortality outcomes in children (under five) across Indian states and various factors that may infleunce it such as malnutrition, women's education and improved access to sanitation (government intervention). The questions for analysis are "Which indicators of child and maternal health along with sanitation infleunce child mortality and to what extent?" and "How has India progressed on child health outcomes across time and regions"?.

Data: National Family Health Surveys (NFHS) from (NFHS) 1-5 (1992-2021) and the Census of India (2011). Option to use API is there in Open Data Government Platform India.

Challenges: Data cleaning challenges included handling missing values, inconsistent formats, and large datasets.NFHS 1 and 2 data are available only in PDFs which had to be scraped out. There were more than 135 columns in NFHS 5 dataset from which relevant data was extracted. Python tools like Pandas helped tidy the data efficiently and conducted correlation and regression analysis in Google Co-Lab. Links are given below in each part.
It was also difficult to get the exact geojson base map of India with correct sub-national and national administrative boundaries of India as new states and Union Territories have been formed. Simple Maps was used.

Part II: Relationship between Child Malnutrition & Access to Sanitation in India

Analytical Framework: Used NFHS-5 (2019-21) state-wise data. Bubble correlation charts (population-adjusted) and correlation analysis explore the relationship between child malnutrition variables and access to sanitation. Filtering the NFHS 5 dataset & estimating correlation - Google CoLab Project Data Analysis Notebook

Chart 1: Correlation Coefficient = -0.466.

Chart 2: Correlation Coefficient = -0.457.

Chart 3: Correlation Coefficient = -0.512.

Interpretation:A moderate negative correlation exists between child malnutrition and improved sanitation. Poor sanitation exacerbates tropical enteropathy, a condition causing chronic intestinal inflammation, disrupting nutrient absorption, impairing growth, and weakening immunity in children.

PartIII: Linking Child Mortality with Health and Education Indicators

Regression Model

Simple linear regression model is contructed:

U5MR_i = β_0 + β_1(Women with 10+ years of schooling_i) + β_2(Improved sanitation facility_i)+ β_3(Below normal BMI Women_i) + β_4(Stunted_i) + β_5(Wasted_i) + β_6(Underweight_i) + β_7(Children with diarrhoea taken to a health facility_i) + ϵ_i


Dependent Variable: U5MR_i: Under-five mortality rate for state_i, defined as deaths per 1000 live births for children under 5.
Independent Variables for state_i:Women’s education, sanitation, maternal health (below normal BMI), % of stunted, wasted, underweight children; and diarrhoea care access (proxy for health access and diarrhoea is 3rd leading cause of child mortality. Statista- Causes of Death of Children India
Coefficients:β_0, β_1, β_2, β_3, β_4, β_5, β_6 are parameters to be estimated.

Assumptions:Linear relationship. Errors are independently and identically distributed.

Analytical Framework: Simple OLS regression of the filtered NFHS 5 dataset and estimating the coefficients - Google CoLab Project Regression Notebook (OpenAI, 2025). Based on this regression, a trellis chart is created which depicts the regression of U5MR for state_i on independent variables that represent indicators of sanitation, child health, women's education and health.

Chart 4: Trellis chart with regression lines for different indicators against U5MR. Source:NFHS 5.

Interpretation:
R-squared (0.731): 73.1% of the variability in under-five mortality is explained by the model, indicating a good fit.
Significant Estimates: Four variables are significant at the 10% level (due to small sample size), on average:
- Women (15-49) with 10+ years of schooling (%): β_1 = -0.3311 Higher female education associated with reduced under-five mortality.
- Improved sanitation (%): β_2 = -0.2611 Greater access to sanitation associated with lower under-five mortality.
- Women with BMI <18.5 (%): β_3 = 1.3275 Poor maternal nutrition associated with increased under-five mortality.
- Stunted children (%): β_4 = 1.0098 Higher stunting rates associated with increased under-five mortality.

Limitations: Omitted variable bias and lack of state-specific or time-invariant fixed effects.

Part IV: Regional Disparities in Child Health Outcomes

Mapping and Ranking Regional Disparities

Identifying regional disparities in child health outcomes in India is crucial for identifying inequities, prioritizing resources, and designing targeted interventions address it.

Chart 5: Choropleth map shows state-wise U5MR disparities. Source:NFHS 5.

Chart 6: Histogram ranks states on child malnutrition indicators.. Source:NFHS 5.

Interpretation: 'Aspirational States'
A class of 'aspirational states'—Bihar, Uttar Pradesh, Madhya Pradesh, Gujarat, Jharkhand, and union territories like Dadra and Nagar Haveli and Daman and Diu—are rapidly progressing yet remain behind key policy targets. The state of Meghalaya is an outlier in stunting while performing better in other indicators, warranting further investigation.

Part V: Assessing India's Progress

India's Progress & Policy Response on Sanitation

Government interventions, such as the Total Sanitation Campaign (1999) and Swachh Bharat Mission (2014), have targeted open defecation and waste management.

Chart 7: Trellis plot showing India's progress in child health outcomes remained consistent in last 2 decades. Stunting% and Wasting% is not comparable across NFHS surveys due to difference in methodologies.

Chart 8: Histogram showing India's massive construction of househould toilets through Swacch Bharat Mission Phase I. Almost 50% of it is in 'Aspirational States'.

Part VI: Conclusion

Summary and Policy Recommendations

Women’s education and health significantly reduce under-five mortality, while improved sanitation reduces disease exposure, addressing tropical enteropathy and improving child survival. Stunted children face higher risks of disease-related mortality, and regional disparities persist in sanitation and nutrition, with 'aspirational states' catching up. The potential impact of other variables such as economic development (per capita state GDP), efficiency of public food distribution systems and availability of pediatricians on child mortality can be examined further. To meet SDGs on child health and sanitation by 2030, India should boost women’s education, especially in aspirational states, enhance sanitation quality and waste management, and strengthen maternal health alongside community nutrition programmes to reduce stunting and improve child survival.

Bibliography

Use of Open AI for generating the python code for regression analysis in Part III.