Back to Portfolio

Indian Healthcare System Analysis

Comprehensive analysis of healthcare infrastructure, outcomes, and disparities across 707 districts in India using National Family Health Survey (NFHS-5) data, uncovering critical gaps and policy insights.

Python SQL Power BI Healthcare
Indian Healthcare Analysis Dashboard showing district-level health indicators
707 Districts Analyzed
36 States & UTs
80+ Health Indicators
6.4L Households Surveyed

Overview & Problem Statement

India's healthcare system serves 1.4 billion people across vastly different geographies, economic conditions, and cultural contexts. Despite significant progress in improving health outcomes over the past two decades, stark disparities persist between urban and rural areas, between states, and across socioeconomic groups.

This project leverages the NFHS-5 (2019-21) dataset, India's most comprehensive household-level health survey, to analyze healthcare infrastructure availability, immunization coverage, maternal health indicators, nutrition outcomes, and sanitation access at the district level. The goal is to identify the most critical gaps and generate actionable policy recommendations.

By combining SQL-based data processing with Power BI visualizations, the analysis creates an interactive dashboard that enables policymakers and researchers to explore health disparities at granular geographic levels.

Methodology

A multi-stage analytical pipeline was developed to transform raw NFHS-5 survey data into actionable insights through rigorous cleaning, analysis, and visualization.

1

Data Extraction

Extracted district-level indicators from NFHS-5 datasets covering maternal health, child health, nutrition, family planning, and healthcare infrastructure across all 707 districts.

2

SQL Processing

Used SQL queries to aggregate, join, and transform multi-table survey data. Created composite health indices by combining related indicators with weighted scoring.

3

Statistical Analysis

Applied Python (Pandas, NumPy, SciPy) for correlation analysis between infrastructure availability and health outcomes, clustering districts by health performance profiles.

4

Dashboard Development

Built interactive Power BI dashboards with drill-down capabilities from national to state to district level, featuring heat maps, scatter plots, and trend comparisons.

Key Findings

  • Full immunization coverage varies dramatically from 33.4% in Nagaland to 90.5% in Odisha, with a national average of 76.4%. Districts within the same state show up to 40 percentage point differences, indicating localized access barriers beyond state policy.
  • Institutional delivery rates have improved nationally to 88.6%, but 47 districts (predominantly in central and northeastern India) still report rates below 60%, correlating strongly with distance to the nearest Community Health Centre and road connectivity.
  • Child stunting (low height-for-age) remains above 40% in 162 districts, concentrated in a contiguous belt across Uttar Pradesh, Bihar, Jharkhand, Madhya Pradesh, and Gujarat. These districts also show the lowest dietary diversity scores.
  • States that increased public healthcare expenditure to above 8% of their budget (Kerala, Tamil Nadu, Rajasthan) consistently outperform on maternal mortality, neonatal mortality, and vaccination metrics, even after controlling for per-capita income.
  • Gender disparities in healthcare access persist: women in the lowest wealth quintile are 3.2x less likely to receive four or more antenatal care visits compared to the highest quintile, a gap that has narrowed only marginally since NFHS-4.
  • A cluster analysis reveals five distinct district health profiles, from "high performing" (mostly urban districts in southern and western states) to "critically underserved" (remote tribal and border districts), each requiring tailored intervention strategies.
Note: The full Power BI dashboard includes interactive maps, filter panels, and drill-through pages for detailed district-level exploration. Add your actual charts and dashboard screenshots here for the complete presentation.

Tools & Technologies

The project combines data engineering, statistical analysis, and business intelligence tools to deliver a comprehensive healthcare analytics solution.

Python Pandas NumPy SciPy SQL (PostgreSQL) Power BI DAX NFHS-5 Microdata Jupyter Notebooks Git / GitHub

Conclusion

This comprehensive district-level analysis of India's healthcare system reveals that while national averages show encouraging improvement trends, they mask severe intra-state and inter-district disparities. The geographic clustering of poor health outcomes in specific corridors suggests that targeted, district-level interventions would be more effective than state-level policy alone.

The strong correlation between public health expenditure and outcomes underscores the need for increased and better-directed health spending, particularly in the 162 districts identified as critically underserved. Nutrition programs, institutional delivery incentives, and last-mile immunization campaigns should be prioritized in these areas.

The Power BI dashboard created as part of this project provides a ready-to-use tool for health administrators and researchers to identify priority districts, track progress over time, and allocate resources more effectively across India's diverse healthcare landscape.