Data Hub

HPC Data Hub

Phase 4 Release of COVID-19 and Sociodemographic Data (4/26/2021)

(COVID-19 confirmed cases and deaths are updated weekly.)

The HPC Data Hub is a data service infrastructure of the Hopkins Population Center (HPC). The HPC Data Hub offers U.S. county-level data on COVID-19 and sociodemographic data necessary for population-based social science, epidemiological, medical and public health research to provide evidence-based policy recommendations for curbing the pandemic. 

Timely and effective data on social, economic and health disparities are needed to respond appropriately to the pandemic as local situations change. Drawing from trusted sources, this data hub collects and manages county-level data on sociodemographic and health factors that influence the spread of COVID-19. The data hub also includes data on the status of COVID-19 related policies being rolled out across the country. 

The Phase-4 release data replace the Phase-3 data with important additions (e.g., updated state policies; vaccine distribution and implementation; biweekly state patterns of pandemic patterns by race/ethnicity). The data files and the corresponding dictionary files are coupled in 3 zipped files at Click the arrow next to the “Code” tab and choose “download zip” which includes a folder for pandemic time-series data and two zip files: and

  1. Pandemic folder includes 4 datafiles in csv format and 1 dictionary file in csv format
    • Daily data on COVID-19 tested and confirmed cases and deaths (weekly updates)
    • Time-series data on human mobility and social distancing
    • Timing data on state policy responses
    • Preexisting health care capacity variables
    • Weekly data on vaccine allocation
  2. includes 1 datafile in csv format and 1 dictionary file in xlsx format, containing the following data
    • Existing health and health care disparity 
    • Individual tax filing, individual and household income brackets
    • Population density per area and crowdedness per housing unit
    • Demographic structure by age, gender and race-ethnicity
    • Prevalence rates of diabetes, HIV, and smoking, conditions associated with more severe COVID-19 symptoms
  3. Unemployment folder includes 1 datafile in csv format and 1 dictionary file in csv format
    • Monthly unemployment rate and size of labor force from January 2019 (monthly updates when its available at BLS)
    • The county identity of spatial neighbors (for spatial analysis)

All data files include county names and FIPS codes to facilitate data combination of Data Hub files and external files. The daily data in this Data Hub is scheduled to routine update every Sunday.

User registration and feedback

The success of HPC Data Hub relies on users’ questions, feedback, and suggestions. The Github repository includes a registration form (to inform you of data updates) and a feedback form. The HPC Data Hub team is devoted to timely responding to users’ questions and suggestions. 

User Registration Form

User Feedback Form

The HPC Data Hub Team

  • Phase 4
    • Faculty: Lingxin Hao
    • Student: Xingyun Wu
  • Phases 1-3
    • Faculty: Qingfeng Li (lead), Alexandre White, Lingxin Hao
    • Students: Xingyun Wu, Apoorv Dayal, Aditya Suru, Jiaolong He, Giuliana Nicolucci-Altman, Gwyneth Wei