Data Hub

HPC Data Hub

Phase 2 Release of COVID-19 and Sociodemographic Data

5/18/2020

Updated Weekly

The HPC Data Hub is a data service infrastructure of the Hopkins Population Center (HPC). The HPC Data Hub offers U.S. county-level data on COVID-19 and sociodemographic data necessary for population-based social science, epidemiological, medical and public health research to provide evidence-based policy recommendations for curbing the pandemic. 

Timely and effective data on social, economic and health disparities are needed to respond appropriately to the pandemic as local situations change. Drawing from trusted sources, this data hub collects and manages county-level data on sociodemographic and health factors that influence the spread of COVID-19. The data hub also includes data on the status of COVID-19 related policies being rolled out across the country. 

The Phase-2 release data replace the Phase-1 data with important additions. The data files and the corresponding dictionary files are coupled in 3 zipped files at our GitHub repository (click and download the desired zip files):

  1. Pandemic.zip includes 4 datafiles in csv format and 1 dictionary file in csv format
  2. Daily data on COVID-19 tested and confirmed cases and deaths
  3. Daily data on human mobility and social distancing
  4. Timing data on state policy responses
  5. Preexisting health care capacity variables
  6. Prepandemic.zip includes 1 datafile in csv format and 1 dictionary file in xlsx format
    1. Existing health and health care disparity 
    1. Individual tax filing, individual and household income brackets
    1. Population density per area and crowdedness per housing unit
    1. Demographic structure by age, gender and race-ethnicity
  7. Prevalence rates of diabetes, HIV, and smoking, conditions associated with more severe COVID-19 symptoms
  8. Unemployment.zip includes 1 datafile in csv format and 1 dictionary file in xlsx format
  9. Monthly unemployment rate and size of labor force from January 2019 to March 2020
  10. The county identity of spatial neighbors (for spatial analysis)

All data files include county names and FIPS codes to facilitate data combination of Data Hub files and external files. The daily data in this Data Hub is scheduled to routine update every Sunday.

The Data Hub team is currently working on Phase 3, which will focus on: 

  1. validating the mobility measures
  2. adding unemployment claims data
  3. Further data on co-morbidities associated with COVID-19 by county

User registration and feedback

The success of HPC Data Hub relies on users’ questions, feedback, and suggestions. The Github repository includes a registration form (to inform you of data updates) and a feedback form. The HPC Data Hub team is devoted to timely responding to users’ questions and suggestions. 

User Registration Form

User Feedback Form

The HPC Data Hub Team

Faculty: Dr. Qingfeng Li (lead), Dr. Alexandre White, Dr. Lingxin Hao

Students: Aditya Suru, Jiaolong He, Giuliana Nicolucci-Altman, Gwyneth Wei