Third-party data solutions for COVID-19
Source third-party data sets on the evolution of testing, vaccination, and treatment of COVID-19 on AWS Data Exchange.
AWS Data Exchange makes it easy to find, subscribe to, and use third-party data in the cloud. As part of our mission, we are sourcing third-party data to help academics, researchers, and the healthcare community triage COVID-19 related issues. As the nature of the COVID-19 pandemic and the evolution of testing, vaccination, and treatment of COVID-19 has progressed, the analytics and data needs of customers have also shifted.
Today, they are running analyses that incorporate testing data, vaccination administration data, human host outcomes data, and viral variant metagenomics. To better serve these customers, AWS Data Exchange now has data in the categories including of COVID-19 testing, viral variant data, host outcomes data, and vaccination status data, as well as nearly 40 other COVID data sets.
If you’re new to AWS Data Exchange, follow our step-by-step guides on how to subscribe to data sets and how you can automate the exporting of data from AWS Data Exchange into your own Amazon S3 bucket.
On-Demand Webinar
Using third-party healthcare data to accelerate drug discovery and clinical trial protocol design
In this virtual session, you'll hear from QIAGEN and IBM Watson Health thought leaders as they uncover prescriptive guidance on how healthcare and life sciences data sets can accelerate the bench-to-bedside cycle through enhanced visualizations, machine learning algorithms, and curated genomics data.
Explore data sources for COVID-19 research and development
-
Testing and Variant Sequencing
-
Host/Human Outcomes
-
Surveillance and Projections
-
Social Determinants and Outcomes
-
Testing and Variant Sequencing
-
COVID-19 Testing and Variant Sequencing
Case projection data that is filterable by geography and can be scaled to region, city, zip code, or census tracts.
COVID-19 Diagnostic Test Results
Since March 2020, Ovation has collected de-identified diagnostic test results in real time from testing facilities outside a normal hospital setting.
Vaccination Status/DistributionVaccination Status/Distribution
De-identified patient-level data on vaccination status, brand of vaccine, and other outcomes data.
IBM MarketScan Research Databases
The IBM® MarketScan® Research Databases are one of the largest and longest-running proprietary sources of US claims data currently available for conducting healthcare research.
-
Host/Human Outcomes
-
Host/Human COVID-19 Outcomes
Patient-level clinical outcomes data, diagnoses and comorbidities, lab values, medications, and clinical notes in some cases.
EHR Data: Data Dictionary
insightDB EHR / RWE Data -- Sample Structured and Unstructured Data with associated Data Description and Data Dictionary.
-
Surveillance and Projections
-
COVID-19 Testing Surveillance and C19 Projections
Data on testing results for COVID-19, testing modality, and some associated clinical data.
COVID-19 US state-by-state projections: non-commercial use
Regularly updated COVID-19 US state projections provided by The Institute for Health Metrics and Evaluation (IHME) at the University of Washington.
Daily Global & U.S. COVID-19 Cases & Testing Data (Enigma Aggregation)
Daily U.S and global cases, deaths and testing data related to COVID-19 provided at the country, U.S. state, and U.S county level.
Daily Global & U.S. COVID-19 Cases & Testing Data (Aggregated Data)
This dataset provides daily data on Covid-19 cases, deaths and testing at various geographical levels i.e. the country, U.S. states, and U.S counties.
-
Social Determinants and Outcomes
-
Social Determinants of Health/Patient Reported Outcomes
Data on socioeconomic status, employment, access to healthcare, health food, and other Social Determinants of Health (SDOH) factors, as well as patient-reported outcomes data associated with COVID-19 infections.
Herceptin Patient-Reported Outcomes Data set
This data set from Navigating Cancer contains side effects (aligned to PRO-CTCAE) for patients that are on Herceptin.
Medisafe in-app COVID 19 Vaccine Survey
In October 2020, Medisafe deployed an in-app survey to patients to identify likeliness of getting COVID-19 vaccine when it comes to market.
Diversity, Equity and Inclusion - Access to Healthcare
This product consists of data sets that relate to access to healthcare among diverse populations in the US and the world.
Customers Using AWS Data Exchange
“Our team of researchers are now analyzing trends in disease spread, its geography, and time evolution by leveraging datasets from the AWS COVID-19 data lake, combined with our own data, in order to better predict COVID epidemiology,”
- Jim Karkanias, VP of Data Science and IT, Chan Zuckerberg Biohub
“The Domo Coronavirus Tracker uses AWS Data Exchange to programmatically feed multiple data sets from different sources through a specialized Domo connector directly into our dashboard. AWS Data Exchange augments Domo’s existing library of more than 1,000 connectors, unlocking new sources of data insight for our team and customers without requiring new code for different APIs from each individual data source,”
-Ben Schein, VP of Data Curiosity, Domo
“When the technological tides shifted to face COVID-19, the availability of GPS data catalyzed our lab’s investigation into new problems. AWS Data Exchange made access to the data easy.”
-Sofia Hurtado, UT Austin System Level Design Group
The AWS COVID-19 data lake is a centralized repository of up-to-date and curated datasets focused on the spread and characteristics of the novel coronavirus (SARS-CoV-2).
Become a data provider
Make it easy to reach AWS customers with your data files, tables, and APIs on the AWS Data Exchange catalog.