MIT COVID-19 Survey Aggregate Data & API

COVID-19 Beliefs, Behaviors & Norms Survey

About

Note: The API is no longer functional. If you are interested in this data, please request individual level data. Thanks!

Here you can access aggregate data from the COVID-19 Beliefs, Behaviors & Norms Survey. There are two ways to access the data. First, downloading the data via simple online form, Second, there is a REST API for programmatically accessing the data. The data provided through these two endpoints contains of summary statistics of the dataset. For the full dataset containing individual level responses, please request the data through this link.

Please sign up for the mailing list for aggregate data users.

Online download

The first way to download aggregated data is through the form below. This returns a CSV file aggregated by gender, age group and country.

You can get the aggregate data for all measures, and all countries here.

Choose Age Group:
Choose Gender:
Choose Education:
Choose Country:

Choose Signal:

Through the API

The second way to get data is through the API. Note The API allows a better way of querying and slicing the data. The API contains many more functionalities and signals that are not available in the dropdown. There are five parameters used for query,age, gender, education, country, signal, and output. country is a required parameter.

Parameters	Description	Format
`age`	age range for the data	could be one of `all`,`20-30`,`31-40`,`41-50`,`51-60`,`61-70`, or `71-80`, default: `all`
`gender`	gender of the users	`male`,`female`,`other`,or `unknown`, default: `all`
`education`	education level of the users	`less_than_primary`,`primary_school`,`secondary_school`,`college`,or `graduate_school`, default: `all`
`race`	race/ethnicity of the users (US only, available after wave6)	`american_indian_or_alaska_native`, `asian`, `black_or_african_american`, `native_hawaiian_or_other_pacific_islander`, `some_other_race`, `white`, or `hispanic`, default: `all`
`country`	use to query data for a country (two letter ISO code)	Should be one of , `all`,`AF`,`AO`,`AR`,`AU`,`AZ`,`BD`,`BO`,`BR`,`CA`,`CI`,`CL`,`CM`,`CO`,`DE`,`DZ`,`EC`,`EE`,`EG`,`ES`,`FR`,`GB`,`GE`,`GH`,`GT`,`HN`,`ID`,`IN`,`IQ`,`IT`,`JM`,`JP`,`KE`,`KH`,`KR`,`KZ`,`LK`,`MA`,`MM`,`MN`,`MX`,`MY`,`MZ`,`NG`,`NL`,`NP`,`PE`,`PH`,`PK`,`PL`,`PT`,`RO`,`SD`,`SG`,`SN`,`TH`,`TR`,`TT`,`TW`,`TZ`,`UA`,`UG`,`AE`,`US`,`UY`,`VE`,`VN`,`ZA`. A two letter ISO code, see this link for reference. Note: When `country` is set to `all`, the default for `wave` is `all`.
`signal`	Specific signal you want from the data (defined below)	Could be one of `all`,`vaccine_accept`,`mismatch_index`,`community_risk_index`,`measures_taken`,`locations_would_attend`,`distancing_familiarity`,`effect_mask`,`news_sources`,`news_mediums`,`health`,`retail`,`restaurants`,or `worship`. Multiple signals can be requested at once using e.g. `{vaccine_accept,mismatch_index,restaurants}`, default: `all`.
`wave`	The survey was conducted in waves (two week periods). Weights are computed for users in a single wave once all the data is collected. `wave1` is July 6 - July 20. `wave2` is July 21 - August 3, `wave3` is Aug 3 - Aug 16, 2020, `wave4` is Aug 17 - Aug 30, 2020, `wave5` is Aug 31 - Sep 13, 2020, `wave6` is Sep 14 - Sep 27, 2020, `wave7` is Sep 28 - Oct 11, 2020, `wave8` is Oct 12 - Oct 25, 2020 `wave9` is Oct 26 - Nov 8, 2020, `wave10` is Nov 9 - Nov 22, 2020 `wave11` is Nov 23 - Dec 6, 2020 and so on. The survey is expected to continue until March 2021. Note: this parameter only applies for a subset of countries (`US`,`BR`,`MX`,`IN`,`GB`,`JP`,`DE`,`IT`,`AR`,`ID`,`CO`,`TR`,`FR`,`EG`,`VN`,`PH`,`PK`,`BD`,`RO`,`PL`,`TH`,`MY`,`NG`)	`wavei` where i is 1,2,3,..., or`all`, default: `all`
`us_state`	US state (2 letter code. Use this link for the mapping.	Should be one of `CA`,`TX`,`FL`,`NY`,`PA`,`OH`,`IL`,`WA`,`GA`,`NC`,`VA`,`MI`,`MA`,`NJ`,`IN`,`MO`,`AZ`,`TN`,`CO`,`MD`,`MN`,`WI`,`OR`,`KY`,`OK`,`AL`,`SC`,`NV`,`UT`,`LA`,`KS`,`AR`,`IA`,`CT`,`NM`,`ID`,`MS`,`WV`,`ME`,`NE`,`MT`,`NH`,`RI`,`DE`,`HI`,`SD`,`AK`,`ND`,`VT`,`WY`, default: `None`
`india_state`	India state (2 letter code. Use this link for the mapping.)	Should be one of `MH`,`UP`,`WB`,`KA`,`DL`,`TN`,`GJ`,`BR`,`RJ`,`TG`,`KL`,`AP`,`MP`,`HR`,`OR`,`JH`,`PB`,`AS`,`UT`,`JK`,`CT`,`HP`,`CH`,`GA`,`MN`,`TR`,`ML`,`AR`,`NL`,`SK`,`MZ`,`PY`,`DD`,`AN`,`LD`, default: `None`
`timeseries`	boolean parameter which iterates over all waves and returns data for each wave.	not set by default, works if set to `true`
`output`	format of the output	`json`, default: `json`

Signal Definitions

This section provides the details on how the signals are obtained and what they mean.

Signal name	Description	Example output value
`vaccine_accept`	Responses to the question `If a vaccine for COVID-19 becomes available, would you choose to get vaccinated?`. (-1 in the response means the user saw the question but did not answer.)	`"vaccine_accept": { "weighted": { "Yes": 0.676, "-1": 0.029, "Don't know": 0.184, "No": 0.111 } }`
`norms_vaccine`	Responses to the question `Out of 100 people in your community, how many do you think would take a COVID-19 vaccine if it were made available?` (only available wave9 onwards).	`"norms_vaccine": { "weighted": 64.48364707608343 }`
`density`	Responses to the question `Which of these best describes the area where you are currently`	`"density": { "weighted": { "City": 0.6585615765177799, "Town": 0.176444429672886, "Village or rural area": 0.1649939938093341 } }`
`flu_vaccine`	Responses to the question `Have you taken a flu vaccine this fall or do you plan to take one in the coming weeks?` (only available wave9 onwards, only fielded in North American countries).	`"flu_vaccine": { "weighted": { "Don't know": 0.045961573076853605, "No": 0.29128329504646727, "Yes": 0.6627551318766807 } }`
`future_vaccine`	Responses to the question `If a vaccine against COVID-19 infection is available in the market, would you take it?` (only available wave9 onwards).	`"future_vaccine": { "weighted": { "No, definitely not": 0.10949627210730835, "Probably": 0.1524073787589803, "Probably not": 0.08056696679439905, "Unsure": 0.2080238770599849, "Yes, definitely": 0.4495055052793251 } }`
`future_vaccine_recommended`	Responses to the question `Would you be more or less likely to take a vaccine against COVID-19 infection if it were made available and recommended to you by each of the following` (only available wave9 onwards).	`"future_vaccine_recommended": { "friends_and_family": { "weighted": { "Less Likely": 0.071, "More Likely": 0.389, "No Impact": 0.54 } }, "government_health_officials": { "weighted": { "Less Likely": 0.148, "More Likely": 0.469, "No Impact": 0.384 } },... }`
`work_industry`	Responses to the question `What is the main activity of the business or organization in which you were working before February 2020?`	`"work_industry": { "weighted": { "Agriculture": 0.011531212310096538, "Buying and selling": 0.06271041558504216, "Construction": 0.017675649241784653, "Education": 0.1358951833575737, "Tourism": 0.01542255113586681, "Transportation": 0.01998329350087886 ... } }`
`mismatch_index`	Average of the disagreement in personal beliefs of a user and their beliefs of others. This is based on responses to the questions: Q_individual `How important is it for you to take actions to prevent the spread of COVID-19 in your community?` (possible answers: Extremely important, Very important, Moderately important, Slightly important, Not important at all) and Q_societal `How important do other people in your community think it is to take actions to prevent the spread of COVID-19?`. We first discretize the results (20 = Not important at all and 100 = Extremely important) and take the difference between a users responses Q_individual - Q_societal. `mismatch_index` is the mean of the distribution of the selected population. Interpretation: A value close to zero indicates agreement in beliefs in the society, a higher value indicates disbelief.	`"mismatch_index": {"weighted": 19.031}`
`community_risk_index`	Based on response to the question: `How dangerous do you think the COVID-19 risk is to your community?` with possible answers: {Not at all dangerous, Slightly dangerous, Moderately dangerous, Very dangerous, Extremely dangerous}. The measure returns the fraction of users who respond to a particular option.	`"community_risk_index": { "weighted": { "Very dangerous": 0.366, "Moderately dangerous": 0.242, "Slightly dangerous": 0.139, "Not at all dangerous": 0.036, "-1": 0.004, "Extremely dangerous": 0.213 }}`
`measures_taken`	Based on response to the question: `What measures have you taken to prevent infection from COVID-19 in the past week?` (multiple choices can be selected). The measure returns the fraction of users who answered Yes/No for each choice.	`"measures_taken": { "avoid_touching_face": { "weighted": { "Yes": 0.963, "No": 0.037 } }, "getting_the_flu_vaccine": { "weighted": { "Yes": 0.839, "No": 0.161 } }, "meter_distance": { "weighted": { "Yes": 0.658, "No": 0.342 } }, ... }`
`locations_would_attend`	Based on response to the question: `Which of the following businesses, locations, or events would you visit or attend in the coming two weeks if they were operating at full capacity?`. The measure returns the fraction of users who answered Yes/No for each choice.	`"locations_would_attend": { "health_care_facilities": { "weighted": { "Yes": 0.277, "No": 0.723 } }, "parks_and_beaches": { "weighted": { "Yes": 0.254, "No": 0.746 } }, "places_of_employment": { "weighted": { "Yes": 0.141, "No": 0.859 } }, ... }`
`distancing_familiarity`	Based on response to the question: `Which of the following best describes your familiarity with the term 'physical distancing' during the COVID-19 pandemic?` with possible answers: {I have not heard of it, I have heard of it but do not know what it means, I have heard of it and have some idea of what it means, I have heard of it and know what it means}. The measure returns the fraction of users who respond to a particular option.	`"distancing_familiarity": { "weighted": { "I have heard of it and have some idea of what it means": 0.125, "I have heard of it and know what it means": 0.788, "I have heard of it but do not know what it means": 0.045, "I have not heard of it": 0.042 } }`
`effect_mask`	Based on response to the question: `How effective is wearing a face mask for preventing the spread of COVID-19?` with possible answers: {Extremely effective, Very effective, Moderately effective, Slightly effective, Not effective at all}. The measure returns the fraction of users who respond to a particular option.	`"effect_mask": { "weighted": { "Extremely effective": 0.485, "Moderately effective": 0.125, "Not effective at all": 0.011, "Slightly effective": 0.037, "Very effective": 0.342 } }`
`news_sources`	Based on response to the question: `In the past week, from which of the following, if any, have you received news and information about COVID-19? Select all that apply.` with possible answers: {`government_health_authorities`:Government health authorities or other officials, `journalists`:Journalists, `local_health_workers`:Local health workers, clinics, and community organizations, `ordinary_people_i_dont_know_personally`:Ordinary people I don't know personally, `ordinary_people_i_know_personally`:Ordinary people I know personally, `politicians`:Politicians, `scientists`:Scientists, doctors, and health experts, `world_health_organization`:World Health Organization (WHO)}. The measure returns the fraction of users who selected each of these choices.	`"news_sources": { "government_health_authorities": { "weighted": { "No": 0.545, "Yes": 0.455 } }, "journalists": { "weighted": { "No": 0.417, "Yes": 0.583 } }, "local_health_workers": { "weighted": { "No": 0.726, "Yes": 0.274 } },... }`
`news_mediums`	Based on response to the question: `In the past week, from which of the following, if any, have you received news and information about COVID-19? Select all that apply.` with possible answers: {`online_sources`: Online sources (websites, apps, social media), `messaging_apps`:Messaging apps / SMS/ text messaging, `newspapers`:Newspapers, `television`: Television, `radio`: Radio}. The measure returns the fraction of users who selected each of these choices.	`"news_mediums": { "messaging_apps": { "weighted": { "No": 0.804, "Yes": 0.196 } }, "newspapers": { "weighted": { "No": 0.461, "Yes": 0.539 } }, "online_sources": { "weighted": { "No": 0.164, "Yes": 0.836 } },... }`
`news_sources_trust`	Based on response to the question: `How much do you trust each of the following as a source of COVID-19 news and information?` with possible answers: {`local_health_workers`: Local health workers,clinics, and community organizations, `scientists`:Scientists, doctors, and health experts, `government_health_authorities`:Governmenthealth authorities or other officials, `politicians`: Politicians, `journalists`: Journalists, `ordinary_people_i_do_not_know_personally`: Ordinary people I don't know personally, `ordinary_people_i_know_personally`: Ordinary people I know personally}. The measure returns the fraction of users who selected each of these choices.	`"news_sources_trust": { "government_health_authorities": { "weighted": { "Do not trust": 0.073, "Somewhat trust": 0.369, "Trust": 0.559 } }, "journalists": { "weighted": { "Do not trust": 0.23, "Somewhat trust": 0.515, "Trust": 0.255 } }, "local_health_workers": { "weighted": { "Do not trust": 0.078, "Somewhat trust": 0.38, "Trust": 0.542 } },... }`
`news_mediums_trust`	Based on response to the question: `How much do you trust each of the following as a source of COVID-19 news and information?` with possible answers: {`online_sources`: Online sources (websites, apps,social media), `messaging_apps`: Messaging apps / SMS/ textmessaging, `newspapers`: Newspapers, `television`: Television, `radio`: Radio. The measure returns the fraction of users who selected each of these choices.	`"news_mediums_trust": { "messaging_apps": { "weighted": { "Do not trust": 0.254, "Somewhat trust": 0.513, "Trust": 0.233 } }, "newspapers": { "weighted": { "Do not trust": 0.074, "Somewhat trust": 0.396, "Trust": 0.53 } }, "online_sources": { "weighted": { "Do not trust": 0.124, "Somewhat trust": 0.573, "Trust": 0.303 } },... }`
`prevention_distancing`	Based on response to the question: `How often are you able to stay at least 1 meteraway from people not in your household?` with possible answers: `[Never, Rarely, Sometimes, Often,`, and `Always` The measure returns the fraction of users who selected each of these choices.	`"prevention_distancing": { "weighted": { "Always": 0.3450418847766386, "Never": 0.04772093455516377, "Often": 0.3305687100781024, "Rarely": 0.11617205426774309, "Sometimes": 0.16049641632235195 } }`
`prevention_mask`	Based on response to the question: `How often are you able to wear a mask or face coveringwhen you are in public?` with possible answers: `[Never, Rarely, Sometimes, Often,`, and `Always` The measure returns the fraction of users who selected each of these choices. This question wasonly added starting in wave 3 on 8/4/2020 at 2:15PM ET.	`"prevention_mask": { "weighted": { "Always": 0.7769272900510984, "Never": 0.008249041768208728, "Often": 0.15339189557185529, "Rarely": 0.027011776599326836, "Sometimes": 0.03441999600950834 } }`
`health`,`retail`,`restaurants`,`worship`	Based on response to the question(s): `Compared to a [location] with no precautions at all, are you more or less likely to visit a restaurant with the following precautions?`, where location could be `health`,`retail`,`restaurants`,`worship`. Possible answers of this question are: {More likely to visit, Doesn't affect my actions, Less likely to visit}. The measure returns the fraction of users who selected each of these choices.	`"restaurants": { "everyone_has_to_pass_a_temperature_check": { "weighted": { "Doesn't affect my actions": 0.175, "More likely to visit": 0.506, "-1": 0.21, "Less likely to visit": 0.109 } }, "operating_at_limited_capacity": { "weighted": { "Doesn't affect my actions": 0.151, "More likely to visit": 0.361, "-1": 0.106, "Less likely to visit": 0.383 } }, ... }`

API Request examples

Method: GET

Example 1. Retrieve the data for a specific signal (say, locations people would attend locations_would_attend) in a specific country, age group for all genders using the http://p5000-covidsurvey.mit.edu.ezproxy.canberra.edu.au/query end point

This could be done using the link http://p5000-covidsurvey.mit.edu.ezproxy.canberra.edu.au/query?age=20-30&gender=all&country=US&signal=locations_would_attend would return the response:

Using the link http://p5000-covidsurvey.mit.edu.ezproxy.canberra.edu.au/query?age=20-30&gender=all&country=US&output=csv&signal=locations_would_attend would return the same response as a csv file.

Example 2. Retrieve the data for multiple signals (say, vaccine_accept, mismatch_index, and community_risk_index) in all countries, for a specific age group and gender

This could be done using the link http://p5000-covidsurvey.mit.edu.ezproxy.canberra.edu.au/query?age=31-40&gender=female&country=all&signal={vaccine_accept,mismatch_index,community_risk_index} would return the response:

Method: Python

A simple example showing how to use this api in Python:


xxxxxxxxxx
import requests
import json
# request data from api
response = requests.get("http://p5000-covidsurvey.mit.edu.ezproxy.canberra.edu.au/query?age=20-30&gender=female&country=US&signal=vaccine_accept").text
#convert json data to dic data for use!
jsonData = json.loads(response)

Method: R

A simple example showing how to use this api in R:


xxxxxxxxxx
library(tidyverse)
library(httr)
library(jsonlite)
# adding url
path <- "http://p5000-covidsurvey.mit.edu.ezproxy.canberra.edu.au/query?age=20-30&gender=female&country=US&signal=vaccine_accept"
# request data from api
request <- GET(url = path)
# make sure the content is encoded with 'UTF-8'
response <- content(request, as = "text", encoding = "UTF-8")
# now we can have a dataframe for use!
coviddata <- fromJSON(response, flatten = TRUE) %>% data.frame()

Citing

We hope this API is useful to different research groups. If you use the API and would like to cite it, please use:

Collis, A., Garimella, K., Moehring, A., Rahimian, M.A., Babalola, S., Gobat, N., Shattuck, D., Stolow, J., Eckles, D., and Aral, S. (2020). Global survey on COVID-19 beliefs, behaviors, and norms. Technical report, MIT Sloan School of Management