Note: The API is no longer functional. If you are interested in this data, please request individual level data. Thanks!
Here you can access aggregate data from the COVID-19 Beliefs, Behaviors & Norms Survey. There are two ways to access the data. First, downloading the data via simple online form, Second, there is a REST API for programmatically accessing the data. The data provided through these two endpoints contains of summary statistics of the dataset. For the full dataset containing individual level responses, please request the data through this link.
Please sign up for the mailing list for aggregate data users.
The first way to download aggregated data is through the form below. This returns a CSV file aggregated by gender, age group and country.
You can get the aggregate data for all measures, and all countries here.
The second way to get data is through the API. Note The API allows a better way of querying and slicing the data. The API contains many more functionalities and signals that are not available in the dropdown.
There are five parameters used for query,age
, gender
, education
, country
, signal
, and output
. country
is a required parameter.
Parameters | Description | Format |
---|---|---|
age | age range for the data | could be one of all ,20-30 ,31-40 ,41-50 ,51-60 ,61-70 , or 71-80 , default: all |
gender | gender of the users | male ,female ,other ,or unknown , default: all |
education | education level of the users | less_than_primary ,primary_school ,secondary_school ,college ,or graduate_school , default: all |
race | race/ethnicity of the users (US only, available after wave6) | american_indian_or_alaska_native , asian , black_or_african_american , native_hawaiian_or_other_pacific_islander , some_other_race , white , or hispanic , default: all |
country | use to query data for a country (two letter ISO code) | Should be one of ,
all ,AF ,AO ,AR ,AU ,AZ ,BD ,BO ,BR ,CA ,CI ,CL ,CM ,CO ,DE ,DZ ,EC ,EE ,EG ,ES ,FR ,GB ,GE ,GH ,GT ,HN ,ID ,IN ,IQ ,IT ,JM ,JP ,KE ,KH ,KR ,KZ ,LK ,MA ,MM ,MN ,MX ,MY ,MZ ,NG ,NL ,NP ,PE ,PH ,PK ,PL ,PT ,RO ,SD ,SG ,SN ,TH ,TR ,TT ,TW ,TZ ,UA ,UG ,AE ,US ,UY ,VE ,VN ,ZA .
A two letter ISO code, see this link for reference. Note: When country is set to all , the default for wave is all . |
signal | Specific signal you want from the data (defined below) | Could be one of all ,vaccine_accept ,mismatch_index ,community_risk_index ,measures_taken ,locations_would_attend ,distancing_familiarity ,effect_mask ,news_sources ,news_mediums ,health ,retail ,restaurants ,or worship . Multiple signals can be requested at once using e.g. {vaccine_accept,mismatch_index,restaurants} , default: all . |
wave | The survey was conducted in waves (two week periods). Weights are computed for users in a single wave once all the data is collected. wave1 is July 6 - July 20. wave2 is July 21 - August 3,
wave3 is Aug 3 - Aug 16, 2020,
wave4 is Aug 17 - Aug 30, 2020,
wave5 is Aug 31 - Sep 13, 2020,
wave6 is Sep 14 - Sep 27, 2020,
wave7 is Sep 28 - Oct 11, 2020,
wave8 is Oct 12 - Oct 25, 2020
wave9 is Oct 26 - Nov 8, 2020,
wave10 is Nov 9 - Nov 22, 2020
wave11 is Nov 23 - Dec 6, 2020
and so on. The survey is expected to continue until March 2021. Note: this parameter only applies for a subset of countries (US ,BR ,MX ,IN ,GB ,JP ,DE ,IT ,AR ,ID ,CO ,TR ,FR ,EG ,VN ,PH ,PK ,BD ,RO ,PL ,TH ,MY ,NG ) | wavei where i is 1,2,3,..., orall , default: all |
us_state | US state (2 letter code. Use this link for the mapping. |
Should be one of CA ,TX ,FL ,NY ,PA ,OH ,IL ,WA ,GA ,NC ,VA ,MI ,MA ,NJ ,IN ,MO ,AZ ,TN ,CO ,MD ,MN ,WI ,OR ,KY ,OK ,AL ,SC ,NV ,UT ,LA ,KS ,AR ,IA ,CT ,NM ,ID ,MS ,WV ,ME ,NE ,MT ,NH ,RI ,DE ,HI ,SD ,AK ,ND ,VT ,WY , default: None |
india_state | India state (2 letter code. Use this link for the mapping.) |
Should be one of MH ,UP ,WB ,KA ,DL ,TN ,GJ ,BR ,RJ ,TG ,KL ,AP ,MP ,HR ,OR ,JH ,PB ,AS ,UT ,JK ,CT ,HP ,CH ,GA ,MN ,TR ,ML ,AR ,NL ,SK ,MZ ,PY ,DD ,AN ,LD , default: None |
timeseries | boolean parameter which iterates over all waves and returns data for each wave. | not set by default, works if set to true |
output | format of the output | json , default: json |
This section provides the details on how the signals are obtained and what they mean.
Signal name | Description | Example output value |
---|---|---|
vaccine_accept | Responses to the question If a vaccine for COVID-19 becomes available, would you choose to get vaccinated? . (-1 in the response means the user saw the question but did not answer.) | "vaccine_accept": {
"weighted": {
"Yes": 0.676,
"-1": 0.029,
"Don't know": 0.184,
"No": 0.111
}
} |
norms_vaccine | Responses to the question Out of 100 people in your community, how many do you think would take a COVID-19 vaccine if it were made available? (only available wave9 onwards). |
"norms_vaccine": {
"weighted": 64.48364707608343
} |
density | Responses to the question Which of these best describes the area where you are currently
|
"density": {
"weighted": {
"City": 0.6585615765177799,
"Town": 0.176444429672886,
"Village or rural area": 0.1649939938093341
}
} |
flu_vaccine | Responses to the question Have you taken a flu vaccine this fall or do you plan to take
one in the coming weeks? (only available wave9 onwards, only fielded in North American countries). |
"flu_vaccine": {
"weighted": {
"Don't know": 0.045961573076853605,
"No": 0.29128329504646727,
"Yes": 0.6627551318766807
}
} |
future_vaccine | Responses to the question If a vaccine against COVID-19 infection is available in the market, would you take it? (only available wave9 onwards). |
"future_vaccine": {
"weighted": {
"No, definitely not": 0.10949627210730835,
"Probably": 0.1524073787589803,
"Probably not": 0.08056696679439905,
"Unsure": 0.2080238770599849,
"Yes, definitely": 0.4495055052793251
}
} |
future_vaccine_recommended | Responses to the question
Would you be more or less likely to take a vaccine against COVID-19 infection if it were
made available and recommended to you by each of the following
(only available wave9 onwards). |
"future_vaccine_recommended": {
"friends_and_family": {
"weighted": {
"Less Likely": 0.071,
"More Likely": 0.389,
"No Impact": 0.54
}
},
"government_health_officials": {
"weighted": {
"Less Likely": 0.148,
"More Likely": 0.469,
"No Impact": 0.384
}
},...
} |
work_industry | Responses to the question
What is the main activity of the business or organization in
which you were working before February 2020?
|
"work_industry": {
"weighted": {
"Agriculture": 0.011531212310096538,
"Buying and selling": 0.06271041558504216,
"Construction": 0.017675649241784653,
"Education": 0.1358951833575737,
"Tourism": 0.01542255113586681,
"Transportation": 0.01998329350087886
...
}
} |
mismatch_index | Average of the disagreement in personal beliefs of a user and their beliefs of others. This is based on responses to the questions: Q_individual How important is it for you to take actions to prevent the spread of COVID-19 in your community? (possible answers: Extremely important, Very important, Moderately important, Slightly important, Not important at all) and
Q_societal How important do other people in your community think it is to take actions to prevent the spread of COVID-19? . We first discretize the results (20 = Not important at all and 100 = Extremely important) and take the difference between a users responses Q_individual - Q_societal. mismatch_index is the mean of the distribution of the selected population. Interpretation: A value close to zero indicates agreement in beliefs in the society, a higher value indicates disbelief. | "mismatch_index": {"weighted": 19.031} |
community_risk_index | Based on response to the question: How dangerous do you think the COVID-19 risk is to your community? with possible answers: {Not at all dangerous, Slightly dangerous, Moderately dangerous, Very dangerous, Extremely dangerous}. The measure returns the fraction of users who respond to a particular option. |
"community_risk_index": {
"weighted": {
"Very dangerous": 0.366,
"Moderately dangerous": 0.242,
"Slightly dangerous": 0.139,
"Not at all dangerous": 0.036,
"-1": 0.004,
"Extremely dangerous": 0.213
}}
|
measures_taken | Based on response to the question: What measures have you taken to prevent infection from COVID-19 in the past week? (multiple choices can be selected). The measure returns the fraction of users who answered Yes/No for each choice. |
"measures_taken": {
"avoid_touching_face": {
"weighted": {
"Yes": 0.963,
"No": 0.037
}
},
"getting_the_flu_vaccine": {
"weighted": {
"Yes": 0.839,
"No": 0.161
}
},
"meter_distance": {
"weighted": {
"Yes": 0.658,
"No": 0.342
}
},
...
}
|
locations_would_attend | Based on response to the question: Which of the following businesses, locations, or events would you visit or attend in the coming two weeks if they were operating at full capacity? . The measure returns the fraction of users who answered Yes/No for each choice. |
"locations_would_attend": {
"health_care_facilities": {
"weighted": {
"Yes": 0.277,
"No": 0.723
}
},
"parks_and_beaches": {
"weighted": {
"Yes": 0.254,
"No": 0.746
}
},
"places_of_employment": {
"weighted": {
"Yes": 0.141,
"No": 0.859
}
}, ... } |
distancing_familiarity | Based on response to the question: Which of the following best describes your familiarity with the term 'physical distancing' during the COVID-19 pandemic? with possible answers: {I have not heard of it, I have heard of it but do not know what it means, I have heard of it and have some idea of what it means, I have heard of it and know what it means}. The measure returns the fraction of users who respond to a particular option. |
"distancing_familiarity": {
"weighted": {
"I have heard of it and have some idea of what it means": 0.125,
"I have heard of it and know what it means": 0.788,
"I have heard of it but do not know what it means": 0.045,
"I have not heard of it": 0.042
}
} |
effect_mask | Based on response to the question: How effective is wearing a face mask for preventing the spread of COVID-19? with possible answers: {Extremely effective, Very effective, Moderately effective, Slightly effective, Not effective at all}. The measure returns the fraction of users who respond to a particular option. |
"effect_mask": {
"weighted": {
"Extremely effective": 0.485,
"Moderately effective": 0.125,
"Not effective at all": 0.011,
"Slightly effective": 0.037,
"Very effective": 0.342
}
} |
news_sources | Based on response to the question: In the past week, from which of the following, if any, have you received news and information about COVID-19? Select all that apply. with possible answers: {government_health_authorities :Government health authorities or other officials, journalists :Journalists, local_health_workers :Local health workers, clinics, and community organizations, ordinary_people_i_dont_know_personally :Ordinary people I don't know personally, ordinary_people_i_know_personally :Ordinary people I know personally, politicians :Politicians, scientists :Scientists, doctors, and health experts, world_health_organization :World Health Organization (WHO)}. The measure returns the fraction of users who selected each of these choices. |
"news_sources": {
"government_health_authorities": {
"weighted": {
"No": 0.545,
"Yes": 0.455
}
},
"journalists": {
"weighted": {
"No": 0.417,
"Yes": 0.583
}
},
"local_health_workers": {
"weighted": {
"No": 0.726,
"Yes": 0.274
}
},...
} |
news_mediums | Based on response to the question: In the past week, from which of the following, if any, have you received news and information about COVID-19? Select all that apply. with possible answers: {online_sources : Online sources (websites, apps, social media), messaging_apps :Messaging apps / SMS/ text messaging, newspapers :Newspapers, television : Television, radio : Radio}. The measure returns the fraction of users who selected each of these choices. |
"news_mediums": {
"messaging_apps": {
"weighted": {
"No": 0.804,
"Yes": 0.196
}
},
"newspapers": {
"weighted": {
"No": 0.461,
"Yes": 0.539
}
},
"online_sources": {
"weighted": {
"No": 0.164,
"Yes": 0.836
}
},...
} |
news_sources_trust | Based on response to the question: How much do you trust each of the following as a source of COVID-19 news and information? with possible answers: {local_health_workers : Local health workers,clinics, and community organizations, scientists :Scientists, doctors, and health experts, government_health_authorities :Governmenthealth authorities or other officials, politicians : Politicians, journalists : Journalists, ordinary_people_i_do_not_know_personally : Ordinary people I don't know personally, ordinary_people_i_know_personally : Ordinary people I know personally}. The measure returns the fraction of users who selected each of these choices. |
"news_sources_trust": {
"government_health_authorities": {
"weighted": {
"Do not trust": 0.073,
"Somewhat trust": 0.369,
"Trust": 0.559
}
},
"journalists": {
"weighted": {
"Do not trust": 0.23,
"Somewhat trust": 0.515,
"Trust": 0.255
}
},
"local_health_workers": {
"weighted": {
"Do not trust": 0.078,
"Somewhat trust": 0.38,
"Trust": 0.542
}
},...
} |
news_mediums_trust | Based on response to the question: How much do you trust each of the following as a source of COVID-19 news and information? with possible answers: {online_sources : Online sources (websites, apps,social media), messaging_apps : Messaging apps / SMS/ textmessaging, newspapers : Newspapers, television : Television, radio : Radio. The measure returns the fraction of users who selected each of these choices. |
"news_mediums_trust": {
"messaging_apps": {
"weighted": {
"Do not trust": 0.254,
"Somewhat trust": 0.513,
"Trust": 0.233
}
},
"newspapers": {
"weighted": {
"Do not trust": 0.074,
"Somewhat trust": 0.396,
"Trust": 0.53
}
},
"online_sources": {
"weighted": {
"Do not trust": 0.124,
"Somewhat trust": 0.573,
"Trust": 0.303
}
},...
} |
prevention_distancing | Based on response to the question: How often are you able to stay at least 1 meteraway from people not in your household? with possible answers: [Never, Rarely, Sometimes, Often, , and Always
The measure returns the fraction of users who selected each of these choices. |
"prevention_distancing": {
"weighted": {
"Always": 0.3450418847766386,
"Never": 0.04772093455516377,
"Often": 0.3305687100781024,
"Rarely": 0.11617205426774309,
"Sometimes": 0.16049641632235195
}
}
|
prevention_mask | Based on response to the question: How often are you able to wear a mask or face coveringwhen you are in public? with possible answers: [Never, Rarely, Sometimes, Often, , and Always
The measure returns the fraction of users who selected each of these choices. This question wasonly added starting in wave 3 on 8/4/2020 at 2:15PM ET. |
"prevention_mask": {
"weighted": {
"Always": 0.7769272900510984,
"Never": 0.008249041768208728,
"Often": 0.15339189557185529,
"Rarely": 0.027011776599326836,
"Sometimes": 0.03441999600950834
}
}
|
health ,retail ,restaurants ,worship | Based on response to the question(s): Compared to a [location] with no precautions at all, are you more or less likely to visit a restaurant with the following precautions? , where location could be health ,retail ,restaurants ,worship . Possible answers of this question are: {More likely to visit, Doesn't affect my actions, Less likely to visit}. The measure returns the fraction of users who selected each of these choices. |
"restaurants": {
"everyone_has_to_pass_a_temperature_check": {
"weighted": {
"Doesn't affect my actions": 0.175,
"More likely to visit": 0.506,
"-1": 0.21,
"Less likely to visit": 0.109
}
},
"operating_at_limited_capacity": {
"weighted": {
"Doesn't affect my actions": 0.151,
"More likely to visit": 0.361,
"-1": 0.106,
"Less likely to visit": 0.383
}
},
...
}
|
Example 1. Retrieve the data for a specific signal
(say, locations people would attend locations_would_attend
) in a specific country
, age
group for all gender
s using the
http://p5000-covidsurvey.mit.edu.ezproxy.canberra.edu.au/query
end point
This could be done using the link http://p5000-covidsurvey.mit.edu.ezproxy.canberra.edu.au/query?age=20-30&gender=all&country=US&signal=locations_would_attend
would return the response:
Using the link http://p5000-covidsurvey.mit.edu.ezproxy.canberra.edu.au/query?age=20-30&gender=all&country=US&output=csv&signal=locations_would_attend
would return the same response as a csv file.
Example 2. Retrieve the data for multiple signal
s (say, vaccine_accept
, mismatch_index
, and community_risk_index
) in all countries
, for a specific age
group and gender
This could be done using the link http://p5000-covidsurvey.mit.edu.ezproxy.canberra.edu.au/query?age=31-40&gender=female&country=all&signal={vaccine_accept,mismatch_index,community_risk_index}
would return the response:
xxxxxxxxxx
import requests
import json
# request data from api
response = requests.get("http://p5000-covidsurvey.mit.edu.ezproxy.canberra.edu.au/query?age=20-30&gender=female&country=US&signal=vaccine_accept").text
#convert json data to dic data for use!
jsonData = json.loads(response)
xxxxxxxxxx
library(tidyverse)
library(httr)
library(jsonlite)
# adding url
path <- "http://p5000-covidsurvey.mit.edu.ezproxy.canberra.edu.au/query?age=20-30&gender=female&country=US&signal=vaccine_accept"
# request data from api
request <- GET(url = path)
# make sure the content is encoded with 'UTF-8'
response <- content(request, as = "text", encoding = "UTF-8")
# now we can have a dataframe for use!
coviddata <- fromJSON(response, flatten = TRUE) %>% data.frame()
We hope this API is useful to different research groups. If you use the API and would like to cite it, please use:
Collis, A., Garimella, K., Moehring, A., Rahimian, M.A., Babalola, S., Gobat, N., Shattuck, D., Stolow, J., Eckles, D., and Aral, S. (2020). Global survey on COVID-19 beliefs, behaviors, and norms. Technical report, MIT Sloan School of Management
Credit: Website design taken from the UMD COVID-19 survey API.