|
| 1 | +--- |
| 2 | +title: Contingency Tables |
| 3 | +parent: COVID Symptom Survey |
| 4 | +nav_order: 4 |
| 5 | +--- |
| 6 | + |
| 7 | +# Contingency Tables |
| 8 | +{: .no_toc} |
| 9 | + |
| 10 | +This documentation describes the fine-resolution contingency tables produced by |
| 11 | +grouping [COVID Symptom Survey](./index.md) individual responses by various |
| 12 | +demographic features: |
| 13 | + |
| 14 | +* [Weekly files](https://cmu.box.com/s/xwjulq0pteen52d4upni9ikagu7d8bl2) |
| 15 | +* [Monthly files](https://cmu.box.com/s/vh4gs3j541tt9pqn2pn72bktu0op8tpo) |
| 16 | + |
| 17 | +These contingency tables provide demographic breakdowns of COVID-related topics such as |
| 18 | +vaccine uptake and acceptance. They are more detailed than the |
| 19 | +[coarse aggregates reported in the COVIDcast Epidata API](../api/covidcast-signals/fb-survey.md), |
| 20 | +which are grouped only by geographic region. [Individual response data](survey-files.md) |
| 21 | +for the survey is available, but only to academic or nonprofit researchers who |
| 22 | +sign a Data Use Agreement, whereas these contingency tables are available to the |
| 23 | +general public. |
| 24 | + |
| 25 | +Important updates for data users, including corrections to data or updates on |
| 26 | +data processing delays, are posted as `OUTAGES.txt` in the directory where the |
| 27 | +data is hosted. |
| 28 | + |
| 29 | +## Table of contents |
| 30 | +{: .no_toc .text-delta} |
| 31 | + |
| 32 | +1. TOC |
| 33 | +{:toc} |
| 34 | + |
| 35 | +## Available Data Files |
| 36 | + |
| 37 | +We provide two types of data files, weekly and monthly. Users who need the most |
| 38 | +up-to-date data or are interested in timeseries should use the weekly files, |
| 39 | +while those who want to study groups with smaller sample sizes should use the |
| 40 | +monthly files. Because monthly aggregates include more responses, they have |
| 41 | +lower missingness when grouping by several features at a time. |
| 42 | + |
| 43 | +## Dates |
| 44 | + |
| 45 | +The included files provide estimates for various metrics of interest over a |
| 46 | +period of either a full epiweek (or [MMWR |
| 47 | +week](https://wwwn.cdc.gov/nndss/document/MMWR_week_overview.pdf), a |
| 48 | +standardized numbering of weeks throughout the year) or a full month. |
| 49 | + |
| 50 | +## Aggregation |
| 51 | + |
| 52 | +The aggregates are filtered to only include estimates for a particular group if |
| 53 | +that group includes 100 or more responses. Especially in the weekly aggregates, |
| 54 | +many of the state-level groups have been filtered out due to low sample size. In |
| 55 | +such cases, the state marginal files, which group by a single demographic of |
| 56 | +interest at a time, will likely provide more coverage. |
| 57 | + |
| 58 | +## File Format |
| 59 | + |
| 60 | +### Naming |
| 61 | + |
| 62 | +Each CSV is named as follows: |
| 63 | + |
| 64 | + {date}_{region}_{vars}.csv |
| 65 | + |
| 66 | +Dates are of the form `YYYYmmdd`. `date` refers to the first day of the time |
| 67 | +period survey responses were aggregated over, in the Pacific time zone (UTC - |
| 68 | +7). Unless noted otherwise, the time period is always a complete month or |
| 69 | +epiweek. `region` is the geographic level responses were aggregated over. At the |
| 70 | +moment, only nation-wide and state groupings are available. `vars` is a list all |
| 71 | +other grouping variables used in the aggregation, ordered alphabetically. |
| 72 | + |
| 73 | +### Columns |
| 74 | + |
| 75 | +Within a CSV, the first few columns are the grouping variables, ordered |
| 76 | +alphabetically. Each aggregate reports four columns (unrounded): |
| 77 | + |
| 78 | +* `val_<indicator name>`: the main value of interest, e.g., percent, average, or |
| 79 | + count, estimated using the [survey weights](weights.md) to better match state |
| 80 | + demographics |
| 81 | +* `se_<indicator name>`: the standard error of `val_<indicator name>` |
| 82 | +* `sample_size_<indicator name>`: the number of survey responses used to |
| 83 | + calculate `val_<indicator name>` |
| 84 | +* `represented_<indicator name>`: the number of people in the population that |
| 85 | + `val_<indicator name>` represents over all days in the given time period. This |
| 86 | + is the sum of [survey weights](./weights.md) for all survey responses |
| 87 | + used. |
| 88 | + |
| 89 | +All aggregates using the same set of grouping variables appear in a single CSV. |
| 90 | + |
| 91 | +## Indicators |
| 92 | + |
| 93 | +The files contain [weighted |
| 94 | +estimates](../api/covidcast-signals/fb-survey.md#survey-weighting) of percent of |
| 95 | +respondents who fulfill one or several criteria. Estimates are broken out by |
| 96 | +state, age, gender, race, and ethnicity. |
| 97 | + |
| 98 | +| Indicator | Description | Survey Item | |
| 99 | +| --- | --- | --- | |
| 100 | +| `pct_vaccinated` | Estimated percentage of respondents who have already received a COVID vaccine. <br/> **Earliest date available:** 2021-01-01 | V1 | |
| 101 | +| `pct_accepting` | Estimated percentage of respondents who would definitely or probably choose to get vaccinated, if a vaccine were offered to them today, among respondents who have not yet been vaccinated. <br/> **Earliest date available:** 2021-01-01 | V3 | |
| 102 | +| `pct_concerned_sideeffects` | Estimated percentage of respondents who are very or moderately concerned that they would "experience a side effect from a COVID-19 vaccination." (Asked of all respondents, including those who have already received one or more doses of a COVID-19 vaccine.) <br/> **Earliest date available:** 2021-01-01 | V9 | |
| 103 | +| `pct_hesitant_sideeffects` | Estimated percentage of respondents who are very or moderately concerned that they would "experience a side effect from a COVID-19 vaccination" *and* would "definitely not" or "probably not" get a COVID-19 vaccine if offered. <br/> **Earliest date available:** 2021-01-01 | V9 and V3 | |
| 104 | +| `pct_trust_fam` | Estimated percentage of respondents who would be more likely to get a COVID-19 vaccine if it were recommended to them by friends and family, among respondents who have not yet been vaccinated. <br/> **Earliest date available:** 2021-01-01 | V4 | |
| 105 | +| `pct_trust_healthcare` | Estimated percentage of respondents who would be more likely to get a COVID-19 vaccine if it were recommended to them by local health workers, among respondents who have not yet been vaccinated. <br/> **Earliest date available:** 2021-01-01 | V4 | |
| 106 | +| `pct_trust_who` | Estimated percentage of respondents who would be more likely to get a COVID-19 vaccine if it were recommended to them by the World Health Organization, among respondents who have not yet been vaccinated. <br/> **Earliest date available:** 2021-01-01 | V4 | |
| 107 | +| `pct_trust_govt` | Estimated percentage of respondents who would be more likely to get a COVID-19 vaccine if it were recommended to them by government health officials, among respondents who have not yet been vaccinated. <br/> **Earliest date available:** 2021-01-01 | V4 | |
| 108 | +| `pct_trust_politicians` | Estimated percentage of respondents who would be more likely to get a COVID-19 vaccine if it were recommended to them by politicians, among respondents who have not yet been vaccinated. <br/> **Earliest date available:** 2021-01-01 | V4 | |
| 109 | +| `pct_hesitant_trust_fam` | Estimated percentage of respondents who would be more likely to get a COVID-19 vaccine if it were recommended to them by friends and family, among respondents who have not yet been vaccinated *and* would "definitely not" or "probably not" get a COVID-19 vaccine if offered. <br/> **Earliest date available:** 2021-01-01 | V3 and V4 | |
| 110 | +| `pct_hesitant_trust_healthcare` | Estimated percentage of respondents who would be more likely to get a COVID-19 vaccine if it were recommended to them by local health workers, among respondents who have not yet been vaccinated *and* would "definitely not" or "probably not" get a COVID-19 vaccine if offered. <br/> **Earliest date available:** 2021-01-01 | V3 and V4 | |
| 111 | +| `pct_hesitant_trust_who` | Estimated percentage of respondents who would be more likely to get a COVID-19 vaccine if it were recommended to them by the World Health Organization, among respondents who have not yet been vaccinated *and* would "definitely not" or "probably not" get a COVID-19 vaccine if offered. <br/> **Earliest date available:** 2021-01-01 | V3 and V4 | |
| 112 | +| `pct_hesitant_trust_govt` | Estimated percentage of respondents who would be more likely to get a COVID-19 vaccine if it were recommended to them by government health officials, among respondents who have not yet been vaccinated *and* would "definitely not" or "probably not" get a COVID-19 vaccine if offered. <br/> **Earliest date available:** 2021-01-01 | V3 and V4 | |
| 113 | +| `pct_trust_politicians` | Estimated percentage of respondents who would be more likely to get a COVID-19 vaccine if it were recommended to them by politicians, among respondents who have not yet been vaccinated *and* would "definitely not" or "probably not" get a COVID-19 vaccine if offered. <br/> **Earliest date available:** 2021-01-01 | V3 and V4 | |
| 114 | + |
| 115 | +Note: CSVs for the month of January 2021 only use data from January 6-31 due to |
| 116 | +a [definitional change in a major vaccine item on January 6](./coding.md#new-items-2). |
| 117 | +Indicators based on [item V9 use data starting January 12](./coding.md#new-items-2). |
0 commit comments