Create a BigQuery data source
Search for the Bigquery platform using search bar
Hit add data + to access datasets
Then look for a public dataset
Select Covid Dataset
Now view the datasets
Select NY Times
The New York Times US Coronavirus Database
The New York Times
Data based on reports from state and local health agencies
VIEW DATASET
OVERVIEWSAMPLES
Overview
This is the US Coronavirus data repository from The New York Times . This data includes COVID-19 cases and deaths reported by state and county. The New York Times compiled this data based on reports from state and local health agencies. More information on the data repository is available here . For additional reporting and data visualizations, see The New York Times’ U.S. coronavirus interactive site .
This public dataset is hosted in Google BigQuery and is included in BigQuery's 1TB/mo of free tier processing. This means that each user receives 1TB of free BigQuery processing every month, which can be used to run queries on this public dataset. Watch this short video to learn how to get started quickly using BigQuery to access public datasets. What is BigQuery .
This dataset has significant public interest in light of the COVID-19 crisis. All bytes processed in queries against this dataset will be zeroed out, making this part of the query free. Data joined with the dataset will be billed at the normal rate to prevent abuse. After September 15, queries over these datasets will revert to the normal billing rate.
Users of The New York Times public-use data files must comply with data use restrictions to ensure that the information will be used solely for noncommercial purposes.
Additional details
Type: Datasets
Last updated: 1/14/21
Category: Datasets for COVID-19 research, Science & research
Dataset source: NYT COVID-19 US Dataset
Cloud service: BigQuery
Region: US/EU
Update frequency: Daily
Samples
Try the sample queries below in the BigQuery UI.
Which US counties have the most confirmed cases per capita?
This query determines which counties have the most cases per 100,000 residents. Note that this may differ from similar queries of other datasets because of differences in reporting lag, methodologies, or other dataset differences. Run this query.
How do I calculate the number of new COVID-19 cases per day?
This query determines the total number of new cases in each state for each day available in the dataset by subtracting the previous day's total from most recent day. Run this query.
Click on number of new COVID-19 cases per day
SELECT
b.state_name,
b.date,
MAX(b.confirmed_cases - a.confirmed_cases) AS daily_confirmed_cases
FROM
(SELECT
state_name AS state,
state_fips_code ,
confirmed_cases,
DATE_ADD(date, INTERVAL 1 day) AS date_shift
FROM
`bigquery-public-data.covid19_nyt.us_states`
WHERE
confirmed_cases + deaths > 0) a
JOIN
`bigquery-public-data.covid19_nyt.us_states` b ON
a.state_fips_code = b.state_fips_code
AND a.date_shift = b.date
GROUP BY
b.state_name, date
ORDER BY
date desc
Save results
Click on OPEN IN DRIVE
Change to OPEN WITH SHEETS
Now you have data
No comments:
Post a Comment