BigQuery Covid

 

Create a BigQuery data source



Search for the Bigquery platform using search bar


 

Hit add data + to access datasets

 

 

Then look for a public dataset

 

 

Select Covid Dataset

 

 

Now view the datasets

 

 

Select NY Times 

 

 

The New York Times US Coronavirus Database

The New York Times

Data based on reports from state and local health agencies

VIEW DATASET 

OVERVIEWSAMPLES

Overview

This is the US Coronavirus data repository from The New York Times . This data includes COVID-19 cases and deaths reported by state and county. The New York Times compiled this data based on reports from state and local health agencies. More information on the data repository is available here . For additional reporting and data visualizations, see The New York Times’ U.S. coronavirus interactive site .

This public dataset is hosted in Google BigQuery and is included in BigQuery's 1TB/mo of free tier processing. This means that each user receives 1TB of free BigQuery processing every month, which can be used to run queries on this public dataset. Watch this short video to learn how to get started quickly using BigQuery to access public datasets. What is BigQuery .

This dataset has significant public interest in light of the COVID-19 crisis. All bytes processed in queries against this dataset will be zeroed out, making this part of the query free. Data joined with the dataset will be billed at the normal rate to prevent abuse. After September 15, queries over these datasets will revert to the normal billing rate.

Users of The New York Times public-use data files must comply with data use restrictions to ensure that the information will be used solely for noncommercial purposes.

Additional details

Samples

Try the sample queries below in the BigQuery UI.

Which US counties have the most confirmed cases per capita?

This query determines which counties have the most cases per 100,000 residents. Note that this may differ from similar queries of other datasets because of differences in reporting lag, methodologies, or other dataset differences. Run this query. 

How do I calculate the number of new COVID-19 cases per day?

This query determines the total number of new cases in each state for each day available in the dataset by subtracting the previous day's total from most recent day. Run this query. 

 

Click on  number of new COVID-19 cases per day

 

 

SELECT

 b.state_name,

 b.date,

 MAX(b.confirmed_cases - a.confirmed_cases) AS daily_confirmed_cases

FROM 

  (SELECT

    state_name AS state,

    state_fips_code ,

    confirmed_cases,

    DATE_ADD(date, INTERVAL 1 day) AS date_shift

  FROM

    `bigquery-public-data.covid19_nyt.us_states` 

  WHERE

    confirmed_cases + deaths > 0) a

JOIN

  `bigquery-public-data.covid19_nyt.us_states` b ON 

    a.state_fips_code = b.state_fips_code 

    AND a.date_shift = b.date

GROUP BY

  b.state_name, date

ORDER BY 

  date desc

 

Save results

 

 

Click on OPEN IN DRIVE

 

Change to OPEN WITH SHEETS

 

Now you have data 

 


No comments:

Post a Comment

Office hours tomorrow(Tuesday) 5:00pm-6:00pm, 4/26/2021, 5:13 PM, English, 4/26/2021, 5:13 PM

Your assigned language is: English Classroom blog: googleclouduconn.blogspot.com 4/26/2021, 5:13 PM Office hours tomorrow(Tuesday) 5...