Skip to content

newbiecoder-123/ETL_project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ETL_project

This project is for UCSD Extension data visualization class. The aim of this project is to perform ETL and merge two datasets - PPP loans and COVID-19 cases for California - to allow researchers to evaluate for a possible link between COVID-19 infection cases and receipt of PPP loans.

Step 1 - Extract the data

Searched Kaggle and downloaded four databases: A. PPP loan B. US county demographics C. US county and covid cases D. US county and corresponding zip codes

Step 2 - ERD

Create Entity Relationship Diagram.

Step 3 - Transform

Cleaned the data using python.

Step 4 - SQL

Created SQL tables.

Step 5 - Load

Connect to PostgreSQL and upload data to tables.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors