This project is for UCSD Extension data visualization class. The aim of this project is to perform ETL and merge two datasets - PPP loans and COVID-19 cases for California - to allow researchers to evaluate for a possible link between COVID-19 infection cases and receipt of PPP loans.
Searched Kaggle and downloaded four databases: A. PPP loan B. US county demographics C. US county and covid cases D. US county and corresponding zip codes
Create Entity Relationship Diagram.
Cleaned the data using python.
Created SQL tables.
Connect to PostgreSQL and upload data to tables.