diff --git a/1. Postgres ETL/README.md b/1. Postgres ETL/README.md index 4fb7a80..7374171 100644 --- a/1. Postgres ETL/README.md +++ b/1. Postgres ETL/README.md @@ -5,7 +5,7 @@ This repo provides the ETL pipeline, to populate the sparkifydb database. * This source of data will be useful in helping Sparkify reach some of its analytical goals, for example, finding out songs that have highest popularity or times of the day which is high in traffic. -## Database Design and ETL Pipeline +## Database Design and ETL Pipeline Schema for Song Play Analysis --- * For the schema design, the STAR schema is used as it simplifies queries and provides fast aggregations of data. @@ -21,4 +21,4 @@ This repo provides the ETL pipeline, to populate the sparkifydb database. --- * First, run create_tables.py to create the data tables using the schema design specified. If tables were created previously, they will be dropped and recreated. -* Next, run etl.py to populate the data tables created. \ No newline at end of file +* Next, run etl.py to populate the data tables created.