This big data project will look at capabilities to run analytical queries on massive datasets. We will use the dataset in a SQL database for this project, and we'll need to ingest and modify the data. We'll use sales and Customer demographics data to perform analysis and answer the following questions:
In this project, we will be using Customer test, Individual test tables from this database.
Customer: This table contain all customer data related information.
Individual: This table contain all Individual data information.
Cust_Ind: This is the final table after joining and performing transformations on above tables.
- Create tables in Azure SQL database.
- Load data in Azure SQL Database.
- Connect tables in databricks and create dataframes.
- Join both the dataframes in databricks.
- Perform transformation and cleaning in databricks using pyspark.
- Load the data back in new table in SQL database.
- Connect data with PowerBI.
- Perform analytics on Sales and Customer demographics data Using PowerBI.


