This is my SQL training repository!
This project explores the NYC Yellow Taxi Trips 2022 dataset using SQL queries on Google Colab.
The goal is to gain insights from real-world transportation data while strengthening my SQL skills.
- Source:
bigquery-public-data.new_york_taxi_trips.tlc_yellow_trips_2022 - Size: ~500+ million records
- Fields: pickup & dropoff time/location, fare, tips, distance, payment type, etc.
Here are some of the data questions I answered using SQL:
- What is the average tip given by passengers during the year 2022?
- In which month was the average total_amount the highest in 2022?
- Display the top 5 vendors (based on vendor_id) with the highest average tip_amount.
- How many trips occurred in December 2022?
- Show the top 10 most frequently used pickup_location_ids. ...and many more. β
All queries can be found in the queries/ folder.