This project explores the rich dataset of the Indian Premier League (IPL) using Python libraries like pandas, matplotlib, and seaborn. The goal is to uncover hidden insights from both match-level and player-level data.
matches.csv: Contains match-level data from IPL 2008 to 2023deliveries.csv: Ball-by-ball delivery-level data for each match
- Removed irrelevant columns (
umpires,method,result_margin,super_over, etc.) - Renamed
match_idand standardizedseason(e.g., 2007/08 → 2008) - Joined datasets using consistent IDs
- Number of matches per season
- Toss decision trends
- Teams with the most wins
- Top run scorers, six-hitters, and fours
- Strike rate and death-over dominance
- Bowlers with most wickets, best economy, and dot balls
- Most runs by a batter against a specific team
- Most dismissals by a bowler to a batsman
- Batting averages vs top teams
- Distribution of dismissal types shown with pie chart
- Grouped rare dismissals under “Others” for clarity
- Bar charts (Top batsmen, bowlers, matches per season)
- Pie chart (Dismissal types)
- Line plots & comparisons
- Heatmaps (optional)
- Python
- pandas
- matplotlib
- seaborn
- Jupyter Notebook
- 📦 IPL-DATA-ANALYSIS
- 📊 cleaned_matches.csv
- 📊 cleaned_deliveries.csv
- 📓 IPL DATA.ipynb
- 📄 README.md