Skip to content

jlnsmith2001/China_WQI_Analysis

Repository files navigation

China WQI Analysis

Data analysis case study using Spreadsheets and Tableau.

  • ✔️ Data Cleaning
  • ✔️ Analysis
  • ✔️ Visualizations
  • ✔️ Final README

DATA DISCLAIMER

This dataset was synthetically generated and does not reflect real-world data. It is meant for exploratory analysis only. Station names, coordinates, and measurements are entirely fictional and were algorithmically randomized within realistic boundaries.

Dataset source

China Water Pollution Monitoring Dataset

Business Task

Analyze water quality data from 2023 to identify which regions in China experienced the highest pollution levels, detect seasonal trends in pollution, and highlight which pollutants were most frequently elevated.

Visualizations include

  • Geographic map showcasing monitoring stations and their average levels of pollution
  • Heat map exploring seasonal trends among provinces
  • Bar graph revealing which pollutants had the highest instances of reaching unsafe levels of contamination
  • Column chart comparing which stations within specific provinces had the highest averages

Presentation and dataset

Powerpoint

Spreadsheet

Visualizations

Geographic map

Heat map

Bar graph

Column chart

Key Insights

  • Most polluted provinces in china included Wuhan, Dali, Yichang, Luoyang, and Zhengzhou
  • Spring and Winter tended to have higher levels of pollution
  • Phosphorous, Ammonia, And Total Nitrogen were the biggest contributors regarding pollution.
  • Stations in Zhengzhou and Yichang had high levels of Phosphorous throughout the entire year.

Tools and Skills used

Google Sheets: Data cleaning through formulas, filters, and helper columns

Google Slides: Thorough explanations and findings

Tableau: Interactive visualizations

Data Cleaning Summary

Created a copy of the raw dataset and cleaned it using Google Sheets. This process consisted of...

  • Filtering out empty/irrelevant rows
  • Renaming columns for clarity
  • Creating helper columns for thresholds and regional validation
  • Removing invalid data entries

This process enabled me to work with clean, structured data so I could focus my attention to answering my business task.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors