📋 CSV-parser

A Python-based CSV data analysis pipeline that parses, cleans, validates, and analyzes tabular data. The project generates a structured, human-readable report with key statistics and insights.

Project Goals

Practice working with real-world CSV data
Build a full data pipeline using pure Python
Understand data validation, cleaning, and analysis
Generate automated analytical reports

Dataset

Source: Student Social Media & Relationships dataset
Format: CSV
Each row represents one student's anonymized survey response

Pipeline Overview

Load CSV data into Python dictionaries
Validate structure and detect missing values
Clean and convert data types
Perform statistical analysis
Generate a formatted text report

🚀 Key Features

CSV parsing using Python standard library
Data validation and missing-value detection
Safe type conversion with error handling
Statistical analysis (averages, top-N values)
Grouping by categorical fields (country)
Automated report generation

Technologies Used

Python 3.10+
csv (standard library)
collections (defaultdict)

How to Run

Clone the repository
Ensure Python 3.10+ is installed
Run:
```
python main.py
```

Project Status

This project was completed as a learning mini-project after the first month of my Machine Learning self-study plan. The project was completed in several iterations, with correction of logical and analytical errors.

Possible Improvements

Refactor into multiple modules
Add pandas-based implementation
Add visualizations
Extend to machine learning tasks

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitignore		.gitignore
README.md		README.md
Students.csv		Students.csv
main.py		main.py
report.txt		report.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📋 CSV-parser

Project Goals

Dataset

Pipeline Overview

🚀 Key Features

Technologies Used

How to Run

Project Status

Possible Improvements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

📋 CSV-parser

Project Goals

Dataset

Pipeline Overview

🚀 Key Features

Technologies Used

How to Run

Project Status

Possible Improvements

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages