Skip to content

A rule-based automated EDA system that processes CSV datasets and generates an interactive analytics dashboard with data-quality diagnostics, statistical summaries, and visual insights to support exploratory decision-making.

Notifications You must be signed in to change notification settings

Prodasher966/AutoDash_AI_Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🚀 AutoDash AI

From a question to a working intelligence system

📌 The Idea (Journey)

It started with a simple but ambitious thought:

“What if I could upload any CSV file__, and a system would automatically understand it, analyze it, generate charts, and explain insights — without the user writing a single line of code?”

At the time, this sounded unrealistic.I wasn’t an AI engineer. I didn’t have advanced ML knowledge.But the idea stayed.

So instead of chasing buzzwords, I broke the problem down:

  • Can a system understand a dataset structurally?

  • Can it analyze columns automatically?

  • Can it generate meaningful visualizations without manual selection?

  • Can it explain insights for both technical and non-technical users?

AutoDash AI is the answer to those questions.

🧠 What is AutoDash AI?

AutoDash AI is an AI-assisted automatic exploratory data analysis (EDA) dashboard.

You upload a CSV file.The system takes over.

It:

  • Analyzes dataset structure

  • Detects column types

  • Handles missing values

  • Generates consistent visualizations

  • Produces layered insights:

    • Statistical (transparent & credible)

    • Human-readable (clear & actionable)

All through a clean Streamlit UI, with zero coding required from the user.

✨ Key Features

📂 CSV-Driven Workflow

  • Upload any CSV file from the UI

  • No configuration, no schema definition

🔍 Automatic Dataset Analysis

  • Row & column detection

  • Duplicate row identification

  • Column type inference

  • Missing value analysis

📊 Smart Visualizations

  • Histogram & boxplots for numeric columns

  • Category distributions for categorical columns

  • Uniform sizing & alignment

  • Clear titles with contextual meaning

  • Auto-saved plots for reuse

🤖 AI-Generated Insights (Phase 3)

Each insight is delivered in two layers:

  1. Human Interpretation – simple, actionable explanation

  2. Technical Explanation – method, statistics & logic (collapsible)

Insights are severity-ranked:

  • 🔴 Critical

  • 🟡 Warning

  • 🟢 Informational

🩺 Dataset Health Score

  • Overall score (0–100)

  • Reflects data quality, missing values, imbalance, and anomalies

🧾 Executive Summary

  • One-paragraph overview of dataset condition

  • Suitable for non-technical stakeholders

🗂️ Project Structure

AutoDash-AI/
│
├── app.py              # Streamlit UI (main user-facing dashboard)
├── main.py             # CLI / pipeline runner (engine-level execution)
├── test.py             # Optional testing & validation
│
├── eda_engine/
│   ├── __init__.py
│   ├── loader.py       # CSV loading & validation
│   ├── analyzer.py     # Dataset analysis logic
│   ├── visualizer.py   # Visualization engine
│   └── insights.py     # Insight & health-score engine
│
├── outputs/
│   └── plots/          # Auto-generated visualizations
│
├── requirements.txt
└── README.md

⚙️ How It Works (Behind the Scenes)

  1. Data Loading

    • CSV is validated and loaded safely
  2. Analysis Engine

    • Identifies numeric & categorical columns

    • Computes missing values, duplicates, distributions

  3. Visualization Engine

    • Chooses appropriate charts automatically

    • Ensures professional consistency (size, spacing, titles)

  4. Insight Engine

    • Detects anomalies, imbalance, missing-data risks

    • Assigns severity levels

    • Produces layered explanations

  5. UI Rendering

    • Streamlit displays metrics, insights, charts, summaries

▶️ How to Run the Project

1️⃣ Install Dependencies

pip install -r requirements.txt

2️⃣ Launch the Dashboard

streamlit run app.py

3️⃣ Upload a CSV File

Sit back. AutoDash AI handles the rest.

🎯 Why This Project Matters

AutoDash AI is not a toy dashboard.

It demonstrates:

  • End-to-end system thinking

  • Modular architecture

  • Real-world data handling

  • UX design for mixed audiences

  • Practical AI-assisted analytics (not hype)

This project is portfolio-worthy because it solves a real problem:

Turning raw data into understanding — automatically.

🚧 Current Status

  • Version: AutoDash AI (v1)

  • Fully functional

  • Feature-complete for its original vision

Future versions may expand into:

  • Time-series detection

  • Correlation analysis

  • PDF / report export

  • Domain-aware insights

But v1 stands strong on its own.

About

A rule-based automated EDA system that processes CSV datasets and generates an interactive analytics dashboard with data-quality diagnostics, statistical summaries, and visual insights to support exploratory decision-making.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages