Read this in 中文 (Chinese)
A demo for a machine learning data analysis platform – not overly powerful, but designed to allow users to train and evaluate common ML models without any coding. Simply upload your data, select a model, and get professional visualizations and metrics analysis in seconds. This demo isn't exhaustive, and we encourage users to modify it to suit their specific needs.
- 🚀 No-code Interface: User-friendly interface for model selection and configuration
- 📊 Data Visualization: Automatically generates model performance visualizations
- 📈 Feature Analysis: Provides feature importance analysis
- 🔄 Multiple ML Models: Supports various classification and regression models
- 📁 Data Flexibility: Works with Excel (.xlsx, .xls) and CSV files
- 📱 Responsive Design: Works on both desktop and mobile devices
- Developed with Python 3.12 and pip (other environments not tested)
- Modern web browser (Chrome, Firefox, Edge, etc.)
-
Clone the repository
git clone https://github.com/somnifex/ml-data-analysis.git cd ml-data-analysis -
Install the required Python packages
pip install -r requirements.txt
-
Start the backend server
cd backend python main.py -
Open the frontend in your browser
-
Navigate to
frontend/index.htmlin your file explorer and open it with your browser -
Or run a simple HTTP server:
cd frontend python -m http.server 8000Then visit
http://localhost:8000in your browser
-
- Upload Data: Use the file upload button to upload your Excel or CSV data file
- Select Model: Choose from various ML models depending on your task type
- Configure Features: Select the feature columns and target variable
- Train Model: Click the "Start Training" button and wait for results
- Review Results: Explore the generated metrics and visualizations
- Neural Network Classifier
- Logistic Regression
- Random Forest Classifier
- Decision Tree Classifier
- Gradient Boosting Classifier
- Support Vector Machine Classifier
- K-Nearest Neighbors Classifier
- XGBoost Classifier (if installed)
- LightGBM Classifier (if installed)
- Neural Network Regressor
- Linear Regression
- Random Forest Regressor
- Decision Tree Regressor
- Gradient Boosting Regressor
- Support Vector Machine Regressor
- K-Nearest Neighbors Regressor
- Elastic Net Regression
- XGBoost Regressor (if installed)
- LightGBM Regressor (if installed)
ml-data-analysis/
├── backend/ # Backend API server
│ ├── main.py # FastAPI server entry point
│ ├── models.py # ML model implementations
│ └── processing.py # Data processing utilities
├── frontend/ # Frontend web interface
│ ├── index.html # Main HTML page
│ ├── script.js # JavaScript logic
│ └── assets/ # Images and resources
└── gendata.py # Data generation utility script
You can generate test data for various ML scenarios using the included script:
python gendata.pyThis will create Excel files with synthetic data for classification, regression, and clustering tasks in a ml_test_data directory.
This project is licensed under the GNU General Public License v3.0 - see LICENSE file for details.
- Built with FastAPI
- Uses scikit-learn and PyTorch for machine learning
- Visualization powered by Matplotlib and Seaborn