Skip to content

Latest commit

Β 

History

History
126 lines (94 loc) Β· 3.78 KB

File metadata and controls

126 lines (94 loc) Β· 3.78 KB

πŸ† AmazonML Price Prediction Project

An advanced machine learning project for predicting product prices using ensemble methods, meta-learning, and computer vision techniques. Achieved 50.12% SMAPE using sophisticated stacking approaches.

πŸš€ Quick Start

# Clone the repository
git clone <repository-url>
cd amazonml-price-prediction

# Install dependencies
pip install -r requirements.txt

# Run the best model (Meta-Learning)
cd models
python meta_learning_model.py

πŸ“ Project Structure

β”œβ”€β”€ models/                          # 🎯 Main optimized models (BEST)
β”‚   β”œβ”€β”€ meta_learning_model.py       # πŸ₯‡ Best: 50.12% SMAPE
β”‚   β”œβ”€β”€ neural_enhanced_model.py     # πŸ₯ˆ 50.45% SMAPE  
β”‚   └── computer_vision_model.py     # πŸ₯‰ ~50% SMAPE
β”‚
β”œβ”€β”€ experiments/                     # πŸ§ͺ Research & development
β”‚   β”œβ”€β”€ ensemble/                    # Ensemble approaches
β”‚   β”œβ”€β”€ optimization/                # Advanced optimizations
β”‚   └── legacy/                      # Earlier experiments
β”‚
β”œβ”€β”€ results/                         # πŸ“Š Model outputs & analysis
β”œβ”€β”€ scripts/                         # πŸ”§ Utility scripts
β”œβ”€β”€ src/                            # πŸ“š Core utilities
β”œβ”€β”€ dataset/                        # πŸ’Ύ Training/test data
└── image_cache/                    # πŸ–ΌοΈ Downloaded images (2,183 files)

πŸ† Model Performance

Model SMAPE Status Features
Meta-Learning 50.12% βœ… Best Advanced stacking + comprehensive features
Neural Enhanced 50.45% πŸ₯ˆ Second Deep learning + feature interactions
Computer Vision ~50% πŸ§ͺ Experimental Image features + text analysis
Ensemble Models 70-80% ❌ Need work Various ensemble attempts

🎯 Target: <48% SMAPE | πŸ“ˆ Best Achievement: 50.12% SMAPE

πŸ”§ Key Features

  • 🧠 Meta-Learning: Advanced stacking with 7 base models
  • πŸ–ΌοΈ Computer Vision: Real image feature extraction (2,183+ images)
  • πŸ“ NLP: Comprehensive text feature engineering
  • ⚑ Ensemble Methods: Multiple ensemble approaches tested
  • πŸ“Š Robust Validation: Cross-validation with proper SMAPE optimization

πŸ“‹ Requirements

  • Python 3.8+
  • pandas, numpy, scikit-learn
  • lightgbm, xgboost
  • PIL (for image processing)
  • See requirements.txt for full list

πŸš€ Usage Examples

Run Best Model

cd models
python meta_learning_model.py

Run Computer Vision Model

cd models  
python computer_vision_model.py

Resume Image Downloads

cd scripts
python download_computer_vision_images.py

πŸ“Š Results & Analysis

  • Training Data: 75,000 samples
  • Test Data: 75,000 samples
  • Image Dataset: 140,587 unique URLs (2,183+ downloaded)
  • Feature Engineering: 600+ features per model
  • Cross-Validation: 5-fold stratified

See results/ folder for detailed performance analysis.

πŸ§ͺ Experimental Work

The experiments/ folder contains extensive research:

  • Ensemble Methods: Gradient boosting combinations
  • Optimization: Advanced hyperparameter tuning
  • Legacy Models: Early development iterations

🀝 Contributing

  1. Fork the repository
  2. Create feature branch (git checkout -b feature/amazing-feature)
  3. Commit changes (git commit -m 'Add amazing feature')
  4. Push to branch (git push origin feature/amazing-feature)
  5. Open Pull Request

πŸ“„ License

This project is licensed under the MIT License - see the LICENSE file for details.

πŸ™ Acknowledgments

  • Advanced machine learning techniques
  • Meta-learning and stacking approaches
  • Computer vision for e-commerce
  • Ensemble method research

⭐ Star this repo if it helped you!