This project tackles the task of predicting house prices using machine learning and also includes a basic recommender system for houses based on features. The workflow consists of structured preprocessing, exploratory data analysis (EDA), feature engineering, ensemble modeling, and a content-based recommendation approach.
House Price Prediction/
βββ Datasets/
β βββ train.csv # Training dataset
β βββ test.csv # Test dataset for prediction
β βββ sample_submission.csv # Sample submission format
βββ Models/
β βββ best_gbr_model.pkl # Trained Gradient Boosting Regressor model
βββ Notebooks/
β βββ 01_data_loading_and_eda.ipynb # Data loading & EDA
β βββ 02_feature_engineering.ipynb # Feature engineering pipeline
β βββ 03_modeling_and_prediction_annotated.ipynb # Modeling and predictions
β βββ 04_recommender_system.ipynb # Recommender system notebook
β βββ submission_*.csv # Generated submission files
βββ Processed/
β βββ X_train.csv # Processed training features
β βββ X_test.csv # Processed test features
βββ submission_ensemble.csv # Final ensemble predictions
βββ submission_gbr.csv # GBR model predictions
- Gradient Boosting Regressor (GBR) β Trained and saved in
best_gbr_model.pkl - Ensemble Predictions β Combined multiple models for improved accuracy
Implemented a content-based recommender system in 04_recommender_system.ipynb, which suggests similar houses based on numerical and categorical features. It uses:
- Feature similarity (cosine distance or Euclidean distance)
- Normalization and vectorization of features
- Nearest neighbor retrieval
git clone https://github.com/your-username/house-price-prediction.git
cd house-price-predictionpip install -r requirements.txtLaunch Jupyter or VSCode and open the notebooks in the Notebooks/ directory.
Install dependencies from requirements.txt:
pip install -r requirements.txtMIT License β free to use with attribution.
Developed by [Your Name]. Contributions welcome!