This project predicts restaurant revenue using operational, categorical, and location features.
- Data cleaning & preprocessing
- One-hot encoding & feature scaling
- Log transformation for skewed target
- Multiple regression models
- Random Forest with cross-validation
- Feature importance & residual analysis
- Best Model: Random Forest Regressor
- Cross-validated RΒ² β 0.89
- Order frequency is the primary driver of revenue
- Franchise presence, city, and category significantly impact revenue
- Follow the notebooks in order from exploratory analysis to final model.
β Best practical model: Random Forest
β Most influential feature: Order count
β Secondary drivers: franchise, city, category
- Python
- Pandas, NumPy
- Scikit-learn
- Matplotlib, Seaborn