← Back to portfolio
Data Science Salary Predictor
Predict expected salary based on job title, education level, and years of experience. Trained on 6,000+ real salary records using ensemble methods.
Estimated Annual Salary (USD)
Model Comparison (R² on test set)
Linear Regression
R² = 0.6514 · MAE = $23,449
Random Forest
R² = 0.9584 · MAE = $4,375
Gradient Boosting ✓
R² = 0.9598 · MAE = $4,576
Trained on 6,000+ salary records. Features: job title, education, years of experience.
Methodology & Limitations
- Data source: Kaggle Salary Dataset — 6,700 records with age, education, job title, and years of experience.
- Title normalization: Senior/Junior/Lead prefixes were stripped — seniority is captured by years of experience instead.
- Filtered to tech/data roles: Only titles related to data science, engineering, analytics, and software development are included (16 titles).
- Limitations: Does not account for location, company size, equity/bonus, negotiation, or industry. Predictions represent broad trends, not individual offers.
- Salary range in training data: $550 – $240,000 (median $133K). Outliers on both ends may skew predictions for edge cases.