Supervised Learning: Regression & Classification
The workhorse algorithms of applied ML — ranked by practical usage.
Supervised learning covers the workhorse algorithms of applied ML — models that learn from labeled examples to predict a target. Below are the most popular families, ranked by practical usage, with what they’re best for and where they’re commonly applied.
- For tabular business data, start with LightGBM/CatBoost/XGBoost, Random Forest, and Logistic/Linear Regression.
- Gradient boosting is widely recognized as a leading practical method for structured prediction and has been prominent in ML competitions.
| # | Algorithm | Best for | Common fields |
|---|---|---|---|
| 1 | Gradient Boosted Trees: XGBoost, LightGBM, CatBoost, GBM | High-performing tabular prediction |
|
| 2 | Random Forest | Strong baseline, robust tabular modeling |
|
| 3 | Logistic Regression | Binary/multiclass classification, interpretable baseline |
|
| 4 | Linear Regression / Ridge / Lasso / Elastic Net | Numeric prediction, interpretable models |
|
| 5 | Decision Tree | Simple interpretable rules |
|
| 6 | Support Vector Machine / SVR | Smaller datasets, high-dimensional data |
|
| 7 | k-Nearest Neighbors | Simple similarity-based prediction |
|
| 8 | Naive Bayes | Fast text classification |
|
| 9 | Generalized Linear Models | Structured statistical modeling |
|
| 10 | Neural Networks / MLPs | Nonlinear supervised learning |
|