Domain
Machine Learning & AI
Skill Profile
XGBoost, LightGBM, CatBoost: hyperparameter tuning, feature importance
Roles
2
where this skill appears
Levels
5
structured growth path
Mandatory requirements
10
the other 0 optional
Machine Learning & AI
Classical Machine Learning
3/17/2026
Choose your current level and compare expectations. The items below show what to cover to advance to the next level.
The table shows how skill depth grows from Junior to Principal. Click a row to see details.
| Role | Required | Description |
|---|---|---|
| Data Scientist | Required | Understands gradient boosting principles and its differences from random forest. Trains XGBoost and LightGBM models on tabular data with basic hyperparameter tuning. Interprets feature importance to explain model results. |
| ML Engineer | Required | Trains XGBoost/LightGBM/CatBoost models with default parameters. Understands gradient boosting concept. Uses feature importance for model analysis. |
| Role | Required | Description |
|---|---|---|
| Data Scientist | Required | Independently trains production-ready gradient boosting models with advanced tuning. Works with XGBoost, LightGBM, and CatBoost, selects the optimal framework. Configures early stopping, regularization, and categorical features handling. |
| ML Engineer | Required | Performs hyperparameter tuning for gradient boosting (learning_rate, max_depth, n_estimators, regularization). Handles categorical features (CatBoost native, target encoding). Configures early stopping and cross-validation. Analyzes SHAP values. |
| Role | Required | Description |
|---|---|---|
| Data Scientist | Required | Designs gradient boosting systems for production: incremental learning, model compression, ONNX export. Optimizes inference speed through tree pruning and quantization. Implements multi-output boosting and custom objective functions for specific business tasks. |
| ML Engineer | Required | Designs production gradient boosting systems. Optimizes inference speed (model pruning, quantization). Builds ensembles from multiple gradient boosting models. Integrates with feature store and model serving. |
| Role | Required | Description |
|---|---|---|
| Data Scientist | Required | Defines gradient boosting standards for the data science team. Establishes reusable training pipelines for tabular data. Coordinates the choice between gradient boosting and deep learning for different task types and data. |
| ML Engineer | Required | Defines gradient boosting usage strategy in ML organization. Evaluates gradient boosting vs deep learning for tabular data. Creates AutoML pipeline. |
| Role | Required | Description |
|---|---|---|
| Data Scientist | Required | Shapes gradient boosting usage strategy at ML platform level. Defines architectural decisions for scalable GBM model training and serving. Evaluates new approaches: differentiable trees, neural-boosting hybrids. |
| ML Engineer | Required | Defines tabular ML strategy for the organization. Researches novel gradient boosting approaches. Publishes results at conferences. |