G1Joshi

catboost

CatBoost gradient boosting with categoricals. Use for tabular ML.

G1Joshi 9 2 Updated 3mo ago
GitHub

Install

npx skillscat add g1joshi/agent-skills/catboost

Install via the SkillsCat registry.

SKILL.md

CatBoost

CatBoost (Yandex) is arguably the easiest boosting library to use because it handles Categorical Features automatically and perfectly without tuning.

When to Use

  • Categorical Data: If you have many strings/IDs, CatBoost is king.
  • Default Params: Works incredibly well out of the box.

Core Concepts

Ordered Boosting

A technique to avoid target leakage (overfitting) during training.

Symmetric Trees

Builds balanced trees, which are faster at inference time.

Best Practices (2025)

Do:

  • Use pool: Pool() is efficient for data loading.
  • Use GPU: CatBoost's GPU implementation is highly optimized.

Don't:

  • Don't One-Hot Encode: Let CatBoost handle it natively.

References