Name: Ml Ensemble
Author: kizabgd123

Ml Ensemble

Build production-grade ML ensembles for Kaggle competitions: stacking, blending, weighted averaging, and rank averaging. Use this skill when the user mentions "ensemble", "stacking", "blending", "combine models", "meta-learner", "level 2 model", "weighted average", "model fusion", "rank average", or asks how to combine LightGBM / XGBoost / CatBoost predictions. Always enforce proper OOF stacking to prevent leakage.

kizabgd1230 星标2026年4月5日

职业
分类: 机器学习

ML Ensemble Builder

Three ensemble strategies ranked by complexity and typical Kaggle gain. Choose based on time remaining and number of base models.

Strategy 1 — Weighted Average (fastest, +0.001–0.003 AUC typical)

Best when: < 3 models, limited time, models have similar CV scores.

import numpy as np

# Load OOF predictions (out-of-fold on train)
oof_lgbm   = np.load("oof/lgbm_oof.npy")
oof_xgb    = np.load("oof/xgb_oof.npy")
oof_cat    = np.load("oof/catboost_oof.npy")

# Load test predictions
test_lgbm  = np.load("preds/lgbm_test.npy")
test_xgb   = np.load("preds/xgb_test.npy")
test_cat   = np.load("preds/catboost_test.npy")

# Grid search weights on OOF
from sklearn.metrics import roc_auc_score
from itertools import product

best_score, best_weights = 0, (1/3, 1/3, 1/3)

for w1, w2 in product(np.arange(0, 1.05, 0.05), repeat=2):
    w3 = 1 - w1 - w2
    if w3 < 0:
        continue
    blend = w1 * oof_lgbm + w2 * oof_xgb + w3 * oof_cat
    score = roc_auc_score(y_train, blend)
    if score > best_score:
        best_score, best_weights = score, (w1, w2, w3)

w1, w2, w3 = best_weights
print(f"Best weights: LGBM={w1:.2f}  XGB={w2:.2f}  CAT={w3:.2f}")
print(f"Ensemble OOF AUC: {best_score:.5f}")

test_blend = w1 * test_lgbm + w2 * test_xgb + w3 * test_cat

Ml Ensemble

kizabgd1230 星标2026年4月5日

职业
分类: 机器学习

Strategy 1 — Weighted Average (fastest, +0.001–0.003 AUC typical)

Best when: < 3 models, limited time, models have similar CV scores.

import numpy as np # Load OOF predictions (out-of-fold on train) oof_lgbm = np.load("oof/lgbm_oof.npy") oof_xgb = np.load("oof/xgb_oof.npy") oof_cat = np.load("oof/catboost_oof.npy") # Load test predictions test_lgbm = np.load("preds/lgbm_test.npy") test_xgb = np.load("preds/xgb_test.npy") test_cat = np.load("preds/catboost_test.npy") # Grid search weights on OOF from sklearn.metrics import roc_auc_score from itertools import product best_score, best_weights = 0, (1/3, 1/3, 1/3) for w1, w2 in product(np.arange(0, 1.05, 0.05), repeat=2): w3 = 1 - w1 - w2 if w3 < 0: continue blend = w1 * oof_lgbm + w2 * oof_xgb + w3 * oof_cat score = roc_auc_score(y_train, blend) if score > best_score: best_score, best_weights = score, (w1, w2, w3) w1, w2, w3 = best_weights print(f"Best weights: LGBM={w1:.2f} XGB={w2:.2f} CAT={w3:.2f}") print(f"Ensemble OOF AUC: {best_score:.5f}") test_blend = w1 * test_lgbm + w2 * test_xgb + w3 * test_cat

Ml Ensemble

ML Ensemble Builder

Strategy 1 — Weighted Average (fastest, +0.001–0.003 AUC typical)

Ml Ensemble

ML Ensemble Builder

Strategy 1 — Weighted Average (fastest, +0.001–0.003 AUC typical)

Strategy 2 — Rank Averaging (robust to scale differences)

Strategy 3 — Stacking / Meta-Learner (most powerful, +0.002–0.008 AUC typical)

Diversity Checklist

Save & Version

Continuous Learning V2

Continuous Learning V2

Continuous Learning V2

Continuous Learning

Continuous Learning

Pytorch Patterns