Ensemble (stacked) — headline

Weighted F1
0.6203
Macro F1
0.2716
Accuracy
70.60%

Per-model comparison

Model Weighted F1 Macro F1 Accuracy
Ensemble (Stacked) ★ 0.6203 0.2716 70.60%
Random Forest 0.6334 0.3009 66.30%
Gradient Boosting 0.6199 0.2694 69.80%
XGBoost 0.5899 0.2998 55.40%

Class distribution

Class Owner range Tier Samples
0 ≤10K Common Indie 4,500
1 35K Niche 2,200
2 75K Growing 1,500
3 150K Established 1,000
4 350K Popular 504
5 ≥750K Breakout Hit 296

Metric definitions

Weighted F1

Harmonic mean of precision and recall, weighted by class support. Ranges 0–1; higher is better.

Macro F1

Unweighted average F1 across all classes. Useful for evaluating performance on minority classes.

Accuracy

Percentage of correctly classified games. A simple overall performance metric.

About the ensemble. The stacked model combines Random Forest, Gradient Boosting, and XGBoost via an XGBoost meta-learner — leveraging the complementary strengths of each base model.