The Power of Ensembles - Why Combine Models?

The intuition: many opinions beat one

If you ask one person for a guess, you can get a bad answer.

If you ask 100 people and average their guesses, the result is often better.

Ensembles do the same for models.

Ensembles improve performance through:

If models make different mistakes, combining helps.

false

  flowchart LR
  M1[Model 1] --> C[Combine]
  M2[Model 2] --> C
  M3[Model 3] --> C
  C --> P[Better prediction]

Models should not all make the same mistakes.

How diversity is created:

Ensembles can be:

But on many tabular problems, they are the strongest first choice.

Which seems more likely to generalize?

(Usually the averaged forest.)

If this helped you, consider buying me a coffee ☕