Regularization - Ridge and Lasso Regression

Why regularization exists

Regression models can overfit when:

Overfitting often shows up as:

Regularization adds a penalty that discourages overly complex solutions.

Ridge minimizes:

MSE + λ * Σ(wi²)MSE + λ * Σ(wi²)

Effect:

Lasso minimizes:

MSE + λ * Σ(|wi|)MSE + λ * Σ(|wi|)

Effect:

false

  flowchart LR
  A[Linear Regression] --> B[Ridge: shrink weights]
  A --> C[Lasso: shrink + select]

Ridge and Lasso

from sklearn.linear_model import Ridge, Lasso
 
ridge = Ridge(alpha=1.0)  # alpha is λ
lasso = Lasso(alpha=0.1)

Ridge and Lasso

from sklearn.linear_model import Ridge, Lasso
 
ridge = Ridge(alpha=1.0)  # alpha is λ
lasso = Lasso(alpha=0.1)

Regularization is sensitive to feature scale.

Use StandardScalerStandardScaler in a pipeline.

If you have 1000 features:

(Usually Lasso.)

If this helped you, consider buying me a coffee ☕