Logistic Regression (Binary vs Multiclass)

Why it’s called “regression”

Logistic regression is a classification algorithm.

It models a probability using a logistic (sigmoid) function.

It predicts:

Then:

false

  flowchart LR
  Z[Linear score z = w·x + b] --> S[Sigmoid]
  S --> P[p = probability]
  P --> Y[Class via threshold]

Common approaches:

Scikit-learn can do both.

LogisticRegression (binary or multiclass)

from sklearn.linear_model import LogisticRegression
 
# For multiclass, common choice:
clf = LogisticRegression(max_iter=1000, multi_class="auto")

LogisticRegression (binary or multiclass)

from sklearn.linear_model import LogisticRegression
 
# For multiclass, common choice:
clf = LogisticRegression(max_iter=1000, multi_class="auto")

Default threshold is 0.5, but for imbalanced problems you may choose:

If missing fraud is expensive, do you optimize for precision or recall?

(Usually recall.)

If this helped you, consider buying me a coffee ☕