Regularization in Logistic Regression

The Journey of Regularization in ML Using Logistic Regression · Part 5 of 6

1. Introduction

When training a machine learning model, especially Logistic Regression, we want the model to generalize well to unseen data. However, as we saw with overfitting, a model can:

Fit the training data perfectly
Learn noise instead of the true pattern
Perform poorly on new data

Regularization is a technique that prevents overfitting by penalizing large weights in the model.

2. Types of Regularization

Type	Penalty Term	Effect
L1 (Lasso)	$\lambda \sum$	$w_i$
L2 (Ridge)	$\lambda \sum w_i^2$	Shrinks all weights, smooth the model, and avoids extreme values

2.1. L1 (Lasso) Regularization

Use L1 regularization when you want automatic feature selection.

Typical scenarios:

1. High-dimensional datasets
When the dataset has many features but only some are important. Example: text classification, gene expression data.

2. Sparse solutions are preferred
L1 pushes many weights exactly to zero, effectively removing irrelevant features.

3. Model interpretability is important
Because many coefficients become zero, the remaining features clearly show which variables influence predictions.

Example Applications:

Spam detection (thousands of word features)
Genomics / bioinformatics
NLP feature selection

2.2 L2 Regularization (Ridge)

Use L2 regularization when all features contribute somewhat and you mainly want to reduce overfitting without eliminating features.

Typical scenarios:

1. Multicollinearity (highly correlated features)
L2 distributes weights among correlated variables rather than eliminating them.

2. When most features are useful
Instead of removing features, L2 shrinks their magnitudes smoothly.

3. More stable models
It creates smooth decision boundaries and reduces variance.

Example Applications:

Regression with many correlated predictors
Neural networks (commonly called weight decay)
Financial prediction models

← Prediction Error Decomposition (Bias–Var...

Linked to

Machine Learning (Course folder)
Logistic Regression (Material)
Linear Regression Model and Optimization (Material)
Lecture 03 Linear Regression and Optimization (Material)

By Dr. Adnan Amin · March 9, 2026 · 688 views

★ ★ ★ ★ ★ (4.7)

2 Comments

Sign in to leave a comment.

A

Abdullah Hasnain Qureshi 4 months ago

From this topic, I learned that regularization helps prevent overfitting in machine learning models by penalizing large weights so the model can generalize better to new data. I also learned the difference between L1 (Lasso) and L2 (Ridge) regularization, where L1 can perform feature selection by setting some weights to zero, while L2 reduces the size of weights to create a more stable model.

M

Muhammad Shaheer Siddiqui 4 months ago

I learned that regularization helps reduce overfitting in machine learning models. I also understood that L1 can remove unnecessary features while L2 reduces the size of weights to make the model more stable.