TechAstra By Darshana: 📉 Overfitting vs Underfitting: How Models Learn (and Fail)

When a machine learning model performs very well on training data but poorly on new data, we often say:

“The model learned too much… or too little.”

That’s the core idea behind Overfitting and Underfitting — two of the most important concepts to understand if you want to build reliable ML models.

I truly started appreciating this topic when I began checking training vs test performance in code, not just reading definitions.

🧠 What Does "Model Learning" Really Mean?

A model learns by identifying patterns in data.
But learning can go wrong in two ways:

The model learns too little → misses important patterns
The model learns too much → memorizes noise instead of general rules

These two extremes are called Underfitting and Overfitting.

🔻 Underfitting: When the Model Is Too Simple

Underfitting happens when a model is too simple to capture the underlying pattern in the data.

🔹 Characteristics

Poor performance on training data
Poor performance on test data
High bias, low variance

🔹 Intuition

It’s like studying only the chapter headings before an exam — you never really understand the topic.

🔹 Example

Using linear regression to model a clearly non-linear relationship.

🔺 Overfitting: When the Model Learns Too Much

Overfitting happens when a model learns noise and details from training data that don’t generalize.

🔹 Characteristics

Very high training accuracy
Poor test performance
Low bias, high variance

🔹 Intuition

It’s like memorizing answers instead of understanding concepts — works only for known questions.

🔹 Example

A very deep decision tree that fits every training point perfectly.

⚖️ The Sweet Spot: Good Fit

A well-trained model:

Learns meaningful patterns
Ignores noise
Performs well on both training and test data

🧮 A Practical View Using Training vs Test Scores

This is where theory becomes real.


print("Train R2:", model.score(X_train, y_train))
print("Test R2:", model.score(X_test, y_test))

How to interpret:

Low train & low test → Underfitting
High train & low test → Overfitting
Similar and high scores → Good fit

This simple check already tells you a lot about model behavior.

🔧 How Do We Fix Underfitting?

Use a more complex model
Add more relevant features
Reduce regularization
Train longer (if applicable)

🛠️ How Do We Fix Overfitting?

Collect more data
Use regularization (L1 / L2)
Reduce model complexity
Use cross-validation
Apply early stopping (for neural networks)


from sklearn.model_selection import cross_val_score

scores = cross_val_score(model, X, y, cv=5)
print("CV Score:", scores.mean())

🧠 Bias–Variance Tradeoff (Simple Explanation)

Bias → error due to overly simple assumptions
Variance → error due to sensitivity to data

Underfitting → High bias
Overfitting → High variance

Good models balance both.

🌱 Why This Concept Matters So Much

Almost every ML problem eventually becomes a question of:

“Is my model learning the right amount?”

Understanding overfitting and underfitting helps you:

Debug models faster
Choose the right complexity
Build models that actually work in production

🧩 Final Thoughts

A model failing is not a bad sign — it’s feedback.

Underfitting tells you the model needs more capacity.
Overfitting tells you the model needs more discipline.

Learning to read these signals is what turns code into intuition.

🔗 Explore Related blogs

Supervised Learning

Unsupervised Learning

TechAstra By Darshana

Tuesday, 13 January 2026

📉 Overfitting vs Underfitting: How Models Learn (and Fail)

🧠 What Does "Model Learning" Really Mean?

🔻 Underfitting: When the Model Is Too Simple

🔹 Characteristics

🔹 Intuition

🔹 Example

🔺 Overfitting: When the Model Learns Too Much

🔹 Characteristics

🔹 Intuition

🔹 Example

⚖️ The Sweet Spot: Good Fit

🧮 A Practical View Using Training vs Test Scores

How to interpret:

🔧 How Do We Fix Underfitting?

🛠️ How Do We Fix Overfitting?

🧠 Bias–Variance Tradeoff (Simple Explanation)

🌱 Why This Concept Matters So Much

🧩 Final Thoughts

🔗 Explore Related blogs

No comments:

Post a Comment

📉 Loss Functions Explained: How Models Know They Are Wrong

Labels

Search This Blog

Blog Archive