TechAstra By Darshana: January 2026

Thursday, 22 January 2026

📉 Loss Functions Explained: How Models Know They Are Wrong

Every machine learning model learns by making mistakes.

But how does a model measure those mistakes?

That’s the role of a loss function.

Understanding loss functions is a turning point in ML learning — because this is where predictions, errors, and optimization finally connect.

🧠 What Is a Loss Function?

A loss function quantifies how far a model’s prediction is from the actual value.

In simple terms:

Loss = “How wrong was the model?”

During training, the model tries to minimize this loss.

Mathematically:

Loss = L (y, \hat{y})

where

$y$ = actual value
$\hat{y}$ = predicted value

🔁 How Loss Fits into the Learning Loop

Model makes a prediction
Loss function measures the error
Optimizer (e.g., Gradient Descent) updates weights
Loss reduces gradually over epochs

📊 Loss Functions for Regression

🔹 1. Mean Squared Error (MSE)

M S E = \frac{1}{n} \sum (y - \hat{y})^{2}

Why it’s used:

Penalizes large errors heavily
Smooth and differentiable

Limitation:

Sensitive to outliers

🔹 2. Mean Absolute Error (MAE)

M A E = \frac{1}{n} \sum ∣ y - \hat{y} ∣

Why it’s used:

More robust to outliers
Easy to interpret

Trade-off:

Less smooth than MSE

🧮 Loss Functions for Classification

🔹 3. Binary Cross-Entropy (Log Loss)

Used for binary classification problems.

L o s s = - [y \log (p) + (1 - y) \log (1 - p)]

Intuition:

Penalizes confident wrong predictions heavily
Encourages probability calibration

🔹 4. Categorical Cross-Entropy

Used when there are multiple classes.

Example:

Handwritten digit recognition (0–9)
Multi-class text classification

The loss increases when the predicted probability for the correct class is low.

⚙️ Choosing the Right Loss Function

Problem Type	Loss Function
Linear Regression	MSE / MAE
Binary Classification	Binary Cross-Entropy
Multi-class Classification	Categorical Cross-Entropy
Deep Learning	Cross-Entropy + Regularization

Choosing the wrong loss can make even a good model fail.

🧠 Why Loss Functions Matter More Than Accuracy

Accuracy tells you what happened.
Loss tells you why it happened.

Two models can have the same accuracy
But very different loss values

Lower loss usually means:

Better confidence
Better generalization
Better learning signal

🌱 Final Thoughts

Loss functions are not just formulas — they are feedback mechanisms.

They guide models:

What to correct
How fast to learn
When to stop

Once you truly understand loss functions, concepts like gradient descent, regularization, and neural network training become much clearer.

Tuesday, 13 January 2026

📉 Overfitting vs Underfitting: How Models Learn (and Fail)

When a machine learning model performs very well on training data but poorly on new data, we often say:

“The model learned too much… or too little.”

That’s the core idea behind Overfitting and Underfitting — two of the most important concepts to understand if you want to build reliable ML models.

I truly started appreciating this topic when I began checking training vs test performance in code, not just reading definitions.

🧠 What Does "Model Learning" Really Mean?

A model learns by identifying patterns in data.
But learning can go wrong in two ways:

The model learns too little → misses important patterns
The model learns too much → memorizes noise instead of general rules

These two extremes are called Underfitting and Overfitting.

🔻 Underfitting: When the Model Is Too Simple

Underfitting happens when a model is too simple to capture the underlying pattern in the data.

🔹 Characteristics

Poor performance on training data
Poor performance on test data
High bias, low variance

🔹 Intuition

It’s like studying only the chapter headings before an exam — you never really understand the topic.

🔹 Example

Using linear regression to model a clearly non-linear relationship.

🔺 Overfitting: When the Model Learns Too Much

Overfitting happens when a model learns noise and details from training data that don’t generalize.

🔹 Characteristics

Very high training accuracy
Poor test performance
Low bias, high variance

🔹 Intuition

It’s like memorizing answers instead of understanding concepts — works only for known questions.

🔹 Example

A very deep decision tree that fits every training point perfectly.

⚖️ The Sweet Spot: Good Fit

A well-trained model:

Learns meaningful patterns
Ignores noise
Performs well on both training and test data

🧮 A Practical View Using Training vs Test Scores

This is where theory becomes real.


print("Train R2:", model.score(X_train, y_train))
print("Test R2:", model.score(X_test, y_test))

How to interpret:

Low train & low test → Underfitting
High train & low test → Overfitting
Similar and high scores → Good fit

This simple check already tells you a lot about model behavior.

🔧 How Do We Fix Underfitting?

Use a more complex model
Add more relevant features
Reduce regularization
Train longer (if applicable)

🛠️ How Do We Fix Overfitting?

Collect more data
Use regularization (L1 / L2)
Reduce model complexity
Use cross-validation
Apply early stopping (for neural networks)


from sklearn.model_selection import cross_val_score

scores = cross_val_score(model, X, y, cv=5)
print("CV Score:", scores.mean())

🧠 Bias–Variance Tradeoff (Simple Explanation)

Bias → error due to overly simple assumptions
Variance → error due to sensitivity to data

Underfitting → High bias
Overfitting → High variance

Good models balance both.

🌱 Why This Concept Matters So Much

Almost every ML problem eventually becomes a question of:

“Is my model learning the right amount?”

Understanding overfitting and underfitting helps you:

Debug models faster
Choose the right complexity
Build models that actually work in production

🧩 Final Thoughts

A model failing is not a bad sign — it’s feedback.

Underfitting tells you the model needs more capacity.
Overfitting tells you the model needs more discipline.

Learning to read these signals is what turns code into intuition.

🔗 Explore Related blogs

Supervised Learning

Unsupervised Learning

Sunday, 4 January 2026

📘 Supervised Learning Explained Practically: From Data to Predictions

Supervised Learning is one of the most fundamental concepts in Machine Learning and Data Science.

From spam detection to price prediction, most real-world ML systems are built using this approach.

As I progressed through my Data Science coursework, adding small practical implementations helped me truly understand how theory translates into working models. This blog combines both.

🔍 What Is Supervised Learning?

Supervised Learning is a machine learning approach where the model learns from labeled data.

Each data point has:

Input features (X)
Known output / label (y)

The model learns a mapping:

f(X) \rightarrow y

so it can make predictions on new, unseen data.

🧠 How Supervised Learning Works (Step-by-Step)

1️⃣ Data Collection & Labeling

Example dataset (House Price Prediction):

Area	Rooms	Price
1000	2	50
1500	3	75

Here:

Features → Area, Rooms
Label → Price

🐍 Python (loading data)


import pandas as pd

data = pd.read_csv("house_prices.csv")
X = data[["Area", "Rooms"]]
y = data["Price"]

2️⃣ Train–Test Split

We split data to evaluate how well the model generalizes.


from sklearn.model_selection import train_test_split

X_train, X_test, y_train, y_test = train_test_split(
    X, y, test_size=0.2, random_state=42
)

📊 Types of Supervised Learning

🔹 1. Regression (Continuous Output)

Use case: House price prediction, sales forecasting.

🐍 Python Example: Linear Regression


from sklearn.linear_model import LinearRegression

model = LinearRegression()
model.fit(X_train, y_train)

predictions = model.predict(X_test)

🔍 Model Evaluation


from sklearn.metrics import mean_squared_error, r2_score

mse = mean_squared_error(y_test, predictions)
r2 = r2_score(y_test, predictions)

print("MSE:", mse)
print("R2 Score:", r2)

🔹 2. Classification (Categorical Output)

Use case: Spam detection, fraud detection, disease prediction.

🐍 Python Example: Logistic Regression


from sklearn.linear_model import LogisticRegression

clf = LogisticRegression()
clf.fit(X_train, y_train)

y_pred = clf.predict(X_test)

🔍 Evaluation Metrics


from sklearn.metrics import accuracy_score, classification_report

print("Accuracy:", accuracy_score(y_test, y_pred))
print(classification_report(y_test, y_pred))

🧮 The Learning Process (Behind the Scenes)

Most supervised models try to minimize a loss function:

Loss = \frac{1}{n} \sum (y - \hat{y})^2

Using Gradient Descent, parameters are updated:

\theta = \theta - \alpha \cdot \nabla Loss

This is what allows the model to gradually improve predictions.

⚠️ Common Challenges (With Practical Fixes)

1️⃣ Overfitting

Model performs well on training data but poorly on test data.


from sklearn.model_selection import cross_val_score

scores = cross_val_score(model, X_train, y_train, cv=5)
print("Cross-validation score:", scores.mean())

2️⃣ Feature Scaling Issues

Some models need normalized data.


from sklearn.preprocessing import StandardScaler

scaler = StandardScaler()
X_train_scaled = scaler.fit_transform(X_train)
X_test_scaled = scaler.transform(X_test)

3️⃣ Imbalanced Data

Accuracy alone can be misleading.


from sklearn.metrics import precision_score, recall_score

precision = precision_score(y_test, y_pred)
recall = recall_score(y_test, y_pred)

🔬 A Practical Mini Walkthrough: From Data to Prediction

Rather than looking at another real-world story, let’s walk through how supervised learning actually feels when you implement it.

Step 1: Understand the Problem

We want to predict a numerical value based on past data → this is a regression problem.

So immediately, we know:

Supervised Learning ✔

Regression ✔

Loss function like MSE ✔

Step 2: Prepare the Data (What You Really Do First)

In practice, most time goes here.


# Check for missing values
data.isnull().sum()

# Basic feature selection
X = data.drop("Price", axis=1)
y = data["Price"]

This step forces you to think:

Which columns actually help the model learn?

Step 3: Train and Evaluate (The Core Loop)


model = LinearRegression()
model.fit(X_train, y_train)

train_score = model.score(X_train, y_train)
test_score = model.score(X_test, y_test)

print("Train R2:", train_score)
print("Test R2:", test_score)

This comparison immediately tells you:

If train >> test → overfitting
If both are low → underfitting

Step 4: Interpret Results (Very Important, Often Ignored)


coefficients = pd.DataFrame({
    "Feature": X.columns,
    "Weight": model.coef_
})

print(coefficients)

Now you’re not just predicting — you’re understanding:

Which features influence predictions
Whether model behavior makes sense logically

This is where Data Science becomes decision-making, not just modeling.

🌱 Why Supervised Learning Still Matters

Even in modern AI systems:

Used in model fine-tuning
Core part of reinforcement learning pipelines
Backbone of most enterprise ML solutions

Supervised learning is not outdated — it’s foundational.

Thursday, 22 January 2026

📉 Loss Functions Explained: How Models Know They Are Wrong

🧠 What Is a Loss Function?

🔁 How Loss Fits into the Learning Loop

📊 Loss Functions for Regression

🔹 1. Mean Squared Error (MSE)

🔹 2. Mean Absolute Error (MAE)

🧮 Loss Functions for Classification

🔹 3. Binary Cross-Entropy (Log Loss)

🔹 4. Categorical Cross-Entropy

⚙️ Choosing the Right Loss Function

🧠 Why Loss Functions Matter More Than Accuracy

🌱 Final Thoughts

Tuesday, 13 January 2026

📉 Overfitting vs Underfitting: How Models Learn (and Fail)

🧠 What Does "Model Learning" Really Mean?

🔻 Underfitting: When the Model Is Too Simple

🔹 Characteristics

🔹 Intuition

🔹 Example

🔺 Overfitting: When the Model Learns Too Much

🔹 Characteristics

🔹 Intuition

🔹 Example

⚖️ The Sweet Spot: Good Fit

🧮 A Practical View Using Training vs Test Scores

How to interpret:

🔧 How Do We Fix Underfitting?

🛠️ How Do We Fix Overfitting?

🧠 Bias–Variance Tradeoff (Simple Explanation)

🌱 Why This Concept Matters So Much

🧩 Final Thoughts

🔗 Explore Related blogs

Sunday, 4 January 2026

📘 Supervised Learning Explained Practically: From Data to Predictions

🔍 What Is Supervised Learning?

🧠 How Supervised Learning Works (Step-by-Step)

1️⃣ Data Collection & Labeling

🐍 Python (loading data)

2️⃣ Train–Test Split

📊 Types of Supervised Learning

🔹 1. Regression (Continuous Output)

🐍 Python Example: Linear Regression

🔍 Model Evaluation

🔹 2. Classification (Categorical Output)

🐍 Python Example: Logistic Regression

🔍 Evaluation Metrics

🧮 The Learning Process (Behind the Scenes)

⚠️ Common Challenges (With Practical Fixes)

1️⃣ Overfitting

2️⃣ Feature Scaling Issues

3️⃣ Imbalanced Data

🔬 A Practical Mini Walkthrough: From Data to Prediction

Rather than looking at another real-world story, let’s walk through how supervised learning actually feels when you implement it.

Step 1: Understand the Problem

We want to predict a numerical value based on past data → this is a regression problem. So immediately, we know: Supervised Learning ✔ Regression ✔ Loss function like MSE ✔

Step 2: Prepare the Data (What You Really Do First)

In practice, most time goes here. # Check for missing values data.isnull().sum() # Basic feature selection X = data.drop("Price", axis=1) y = data["Price"] This step forces you to think: Which columns actually help the model learn?

Step 3: Train and Evaluate (The Core Loop)

Step 4: Interpret Results (Very Important, Often Ignored)

🌱 Why Supervised Learning Still Matters

📉 Loss Functions Explained: How Models Know They Are Wrong

We want to predict a numerical value based on past data → this is a regression problem.

So immediately, we know:

Supervised Learning ✔

Regression ✔

Loss function like MSE ✔

In practice, most time goes here.

`# Check for missing values data.isnull().sum() # Basic feature selection X = data.drop("Price", axis=1) y = data["Price"]`

This step forces you to think:

Which columns actually help the model learn?