Monday, 24 November 2025

πŸ“Š Types of Data in Data Science: A Simple & Clear Guide

 When you start learning Data Science or Statistics, one of the first concepts you come across is Types of Data.

This foundation decides which graphs to use, which statistical tests are valid, and which ML algorithms will work best.

So let’s break it down in a very simple way — the same way I understood it during my Data Science coursework.


πŸ”° Two Main Types of Data

All data you deal with falls under two big buckets:

1️⃣ Qualitative Data (Categorical)

Non-numerical data — describes qualities, labels, or categories.

2️⃣ Quantitative Data (Numerical)

Data measured using numbers — describes quantity or amount.

Let’s understand each one easily.




🎨 1. Qualitative Data (Categorical)

This data represents categories, labels, or names.
You cannot do mathematical operations on it (like addition or average).

Qualitative data is of two types:


πŸ”Έ A. Nominal Data

✔ Labels with no order
✔ Categories are equal
✔ Only classification is possible

Examples:

  • Gender (Male/Female/Other)

  • Nationality (Indian, American…)

  • Eye Color

  • Marital Status

  • Mode of Transport

πŸ‘‰ You cannot say one is “higher” or “lower” — only categories.


πŸ”Έ B. Ordinal Data

✔ Labels with a meaningful order
✔ But difference between them is not measurable

Examples:

  • Customer satisfaction rating (1–5)

  • Education level (Primary → Secondary → Graduate → Postgraduate)

  • Letter grades (A, B, C…)

  • Rankings (1st, 2nd, 3rd)

πŸ‘‰ You know the order, but you don’t know how big the difference is.


πŸ”’ 2. Quantitative Data (Numerical)

Data that represents numbers we can measure, calculate, or compare.

This is divided into two types:


πŸ”Έ A. Discrete Data

Whole numbers only
✔ Counts, not measurements
✔ Cannot have decimals

Examples:

  • Number of students in a class

  • Number of employees

  • Number of vehicles

  • Number of products sold

πŸ‘‰ Always countable.


πŸ”Έ B. Continuous Data

✔ Can take any value (decimals allowed)
✔ Measurements
✔ More precise than discrete data

Examples:

  • Height, weight

  • Time taken to finish a task

  • Speed of a vehicle

  • Temperature

  • Market share price

πŸ‘‰ Values fall anywhere within a range.


🧩 Putting It All Together (Simple Table)

TypeSub-TypeMeaningExamples
QualitativeNominalCategories without orderGender, Eye Color
QualitativeOrdinalCategories with orderRatings, Grades
QuantitativeDiscreteCountable numbersStudents, Cars
QuantitativeContinuousMeasurable valuesHeight, Time

πŸ“ Why Understanding Data Types Is Important?

Because it affects everything in Data Science:

✔ What type of chart you will use
✔ Which statistical test is valid
✔ Which ML model works best
✔ How you preprocess/clean the data

For example:

  • Nominal → One-hot encoding

  • Ordinal → Label encoding

  • Continuous → Standardization/Normalization

  • Discrete → No scaling needed sometimes


🎯 Real Example: Choosing the Right Method

If you’re predicting House Prices

  • Area in sq. ft → Continuous

  • Number of bedrooms → Discrete

  • Location → Nominal

  • Condition (poor/average/good) → Ordinal

The type determines how you handle each feature.


🌟 Conclusion

Understanding data types is the first and most essential step in Data Science.
Once you get this right, every other concept — visualization, encoding, modeling, statistics — becomes so much easier.

If you’re curious about how this fits into the bigger picture, you can read my post on What is Data Science?.

No comments:

Post a Comment

🎯 Supervised Learning: How Machines Learn From Labeled Data

In Data Science and Machine Learning, one of the most fundamental concepts you will hear again and again is Supervised Learning . It’s the ...