When you start learning Data Science or Statistics, one of the first concepts you come across is Types of Data.

This foundation decides which graphs to use, which statistical tests are valid, and which ML algorithms will work best.

So let’s break it down in a very simple way — the same way I understood it during my Data Science coursework.

🔰 Two Main Types of Data

All data you deal with falls under two big buckets:

1️⃣ Qualitative Data (Categorical)

Non-numerical data — describes qualities, labels, or categories.

2️⃣ Quantitative Data (Numerical)

Data measured using numbers — describes quantity or amount.

Let’s understand each one easily.

🎨 1. Qualitative Data (Categorical)

This data represents categories, labels, or names.
You cannot do mathematical operations on it (like addition or average).

Qualitative data is of two types:

🔸 A. Nominal Data

✔ Labels with no order
✔ Categories are equal
✔ Only classification is possible

Examples:

Gender (Male/Female/Other)
Nationality (Indian, American…)
Eye Color
Marital Status
Mode of Transport

👉 You cannot say one is “higher” or “lower” — only categories.

🔸 B. Ordinal Data

✔ Labels with a meaningful order
✔ But difference between them is not measurable

Examples:

Customer satisfaction rating (1–5)
Education level (Primary → Secondary → Graduate → Postgraduate)
Letter grades (A, B, C…)
Rankings (1st, 2nd, 3rd)

👉 You know the order, but you don’t know how big the difference is.

🔢 2. Quantitative Data (Numerical)

Data that represents numbers we can measure, calculate, or compare.

This is divided into two types:

🔸 A. Discrete Data

✔ Whole numbers only
✔ Counts, not measurements
✔ Cannot have decimals

Examples:

Number of students in a class
Number of employees
Number of vehicles
Number of products sold

👉 Always countable.

🔸 B. Continuous Data

✔ Can take any value (decimals allowed)
✔ Measurements
✔ More precise than discrete data

Examples:

Height, weight
Time taken to finish a task
Speed of a vehicle
Temperature
Market share price

👉 Values fall anywhere within a range.

🧩 Putting It All Together (Simple Table)

Type	Sub-Type	Meaning	Examples
Qualitative	Nominal	Categories without order	Gender, Eye Color
Qualitative	Ordinal	Categories with order	Ratings, Grades
Quantitative	Discrete	Countable numbers	Students, Cars
Quantitative	Continuous	Measurable values	Height, Time

📝 Why Understanding Data Types Is Important?

Because it affects everything in Data Science:

✔ What type of chart you will use
✔ Which statistical test is valid
✔ Which ML model works best
✔ How you preprocess/clean the data

For example:

Nominal → One-hot encoding
Ordinal → Label encoding
Continuous → Standardization/Normalization
Discrete → No scaling needed sometimes

🎯 Real Example: Choosing the Right Method

If you’re predicting House Prices ⬇

Area in sq. ft → Continuous
Number of bedrooms → Discrete
Location → Nominal
Condition (poor/average/good) → Ordinal

The type determines how you handle each feature.

🌟 Conclusion

Understanding data types is the first and most essential step in Data Science.
Once you get this right, every other concept — visualization, encoding, modeling, statistics — becomes so much easier.

If you’re curious about how this fits into the bigger picture, you can read my post on What is Data Science?.

TechAstra By Darshana

Monday, 24 November 2025

📊 Types of Data in Data Science: A Simple & Clear Guide

🔰 Two Main Types of Data

1️⃣ Qualitative Data (Categorical)

2️⃣ Quantitative Data (Numerical)

🎨 1. Qualitative Data (Categorical)

🔸 A. Nominal Data

🔸 B. Ordinal Data

🔢 2. Quantitative Data (Numerical)

🔸 A. Discrete Data

🔸 B. Continuous Data

🧩 Putting It All Together (Simple Table)

📝 Why Understanding Data Types Is Important?

🎯 Real Example: Choosing the Right Method

🌟 Conclusion

No comments:

Post a Comment

🏞️ Data Lake vs Data Warehouse vs Lakehouse: Understanding Modern Data Architectures

Labels

Search This Blog

Blog Archive