When you start learning Data Science or Statistics, one of the first concepts you come across is Types of Data.
This foundation decides which graphs to use, which statistical tests are valid, and which ML algorithms will work best.
So let’s break it down in a very simple way — the same way I understood it during my Data Science coursework.
π° Two Main Types of Data
All data you deal with falls under two big buckets:
1️⃣ Qualitative Data (Categorical)
Non-numerical data — describes qualities, labels, or categories.
2️⃣ Quantitative Data (Numerical)
Data measured using numbers — describes quantity or amount.
Let’s understand each one easily.
π¨ 1. Qualitative Data (Categorical)
This data represents categories, labels, or names.
You cannot do mathematical operations on it (like addition or average).
Qualitative data is of two types:
πΈ A. Nominal Data
✔ Labels with no order
✔ Categories are equal
✔ Only classification is possible
Examples:
-
Gender (Male/Female/Other)
-
Nationality (Indian, American…)
-
Eye Color
-
Marital Status
-
Mode of Transport
π You cannot say one is “higher” or “lower” — only categories.
πΈ B. Ordinal Data
✔ Labels with a meaningful order
✔ But difference between them is not measurable
Examples:
-
Customer satisfaction rating (1–5)
-
Education level (Primary → Secondary → Graduate → Postgraduate)
-
Letter grades (A, B, C…)
-
Rankings (1st, 2nd, 3rd)
π You know the order, but you don’t know how big the difference is.
π’ 2. Quantitative Data (Numerical)
Data that represents numbers we can measure, calculate, or compare.
This is divided into two types:
πΈ A. Discrete Data
✔ Whole numbers only
✔ Counts, not measurements
✔ Cannot have decimals
Examples:
-
Number of students in a class
-
Number of employees
-
Number of vehicles
-
Number of products sold
π Always countable.
πΈ B. Continuous Data
✔ Can take any value (decimals allowed)
✔ Measurements
✔ More precise than discrete data
Examples:
-
Height, weight
-
Time taken to finish a task
-
Speed of a vehicle
-
Temperature
-
Market share price
π Values fall anywhere within a range.
π§© Putting It All Together (Simple Table)
| Type | Sub-Type | Meaning | Examples |
|---|---|---|---|
| Qualitative | Nominal | Categories without order | Gender, Eye Color |
| Qualitative | Ordinal | Categories with order | Ratings, Grades |
| Quantitative | Discrete | Countable numbers | Students, Cars |
| Quantitative | Continuous | Measurable values | Height, Time |
π Why Understanding Data Types Is Important?
Because it affects everything in Data Science:
✔ What type of chart you will use
✔ Which statistical test is valid
✔ Which ML model works best
✔ How you preprocess/clean the data
For example:
-
Nominal → One-hot encoding
-
Ordinal → Label encoding
-
Continuous → Standardization/Normalization
-
Discrete → No scaling needed sometimes
π― Real Example: Choosing the Right Method
If you’re predicting House Prices ⬇
-
Area in sq. ft → Continuous
-
Number of bedrooms → Discrete
-
Location → Nominal
-
Condition (poor/average/good) → Ordinal
The type determines how you handle each feature.
π Conclusion
Understanding data types is the first and most essential step in Data Science.
Once you get this right, every other concept — visualization, encoding, modeling, statistics — becomes so much easier.
If you’re curious about how this fits into the bigger picture, you can read my post on What is Data Science?.

No comments:
Post a Comment