Introduction
Page 1: Mean of Grouped Data (The Direct Method)
Welcome to the world of Statistics! In Class IX, you learned how to organize data and find its "center" using the mean, median, and mode for simple, ungrouped lists of numbers. But what happens when you're faced with massive amounts of data, like the exam scores of every student in a city? Listing them one by one is impractical. This is where grouped data comes in.
Imagine a cricket analyst trying to understand a batsman's performance over a year. Instead of looking at every single score (7, 45, 102, 33, ...), they group them: "How many times did he score between 0-20 runs?", "How many times between 21-40?", and so on. This gives a much clearer picture. In this chapter, we'll upgrade our skills to find the mean, median, and mode for exactly this kind of grouped data, starting with the most fundamental measure: the mean.
{{FORMULA: expr=x̄ = (Σfᵢxᵢ) / (Σfᵢ) | symbols=x̄:Mean, Σ:Summation, fᵢ:Frequency of the i-th class, xᵢ:Class mark of the i-th class}}
Definitions & Formulas
Before we dive in, let's clarify the key terms we'll be using for grouped data.
| Variable / Term | Meaning |
|---|---|
| Class Interval | The range into which raw data is grouped (e.g., 10-20, 20-30). |
| Lower Class Limit | The smallest data value that can go into a class. For 10-20, it's 10. |
| Upper Class Limit | The largest data value that can go into a class. For 10-20, it's 20. |
Frequency (fᵢ) | The number of observations that fall within a specific class interval. |
Class Mark (xᵢ) | The midpoint of a class interval. This value represents the entire class. |
Σ (Sigma) | A Greek letter used in mathematics to denote summation or "adding up". |
Mean (x̄) | The average of the data set, calculated for grouped data. |
The Logic: From Ungrouped to Grouped Mean
Finding the mean seems simple: add everything up and divide. But how do you "add up" data when it's in ranges like "40-55"? We can't use the range itself. The logic behind the Direct Method involves a clever and crucial assumption.
-
Recall the Ungrouped Mean: For a simple frequency table where each value
xappearsftimes, the mean is the sum of all(f × x)products, divided by the total number of observations (sum of allf).Mean x̄ = (f₁x₁ + f₂x₂ + ... + fₙxₙ) / (f₁ + f₂ + ... + fₙ) = (Σfᵢxᵢ) / (Σfᵢ) -
The Grouped Data Problem: In a grouped frequency table (e.g., Table 13.2 in your textbook), we have class intervals (like 40-55) and their frequencies (e.g., 7 students). We know 7 students scored between 40 and 55, but we don't know their exact marks.
{{VISUAL: chart: A side-by-side comparison showing a bar graph for ungrouped marks (discrete bars at 10, 20, 36 etc.) and a histogram for the same marks grouped into class intervals like 10-25, 25-40 (adjacent bars).}}
-
The Mid-Point Solution: To solve this, we need a single value to represent each class interval. The most logical choice is the midpoint, which we call the class mark (
xᵢ).Class Mark (xᵢ) = (Upper Class Limit + Lower Class Limit) / 2 -
The Core Assumption: We assume that the frequency of each class is centered around its class mark. For the 40-55 interval with 7 students, we pretend all 7 students scored exactly at the midpoint, which is (40+55)/2 = 47.5. This is an approximation, but it's a very effective one for large datasets.
{{VISUAL: diagram: A number line showing a class interval from 40 to 55, with its midpoint, the class mark 47.5, clearly labeled. An arrow points to 47.5 with the text "Representative value (xᵢ) for all 7 students in this class".}}
-
Applying the Formula: Now that we have a representative
xᵢ(the class mark) for each frequencyfᵢ, we can use the same formula as the ungrouped mean. We are simply replacing the individual observations with the class marks.Mean x̄ = (Σfᵢxᵢ) / (Σfᵢ)
This process gives us a very close estimate of the true mean and is known as the Direct Method.
{{KEY: type=concept | title=The Mid-Point Assumption | text=The Direct Method's accuracy relies on the assumption that the values within each class are evenly distributed, making the class mark (mid-point) a fair representative for all data points in that class. The exact mean can only be found from raw, ungrouped data.}}
Solved Examples
Let's apply the Direct Method to a few problems, increasing in difficulty.
Example 1: Finding Mean Pocket Money (Easy)
Given: The daily pocket money of 20 students in a class is recorded in the table below.
| Pocket Money (in ₹) | 10-20 | 20-30 | 30-40 | 40-50 |
|---|---|---|---|---|
Number of Students (fᵢ) | 4 | 5 | 8 | 3 |
To Find: The mean daily pocket money.
Solution:
- First, we need to find the class mark (
xᵢ) for each interval and then calculate the productfᵢxᵢ. Let's create a calculation table.
| Class Interval | Frequency (fᵢ) | Class Mark (xᵢ) | fᵢxᵢ |
|---|---|---|---|
| 10-20 | 4 | (10+20)/2 = 15 | 4 × 15 = 60 |
| 20-30 | 5 | (20+30)/2 = 25 | 5 × 25 = 125 |
| 30-40 | 8 | (30+40)/2 = 35 | 8 × 35 = 280 |
| 40-50 | 3 | (40+50)/2 = 45 | 3 × 45 = 135 |
| Total | Σfᵢ = 20 | Σfᵢxᵢ = 600 |
-
Now, find the sum of frequencies (
Σfᵢ) and the sum of the products (Σfᵢxᵢ).Σfᵢ = 4 + 5 + 8 + 3 = 20Σfᵢxᵢ = 60 + 125 + 280 + 135 = 600 -
Apply the formula for the mean.
x̄ = (Σfᵢxᵢ) / (Σfᵢ) = 600 / 20x̄ = 30
Final Answer:
The mean daily pocket money is ₹30.
Example 2: Literacy Rate Analysis (Medium)
Given: The following table shows the literacy rate (in percentage) of 35 cities.
| Literacy Rate (%) | 45-55 | 55-65 | 65-75 | 75-85 | 85-95 |
|---|---|---|---|---|---|
Number of Cities (fᵢ) | 3 | 10 | 11 | 8 | 3 |
To Find: The mean literacy rate.
Solution:
- Construct a table to calculate class marks (
xᵢ) and the productfᵢxᵢ.
| Literacy Rate (%) | Freq. (fᵢ) | Class Mark (xᵢ) | fᵢxᵢ |
|---|---|---|---|
| 45-55 | 3 | 50 | 3 × 50 = 150 |
| 55-65 | 10 | 60 | 10 × 60 = 600 |
| 65-75 | 11 | 70 | 11 × 70 = 770 |
| 75-85 | 8 | 80 | 8 × 80 = 640 |
| 85-95 | 3 | 90 | 3 × 90 = 270 |
| Total | Σfᵢ = 35 | Σfᵢxᵢ = 2430 |
-
Sum the
fᵢandfᵢxᵢcolumns. We haveΣfᵢ = 35andΣfᵢxᵢ = 2430. -
Use the mean formula.
x̄ = (Σfᵢxᵢ) / (Σfᵢ) = 2430 / 35 -
Perform the division.
x̄ ≈ 69.43
Final Answer:
The mean literacy rate is approximately 69.43%.
Example 3: Handling Inclusive Classes (Hard)
Given: The weights of 50 wrestlers are recorded as follows. Notice the class intervals are inclusive.
| Weight (in kg) | 100-109 | 110-119 | 120-129 | 130-139 | 140-149 |
|---|---|---|---|---|---|
No. of wrestlers (fᵢ) | 4 | 14 | 21 | 8 | 3 |
To Find: The mean weight.
Solution:
-
The data is in an inclusive format (there's a gap between 109 and 110). We must first convert it to an exclusive format (continuous classes) to find the correct class marks. Subtract 0.5 from the lower limit and add 0.5 to the upper limit of each class.
- 100-109 becomes 99.5 - 109.5
- 110-119 becomes 109.5 - 119.5
- ...and so on.
-
Now, create the calculation table with the new continuous class intervals.
{{VISUAL: chart: A visually annotated table for this example, showing the original inclusive classes (100-109), the conversion to exclusive classes (99.5-109.5), the calculation of class marks (xᵢ = 104.5), and the fᵢxᵢ product column, with Σfᵢ and Σfᵢxᵢ totals highlighted.}}
| Original Interval | Corrected Interval | Freq. (fᵢ) | Class Mark (xᵢ) | fᵢxᵢ |
|---|---|---|---|---|
| 100-109 | 99.5 - 109.5 | 4 | 104.5 | 418.0 |
| 110-119 | 109.5 - 119.5 | 14 | 114.5 | 1603.0 |
| 120-129 | 119.5 - 129.5 | 21 | 124.5 | 2614.5 |
| 130-139 | 129.5 - 139.5 | 8 | 134.5 | 1076.0 |
| 140-149 | 139.5 - 149.5 | 3 | 144.5 | 433.5 |
| Total | Σfᵢ = 50 | Σfᵢxᵢ = 6145.0 |
-
Apply the mean formula with the calculated sums.
x̄ = (Σfᵢxᵢ) / (Σfᵢ) = 6145 / 50 -
Calculate the final value.
x̄ = 122.9
Final Answer:
The mean weight of the wrestlers is 122.9 kg.
Example 4: Finding a Missing Frequency (Tricky)
Given: The mean of the following distribution is 50.
| Class | 0-20 | 20-40 | 40-60 | 60-80 | 80-100 |
|---|---|---|---|---|---|
Frequency (fᵢ) | 17 | p | 32 | 24 | 19 |
To Find: The value of the missing frequency p.
Solution:
- Set up the calculation table as usual, but this time we will have the variable
pin our sums.
| Class | Frequency (fᵢ) | Class Mark (xᵢ) | fᵢxᵢ |
|---|---|---|---|
| 0-20 | 17 | 10 | 170 |
| 20-40 | p | 30 | 30p |
| 40-60 | 32 | 50 | 1600 |
| 60-80 | 24 | 70 | 1680 |
| 80-100 | 19 | 90 | 1710 |
| Total | Σfᵢ = 92 + p | Σfᵢxᵢ = 5160 + 30p |
-
Calculate
ΣfᵢandΣfᵢxᵢin terms ofp.Σfᵢ = 17 + p + 32 + 24 + 19 = 92 + pΣfᵢxᵢ = 170 + 30p + 1600 + 1680 + 1710 = 5160 + 30p -
Now, use the mean formula. We are given that
x̄ = 50.x̄ = (Σfᵢxᵢ) / (Σfᵢ)50 = (5160 + 30p) / (92 + p) -
Solve the linear equation for
p.50 × (92 + p) = 5160 + 30p4600 + 50p = 5160 + 30p50p - 30p = 5160 - 460020p = 560p = 560 / 20 = 28
Final Answer:
The value of the missing frequency p is 28.
Tips & Tricks
| Trick | Description |
|---|---|
| Speedy Class Marks | If all class intervals have the same width (h), calculate the first class mark x₁. Then, find the rest by simply adding h: x₂ = x₁ + h, x₃ = x₂ + h, and so on. |
| Calculation Check | Always verify your Σfᵢ value. It should be equal to the total number of observations given in the problem statement (e.g., "35 cities", "50 wrestlers"). |
| Unit Awareness | Always write the correct units in your final answer (e.g., kg, cm, %, ₹). Forgetting units can cost you marks in an exam. |
Common Mistakes
| ❌ Wrong Approach | ✅ Right Approach | Why it's a Mistake |
|---|---|---|
Using lower or upper class limits for xᵢ. <br> E.g., for 10-20, using 10 or 20. | Using the class mark (midpoint) for xᵢ. <br> E.g., for 10-20, use xᵢ = 15. | The class limits are the boundaries. The class mark is the representative value for the entire interval. |
| Calculating class marks directly from inclusive classes. <br> E.g., for 100-109, using (100+109)/2 = 104.5. | First converting inclusive classes to exclusive (99.5-109.5), then finding the class mark. (99.5+109.5)/2 = 104.5. | While the value is coincidentally the same here, it's conceptually wrong and can fail in complex cases. Always establish continuity first. |
Making a single error in one fᵢxᵢ calculation. | Double-checking each multiplication before summing up the Σfᵢxᵢ column. | A small error in one row will make the entire sum Σfᵢxᵢ incorrect, leading to the wrong final mean. Be methodical. |
Forgetting to add the variable part in sums. <br> E.g., Σfᵢ = 92 instead of 92 + p. | Keeping the variable as part of the algebraic sum. <br> E.g., Σfᵢ = 92 + p. | This is a common algebraic slip-up in "missing frequency" problems. The sum depends on the unknown variable. |
Brain-Teaser Questions
-
The mean age of a group of 30 students is 15 years. If the age of their teacher, 46 years, is also included, what will be the new mean age of the group?
💡 Answer: Initial total age = 30 × 15 = 450 years. New total age (with teacher) = 450 + 46 = 496 years. New number of people = 30 + 1 = 31. New mean = 496 / 31 = 16 years.
-
The class marks of a continuous frequency distribution are given as: 22, 28, 34, 40, 46. What is the class size and the class interval corresponding to the class mark 34?
💡 Answer: The class size (
h) is the difference between consecutive class marks: 28 - 22 = 6. For the class mark 34, the interval extendsh/2on both sides.h/2 = 6/2 = 3. Lower Limit = 34 - 3 = 31. Upper Limit = 34 + 3 = 37. The class interval is 31 - 37. -
If the mean of a distribution is 62 (calculated using the Direct Method from grouped data) and the exact mean from the original ungrouped data was 59.3, what could be a possible reason for this difference?
💡 Answer: The difference arises from the "mid-point assumption". The Direct Method assumes all values in a class interval are concentrated at the midpoint (class mark). In reality, the actual data points within each interval might have been skewed towards the lower or upper end, causing the approximate mean (62) to differ from the exact mean (59.3).
Mini Cheatsheet
| Concept | Formula / Rule |
|---|---|
| Mean (x̄) | x̄ = (Σfᵢxᵢ) / (Σfᵢ) |
| Class Mark (xᵢ) | (Upper Limit + Lower Limit) / 2 |
| Class Interval Conversion | For inclusive a-b, convert to exclusive (a-0.5) - (b+0.5) |
| Sum of Frequencies (Σfᵢ) | Represents the total number of observations, N. |
| Direct Method | The method of finding the mean using fᵢ and xᵢ directly. |
Mean of Grouped Data — Direct Method
Mean of Grouped Data — Direct Method
{{FORMULA: expr=x̄ = Σ(fᵢ xᵢ) / Σfᵢ | symbols=x̄:Mean, fᵢ:Frequency of the i-th class, xᵢ:Class mark of the i-th class, Σ:Summation}}
Concept Introduction
Imagine you're the manager of a popular pizza place. At the end of a busy week, you want to know the average amount a customer spends. You have hundreds of bills, but looking at each one is too slow. Instead, you group them into spending ranges: ₹100-200, ₹200-300, ₹300-400, and so on.
Now, how do you find the average? You don't have the exact bill amounts anymore, just the groups. This is where we need a method for finding the mean of grouped data. We make a smart assumption: we treat everyone in a group as if they spent the mid-point amount. For the ₹100-200 group, we assume everyone spent ₹150. This clever shortcut allows us to calculate a very close estimate of the average spending, giving you valuable business insights quickly.
Definitions & Formulas
Before we jump into calculations, let's be crystal clear on the terms we'll be using. These are the building blocks for finding the mean of any grouped data.
| Variable / Term | Meaning |
|---|---|
| Class Interval | The range into which data is grouped, e.g., 10-25, 25-40. It has a Lower Class Limit and an Upper Class Limit. |
Frequency (fᵢ) | The number of observations that fall into a specific class interval. |
Class Mark (xᵢ) | The mid-point of a class interval. It acts as the representative value for all data points in that class. |
Summation (Σ) | The Greek letter Sigma, which means 'to sum up'. Σfᵢ means summing up all the frequencies. |
Mean (x̄) | The average of the data set. For grouped data, it's an estimated or approximate mean. |
The two key formulas for this method are:
-
Finding the Class Mark (
xᵢ):Class Mark = (Upper Class Limit + Lower Class Limit) / 2 -
Finding the Mean (
x̄) using the Direct Method:x̄ = (Σfᵢ xᵢ) / (Σfᵢ)
The Logic Behind the Method
Why does this method work? It's all based on one central assumption. When we group data, we lose the original individual values. To overcome this, we assume that all data points within a class are centered around its mid-point.
Here's the step-by-step logic:
-
The Problem: We have data in ranges (class intervals), not as individual points. We can't add up the original values because we don't know them anymore.
-
The Assumption: We need a single value to represent each class. The most logical choice is the middle value, the class mark (
xᵢ). We assume that the frequency (fᵢ) for each class is centered around this class mark.{{VISUAL: diagram: A number line from 10 to 25. A large dot is placed exactly in the middle at 17.5, with a label "Class Mark (xᵢ)". Text below reads: "All values in the 10-25 interval are represented by 17.5."}}
-
The Transformation: By calculating the class mark for each interval, we transform the grouped data into a format that looks like ungrouped frequency data. The class marks (
xᵢ) become our new "observations". -
Applying the Old Formula: Now we can use the familiar formula for the mean of a frequency distribution:
Mean = (Sum of all observations) / (Total number of observations). -
The sum of all observations is estimated by
f₁x₁ + f₂x₂ + ... + fₙxₙ, which we write asΣfᵢxᵢ. -
The total number of observations is simply the sum of all frequencies:
f₁ + f₂ + ... + fₙ, which we write asΣfᵢ. -
The Result: Combining these gives us the Direct Method formula for the mean of grouped data:
x̄ = Σ(fᵢ xᵢ) / Σfᵢ.
{{KEY: type=concept | title=The Mid-Point Assumption | text=The Direct Method's accuracy depends on the assumption that frequencies are centered around the class mark. This is why the calculated mean is an approximation of the true mean, but it's a very effective and widely used one.}}
Solved Examples
Let's put this into practice. We'll start easy and gradually increase the difficulty.
Example 1: Basic Calculation (Easy)
Given: The distribution of marks obtained by 40 students in a science test.
| Marks | 0-10 | 10-20 | 20-30 | 30-40 | 40-50 |
|---|---|---|---|---|---|
| No. of Students | 5 | 8 | 15 | 7 | 5 |
To Find: The mean marks of the students using the direct method.
Solution:
-
First, we need to construct a table with columns for Class Interval, Frequency (
fᵢ), Class Mark (xᵢ), and the productfᵢxᵢ. -
Calculate the Class Mark (
xᵢ) for each interval. For0-10,x₁ = (0+10)/2 = 5. For10-20,x₂ = (10+20)/2 = 15, and so on. -
Calculate the product
fᵢxᵢfor each row. For the first row,f₁x₁ = 5 × 5 = 25. -
Complete the table and find the sums
ΣfᵢandΣfᵢxᵢ.Marks (Class Interval) No. of Students (fᵢ) Class Mark (xᵢ) fᵢxᵢ 0 - 10 5 5 25 10 - 20 8 15 120 20 - 30 15 25 375 30 - 40 7 35 245 40 - 50 5 45 225 Total Σfᵢ = 40 Σfᵢxᵢ = 990 -
Apply the mean formula.
x̄ = Σfᵢxᵢ / Σfᵢx̄ = 990 / 40x̄ = 24.75
Final Answer:
The mean marks obtained by the students is 24.75.
Example 2: Data with Decimal Class Marks (Medium)
Given: The daily wages of 50 workers in a factory.
| Daily Wages (in ₹) | 100-120 | 120-140 | 140-160 | 160-180 | 180-200 |
|---|---|---|---|---|---|
| Number of Workers | 12 | 14 | 8 | 6 | 10 |
To Find: The mean daily wages of the workers.
Solution:
-
Set up the calculation table. The class marks will be whole numbers here, but the calculation involves larger numbers.
-
Calculate the class mark for each interval. For
100-120,x₁ = (100+120)/2 = 110. -
Calculate the product
fᵢxᵢfor each row. For the first row,f₁x₁ = 12 × 110 = 1320. -
Fill the table and find the sums.
Daily Wages (₹) Number of Workers (fᵢ) Class Mark (xᵢ) fᵢxᵢ 100 - 120 12 110 1320 120 - 140 14 130 1820 140 - 160 8 150 1200 160 - 180 6 170 1020 180 - 200 10 190 1900 Total Σfᵢ = 50 Σfᵢxᵢ = 7260 -
Use the direct method formula.
x̄ = Σfᵢxᵢ / Σfᵢx̄ = 7260 / 50x̄ = 145.2
Final Answer:
The mean daily wage of the workers is ₹145.20.
Example 3: Finding a Missing Frequency (Hard)
Given: The mean of the following distribution is 18. The data shows the daily pocket allowance of children in a locality.
| Daily Allowance (₹) | 11-13 | 13-15 | 15-17 | 17-19 | 19-21 | 21-23 | 23-25 |
|---|---|---|---|---|---|---|---|
| Number of Children | 7 | 6 | 9 | 13 | f | 5 | 4 |
To Find: The value of the missing frequency, f.
Solution:
-
In this problem, the mean
x̄is given as 18. We need to work backwards to findf. First, let's set up our table as usual. -
Calculate class marks (
xᵢ) and the productsfᵢxᵢ.Daily Allowance (₹) Frequency (fᵢ) Class Mark (xᵢ) fᵢxᵢ 11 - 13 7 12 84 13 - 15 6 14 84 15 - 17 9 16 144 17 - 19 13 18 234 19 - 21 f 20 20f 21 - 23 5 22 110 23 - 25 4 24 96 -
Now, find the sums
ΣfᵢandΣfᵢxᵢin terms off.Σfᵢ = 7 + 6 + 9 + 13 + f + 5 + 4 = 44 + fΣfᵢxᵢ = 84 + 84 + 144 + 234 + 20f + 110 + 96 = 752 + 20f -
Substitute these sums and the given mean into the formula.
x̄ = Σfᵢxᵢ / Σfᵢ18 = (752 + 20f) / (44 + f) -
Now, solve the linear equation for
f.18 × (44 + f) = 752 + 20f792 + 18f = 752 + 20f792 - 752 = 20f - 18f40 = 2ff = 20
Final Answer:
The value of the missing frequency (f) is 20.
Example 4: Unequal Class Sizes (Tricky)
Given: The age distribution of patients treated in a hospital on a particular day.
| Age (in years) | 5-15 | 15-25 | 25-35 | 35-45 | 45-65 |
|---|---|---|---|---|---|
| No. of Patients | 6 | 11 | 21 | 23 | 14 |
To Find: The mean age of the patients.
Solution:
-
Notice that the class sizes are not equal. The first four classes have a size of 10 (
15-5=10), but the last class45-65has a size of 20. The Direct Method works perfectly fine even with unequal class sizes. -
We follow the exact same procedure. Set up the table.
-
Calculate the class marks (
xᵢ) and products (fᵢxᵢ).Age (years) No. of Patients (fᵢ) Class Mark (xᵢ) fᵢxᵢ 5 - 15 6 10 60 15 - 25 11 20 220 25 - 35 21 30 630 35 - 45 23 40 920 45 - 65 14 55 770 Total Σfᵢ = 75 Σfᵢxᵢ = 2600 -
Calculate the mean using the sums.
x̄ = Σfᵢxᵢ / Σfᵢx̄ = 2600 / 75x̄ = 34.666...
Final Answer:
The mean age of the patients is approximately 34.67 years.
Tips & Tricks
Working smart is just as important as working hard. Here are a few tricks to make your calculations faster and more accurate.
| Tip | Description |
|---|---|
| 1. Organize with a Table | Always create a 4-column table: Class Interval, fᵢ, xᵢ, fᵢxᵢ. This structure minimizes calculation errors and keeps your work neat and easy to review. |
| 2. Spot the Pattern in Class Marks | If the class sizes are equal, the class marks will form an arithmetic progression. Calculate the first two (x₁, x₂) and then just keep adding the class size to get the rest. This saves you from repeatedly calculating the average. |
| 3. Handle Large Numbers Carefully | The Direct Method can become tedious with large fᵢ and xᵢ values. Double-check your multiplication. A single error in an fᵢxᵢ value will lead to an incorrect final answer. |
Common Mistakes
Many students make similar small mistakes that can cost them marks. Here’s what to watch out for.
| ❌ Wrong Method | ✅ Right Method |
|---|---|
Using the upper or lower class limit for xᵢ. For class 10-20, using 10 or 20. | Always use the class mark (mid-point). For 10-20, xᵢ = (10+20)/2 = 15. |
| Dividing by the number of class intervals (e.g., 5 rows in the table). | Always divide by the total frequency (Σfᵢ), which is the sum of all observations. |
Calculating Σfᵢ × Σxᵢ instead of Σ(fᵢxᵢ). | First multiply fᵢ and xᵢ for each row, then sum up that entire column to get Σ(fᵢxᵢ). |
Forgetting a missing frequency (f) in the sum Σfᵢ. | Remember to include the variable in the total sum, e.g., Σfᵢ = 44 + f. |
Brain-Teaser Questions
Test your understanding with these slightly challenging problems.
-
The mean weight of a class of 30 students is 45 kg. If the weight of the teacher, who weighs 75 kg, is included, what is the new mean weight of the group?
💡 Answer: Total weight of students = 30 × 45 = 1350 kg. New total weight (with teacher) = 1350 + 75 = 1425 kg. New total number of people = 30 + 1 = 31. New mean = 1425 / 31 ≈ 45.97 kg.
-
The class marks of a continuous distribution are: 22, 28, 34, 40, 46. What is the class size and what are the class intervals?
💡 Answer: The class size (
h) is the difference between consecutive class marks:h = 28 - 22 = 6. The class limits are halfway between the class marks. For the first class mark 22, the lower limit is22 - h/2 = 22 - 3 = 19and the upper limit is22 + h/2 = 22 + 3 = 25. The class intervals are: 19-25, 25-31, 31-37, 37-43, 43-49. -
If the mean of a frequency distribution is 8.1 and
Σfᵢxᵢ = 132 + 5k, andΣfᵢ = 20, find the value ofk.💡 Answer: We use the formula
x̄ = Σfᵢxᵢ / Σfᵢ.8.1 = (132 + 5k) / 208.1 × 20 = 132 + 5k162 = 132 + 5k162 - 132 = 5k30 = 5kk = 6.
Mini Cheatsheet
Here's a quick summary of everything on this page. Screenshot this for your last-minute revision!
| Concept | Formula / Definition |
|---|---|
| Mean (Direct Method) | x̄ = Σ(fᵢ xᵢ) / Σfᵢ |
Class Mark (xᵢ) | (Upper Class Limit + Lower Class Limit) / 2 |
Frequency (fᵢ) | Number of observations in a class. |
Σfᵢ | Total number of observations. |
Σfᵢxᵢ | Sum of the products of frequency and class mark for all classes. |
Mean of Grouped Data — Assumed Mean Method
{{FORMULA: expr=x̄ = a + (Σfᵢdᵢ / Σfᵢ) | symbols=x̄:Mean, a:Assumed Mean, fᵢ:Frequency, dᵢ:Deviation (xᵢ - a)}}
Mean of Grouped Data — Assumed Mean Method
Concept Introduction
In the previous section, we learned the Direct Method to find the mean of grouped data. It works perfectly, but what happens when the data involves very large numbers? Imagine calculating the average monthly salary for 5,000 employees in a factory, where salaries range from ₹18,575 to ₹85,250.
Calculating the product of class marks (xᵢ) and frequencies (fᵢ) would be tedious, time-consuming, and prone to errors. 150 × 45750.50 is not a calculation you want to do by hand!
To simplify this, we use the Assumed Mean Method. The core idea is simple: we assume a convenient value as the mean (usually from the middle of the data), calculate how far off our data points are from this assumption, find the average of these "errors" (deviations), and then adjust our original assumption. It's a clever shortcut to tame large numbers.
Definitions & Formulas
This method introduces a few new terms, primarily the concept of deviation.
| Variable | Meaning | Description |
|---|---|---|
xᵢ | Class Mark | The mid-point of a class interval. Calculated as (Upper Limit + Lower Limit) / 2. |
fᵢ | Frequency | The number of observations in a particular class interval. |
a | Assumed Mean | A value, usually one of the middle xᵢ values, chosen to simplify calculations. |
dᵢ | Deviation | The difference between each class mark and the assumed mean. dᵢ = xᵢ - a. |
Σfᵢdᵢ | Sum of Products | The sum of the products of frequency and deviation for each class. |
Σfᵢ | Total Frequency | The sum of all frequencies, also denoted by N. |
x̄ | Mean | The arithmetic mean of the data. |
The Logic Behind the Formula
Why does this method work? It's a simple algebraic rearrangement of the Direct Method formula. Let's see how.
-
We know the formula for the mean by the Direct Method is:
x̄ = (Σfᵢxᵢ) / (Σfᵢ) -
We define a new term, deviation
dᵢ, for each class markxᵢfrom an assumed meana.dᵢ = xᵢ - a -
This means we can express any class mark
xᵢin terms of the assumed mean and its deviation.xᵢ = a + dᵢ -
Now, let's substitute this expression for
xᵢback into the Direct Method formula.x̄ = (Σfᵢ(a + dᵢ)) / (Σfᵢ) -
Using the distributive property of summation, we can split the numerator.
x̄ = (Σ(fᵢa) + Σ(fᵢdᵢ)) / (Σfᵢ) -
Since
ais a constant,Σ(fᵢa)is the same asa × (Σfᵢ).x̄ = (a × (Σfᵢ) + Σfᵢdᵢ) / (Σfᵢ) -
Finally, we can separate this into two fractions.
x̄ = (a × Σfᵢ / Σfᵢ) + (Σfᵢdᵢ / Σfᵢ) -
The
Σfᵢterms cancel out in the first fraction, leaving us with the final formula for the Assumed Mean Method.x̄ = a + (Σfᵢdᵢ / Σfᵢ)
{{KEY: type=concept | title=Mean of Deviations | text=The term Σfᵢdᵢ / Σfᵢ is the mean of the deviations. So, the formula is simply: Actual Mean = Assumed Mean + Mean of Deviations. This shows how we are "correcting" our initial assumption.}}
Solved Examples
Example 1: Easy Start
Given: The distribution of daily wages of 50 workers in a factory.
| Daily Wages (in ₹) | 100-120 | 120-140 | 140-160 | 160-180 | 180-200 |
|---|---|---|---|---|---|
Number of Workers (fᵢ) | 12 | 14 | 8 | 6 | 10 |
To Find: The mean daily wages using the Assumed Mean method.
Solution:
-
First, we create a table to find the class mark (
xᵢ) for each interval. The class mark is the midpoint:(Lower Limit + Upper Limit) / 2. -
Let's choose an assumed mean (
a). A good choice is one of the middlexᵢvalues. Let's takea = 150. -
Next, we calculate the deviation
dᵢ = xᵢ - afor each class. -
Then, we find the product
fᵢdᵢfor each class. -
Finally, we sum the
fᵢandfᵢdᵢcolumns.
| Class Interval | fᵢ | xᵢ | dᵢ = xᵢ - 150 | fᵢdᵢ |
|---|---|---|---|---|
| 100-120 | 12 | 110 | -40 | -480 |
| 120-140 | 14 | 130 | -20 | -280 |
| 140-160 | 8 | 150 | 0 | 0 |
| 160-180 | 6 | 170 | 20 | 120 |
| 180-200 | 10 | 190 | 40 | 400 |
| Total | Σfᵢ = 50 | Σfᵢdᵢ = -240 |
- Now, we apply the Assumed Mean formula.
x̄ = a + (Σfᵢdᵢ / Σfᵢ)x̄ = 150 + (-240 / 50)x̄ = 150 - 4.8x̄ = 145.2
Final Answer:
The mean daily wages are ₹145.20.
Example 2: Medium (NCERT Context)
Given: The marks obtained by 30 students of Class X, condensed into grouped data.
| Class Interval | 10-25 | 25-40 | 40-55 | 55-70 | 70-85 | 85-100 |
|---|---|---|---|---|---|---|
Number of Students (fᵢ) | 2 | 3 | 7 | 6 | 6 | 6 |
To Find: The mean marks using the Assumed Mean method.
Solution:
- We prepare the calculation table. Let's follow the NCERT example and choose
a = 47.5as the assumed mean.
| Class Interval | fᵢ | xᵢ | dᵢ = xᵢ - 47.5 | fᵢdᵢ |
|---|---|---|---|---|
| 10-25 | 2 | 17.5 | -30 | -60 |
| 25-40 | 3 | 32.5 | -15 | -45 |
| 40-55 | 7 | 47.5 | 0 | 0 |
| 55-70 | 6 | 62.5 | 15 | 90 |
| 70-85 | 6 | 77.5 | 30 | 180 |
| 85-100 | 6 | 92.5 | 45 | 270 |
| Total | Σfᵢ = 30 | Σfᵢdᵢ = 435 |
-
Now, we use the formula
x̄ = a + (Σfᵢdᵢ / Σfᵢ).x̄ = 47.5 + (435 / 30) -
Calculate the mean of deviations.
435 / 30 = 14.5 -
Add this to the assumed mean.
x̄ = 47.5 + 14.5x̄ = 62
Final Answer:
The mean marks obtained by the students is 62.
Example 3: Hard (Larger Numbers)
Given: The distribution of the monthly expenditure of 200 families in a locality.
| Expenditure (in ₹) | 1000-1500 | 1500-2000 | 2000-2500 | 2500-3000 | 3000-3500 | 3500-4000 | 4000-4500 | 4500-5000 |
|---|---|---|---|---|---|---|---|---|
No. of Families (fᵢ) | 24 | 40 | 33 | 28 | 30 | 22 | 16 | 7 |
To Find: The mean monthly expenditure.
Solution:
