CBSE Class 10 Mathematics

Ch 13: Statistics

5 sections AI-powered notes
GET THE FULL EXPERIENCE

This is the chapter notes. Students get the interactive version.

  • Ask Aarav Sir anything — instant voice + chat doubts
  • Interactive lessons with audio narration + visual diagrams
  • Study Lab — paste any photo, PDF, or YouTube link to get it explained

Introduction

Page 1: Mean of Grouped Data (The Direct Method)

Welcome to the world of Statistics! In Class IX, you learned how to organize data and find its "center" using the mean, median, and mode for simple, ungrouped lists of numbers. But what happens when you're faced with massive amounts of data, like the exam scores of every student in a city? Listing them one by one is impractical. This is where grouped data comes in.

Imagine a cricket analyst trying to understand a batsman's performance over a year. Instead of looking at every single score (7, 45, 102, 33, ...), they group them: "How many times did he score between 0-20 runs?", "How many times between 21-40?", and so on. This gives a much clearer picture. In this chapter, we'll upgrade our skills to find the mean, median, and mode for exactly this kind of grouped data, starting with the most fundamental measure: the mean.

{{FORMULA: expr=x̄ = (Σfᵢxᵢ) / (Σfᵢ) | symbols=x̄:Mean, Σ:Summation, fᵢ:Frequency of the i-th class, xᵢ:Class mark of the i-th class}}


Definitions & Formulas

Before we dive in, let's clarify the key terms we'll be using for grouped data.

Variable / TermMeaning
Class IntervalThe range into which raw data is grouped (e.g., 10-20, 20-30).
Lower Class LimitThe smallest data value that can go into a class. For 10-20, it's 10.
Upper Class LimitThe largest data value that can go into a class. For 10-20, it's 20.
Frequency (fᵢ)The number of observations that fall within a specific class interval.
Class Mark (xᵢ)The midpoint of a class interval. This value represents the entire class.
Σ (Sigma)A Greek letter used in mathematics to denote summation or "adding up".
Mean ()The average of the data set, calculated for grouped data.

The Logic: From Ungrouped to Grouped Mean

Finding the mean seems simple: add everything up and divide. But how do you "add up" data when it's in ranges like "40-55"? We can't use the range itself. The logic behind the Direct Method involves a clever and crucial assumption.

  1. Recall the Ungrouped Mean: For a simple frequency table where each value x appears f times, the mean is the sum of all (f × x) products, divided by the total number of observations (sum of all f).

    Mean x̄ = (f₁x₁ + f₂x₂ + ... + fₙxₙ) / (f₁ + f₂ + ... + fₙ) = (Σfᵢxᵢ) / (Σfᵢ)
    
  2. The Grouped Data Problem: In a grouped frequency table (e.g., Table 13.2 in your textbook), we have class intervals (like 40-55) and their frequencies (e.g., 7 students). We know 7 students scored between 40 and 55, but we don't know their exact marks.

{{VISUAL: chart: A side-by-side comparison showing a bar graph for ungrouped marks (discrete bars at 10, 20, 36 etc.) and a histogram for the same marks grouped into class intervals like 10-25, 25-40 (adjacent bars).}}

  1. The Mid-Point Solution: To solve this, we need a single value to represent each class interval. The most logical choice is the midpoint, which we call the class mark (xᵢ).

    Class Mark (xᵢ) = (Upper Class Limit + Lower Class Limit) / 2
    
  2. The Core Assumption: We assume that the frequency of each class is centered around its class mark. For the 40-55 interval with 7 students, we pretend all 7 students scored exactly at the midpoint, which is (40+55)/2 = 47.5. This is an approximation, but it's a very effective one for large datasets.

{{VISUAL: diagram: A number line showing a class interval from 40 to 55, with its midpoint, the class mark 47.5, clearly labeled. An arrow points to 47.5 with the text "Representative value (xᵢ) for all 7 students in this class".}}

  1. Applying the Formula: Now that we have a representative xᵢ (the class mark) for each frequency fᵢ, we can use the same formula as the ungrouped mean. We are simply replacing the individual observations with the class marks.

    Mean x̄ = (Σfᵢxᵢ) / (Σfᵢ)
    

This process gives us a very close estimate of the true mean and is known as the Direct Method.

{{KEY: type=concept | title=The Mid-Point Assumption | text=The Direct Method's accuracy relies on the assumption that the values within each class are evenly distributed, making the class mark (mid-point) a fair representative for all data points in that class. The exact mean can only be found from raw, ungrouped data.}}


Solved Examples

Let's apply the Direct Method to a few problems, increasing in difficulty.

Example 1: Finding Mean Pocket Money (Easy)

Given: The daily pocket money of 20 students in a class is recorded in the table below.

Pocket Money (in ₹)10-2020-3030-4040-50
Number of Students (fᵢ)4583

To Find: The mean daily pocket money.

Solution:

  1. First, we need to find the class mark (xᵢ) for each interval and then calculate the product fᵢxᵢ. Let's create a calculation table.
Class IntervalFrequency (fᵢ)Class Mark (xᵢ)fᵢxᵢ
10-204(10+20)/2 = 154 × 15 = 60
20-305(20+30)/2 = 255 × 25 = 125
30-408(30+40)/2 = 358 × 35 = 280
40-503(40+50)/2 = 453 × 45 = 135
TotalΣfᵢ = 20Σfᵢxᵢ = 600
  1. Now, find the sum of frequencies (Σfᵢ) and the sum of the products (Σfᵢxᵢ).

    Σfᵢ = 4 + 5 + 8 + 3 = 20
    
    Σfᵢxᵢ = 60 + 125 + 280 + 135 = 600
    
  2. Apply the formula for the mean.

    x̄ = (Σfᵢxᵢ) / (Σfᵢ) = 600 / 20
    
    x̄ = 30
    

Final Answer:

The mean daily pocket money is ₹30.

Example 2: Literacy Rate Analysis (Medium)

Given: The following table shows the literacy rate (in percentage) of 35 cities.

Literacy Rate (%)45-5555-6565-7575-8585-95
Number of Cities (fᵢ)3101183

To Find: The mean literacy rate.

Solution:

  1. Construct a table to calculate class marks (xᵢ) and the product fᵢxᵢ.
Literacy Rate (%)Freq. (fᵢ)Class Mark (xᵢ)fᵢxᵢ
45-553503 × 50 = 150
55-65106010 × 60 = 600
65-75117011 × 70 = 770
75-858808 × 80 = 640
85-953903 × 90 = 270
TotalΣfᵢ = 35Σfᵢxᵢ = 2430
  1. Sum the fᵢ and fᵢxᵢ columns. We have Σfᵢ = 35 and Σfᵢxᵢ = 2430.

  2. Use the mean formula.

    x̄ = (Σfᵢxᵢ) / (Σfᵢ) = 2430 / 35
    
  3. Perform the division.

    x̄ ≈ 69.43
    

Final Answer:

The mean literacy rate is approximately 69.43%.

Example 3: Handling Inclusive Classes (Hard)

Given: The weights of 50 wrestlers are recorded as follows. Notice the class intervals are inclusive.

Weight (in kg)100-109110-119120-129130-139140-149
No. of wrestlers (fᵢ)4142183

To Find: The mean weight.

Solution:

  1. The data is in an inclusive format (there's a gap between 109 and 110). We must first convert it to an exclusive format (continuous classes) to find the correct class marks. Subtract 0.5 from the lower limit and add 0.5 to the upper limit of each class.

    • 100-109 becomes 99.5 - 109.5
    • 110-119 becomes 109.5 - 119.5
    • ...and so on.
  2. Now, create the calculation table with the new continuous class intervals.

{{VISUAL: chart: A visually annotated table for this example, showing the original inclusive classes (100-109), the conversion to exclusive classes (99.5-109.5), the calculation of class marks (xᵢ = 104.5), and the fᵢxᵢ product column, with Σfᵢ and Σfᵢxᵢ totals highlighted.}}

Original IntervalCorrected IntervalFreq. (fᵢ)Class Mark (xᵢ)fᵢxᵢ
100-10999.5 - 109.54104.5418.0
110-119109.5 - 119.514114.51603.0
120-129119.5 - 129.521124.52614.5
130-139129.5 - 139.58134.51076.0
140-149139.5 - 149.53144.5433.5
TotalΣfᵢ = 50Σfᵢxᵢ = 6145.0
  1. Apply the mean formula with the calculated sums.

    x̄ = (Σfᵢxᵢ) / (Σfᵢ) = 6145 / 50
    
  2. Calculate the final value.

    x̄ = 122.9
    

Final Answer:

The mean weight of the wrestlers is 122.9 kg.

Example 4: Finding a Missing Frequency (Tricky)

Given: The mean of the following distribution is 50.

Class0-2020-4040-6060-8080-100
Frequency (fᵢ)17p322419

To Find: The value of the missing frequency p.

Solution:

  1. Set up the calculation table as usual, but this time we will have the variable p in our sums.
ClassFrequency (fᵢ)Class Mark (xᵢ)fᵢxᵢ
0-201710170
20-40p3030p
40-6032501600
60-8024701680
80-10019901710
TotalΣfᵢ = 92 + pΣfᵢxᵢ = 5160 + 30p
  1. Calculate Σfᵢ and Σfᵢxᵢ in terms of p.

    Σfᵢ = 17 + p + 32 + 24 + 19 = 92 + p
    
    Σfᵢxᵢ = 170 + 30p + 1600 + 1680 + 1710 = 5160 + 30p
    
  2. Now, use the mean formula. We are given that x̄ = 50.

    x̄ = (Σfᵢxᵢ) / (Σfᵢ)
    
    50 = (5160 + 30p) / (92 + p)
    
  3. Solve the linear equation for p.

    50 × (92 + p) = 5160 + 30p
    
    4600 + 50p = 5160 + 30p
    
    50p - 30p = 5160 - 4600
    
    20p = 560
    
    p = 560 / 20 = 28
    

Final Answer:

The value of the missing frequency p is 28.

Tips & Tricks

TrickDescription
Speedy Class MarksIf all class intervals have the same width (h), calculate the first class mark x₁. Then, find the rest by simply adding h: x₂ = x₁ + h, x₃ = x₂ + h, and so on.
Calculation CheckAlways verify your Σfᵢ value. It should be equal to the total number of observations given in the problem statement (e.g., "35 cities", "50 wrestlers").
Unit AwarenessAlways write the correct units in your final answer (e.g., kg, cm, %, ₹). Forgetting units can cost you marks in an exam.

Common Mistakes

❌ Wrong Approach✅ Right ApproachWhy it's a Mistake
Using lower or upper class limits for xᵢ. <br> E.g., for 10-20, using 10 or 20.Using the class mark (midpoint) for xᵢ. <br> E.g., for 10-20, use xᵢ = 15.The class limits are the boundaries. The class mark is the representative value for the entire interval.
Calculating class marks directly from inclusive classes. <br> E.g., for 100-109, using (100+109)/2 = 104.5.First converting inclusive classes to exclusive (99.5-109.5), then finding the class mark. (99.5+109.5)/2 = 104.5.While the value is coincidentally the same here, it's conceptually wrong and can fail in complex cases. Always establish continuity first.
Making a single error in one fᵢxᵢ calculation.Double-checking each multiplication before summing up the Σfᵢxᵢ column.A small error in one row will make the entire sum Σfᵢxᵢ incorrect, leading to the wrong final mean. Be methodical.
Forgetting to add the variable part in sums. <br> E.g., Σfᵢ = 92 instead of 92 + p.Keeping the variable as part of the algebraic sum. <br> E.g., Σfᵢ = 92 + p.This is a common algebraic slip-up in "missing frequency" problems. The sum depends on the unknown variable.

Brain-Teaser Questions

  1. The mean age of a group of 30 students is 15 years. If the age of their teacher, 46 years, is also included, what will be the new mean age of the group?

    💡 Answer: Initial total age = 30 × 15 = 450 years. New total age (with teacher) = 450 + 46 = 496 years. New number of people = 30 + 1 = 31. New mean = 496 / 31 = 16 years.

  2. The class marks of a continuous frequency distribution are given as: 22, 28, 34, 40, 46. What is the class size and the class interval corresponding to the class mark 34?

    💡 Answer: The class size (h) is the difference between consecutive class marks: 28 - 22 = 6. For the class mark 34, the interval extends h/2 on both sides. h/2 = 6/2 = 3. Lower Limit = 34 - 3 = 31. Upper Limit = 34 + 3 = 37. The class interval is 31 - 37.

  3. If the mean of a distribution is 62 (calculated using the Direct Method from grouped data) and the exact mean from the original ungrouped data was 59.3, what could be a possible reason for this difference?

    💡 Answer: The difference arises from the "mid-point assumption". The Direct Method assumes all values in a class interval are concentrated at the midpoint (class mark). In reality, the actual data points within each interval might have been skewed towards the lower or upper end, causing the approximate mean (62) to differ from the exact mean (59.3).


Mini Cheatsheet

ConceptFormula / Rule
Mean (x̄)x̄ = (Σfᵢxᵢ) / (Σfᵢ)
Class Mark (xᵢ)(Upper Limit + Lower Limit) / 2
Class Interval ConversionFor inclusive a-b, convert to exclusive (a-0.5) - (b+0.5)
Sum of Frequencies (Σfᵢ)Represents the total number of observations, N.
Direct MethodThe method of finding the mean using fᵢ and xᵢ directly.

Mean of Grouped Data — Direct Method

Mean of Grouped Data — Direct Method

{{FORMULA: expr=x̄ = Σ(fᵢ xᵢ) / Σfᵢ | symbols=x̄:Mean, fᵢ:Frequency of the i-th class, xᵢ:Class mark of the i-th class, Σ:Summation}}

Concept Introduction

Imagine you're the manager of a popular pizza place. At the end of a busy week, you want to know the average amount a customer spends. You have hundreds of bills, but looking at each one is too slow. Instead, you group them into spending ranges: ₹100-200, ₹200-300, ₹300-400, and so on.

Now, how do you find the average? You don't have the exact bill amounts anymore, just the groups. This is where we need a method for finding the mean of grouped data. We make a smart assumption: we treat everyone in a group as if they spent the mid-point amount. For the ₹100-200 group, we assume everyone spent ₹150. This clever shortcut allows us to calculate a very close estimate of the average spending, giving you valuable business insights quickly.

Definitions & Formulas

Before we jump into calculations, let's be crystal clear on the terms we'll be using. These are the building blocks for finding the mean of any grouped data.

Variable / TermMeaning
Class IntervalThe range into which data is grouped, e.g., 10-25, 25-40. It has a Lower Class Limit and an Upper Class Limit.
Frequency (fᵢ)The number of observations that fall into a specific class interval.
Class Mark (xᵢ)The mid-point of a class interval. It acts as the representative value for all data points in that class.
Summation (Σ)The Greek letter Sigma, which means 'to sum up'. Σfᵢ means summing up all the frequencies.
Mean ()The average of the data set. For grouped data, it's an estimated or approximate mean.

The two key formulas for this method are:

  1. Finding the Class Mark (xᵢ):

    Class Mark = (Upper Class Limit + Lower Class Limit) / 2
    
  2. Finding the Mean () using the Direct Method:

    x̄ = (Σfᵢ xᵢ) / (Σfᵢ)
    

The Logic Behind the Method

Why does this method work? It's all based on one central assumption. When we group data, we lose the original individual values. To overcome this, we assume that all data points within a class are centered around its mid-point.

Here's the step-by-step logic:

  1. The Problem: We have data in ranges (class intervals), not as individual points. We can't add up the original values because we don't know them anymore.

  2. The Assumption: We need a single value to represent each class. The most logical choice is the middle value, the class mark (xᵢ). We assume that the frequency (fᵢ) for each class is centered around this class mark.

    {{VISUAL: diagram: A number line from 10 to 25. A large dot is placed exactly in the middle at 17.5, with a label "Class Mark (xᵢ)". Text below reads: "All values in the 10-25 interval are represented by 17.5."}}

  3. The Transformation: By calculating the class mark for each interval, we transform the grouped data into a format that looks like ungrouped frequency data. The class marks (xᵢ) become our new "observations".

  4. Applying the Old Formula: Now we can use the familiar formula for the mean of a frequency distribution: Mean = (Sum of all observations) / (Total number of observations).

  5. The sum of all observations is estimated by f₁x₁ + f₂x₂ + ... + fₙxₙ, which we write as Σfᵢxᵢ.

  6. The total number of observations is simply the sum of all frequencies: f₁ + f₂ + ... + fₙ, which we write as Σfᵢ.

  7. The Result: Combining these gives us the Direct Method formula for the mean of grouped data: x̄ = Σ(fᵢ xᵢ) / Σfᵢ.

{{KEY: type=concept | title=The Mid-Point Assumption | text=The Direct Method's accuracy depends on the assumption that frequencies are centered around the class mark. This is why the calculated mean is an approximation of the true mean, but it's a very effective and widely used one.}}

Solved Examples

Let's put this into practice. We'll start easy and gradually increase the difficulty.

Example 1: Basic Calculation (Easy)

Given: The distribution of marks obtained by 40 students in a science test.

Marks0-1010-2020-3030-4040-50
No. of Students581575

To Find: The mean marks of the students using the direct method.

Solution:

  1. First, we need to construct a table with columns for Class Interval, Frequency (fᵢ), Class Mark (xᵢ), and the product fᵢxᵢ.

  2. Calculate the Class Mark (xᵢ) for each interval. For 0-10, x₁ = (0+10)/2 = 5. For 10-20, x₂ = (10+20)/2 = 15, and so on.

  3. Calculate the product fᵢxᵢ for each row. For the first row, f₁x₁ = 5 × 5 = 25.

  4. Complete the table and find the sums Σfᵢ and Σfᵢxᵢ.

    Marks (Class Interval)No. of Students (fᵢ)Class Mark (xᵢ)fᵢxᵢ
    0 - 105525
    10 - 20815120
    20 - 301525375
    30 - 40735245
    40 - 50545225
    TotalΣfᵢ = 40Σfᵢxᵢ = 990
  5. Apply the mean formula.

    x̄ = Σfᵢxᵢ / Σfᵢ
    
    x̄ = 990 / 40
    
    x̄ = 24.75
    

Final Answer:

The mean marks obtained by the students is 24.75.

Example 2: Data with Decimal Class Marks (Medium)

Given: The daily wages of 50 workers in a factory.

Daily Wages (in ₹)100-120120-140140-160160-180180-200
Number of Workers12148610

To Find: The mean daily wages of the workers.

Solution:

  1. Set up the calculation table. The class marks will be whole numbers here, but the calculation involves larger numbers.

  2. Calculate the class mark for each interval. For 100-120, x₁ = (100+120)/2 = 110.

  3. Calculate the product fᵢxᵢ for each row. For the first row, f₁x₁ = 12 × 110 = 1320.

  4. Fill the table and find the sums.

    Daily Wages (₹)Number of Workers (fᵢ)Class Mark (xᵢ)fᵢxᵢ
    100 - 120121101320
    120 - 140141301820
    140 - 16081501200
    160 - 18061701020
    180 - 200101901900
    TotalΣfᵢ = 50Σfᵢxᵢ = 7260
  5. Use the direct method formula.

    x̄ = Σfᵢxᵢ / Σfᵢ
    
    x̄ = 7260 / 50
    
    x̄ = 145.2
    

Final Answer:

The mean daily wage of the workers is ₹145.20.

Example 3: Finding a Missing Frequency (Hard)

Given: The mean of the following distribution is 18. The data shows the daily pocket allowance of children in a locality.

Daily Allowance (₹)11-1313-1515-1717-1919-2121-2323-25
Number of Children76913f54

To Find: The value of the missing frequency, f.

Solution:

  1. In this problem, the mean is given as 18. We need to work backwards to find f. First, let's set up our table as usual.

  2. Calculate class marks (xᵢ) and the products fᵢxᵢ.

    Daily Allowance (₹)Frequency (fᵢ)Class Mark (xᵢ)fᵢxᵢ
    11 - 1371284
    13 - 1561484
    15 - 17916144
    17 - 191318234
    19 - 21f2020f
    21 - 23522110
    23 - 2542496
  3. Now, find the sums Σfᵢ and Σfᵢxᵢ in terms of f.

    Σfᵢ = 7 + 6 + 9 + 13 + f + 5 + 4 = 44 + f
    
    Σfᵢxᵢ = 84 + 84 + 144 + 234 + 20f + 110 + 96 = 752 + 20f
    
  4. Substitute these sums and the given mean into the formula.

    x̄ = Σfᵢxᵢ / Σfᵢ
    
    18 = (752 + 20f) / (44 + f)
    
  5. Now, solve the linear equation for f.

    18 × (44 + f) = 752 + 20f
    
    792 + 18f = 752 + 20f
    
    792 - 752 = 20f - 18f
    
    40 = 2f
    
    f = 20
    

Final Answer:

The value of the missing frequency (f) is 20.

Example 4: Unequal Class Sizes (Tricky)

Given: The age distribution of patients treated in a hospital on a particular day.

Age (in years)5-1515-2525-3535-4545-65
No. of Patients611212314

To Find: The mean age of the patients.

Solution:

  1. Notice that the class sizes are not equal. The first four classes have a size of 10 (15-5=10), but the last class 45-65 has a size of 20. The Direct Method works perfectly fine even with unequal class sizes.

  2. We follow the exact same procedure. Set up the table.

  3. Calculate the class marks (xᵢ) and products (fᵢxᵢ).

    Age (years)No. of Patients (fᵢ)Class Mark (xᵢ)fᵢxᵢ
    5 - 1561060
    15 - 251120220
    25 - 352130630
    35 - 452340920
    45 - 651455770
    TotalΣfᵢ = 75Σfᵢxᵢ = 2600
  4. Calculate the mean using the sums.

    x̄ = Σfᵢxᵢ / Σfᵢ
    
    x̄ = 2600 / 75
    
    x̄ = 34.666...
    

Final Answer:

The mean age of the patients is approximately 34.67 years.

Tips & Tricks

Working smart is just as important as working hard. Here are a few tricks to make your calculations faster and more accurate.

TipDescription
1. Organize with a TableAlways create a 4-column table: Class Interval, fᵢ, xᵢ, fᵢxᵢ. This structure minimizes calculation errors and keeps your work neat and easy to review.
2. Spot the Pattern in Class MarksIf the class sizes are equal, the class marks will form an arithmetic progression. Calculate the first two (x₁, x₂) and then just keep adding the class size to get the rest. This saves you from repeatedly calculating the average.
3. Handle Large Numbers CarefullyThe Direct Method can become tedious with large fᵢ and xᵢ values. Double-check your multiplication. A single error in an fᵢxᵢ value will lead to an incorrect final answer.

Common Mistakes

Many students make similar small mistakes that can cost them marks. Here’s what to watch out for.

❌ Wrong Method✅ Right Method
Using the upper or lower class limit for xᵢ. For class 10-20, using 10 or 20.Always use the class mark (mid-point). For 10-20, xᵢ = (10+20)/2 = 15.
Dividing by the number of class intervals (e.g., 5 rows in the table).Always divide by the total frequency (Σfᵢ), which is the sum of all observations.
Calculating Σfᵢ × Σxᵢ instead of Σ(fᵢxᵢ).First multiply fᵢ and xᵢ for each row, then sum up that entire column to get Σ(fᵢxᵢ).
Forgetting a missing frequency (f) in the sum Σfᵢ.Remember to include the variable in the total sum, e.g., Σfᵢ = 44 + f.

Brain-Teaser Questions

Test your understanding with these slightly challenging problems.

  1. The mean weight of a class of 30 students is 45 kg. If the weight of the teacher, who weighs 75 kg, is included, what is the new mean weight of the group?

    💡 Answer: Total weight of students = 30 × 45 = 1350 kg. New total weight (with teacher) = 1350 + 75 = 1425 kg. New total number of people = 30 + 1 = 31. New mean = 1425 / 31 ≈ 45.97 kg.

  2. The class marks of a continuous distribution are: 22, 28, 34, 40, 46. What is the class size and what are the class intervals?

    💡 Answer: The class size (h) is the difference between consecutive class marks: h = 28 - 22 = 6. The class limits are halfway between the class marks. For the first class mark 22, the lower limit is 22 - h/2 = 22 - 3 = 19 and the upper limit is 22 + h/2 = 22 + 3 = 25. The class intervals are: 19-25, 25-31, 31-37, 37-43, 43-49.

  3. If the mean of a frequency distribution is 8.1 and Σfᵢxᵢ = 132 + 5k, and Σfᵢ = 20, find the value of k.

    💡 Answer: We use the formula x̄ = Σfᵢxᵢ / Σfᵢ. 8.1 = (132 + 5k) / 20 8.1 × 20 = 132 + 5k 162 = 132 + 5k 162 - 132 = 5k 30 = 5k k = 6.

Mini Cheatsheet

Here's a quick summary of everything on this page. Screenshot this for your last-minute revision!

ConceptFormula / Definition
Mean (Direct Method)x̄ = Σ(fᵢ xᵢ) / Σfᵢ
Class Mark (xᵢ)(Upper Class Limit + Lower Class Limit) / 2
Frequency (fᵢ)Number of observations in a class.
ΣfᵢTotal number of observations.
ΣfᵢxᵢSum of the products of frequency and class mark for all classes.

Mean of Grouped Data — Assumed Mean Method

{{FORMULA: expr=x̄ = a + (Σfᵢdᵢ / Σfᵢ) | symbols=x̄:Mean, a:Assumed Mean, fᵢ:Frequency, dᵢ:Deviation (xᵢ - a)}}

Mean of Grouped Data — Assumed Mean Method

Concept Introduction

In the previous section, we learned the Direct Method to find the mean of grouped data. It works perfectly, but what happens when the data involves very large numbers? Imagine calculating the average monthly salary for 5,000 employees in a factory, where salaries range from ₹18,575 to ₹85,250.

Calculating the product of class marks (xᵢ) and frequencies (fᵢ) would be tedious, time-consuming, and prone to errors. 150 × 45750.50 is not a calculation you want to do by hand!

To simplify this, we use the Assumed Mean Method. The core idea is simple: we assume a convenient value as the mean (usually from the middle of the data), calculate how far off our data points are from this assumption, find the average of these "errors" (deviations), and then adjust our original assumption. It's a clever shortcut to tame large numbers.

Definitions & Formulas

This method introduces a few new terms, primarily the concept of deviation.

VariableMeaningDescription
xᵢClass MarkThe mid-point of a class interval. Calculated as (Upper Limit + Lower Limit) / 2.
fᵢFrequencyThe number of observations in a particular class interval.
aAssumed MeanA value, usually one of the middle xᵢ values, chosen to simplify calculations.
dᵢDeviationThe difference between each class mark and the assumed mean. dᵢ = xᵢ - a.
ΣfᵢdᵢSum of ProductsThe sum of the products of frequency and deviation for each class.
ΣfᵢTotal FrequencyThe sum of all frequencies, also denoted by N.
MeanThe arithmetic mean of the data.

The Logic Behind the Formula

Why does this method work? It's a simple algebraic rearrangement of the Direct Method formula. Let's see how.

  1. We know the formula for the mean by the Direct Method is:

    x̄ = (Σfᵢxᵢ) / (Σfᵢ)
    
  2. We define a new term, deviation dᵢ, for each class mark xᵢ from an assumed mean a.

    dᵢ = xᵢ - a
    
  3. This means we can express any class mark xᵢ in terms of the assumed mean and its deviation.

    xᵢ = a + dᵢ
    
  4. Now, let's substitute this expression for xᵢ back into the Direct Method formula.

    x̄ = (Σfᵢ(a + dᵢ)) / (Σfᵢ)
    
  5. Using the distributive property of summation, we can split the numerator.

    x̄ = (Σ(fᵢa) + Σ(fᵢdᵢ)) / (Σfᵢ)
    
  6. Since a is a constant, Σ(fᵢa) is the same as a × (Σfᵢ).

    x̄ = (a × (Σfᵢ) + Σfᵢdᵢ) / (Σfᵢ)
    
  7. Finally, we can separate this into two fractions.

    x̄ = (a × Σfᵢ / Σfᵢ) + (Σfᵢdᵢ / Σfᵢ)
    
  8. The Σfᵢ terms cancel out in the first fraction, leaving us with the final formula for the Assumed Mean Method.

    x̄ = a + (Σfᵢdᵢ / Σfᵢ)
    

{{KEY: type=concept | title=Mean of Deviations | text=The term Σfᵢdᵢ / Σfᵢ is the mean of the deviations. So, the formula is simply: Actual Mean = Assumed Mean + Mean of Deviations. This shows how we are "correcting" our initial assumption.}}

Solved Examples

Example 1: Easy Start

Given: The distribution of daily wages of 50 workers in a factory.

Daily Wages (in ₹)100-120120-140140-160160-180180-200
Number of Workers (fᵢ)12148610

To Find: The mean daily wages using the Assumed Mean method.

Solution:

  1. First, we create a table to find the class mark (xᵢ) for each interval. The class mark is the midpoint: (Lower Limit + Upper Limit) / 2.

  2. Let's choose an assumed mean (a). A good choice is one of the middle xᵢ values. Let's take a = 150.

  3. Next, we calculate the deviation dᵢ = xᵢ - a for each class.

  4. Then, we find the product fᵢdᵢ for each class.

  5. Finally, we sum the fᵢ and fᵢdᵢ columns.

Class Intervalfᵢxᵢdᵢ = xᵢ - 150fᵢdᵢ
100-12012110-40-480
120-14014130-20-280
140-160815000
160-180617020120
180-2001019040400
TotalΣfᵢ = 50Σfᵢdᵢ = -240
  1. Now, we apply the Assumed Mean formula.
    x̄ = a + (Σfᵢdᵢ / Σfᵢ)
    
    x̄ = 150 + (-240 / 50)
    
    x̄ = 150 - 4.8
    
    x̄ = 145.2
    

Final Answer:

The mean daily wages are ₹145.20.

Example 2: Medium (NCERT Context)

Given: The marks obtained by 30 students of Class X, condensed into grouped data.

Class Interval10-2525-4040-5555-7070-8585-100
Number of Students (fᵢ)237666

To Find: The mean marks using the Assumed Mean method.

Solution:

  1. We prepare the calculation table. Let's follow the NCERT example and choose a = 47.5 as the assumed mean.
Class Intervalfᵢxᵢdᵢ = xᵢ - 47.5fᵢdᵢ
10-25217.5-30-60
25-40332.5-15-45
40-55747.500
55-70662.51590
70-85677.530180
85-100692.545270
TotalΣfᵢ = 30Σfᵢdᵢ = 435
  1. Now, we use the formula x̄ = a + (Σfᵢdᵢ / Σfᵢ).

    x̄ = 47.5 + (435 / 30)
    
  2. Calculate the mean of deviations.

    435 / 30 = 14.5
    
  3. Add this to the assumed mean.

    x̄ = 47.5 + 14.5
    
    x̄ = 62
    

Final Answer:

The mean marks obtained by the students is 62.

Example 3: Hard (Larger Numbers)

Given: The distribution of the monthly expenditure of 200 families in a locality.

Expenditure (in ₹)1000-15001500-20002000-25002500-30003000-35003500-40004000-45004500-5000
No. of Families (fᵢ)244033283022167

To Find: The mean monthly expenditure.

Solution:

Stuck on something here?
Aarav Sir explains any part — voice or chat — 24/7.
  1. The numbers are large, so the Assumed Mean method is ideal. Let's create our table. The xᵢ values are 1250, 1750, 2250, 2750, 3250, 3750, 4250, 4750.
  2. Let's choose a middle value for a. We have 8 classes, so either x₄=2750 or x₅=3250 are good choices. Let's pick a = 2750.
Expenditurefᵢxᵢdᵢ = xᵢ - 2750fᵢdᵢ
1000-1500241250-1500-36000
1500-2000401750-1000-40000
2000-2500332250-500-16500
2500-300028275000
3000-350030325050015000
3500-4000223750100022000
4000-4500164250150024000
4500-500074750200014000
TotalΣfᵢ = 200Σfᵢdᵢ = -17500
  1. Apply the formula.

    x̄ = a + (Σfᵢdᵢ / Σfᵢ)
    
    x̄ = 2750 + (-17500 / 200)
    
  2. Simplify the fraction.

    x̄ = 2750 - (175 / 2)
    
    x̄ = 2750 - 87.5
    
    x̄ = 2662.5
    

Final Answer:

The mean monthly expenditure is ₹2662.50.

Example 4: Tricky (Missing Frequency)

Given: The mean of the following distribution is 18.

Class Interval11-1313-1515-1717-1919-2121-2323-25
Frequency (fᵢ)76913p54

To Find: The value of the missing frequency, p.

Solution:

  1. This problem requires us to work backward. We are given x̄ = 18. We'll use the assumed mean method to build an equation involving p.
  2. Let's calculate the class marks (xᵢ): 12, 14, 16, 18, 20, 22, 24.
  3. A perfect choice for the assumed mean a is 18, as it's one of the xᵢ values and is also the given mean. This will simplify calculations. Let a = 18.
Class Intervalfᵢxᵢdᵢ = xᵢ - 18fᵢdᵢ
11-13712-6-42
13-15614-4-24
15-17916-2-18
17-19131800
19-21p2022p
21-23522420
23-25424624
TotalΣfᵢ=44+pΣfᵢdᵢ=2p-40
  1. Now, substitute these sums into the Assumed Mean formula.

    x̄ = a + (Σfᵢdᵢ / Σfᵢ)
    
    18 = 18 + ((2p - 40) / (44 + p))
    
  2. Subtract 18 from both sides.

    0 = (2p - 40) / (44 + p)
    
  3. For a fraction to be zero, its numerator must be zero.

    2p - 40 = 0
    
    2p = 40
    
    p = 20
    

Final Answer:

The value of the missing frequency p is 20.

Tips & Tricks

TechniqueDescription
Choosing 'a' wiselyAlways pick one of the xᵢ values from the middle of the table. This makes one of the dᵢ values zero and balances the positive and negative dᵢ values, keeping Σfᵢdᵢ small and manageable.
Interpreting ΣfᵢdᵢThe sign of Σfᵢdᵢ tells you where the actual mean is relative to your assumed mean. If Σfᵢdᵢ is positive, x̄ > a. If it's negative, x̄ < a. If it's zero, x̄ = a. This is a great quick check.
Common Factor ShortcutIf all your dᵢ values have a common factor (like 10, 50, or the class size h), you can divide all dᵢ by it to make numbers even smaller. Just remember to multiply the final result by that factor before adding it to a. This is the basis for the next method (Step-Deviation).

Common Mistakes to Avoid

❌ Wrong Approach✅ Right ApproachWhy it's a Mistake
Forgetting to add a back. x̄ = Σfᵢdᵢ / Σfᵢ = 435 / 30 = 14.5x̄ = a + (Σfᵢdᵢ / Σfᵢ) = 47.5 + 14.5 = 62The term Σfᵢdᵢ / Σfᵢ is the mean of deviations (the average error), not the actual mean. You must add it back to your initial assumption a.
Incorrectly calculating deviation: dᵢ = a - xᵢ.dᵢ = xᵢ - a.This inverts the signs of all your deviations, leading to a completely wrong result. Remember: it's how far the observation (xᵢ) deviates from the assumed mean (a).
Wrong Class Mark formula: (Upper - Lower) / 2.(Upper + Lower) / 2.Subtracting the limits gives you the range of the interval, not its midpoint. The class mark must be the average of the two limits.
Assuming a must be an xᵢ value.a can be any convenient number, even one not in the data set.While choosing an xᵢ is easiest, any reasonable number will work. The math will always lead to the same final answer, but a bad choice for a can make calculations harder.

Brain-Teaser Questions

  1. In the 'Hard' example above, we chose a = 2750. What would happen to the final answer if you chose the first class mark, a = 1250, instead? Would the calculation be easier or harder?

    💡 Answer: The final answer for the mean would be exactly the same (₹2662.50). However, the calculations for dᵢ and fᵢdᵢ would become much larger because all dᵢ values (xᵢ - 1250) would be positive and large, leading to a larger Σfᵢdᵢ. This makes the calculation more difficult and error-prone.

  2. If, for a given dataset, you calculate Σfᵢdᵢ = 0, what can you conclude about your choice of the assumed mean a?

    💡 Answer: If Σfᵢdᵢ = 0, then the term (Σfᵢdᵢ / Σfᵢ) becomes 0. The formula simplifies to x̄ = a + 0, which means x̄ = a. This implies that your assumed mean was, by chance, the exact actual mean of the data.

  3. Suppose you calculate the mean height of a group of students using a = 160 cm. If every student then grows by exactly 2 cm, and you re-calculate the mean using a new assumed mean of a = 162 cm, how will your new dᵢ values compare to the old dᵢ values?

    💡 Answer: The dᵢ values will remain exactly the same. The new class marks will be xᵢ' = xᵢ + 2. The new assumed mean is a' = a + 2 = 162. The new deviation is dᵢ' = xᵢ' - a' = (xᵢ + 2) - (a + 2) = xᵢ - a = dᵢ. Because both the data and the assumption shifted by the same amount, their difference (deviation) remains constant.

Mini Cheatsheet

ConceptFormula / Description
Class Mark (xᵢ)(Upper Class Limit + Lower Class Limit) / 2
Assumed Mean (a)A convenient value, usually a middle xᵢ, chosen to simplify dᵢ.
Deviation (dᵢ)dᵢ = xᵢ - a
Mean of Deviations ()d̄ = Σfᵢdᵢ / Σfᵢ
Assumed Mean Formulax̄ = a + d̄ or x̄ = a + (Σfᵢdᵢ / Σfᵢ)

Mean of Grouped Data — Step-deviation Method

Mean of Grouped Data — Step-deviation Method

Welcome back! In the previous section, we learned the Assumed Mean Method to simplify our calculations. We subtracted an assumed mean a from each class mark xᵢ to get a smaller deviation dᵢ. But what if the deviations dᵢ are still large numbers? What if they all share a common factor?

This is where the Step-deviation Method comes in. It's a fantastic upgrade to the assumed mean method, designed to make your calculations even simpler and faster, especially when dealing with large data sets with uniform class sizes. Imagine you're a city planner analyzing traffic data. The number of cars passing a junction per hour is huge. Instead of working with these large numbers, you could simplify them into smaller, manageable steps. That's precisely what this method helps us do.

{{FORMULA: expr=x̄ = a + (Σfᵢuᵢ / Σfᵢ) × h | symbols=x̄:Mean, a:Assumed Mean, fᵢ:Frequency, uᵢ:Step Deviation, h:Class Size}}

The core idea is to "step down" the deviations dᵢ by dividing them by a common factor, which is usually the class size h. This gives us a new, even smaller variable uᵢ. We then find the mean of these uᵢ values and adjust it back at the end to find the actual mean of the original data.


Definitions & Formulas

Let's formally define the components of the Step-deviation Method.

VariableMeaningFormula / Explanation
xᵢClass MarkThe mid-point of a class interval. (Upper Limit + Lower Limit) / 2
fᵢFrequencyThe number of observations in a particular class.
hClass SizeThe difference between the upper and lower limits of a class. Must be constant for this method.
aAssumed MeanA value chosen from the class marks (xᵢ), usually one in the middle, to simplify calculations.
dᵢDeviationThe difference between the class mark and the assumed mean. dᵢ = xᵢ - a
uᵢStep DeviationThe deviation divided by the class size. uᵢ = dᵢ / h = (xᵢ - a) / h
MeanThe average of the data set, calculated using the step-deviation formula.

Derivation: How the Formula Works

Understanding the logic behind a formula is key to mastering it. Let's see how the step-deviation formula is derived directly from the assumed mean method.

  1. We start with the relationship from the assumed mean method, where the mean is the assumed mean a plus the mean of the deviations .

    x̄ = a + d̄
    
  2. The mean of the deviations, , is calculated as:

    d̄ = Σfᵢdᵢ / Σfᵢ
    
  3. Now, we introduce our new variable, the step deviation uᵢ. We define it by dividing the deviation dᵢ by the constant class size h.

    uᵢ = dᵢ / h
    
  4. By rearranging this, we can express dᵢ in terms of uᵢ and h.

    dᵢ = uᵢ × h
    
  5. Let's substitute this new expression for dᵢ back into the formula for (from Step 2).

    d̄ = Σfᵢ(uᵢ × h) / Σfᵢ
    
  6. Since h is a constant for all classes, we can take it outside the summation (Σ) sign.

    d̄ = h × (Σfᵢuᵢ / Σfᵢ)
    
  7. Finally, we substitute this new, complete expression for back into our starting equation from Step 1. This gives us the final Step-deviation Method formula.

    x̄ = a + (Σfᵢuᵢ / Σfᵢ) × h
    

This elegant derivation shows that the step-deviation method is not a new concept, but a clever simplification of the assumed mean method.

{{KEY: type=concept | title=When to Use Step-deviation | text=The Step-deviation Method is most effective when the class size h is the same for all class intervals and the deviations dᵢ = xᵢ - a all share h as a common factor. This significantly reduces calculation complexity.}}


Solved Examples

Let's apply our knowledge to solve some problems, starting from easy and moving to more challenging ones.

Example 1: Basic Application (Easy)

Given: The distribution of daily wages of 50 workers in a factory.

Daily Wages (in Rs)100-120120-140140-160160-180180-200
Number of Workers (fᵢ)12148610

To Find: The mean daily wages using the step-deviation method.

Solution:

  1. First, let's create our calculation table. The class size h is 120 - 100 = 20. Let's choose the middle class mark xᵢ = 150 as our assumed mean a.
Class IntervalFrequency (fᵢ)Class Mark (xᵢ)dᵢ = xᵢ - 150uᵢ = dᵢ / 20fᵢuᵢ
100-12012110-40-2-24
120-14014130-20-1-14
140-1608150 (a)000
160-18061702016
180-2001019040220
TotalΣfᵢ = 50Σfᵢuᵢ = -12
  1. We have the necessary sums: Σfᵢ = 50 and Σfᵢuᵢ = -12. We also know a = 150 and h = 20.

  2. Now, apply the step-deviation formula.

    x̄ = a + (Σfᵢuᵢ / Σfᵢ) × h
    
  3. Substitute the values into the formula.

    x̄ = 150 + (-12 / 50) × 20
    
  4. Simplify the expression.

    x̄ = 150 - (12 × 20) / 50
    
    x̄ = 150 - 240 / 50
    
    x̄ = 150 - 4.8
    
    x̄ = 145.2
    

Final Answer: The mean daily wages are Rs 145.20.


Example 2: Data with Decimal Class Marks (Medium)

Given: The concentration of SO₂ in the air (in parts per million, ppm) was measured for 30 localities.

Concentration of SO₂ (ppm)0.00-0.040.04-0.080.08-0.120.12-0.160.16-0.200.20-0.24
Frequency (fᵢ)499242

To Find: The mean concentration of SO₂ in the air.

Solution:

  1. The class size h is 0.04 - 0.00 = 0.04. Let's pick a = 0.10 as the assumed mean.
Concentration (ppm)fᵢxᵢdᵢ = xᵢ - 0.10uᵢ = dᵢ / 0.04fᵢuᵢ
0.00-0.0440.02-0.08-2-8
0.04-0.0890.06-0.04-1-9
0.08-0.1290.10 (a)000
0.12-0.1620.140.0412
0.16-0.2040.180.0828
0.20-0.2420.220.1236
TotalΣfᵢ = 30Σfᵢuᵢ = -1
  1. We have Σfᵢ = 30, Σfᵢuᵢ = -1, a = 0.10, and h = 0.04.

  2. Apply the step-deviation formula.

    x̄ = a + (Σfᵢuᵢ / Σfᵢ) × h
    
  3. Substitute the values.

    x̄ = 0.10 + (-1 / 30) × 0.04
    
  4. Calculate the result.

    x̄ = 0.10 - 0.04 / 30
    
    x̄ = 0.10 - 0.00133...
    
    x̄ ≈ 0.0987
    

Rounding to a reasonable number of decimal places for this context (e.g., 4):

x̄ ≈ 0.0987

Final Answer: The mean concentration of SO₂ is approximately 0.099 ppm.


Example 3: Finding a Missing Frequency (Hard)

Given: The mean pocket allowance of children is Rs 18. The frequency p for the class interval 19-21 is missing.

Daily Allowance (Rs)11-1313-1515-1717-1919-2121-2323-25
Number of Children (fᵢ)76913p54

To Find: The value of the missing frequency p.

Solution:

  1. Here, we are given the mean x̄ = 18. The class size h = 13 - 11 = 2. Let's choose a = 18.
Class Intervalfᵢxᵢdᵢ = xᵢ - 18uᵢ = dᵢ / 2fᵢuᵢ
11-13712-6-3-21
13-15614-4-2-12
15-17916-2-1-9
17-191318 (a)000
19-21p2021p
21-235224210
23-254246312
TotalΣfᵢ=44+pΣfᵢuᵢ=p-20
  1. Let's sum the fᵢ and fᵢuᵢ columns. Σfᵢ = 7 + 6 + 9 + 13 + p + 5 + 4 = 44 + p Σfᵢuᵢ = (-21) + (-12) + (-9) + 0 + p + 10 + 12 = p - 20

  2. Now, we use the step-deviation formula with the known mean x̄ = 18.

    x̄ = a + (Σfᵢuᵢ / Σfᵢ) × h
    
  3. Substitute all known values into the equation.

    18 = 18 + ((p - 20) / (44 + p)) × 2
    
  4. Solve the equation for p. First, subtract 18 from both sides.

    0 = ((p - 20) / (44 + p)) × 2
    
  5. Since (expression) × 2 = 0, the expression itself must be zero.

    (p - 20) / (44 + p) = 0
    
  6. For a fraction to be zero, the numerator must be zero.

    p - 20 = 0
    
    p = 20
    

Final Answer: The value of the missing frequency p is 20.


Example 4: Non-Continuous Class Intervals (Tricky)

Given: The distribution of the lengths of 40 leaves of a plant.

Length (in mm)118-126127-135136-144145-153154-162163-171172-180
Number of Leaves (fᵢ)35912542

To Find: The mean length of the leaves.

Solution:

  1. First, notice the class intervals are not continuous (e.g., 126 is followed by 127). We must make them continuous. The gap is 127 - 126 = 1. We add 1/2 = 0.5 to each upper limit and subtract 0.5 from each lower limit.
Adjusted Class Intervalfᵢ
117.5-126.53
126.5-135.55
135.5-144.59
144.5-153.512
153.5-162.55
162.5-171.54
171.5-180.52
  1. Now we can proceed. The class size h = 126.5 - 117.5 = 9. Let's choose a = 149 as the assumed mean (the class mark of 144.5-153.5).
Class Intervalfᵢxᵢdᵢ = xᵢ - 149uᵢ = dᵢ / 9fᵢuᵢ
117.5-126.53122-27-3-9
126.5-135.55131-18-2-10
135.5-144.59140-9-1-9
144.5-153.512149 (a)000
153.5-162.55158915
162.5-171.541671828
171.5-180.521762736
TotalΣfᵢ = 40Σfᵢuᵢ = -9
  1. We have Σfᵢ = 40, Σfᵢuᵢ = -9, a = 149, and h = 9.

  2. Apply the step-deviation formula.

    x̄ = a + (Σfᵢuᵢ / Σfᵢ) × h
    
  3. Substitute the values.

    x̄ = 149 + (-9 / 40) × 9
    
    x̄ = 149 - 81 / 40
    
    x̄ = 149 - 2.025
    
    x̄ = 146.975
    

Final Answer: The mean length of the leaves is 146.975 mm.


Tips & Tricks

TipDescription
Smart a SelectionAlways choose the class mark xᵢ from the middle of the distribution as your assumed mean a. This makes the uᵢ values symmetric (like -2, -1, 0, 1, 2) and minimizes calculation errors.
uᵢ Pattern CheckFor continuous classes of equal size, the uᵢ column will always be a sequence of consecutive integers centered around 0 (e.g., ..., -2, -1, 0, 1, 2, ...). If your uᵢ values don't follow this pattern, re-check your calculations.
Mental Math for fᵢuᵢThe biggest advantage of this method is that the numbers in the uᵢ and fᵢuᵢ columns are very small. You can often calculate the fᵢuᵢ products mentally, saving a lot of time in exams.

Common Mistakes

❌ Wrong Method✅ Right MethodWhy it Matters
Forgetting to multiply by h at the end: <br> x̄ = a + (Σfᵢuᵢ / Σfᵢ)Multiplying the final term by h: <br> x̄ = a + (Σfᵢuᵢ / Σfᵢ) × hThe uᵢ values were scaled down by h. You must scale the result back up by multiplying by h to get the correct mean.
Calculating dᵢ with class limits: <br> dᵢ = (lower limit) - aCalculating dᵢ with the class mark xᵢ: <br> dᵢ = xᵢ - aThe class mark xᵢ is the single representative value for the entire class interval. All calculations must use xᵢ.
Using uᵢ = (a - xᵢ) / hUsing uᵢ = (xᵢ - a) / hThe formula for deviation is observation - mean. Reversing the subtraction will flip the signs of all uᵢ values and lead to an incorrect answer.
Applying the method when h is not constant.Verifying that the class size h is the same for all intervals before starting.The entire derivation relies on h being a common factor that can be taken outside the summation. If h varies, this method is not applicable.

Brain-Teaser Questions

Test your understanding with these higher-order thinking problems.

  1. If the mean of a distribution calculated by the step-deviation method is 45, what will be the new mean if the assumed mean a was increased by 5, but the original data remains unchanged?

    💡 Answer: The final mean will still be 45. The choice of assumed mean a is just a calculation tool. A different a will change the intermediate Σfᵢuᵢ value, but the final will always be the same for the same data set. If a increases by 5, Σfᵢuᵢ / Σfᵢ will decrease by an amount that, when multiplied by h, exactly cancels out the increase, leaving unchanged.

  2. A student calculates the mean using the step-deviation method. They correctly find Σfᵢ = 50, Σfᵢuᵢ = 20, and choose a = 25. They then realize they used h = 10 but the actual class size was h = 5. What is the correct mean?

    💡 Answer: The values of uᵢ were calculated as (xᵢ - a) / 10. The correct uᵢ values should be (xᵢ - a) / 5, which are exactly double the student's values. Therefore, the correct Σfᵢuᵢ would be 2 × 20 = 40. Now, let's calculate the correct mean: x̄ = a + (correct Σfᵢuᵢ / Σfᵢ) × (correct h) x̄ = 25 + (40 / 50) × 5 x̄ = 25 + (4/5) × 5 x̄ = 25 + 4 = 29. The correct mean is 29.

  3. For a given frequency distribution, the sum of deviations from the assumed mean a = 50 is Σfᵢdᵢ = -200. If the total frequency Σfᵢ = 40 and the class size h = 10, what is the value of Σfᵢuᵢ?

    💡 Answer: We know that uᵢ = dᵢ / h. Therefore, fᵢuᵢ = fᵢdᵢ / h. Summing both sides gives us Σfᵢuᵢ = Σ(fᵢdᵢ / h). Since h is a constant, we can write Σfᵢuᵢ = (1/h) × Σfᵢdᵢ. Σfᵢuᵢ = (1/10) × (-200) = -20.


Mini Cheatsheet

Here's a quick summary of the key formulas from this page. Screenshot this for your last-minute revision!

ConceptFormula
Class Mark (xᵢ)xᵢ = (Upper Limit + Lower Limit) / 2
Deviation (dᵢ)dᵢ = xᵢ - a
Step Deviation (uᵢ)uᵢ = dᵢ / h or uᵢ = (xᵢ - a) / h
Step-deviation Mean (x̄)x̄ = a + (Σfᵢuᵢ / Σfᵢ) × h
Summation (Σ)Σfᵢ means the sum of all frequencies. Σfᵢuᵢ means the sum of all fᵢuᵢ products.

Mean of Grouped Data — Choosing a Method and Examples

Mean of Grouped Data: Choosing the Right Method

Welcome back! So far, we've learned how to organize data into groups and find a representative value for each group, the class mark (xᵢ). We also looked at the Direct Method for calculating the mean. But what happens when the numbers get really big? Calculating fᵢ × xᵢ can become tedious and prone to errors.

Imagine you're an economist analyzing the average monthly income for 10,000 households in a city. The incomes might range from ₹8,500 to ₹1,50,000. Using the Direct Method here would involve multiplying very large numbers, making the calculation a nightmare! This is where statisticians use clever shortcuts to simplify their work without losing accuracy. Today, we'll master two such powerful techniques: the Assumed Mean Method and the Step-Deviation Method. We'll also learn the most crucial skill: when to use which method.

{{FORMULA: expr=x̄ = a + h(Σfᵢuᵢ / Σfᵢ) | symbols=x̄:mean, a:assumed mean, h:class size, fᵢ:frequency, uᵢ:step deviation}}

Definitions & Formulas

Let's organize our toolkit. These are the three methods we'll be working with to find the mean of grouped data.

VariableMeaningFormula
xᵢClass Mark (mid-point of a class interval)(Upper Limit + Lower Limit) / 2
fᵢFrequency of the i-th classGiven in the data
Mean of the dataVaries by method
aAssumed Mean (a convenient xᵢ value)Chosen from the xᵢ column
dᵢDeviation from the assumed meandᵢ = xᵢ - a
hClass Size (must be uniform)Upper Limit - Lower Limit
uᵢStep Deviationuᵢ = (xᵢ - a) / h

The Logic Behind the Shortcuts

Why do these methods work? It's all about shifting the data to make it simpler, calculating the average of the simplified data, and then shifting the result back.

1. The Assumed Mean Method

This method is based on shifting the origin. We "assume" a mean (a), find out how far off our assumption is on average, and then add this "correction factor" back. The NCERT text started this derivation, let's complete it.

  1. We define the deviation of each class mark from our assumed mean a as dᵢ = xᵢ - a.

  2. Rearranging this, we get xᵢ = a + dᵢ. This is the key relationship.

  3. To find the mean , we use the fundamental formula x̄ = (Σfᵢxᵢ) / (Σfᵢ). Let's substitute our expression for xᵢ:

    x̄ = (Σfᵢ(a + dᵢ)) / (Σfᵢ)
    
  4. Distributing fᵢ inside the summation, we get:

    x̄ = (Σ(fᵢa + fᵢdᵢ)) / (Σfᵢ)
    
  5. The sum of two terms is the sum of their sums:

    x̄ = (Σfᵢa + Σfᵢdᵢ) / (Σfᵢ)
    
  6. Since a is a constant, Σfᵢa is just a × Σfᵢ. So we can split the fraction:

    x̄ = (aΣfᵢ / Σfᵢ) + (Σfᵢdᵢ / Σfᵢ)
    
  7. The Σfᵢ terms cancel in the first part, leaving us with the final formula:

    x̄ = a + (Σfᵢdᵢ / Σfᵢ)
    

2. The Step-Deviation Method

This method goes one step further. After shifting the origin (by subtracting a), it also scales the data down (by dividing by h). This is useful when all the deviations dᵢ are multiples of the class size h.

  1. We start from the Assumed Mean formula: x̄ = a + (Σfᵢdᵢ / Σfᵢ).

  2. We define a new, simpler variable uᵢ = dᵢ / h, which means dᵢ = h × uᵢ.

  3. Substitute this expression for dᵢ into the Assumed Mean formula:

    x̄ = a + (Σfᵢ(h × uᵢ)) / (Σfᵢ)
    
  4. Since the class size h is a constant for all terms, we can factor it out of the summation:

    x̄ = a + (h × Σfᵢuᵢ) / (Σfᵢ)
    
  5. This gives us the final, and often simplest, formula for calculation:

    x̄ = a + h(Σfᵢuᵢ / Σfᵢ)
    

    This method is incredibly efficient as you often end up multiplying fᵢ with small integers like -2, -1, 0, 1, 2.

Solved Examples

Let's apply these methods to see them in action. We'll start simple and move to more complex problems.

Example 1: Direct Method (Easy)

Given: The number of wickets taken by a bowler in 10 cricket matches are: 2, 6, 4, 5, 0, 2, 1, 3, 2, 3.

To Find: The mean number of wickets using the direct method for ungrouped data (a quick recap).

Solution:

  1. First, list the observations (xᵢ) and their frequencies (fᵢ).

    • Wickets (xᵢ): 0, 1, 2, 3, 4, 5, 6
    • Frequency (fᵢ): 1, 1, 3, 2, 1, 1, 1
  2. Calculate the total number of matches, Σfᵢ.

    Σfᵢ = 1 + 1 + 3 + 2 + 1 + 1 + 1 = 10
    
  3. Calculate Σfᵢxᵢ.

    Σfᵢxᵢ = (0×1) + (1×1) + (2×3) + (3×2) + (4×1) + (5×1) + (6×1)
    Σfᵢxᵢ = 0 + 1 + 6 + 6 + 4 + 5 + 6 = 28
    
  4. Apply the Direct Method formula x̄ = Σfᵢxᵢ / Σfᵢ.

    x̄ = 28 / 10 = 2.8
    

Final Answer: The mean number of wickets taken is 2.8.

Example 2: Assumed Mean Method (Medium)

Given: The daily wages of 50 workers in a factory are given below.

Daily Wages (in ₹)100-120120-140140-160160-180180-200
Number of workers12148610

To Find: The mean daily wages using the Assumed Mean Method.

Solution:

  1. First, create a table with class intervals, frequencies (fᵢ), and class marks (xᵢ).

    Classfᵢxᵢ
    100-12012110
    120-14014130
    140-1608150
    160-1806170
    180-20010190
    Total50
  2. Choose an Assumed Mean (a). Let's pick the middle value of xᵢ, which is a = 150.

  3. Calculate the deviations dᵢ = xᵢ - a for each class.

    xᵢdᵢ = xᵢ - 150
    110-40
    130-20
    1500
    17020
    19040
  4. Now calculate the product fᵢdᵢ for each class and find their sum Σfᵢdᵢ.

    fᵢdᵢfᵢdᵢ
    12-40-480
    14-20-280
    800
    620120
    1040400
    Σfᵢ=50Σfᵢdᵢ = -240
  5. Apply the Assumed Mean formula: x̄ = a + (Σfᵢdᵢ / Σfᵢ).

    x̄ = 150 + (-240 / 50)
    
  6. Simplify the calculation.

    x̄ = 150 - 4.8 = 145.2
    

Final Answer: The mean daily wage is ₹145.20.

Example 3: Step-Deviation Method (Hard)

Given: The distribution below shows the number of runs scored by some top batsmen of the world in one-day international cricket matches.

Runs Scored3000-40004000-50005000-60006000-70007000-80008000-9000
Batsmen (fᵢ)4189763

To Find: The mean runs scored using the Step-Deviation Method.

Solution:

  1. Identify the class size h. Here, h = 4000 - 3000 = 1000.
  2. Choose an Assumed Mean a. Let's pick a = 6500.
  3. Prepare the calculation table. First, find xᵢ. Then dᵢ = xᵢ - a. Then uᵢ = dᵢ / h. Finally, fᵢuᵢ.
Runsfᵢxᵢdᵢ=xᵢ-6500uᵢ=dᵢ/1000fᵢuᵢ
3000-400043500-3000-3-12
4000-5000184500-2000-2-36
5000-600095500-1000-1-9
6000-700076500000
7000-800067500100016
8000-900038500200026
Total47-45
  1. Calculate Σfᵢ and Σfᵢuᵢ from the table. We have Σfᵢ = 47 and Σfᵢuᵢ = -45.

  2. Apply the Step-Deviation formula: x̄ = a + h(Σfᵢuᵢ / Σfᵢ).

    x̄ = 6500 + 1000 × (-45 / 47)
    
  3. Calculate the value.

    x̄ = 6500 - (45000 / 47)
    x̄ ≈ 6500 - 957.45
    x̄ ≈ 5542.55
    

Final Answer: The mean runs scored is approximately 5542.55.

Example 4: Finding a Missing Frequency (Tricky)

Given: The mean pocket allowance of children is ₹18. The following frequency distribution is for the daily pocket allowance.

Allowance (in ₹)11-1313-1515-1717-1919-2121-2323-25
Children (fᵢ)76913p54

To Find: The value of the missing frequency p.

Solution:

  1. Since the numbers are relatively small and the mean is given, the Direct Method is suitable. First, prepare the table with xᵢ and fᵢxᵢ.
Allowancefᵢxᵢfᵢxᵢ
11-1371284
13-1561484
15-17916144
17-191318234
19-21p2020p
21-23522110
23-2542496
Total44+p752 + 20p
  1. Sum the frequencies to find Σfᵢ.

    Σfᵢ = 7 + 6 + 9 + 13 + p + 5 + 4 = 44 + p
    
  2. Sum the fᵢxᵢ column to find Σfᵢxᵢ.

    Σfᵢxᵢ = 84 + 84 + 144 + 234 + 20p + 110 + 96 = 752 + 20p
    
  3. Use the formula for the mean: x̄ = Σfᵢxᵢ / Σfᵢ. We are given x̄ = 18.

    18 = (752 + 20p) / (44 + p)
    
  4. Now, solve the linear equation for p.

    18(44 + p) = 752 + 20p
    792 + 18p = 752 + 20p
    792 - 752 = 20p - 18p
    40 = 2p
    p = 20
    

Final Answer: The missing frequency p is 20.


{{KEY: type=strategy | title=How to Choose the Right Method | text=

  1. Direct Method: Use when the values of class marks (xᵢ) and frequencies (fᵢ) are small, making fᵢ × xᵢ easy to calculate.
  2. Assumed Mean Method: Use when xᵢ values are large. This method simplifies xᵢ into smaller dᵢ values, making multiplication easier.
  3. Step-Deviation Method: This is the best choice when xᵢ values are large AND the class size (h) is uniform. It converts deviations into very small integers (uᵢ), making calculations the simplest. }}

Tips & Tricks

TechniqueDescription
Smart Assumed MeanAlways choose a to be one of the middle xᵢ values. This ensures that your deviations (dᵢ) are balanced between positive and negative, minimizing the final sum Σfᵢdᵢ.
Sanity CheckThe calculated mean () must lie between the smallest and largest class marks. If your answer is outside this range, you've made a calculation error.
Look for Common FactorsBefore choosing a method, glance at the dᵢ values. If they all share a common factor (usually the class size h), the Step-Deviation Method will save you the most time.

Common Mistakes to Avoid

❌ Wrong Approach✅ Right Approach
Forgetting to add back the assumed mean: Calculating Σfᵢdᵢ / Σfᵢ and stopping there.Always complete the formula: x̄ = a + (Σfᵢdᵢ / Σfᵢ). The a is a crucial part of the answer.
Using class limits (e.g., 10-20) for calculation instead of the class mark (xᵢ).Always calculate the class mark first: xᵢ = (Upper + Lower) / 2. xᵢ represents the entire class.
Making sign errors when calculating fᵢdᵢ or fᵢuᵢ. A single + instead of - can throw off the entire result.Be extra careful with negative numbers. Sum all positive fᵢdᵢ first, then all negative ones, and then combine them.
Using the Step-Deviation method when class sizes are not uniform. The formula uᵢ = (xᵢ - a) / h is only valid if h is constant.If class sizes are unequal, you must use either the Direct Method or the Assumed Mean Method.

Brain-Teaser Questions

  1. If the mean of a distribution is 25, and every observation is increased by 3, what will be the new mean?

    💡 Answer: The new mean will be 25 + 3 = 28. The mean is affected by a change of origin. If every value xᵢ becomes xᵢ + k, the new mean becomes x̄ + k.

  2. In a frequency distribution, the assumed mean a = 55, Σfᵢ = 100, h = 10, and Σfᵢuᵢ = -30. What is the actual mean ?

    💡 Answer: Using the Step-Deviation formula: x̄ = a + h(Σfᵢuᵢ / Σfᵢ). x̄ = 55 + 10 × (-30 / 100) = 55 - 3 = 52. The mean is 52.

  3. The mean of 10 numbers is 20. If one number, 15, is removed, what is the mean of the remaining 9 numbers?

    💡 Answer: Original sum of 10 numbers = Mean × Number of items = 20 × 10 = 200. After removing 15, the new sum = 200 - 15 = 185. The new mean of 9 numbers = 185 / 9 ≈ 20.56.

Mini Cheatsheet

Here's a quick summary of everything on this page. Screenshot this for your last-minute revision!

ConceptFormula / Key Idea
Class Markxᵢ = (Upper Class Limit + Lower Class Limit) / 2
Direct Methodx̄ = (Σfᵢxᵢ) / (Σfᵢ)
Assumed Meanx̄ = a + (Σfᵢdᵢ) / (Σfᵢ), where dᵢ = xᵢ - a
Step-Deviationx̄ = a + h × (Σfᵢuᵢ / Σfᵢ), where uᵢ = (xᵢ - a) / h
When to UseDirect: Small numbers. Assumed Mean: Large xᵢ. Step-Dev: Large xᵢ & uniform h.

In this chapter

  • 1.Introduction
  • 2.Mean of Grouped Data — Direct Method
  • 3.Mean of Grouped Data — Assumed Mean Method
  • 4.Mean of Grouped Data — Step-deviation Method
  • 5.Mean of Grouped Data — Choosing a Method and Examples

Frequently asked questions

What is Introduction?

Welcome to the world of Statistics! In Class IX, you learned how to organize data and find its "center" using the mean, median, and mode for simple, ungrouped lists of numbers. But what happens when you're faced with massive amounts of data, like the exam scores of every student in a city? Listing them one by one is im

What is Mean of Grouped Data — Direct Method?

Imagine you're the manager of a popular pizza place. At the end of a busy week, you want to know the average amount a customer spends. You have hundreds of bills, but looking at each one is too slow. Instead, you group them into spending ranges: ₹100-200, ₹200-300, ₹300-400, and so on.

What is Mean of Grouped Data — Assumed Mean Method?

In the previous section, we learned the **Direct Method** to find the mean of grouped data. It works perfectly, but what happens when the data involves very large numbers? Imagine calculating the average monthly salary for 5,000 employees in a factory, where salaries range from ₹18,575 to ₹85,250.

What is Mean of Grouped Data — Step-deviation Method?

Welcome back! In the previous section, we learned the *Assumed Mean Method* to simplify our calculations. We subtracted an assumed mean `a` from each class mark `xᵢ` to get a smaller deviation `dᵢ`. But what if the deviations `dᵢ` are still large numbers? What if they all share a common factor?

What is Mean of Grouped Data — Choosing a Method and Examples?

Welcome back! So far, we've learned how to organize data into groups and find a representative value for each group, the **class mark** (`xᵢ`). We also looked at the **Direct Method** for calculating the mean. But what happens when the numbers get really big? Calculating `fᵢ × xᵢ` can become tedious and prone to errors

More chapters in CBSE Class 10 Mathematics

Want the full CBSE Class 10 Mathematics experience?

Every chapter. Interactive lessons. AI tutor on tap. Study Lab for any photo or PDF. 7-day free trial — no credit card.

1000s of students
100% NCERT-aligned
Powered by AI

Install Learn Skill

Add to home screen for the best experience