If you're seeing this message, it means we're having trouble loading external resources on our website.

If you're behind a web filter, please make sure that the domains *.kastatic.org and *.kasandbox.org are unblocked.

To log in and use all the features of Khan Academy, please enable JavaScript in your browser.

Statistics and probability

Unit 1: analyzing categorical data, unit 2: displaying and comparing quantitative data, unit 3: summarizing quantitative data, unit 4: modeling data distributions, unit 5: exploring bivariate numerical data, unit 6: study design, unit 7: probability, unit 8: counting, permutations, and combinations, unit 9: random variables, unit 10: sampling distributions, unit 11: confidence intervals, unit 12: significance tests (hypothesis testing), unit 13: two-sample inference for the difference between groups, unit 14: inference for categorical data (chi-square tests), unit 15: advanced regression (inference and transforming), unit 16: analysis of variance (anova).

Please ensure that your password is at least 8 characters and contains each of the following:

  • a special character: @$#!%*?&

logo

Have an account?

pencil-icon

Chapter 2, 2.1-2.5

11th - 12th grade, mathematics.

15 questions

Player avatar

Introducing new   Paper mode

No student devices needed.   Know more

  • 1. Multiple Choice Edit 30 seconds 1 pt The sum of the lower and upper limits of the class divided by 2. midpoint relative frequency cumulative frequency class
  • 2. Multiple Choice Edit 30 seconds 1 pt Class boundaries are numbers that separate classes with forming gaps between them True False
  • 4. Multiple Choice Edit 45 seconds 1 pt 12, 24, 6, 30, 37 Find the mean. 25.6 21.8 27.2 16.5
  • 5. Multiple Choice Edit 30 seconds 1 pt What is not commonly used as central tendency? mean outliar mode median
  • 6. Multiple Choice Edit 30 seconds 1 pt When a vertical line can be drawn through the middle of a graph of the distribution and the resulting halves are approximatley mirror images rectangular graph skewed left symmetric skewed right
  • 7. Multiple Choice Edit 30 seconds 1 pt In the Empirical rule, about 68% of the data lie within one standard deviation of the mean. True False
  • 8. Multiple Choice Edit 30 seconds 1 pt The average of the squares of the deviants is what? population variance sum of squares deviation population standard deviation
  • 9. Multiple Choice Edit 1 minute 1 pt Office rental rates 22,35,18,21,27,16,18,22,16,24,20,17,15,31,24,25,24, 23. What is the standard deviation? 5.3 4.7 6.2 5.1
  • 11. Multiple Choice Edit 30 seconds 1 pt The second quartile is the______ of a data set. Mean Median Mode Range
  • 12. Multiple Choice Edit 30 seconds 1 pt Fractiles are divided into 4 equal parts, how many parts are percentiles divided into? 50 20 200 100
  • 14. Multiple Choice Edit 30 seconds 1 pt A data set that is composed of quantitative entries taken at regular intervals over a period of time. Scatter Plot Time Series Paired data sets Pareto charts

Explore all questions with a free account

Google Logo

Continue with email

Continue with phone

The relative frequency shows the proportion of data points that have each value. The frequency tells the number of data points that have each value.

Answers will vary. One possible histogram is shown:

Find the midpoint for each class. These will be graphed on the x -axis. The frequency values will be graphed on the y -axis values.

  • The 40 th percentile is 37 years.
  • The 78 th percentile is 70 years.

Jesse graduated 37 th out of a class of 180 students. There are 180 – 37 = 143 students ranked below Jesse. There is one rank of 37.

x = 143 and y = 1. x + 0.5 y n x + 0.5 y n (100) = 143 + 0.5 ( 1 ) 180 143 + 0.5 ( 1 ) 180 (100) = 79.72. Jesse’s rank of 37 puts him at the 80 th percentile.

  • For runners in a race it is more desirable to have a high percentile for speed. A high percentile means a higher speed which is faster.
  • 40% of runners ran at speeds of 7.5 miles per hour or less (slower). 60% of runners ran at speeds of 7.5 miles per hour or more (faster).

When waiting in line at the DMV, the 85 th percentile would be a long wait time compared to the other people waiting. 85% of people had shorter wait times than Mina. In this context, Mina would prefer a wait time corresponding to a lower percentile. 85% of people at the DMV waited 32 minutes or less. 15% of people at the DMV waited 32 minutes or longer.

The manufacturer and the consumer would be upset. This is a large repair cost for the damages, compared to the other cars in the sample. INTERPRETATION: 90% of the crash tested cars had damage repair costs of $1700 or less; only 10% had damage repair costs of $1700 or more.

You can afford 34% of houses. 66% of the houses are too expensive for your budget. INTERPRETATION: 34% of houses cost $240,000 or less. 66% of houses cost $240,000 or more.

More than 25% of salespersons sell four cars in a typical week. You can see this concentration in the box plot because the first quartile is equal to the median. The top 25% and the bottom 25% are spread out evenly; the whiskers have the same length.

Mean: 16 + 17 + 19 + 20 + 20 + 21 + 23 + 24 + 25 + 25 + 25 + 26 + 26 + 27 + 27 + 27 + 28 + 29 + 30 + 32 + 33 + 33 + 34 + 35 + 37 + 39 + 40 = 738;

738 27 738 27 = 27.33

The most frequent lengths are 25 and 27, which occur three times. Mode = 25, 27

The data are symmetrical. The median is 3 and the mean is 2.85. They are close, and the mode lies close to the middle of the data, so the data are symmetrical.

The data are skewed right. The median is 87.5 and the mean is 88.2. Even though they are close, the mode lies to the left of the middle of the data, and there are many more instances of 87 than any other number, so the data are skewed right.

When the data are symmetrical, the mean and median are close or the same.

The distribution is skewed right because it looks pulled out to the right.

The mean is 4.1 and is slightly greater than the median, which is four.

The mode and the median are the same. In this case, they are both five.

The distribution is skewed left because it looks pulled out to the left.

The mean and the median are both six.

The mode is 12, the median is 12.5, and the mean is 15.1. The mean is the largest.

The mean tends to reflect skewing the most because it is affected the most by outliers.

For Fredo: z = 0.158  –  0.166 0.012 0.158  –  0.166 0.012 = –0.67

For Karl: z = 0.177  –  0.189 0.015 0.177  –  0.189 0.015 = –0.8

Fredo’s z -score of –0.67 is higher than Karl’s z -score of –0.8. For batting average, higher values are better, so Fredo has a better batting average compared to his team.

  • s x = ∑ f m 2 n − x ¯ 2 = 193157.45 30 − 79.5 2 = 10.88 s x = ∑ f m 2 n − x ¯ 2 = 193157.45 30 − 79.5 2 = 10.88
  • s x = ∑ f m 2 n − x ¯ 2 = 380945.3 101 − 60.94 2 = 7.62 s x = ∑ f m 2 n − x ¯ 2 = 380945.3 101 − 60.94 2 = 7.62
  • s x = ∑ f m 2 n − x ¯ 2 = 440051.5 86 − 70.66 2 = 11.14 s x = ∑ f m 2 n − x ¯ 2 = 440051.5 86 − 70.66 2 = 11.14
  • Number the entries in the table 1–51 (Includes Washington, DC; Numbered vertically)
  • Arrow over to PRB
  • Press 5:randInt(
  • Enter 51,1,8)

Eight numbers are generated (use the right arrow key to scroll through the numbers). The numbers correspond to the numbered states (for this example: {47 21 9 23 51 13 25 4}. If any numbers are repeated, generate a different number by using 5:randInt(51,1)). Here, the states (and Washington DC) are {Arkansas, Washington DC, Idaho, Maryland, Michigan, Mississippi, Virginia, Wyoming}.

Corresponding percents are {30.1, 22.2, 26.5, 27.1, 30.9, 34.0, 26.0, 25.1}.

  • See Table 2.86 and Table 2.87 .
  • Both graphs have a single peak.
  • Both graphs use class intervals with width equal to $50.
  • The couples graph has a class interval with no values.
  • It takes almost twice as many class intervals to display the data for couples.
  • Answers may vary. Possible answers include: The graphs are more similar than different because the overall patterns for the graphs are the same.
  • Check student's solution.
  • Both graphs display 6 class intervals.
  • Both graphs show the same general pattern.
  • Answers may vary. Possible answers include: Although the width of the class intervals for couples is double that of the class intervals for singles, the graphs are more similar than they are different.
  • Answers may vary. Possible answers include: You are able to compare the graphs interval by interval. It is easier to compare the overall patterns with the new scale on the Couples graph. Because a couple represents two individuals, the new scale leads to a more accurate comparison.
  • Answers may vary. Possible answers include: Based on the histograms, it seems that spending does not vary much from singles to individuals who are part of a couple. The overall patterns are the same. The range of spending for couples is approximately double the range for individuals.

Answers will vary.

  • 1 – (0.02+0.09+0.19+0.26+0.18+0.17+0.02+0.01) = 0.06
  • 0.19+0.26+0.18 = 0.63
  • Check student’s solution.

40 th percentile will fall between 30,000 and 40,000

80 th percentile will fall between 50,000 and 75,000

  • more children; the left whisker shows that 25% of the population are children 17 and younger. The right whisker shows that 25% of the population are adults 50 and older, so adults 65 and over represent less than 25%.
  • Answers will vary. Possible answer: State University conducted a survey to see how involved its students are in community service. The box plot shows the number of community service hours logged by participants over the past year.
  • Because the first and second quartiles are close, the data in this quarter is very similar. There is not much variation in the values. The data in the third quarter is much more variable, or spread out. This is clear because the second quartile is so far away from the third quartile.
  • Each box plot is spread out more in the greater values. Each plot is skewed to the right, so the ages of the top 50% of buyers are more variable than the ages of the lower 50%.
  • The BMW 3 series is most likely to have an outlier. It has the longest whisker.
  • Comparing the median ages, younger people tend to buy the BMW 3 series, while older people tend to buy the BMW 7 series. However, this is not a rule, because there is so much variability in each data set.
  • The second quarter has the smallest spread. There seems to be only a three-year difference between the first quartile and the median.
  • The third quarter has the largest spread. There seems to be approximately a 14-year difference between the median and the third quartile.
  • IQR ~ 17 years
  • There is not enough information to tell. Each interval lies within a quarter, so we cannot tell exactly where the data in that quarter is concentrated.
  • The interval from 31 to 35 years has the fewest data values. Twenty-five percent of the values fall in the interval 38 to 41, and 25% fall between 41 and 64. Since 25% of values fall between 31 and 38, we know that fewer than 25% fall between 31 and 35.

The mean percentage, x ¯ = 1328.65 50 = 26.75 x ¯ = 1328.65 50 = 26.75

The median value is the middle value in the ordered list of data values. The median value of a set of 11 will be the 6th number in order. Six years will have totals at or below the median.

  • mean = 1,809.3
  • median = 1,812.5
  • standard deviation = 151.2
  • first quartile = 1,690
  • third quartile = 1,935

Hint: Think about the number of years covered by each time period and what happened to higher education during those periods.

For pianos, the cost of the piano is 0.4 standard deviations BELOW the mean. For guitars, the cost of the guitar is 0.25 standard deviations ABOVE the mean. For drums, the cost of the drum set is 1.0 standard deviations BELOW the mean. Of the three, the drums cost the lowest in comparison to the cost of other instruments of the same type. The guitar costs the most in comparison to the cost of other instruments of the same type.

  • x ¯ = 23.32 x ¯ = 23.32
  • Using the TI 83/84, we obtain a standard deviation of: s x = 12.95. s x = 12.95.
  • The obesity rate of the United States is 10.58% higher than the average obesity rate.
  • Since the standard deviation is 12.95, we see that 23.32 + 12.95 = 36.27 is the obesity percentage that is one standard deviation from the mean. The United States obesity rate is slightly less than one standard deviation from the mean. Therefore, we can assume that the United States, while 34% obese, does not hav e an unusually high percentage of obese people.
  • For graph, check student's solution.
  • 49.7% of the community is under the age of 35.
  • Based on the information in the table, graph (a) most closely represents the data.
  • 174; 177; 178; 184; 185; 185; 185; 185; 188; 190; 200; 205; 205; 206; 210; 210; 210; 212; 212; 215; 215; 220; 223; 228; 230; 232; 241; 241; 242; 245; 247; 250; 250; 259; 260; 260; 265; 265; 270; 272; 273; 275; 276; 278; 280; 280; 285; 285; 286; 290; 290; 295; 302
  • 205.5, 272.5
  • 0.84 std. dev. below the mean

As an Amazon Associate we earn from qualifying purchases.

This book may not be used in the training of large language models or otherwise be ingested into large language models or generative AI offerings without OpenStax's permission.

Want to cite, share, or modify this book? This book uses the Creative Commons Attribution License and you must attribute OpenStax.

Access for free at https://openstax.org/books/introductory-statistics/pages/1-introduction
  • Authors: Barbara Illowsky, Susan Dean
  • Publisher/website: OpenStax
  • Book title: Introductory Statistics
  • Publication date: Sep 19, 2013
  • Location: Houston, Texas
  • Book URL: https://openstax.org/books/introductory-statistics/pages/1-introduction
  • Section URL: https://openstax.org/books/introductory-statistics/pages/2-solutions

© Jun 23, 2022 OpenStax. Textbook content produced by OpenStax is licensed under a Creative Commons Attribution License . The OpenStax name, OpenStax logo, OpenStax book covers, OpenStax CNX name, and OpenStax CNX logo are not subject to the Creative Commons license and may not be reproduced without the prior and express written consent of Rice University.

COMMENTS

  1. Statistics 2.2 Flashcards

    Tail of the graph falls to the left. The peak is arranged to the right while the tail falls to the left. It is negatively skewed. Chap. 2.2: (Organizing & Summarizing Data) Organizing quantitative data - the popular displays. From "Statistics: Informed Decisions Using Data" (3rd ed.)

  2. Homework 2.1-2.2 and 2.3

    A dotplot graph consists of a graph in which each data value is plotted as a point (or dot) along a scale of values. Dots representing equal values are stacked. we use statistical ___ to look for features that reveal some useful or interesting characteristics of the data set. graphs! we use graphs to analyze data.

  3. Elementary Statistics, Chapter 2, Unit 2, Homework Problems

    Study with Quizlet and memorize flashcards containing terms like Heights of adult males are normally distributed. If a large sample of heights of adult males is randomly selected and the heights are illustrated in a histogram, what is the shape of that histogram?, The population of ages at inauguration of all U.S. Presidents who had professions in the military is 62, 46, 68, 64, 57.

  4. Ch. 2 Solutions

    Possible answers include the following: Both graphs have a single peak. Both graphs use class intervals with width equal to $50; Answers may vary. Possible answers include the following: The couples graph has a class interval with no values; It takes almost twice as many class intervals to display the data for couples; Answers may vary.

  5. Statistics and Probability

    Unit 7: Probability. 0/1600 Mastery points. Basic theoretical probability Probability using sample spaces Basic set operations Experimental probability. Randomness, probability, and simulation Addition rule Multiplication rule for independent events Multiplication rule for dependent events Conditional probability and independence.

  6. Ch. 3 Solutions

    2.2 Histograms, Frequency Polygons, and Time Series Graphs; 2.3 Measures of the Location of the Data; 2.4 Box Plots; 2.5 Measures of the Center of the Data; 2.6 Skewness and the Mean, Median, and Mode; 2.7 Measures of the Spread of the Data; 2.8 Descriptive Statistics; Key Terms; Chapter Review; Formula Review; Practice; Homework; Bringing It ...

  7. Ch. 7 Solutions

    2.2 Histograms, Frequency Polygons, and Time Series Graphs; 2.3 Measures of the Location of the Data; 2.4 Box Plots; 2.5 Measures of the Center of the Data; 2.6 Skewness and the Mean, Median, and Mode; 2.7 Measures of the Spread of the Data; 2.8 Descriptive Statistics; Key Terms; Chapter Review; Formula Review; Practice; Homework; Bringing It ...

  8. Mathway

    Free math problem solver answers your statistics homework questions with step-by-step explanations. Mathway. Visit Mathway on the web. Start 7-day free trial on the app. Start 7-day free trial on the app. Download free on Amazon. Download free in Windows Store. get Go. Statistics. Basic Math. Pre-Algebra. Algebra. Trigonometry. Precalculus.

  9. Chapter 2, 2.1-2.5

    Chapter 2, 2.1-2.5 quiz for 11th grade students. Find other quizzes for Mathematics and more on Quizizz for free!

  10. Solved SECTION 2.1 Frequency Distributions and Their Graphs

    Statistics and Probability questions and answers. SECTION 2.1 Frequency Distributions and Their Graphs 53 Constructing a Frequency Distribution and a Relative Frequency Histogram In Exercises 37-40, construct a frequency distribution and a relative frequency histogram for the data set using five classes. Which class has the greatest relative ...

  11. Chapter 2.3 Solutions

    Problem 1BSC. Chapter. CH2.3. Problem. 1BSC. Step-by-step solution. Step 1 of 1. Graphical representation is always better than that of numerical illustration. Therefore, histogram is a better tool to study about a frequency distribution, whether it is normal or approximately normal or non-normal.

  12. Chapter 2 Solutions

    The company has provided with the results supporting its claims which shows that the investing at EndRun is a good option. For example: The company claims that "winners outpace losers by nearly 2-to-1 margin among the investors who have trusted EndRun to select mutual funds for their portfolio" and to support its claim, company has provided with the bar chart which clearly shows that ...

  13. Chapter 8.2 Solutions

    Chapter. CH8.2. Problem. 1AYU. Step-by-step solution. Step 1 of 1. Population size N = 500. Number of house holds have dog = 220. Population proportion.

  14. Ch. 2 Solutions

    s x = ∑ f m 2 n − x ¯ 2 = 440051.5 86 − 70.66 2 = 11.14 s x = ∑ f m 2 n − x ¯ 2 = 440051.5 86 − 70.66 2 = 11.14 75 . Example solution for using the random number generator for the TI-84+ to generate a simple random sample of 8 states.

  15. Stat 301: Homework #2

    Unit 3 exam review - Professor Mitchell Paukner. Unit 3 correlation - Mitchell Paukner. Module 2 point estimate. Module 2 categorical data. Module 2. Stat 301: Homework 1 - Professor Chelsey Green.

  16. Chapter 2.1 Solutions

    1SE. Step-by-step solution. Step 1 of 20. The data represent ages of taste-test participants in years. a) The class width of 15-19 class is, The class width of 15-19 class is . In this case, the class width of each class is equal which is 5. Step 2 of 20.

  17. CH.2 Descriptive Statistics: Tabular and Graphical Displays: Answers

    CH.4 Introduction to Probability: Answer Key to Problem Set 4; CH.3: Descriptive Statistics Numerical Measures Problem Set 3; CH.2 Descriptive Statistics: Tabular and Graphical Displays : Problem Set 2

  18. Statistics milestone 2

    1 inches and 2 inches are both 0 inches from the mean of 2 inches, which is the same as one standard deviation in either direction. This tells us that 68% of the data should lie between 1 inches to 2 inches. 68% 95% 99% 34% 68-95-99 Rule. 16. Report an issue with this question

  19. Chapter 2 Solutions

    Step-by-step solution. Step 1 of 5. a) To find the mean of a data set or distribution, use the formula, where N stands for the number of scores and stands for the sum of all the scores in a data set or distribution. Calculating the mean of this data set with scores yields, Step 2 of 5. b)