Concept explainers
Explain the reason behind preferring standard deviation over
Explanation of Solution
Variability:
Measure of variability infers the amount of dispersion in the dataset, it defines the distance between the data points from the center. That is, it defines the spread of the data set. Variability describes the variation or diversity among the data points in the data set.
Range:
Range of a data set is the difference between maximum and minimum values of the data set.
The general formula to obtain range is given below:
Some of the flaws for using the range to compare the variability of data sets:
- Even though, both the data sets have same range, the data will quickly vary among themselves, and the spread will be different.
- The extreme measures will affect the range very much.
Standard deviation:
Standard deviation is based on how much each observation deviates from a central point of the data set. It is known that, the greater the distances between the individual observations and the central point, the greater the variability in the data set.
Consider
The general formula to obtain standard deviation is given below:
The value of standard deviation is based on all the values of the data set and the range is based on only the maximum and minimum values of the data set.
Since, standard deviation is based on each value in the data set, it is preferable to use standard deviation over range to measure variability in the data set.
Want to see more full solutions like this?
Chapter 2 Solutions
Statistics
- 8 Suppose that a small town has five people with a rare form of cancer. Does this auto- matically mean a huge problem exists that needs to be addressed?arrow_forward1 M&Ms colors come in the following percent- ages: 13 percent brown, 14 percent yellow, 13 percent red, 24 percent blue, 20 percent orange, and 16 percent green. Reach into a bag of M&Ms without looking. a. What's the chance that you pull out a brown or yellow M&M? b. What's the chance that you won't pull out a blue? swarrow_forward11. Prove or disprove: (a) If is a characteristic function, then so is ²; (b) If is a non-negative characteristic function, then so is √√4.arrow_forward
- 4. Suppose that P(X = 1) = P(X = -1) = 1/2, that Y = U(-1, 1) and that X and Y are independent. (a) Show, by direct computation, that X + Y = U(-2, 2). (b) Translate the result to a statement about characteristic functions. (c) Which well-known trigonometric formula did you discover?arrow_forward9. The concentration function of a random variable X is defined as Qx(h) = sup P(x ≤ X ≤x+h), h>0. x (a) Show that Qx+b (h) = Qx(h). (b) Is it true that Qx(ah) =aQx(h)? (c) Show that, if X and Y are independent random variables, then Qx+y (h) min{Qx(h). Qy (h)). To put the concept in perspective, if X1, X2, X, are independent, identically distributed random variables, and S₁ = Z=1Xk, then there exists an absolute constant, A, such that A Qs, (h) ≤ √n Some references: [79, 80, 162, 222], and [204], Sect. 1.5.arrow_forward29 Suppose that a mound-shaped data set has a must mean of 10 and standard deviation of 2. a. About what percentage of the data should lie between 6 and 12? b. About what percentage of the data should lie between 4 and 6? c. About what percentage of the data should lie below 4? 91002 175/1 3arrow_forward
- 2,3, ample and rical t? the 28 Suppose that a mound-shaped data set has a mean of 10 and standard deviation of 2. a. About what percentage of the data should lie between 8 and 12? b. About what percentage of the data should lie above 10? c. About what percentage of the data should lie above 12?arrow_forward27 Suppose that you have a data set of 1, 2, 2, 3, 3, 3, 4, 4, 5, and you assume that this sample represents a population. The mean is 3 and g the standard deviation is 1.225.10 a. Explain why you can apply the empirical rule to this data set. b. Where would "most of the values" in the population fall, based on this data set?arrow_forward30 Explain how you can use the empirical rule to find out whether a data set is mound- shaped, using only the values of the data themselves (no histogram available).arrow_forward
- 5. Let X be a positive random variable with finite variance, and let A = (0, 1). Prove that P(X AEX) 2 (1-A)² (EX)² EX2arrow_forward6. Let, for p = (0, 1), and xe R. X be a random variable defined as follows: P(X=-x) = P(X = x)=p. P(X=0)= 1-2p. Show that there is equality in Chebyshev's inequality for X. This means that Chebyshev's inequality, in spite of being rather crude, cannot be improved without additional assumptions.arrow_forward4. Prove that, for any random variable X, the minimum of EIX-al is attained for a = med (X).arrow_forward
- MATLAB: An Introduction with ApplicationsStatisticsISBN:9781119256830Author:Amos GilatPublisher:John Wiley & Sons IncProbability and Statistics for Engineering and th...StatisticsISBN:9781305251809Author:Jay L. DevorePublisher:Cengage LearningStatistics for The Behavioral Sciences (MindTap C...StatisticsISBN:9781305504912Author:Frederick J Gravetter, Larry B. WallnauPublisher:Cengage Learning
- Elementary Statistics: Picturing the World (7th E...StatisticsISBN:9780134683416Author:Ron Larson, Betsy FarberPublisher:PEARSONThe Basic Practice of StatisticsStatisticsISBN:9781319042578Author:David S. Moore, William I. Notz, Michael A. FlignerPublisher:W. H. FreemanIntroduction to the Practice of StatisticsStatisticsISBN:9781319013387Author:David S. Moore, George P. McCabe, Bruce A. CraigPublisher:W. H. Freeman