In your own words, describe to someone who knows only a little statistics how to recognize when an observation is an outlier. What action(s) should be taken with an outlier? b. Which measure of the center (mean or median) is more resistant to outliers, and what does "resistant to outliers" mean?
In your own words, describe to someone who knows only a little statistics how to recognize when an observation is an outlier. What action(s) should be taken with an outlier? b. Which measure of the center (mean or median) is more resistant to outliers, and what does "resistant to outliers" mean?
In your own words, describe to someone who knows only a little statistics how to recognize when an observation is an outlier. What action(s) should be taken with an outlier? b. Which measure of the center (mean or median) is more resistant to outliers, and what does "resistant to outliers" mean?
a. In your own words, describe to someone who knows only a little statistics how to recognize when an observation is an outlier. What action(s) should be taken with an outlier?
b. Which measure of the center (mean or median) is more resistant to outliers, and what does "resistant to outliers" mean?
Transcribed Image Text:a. In your own words, describe to someone who knows only a little statistics how to recognize when an observation is an outlier. What action(s) should be taken with an outlier?
b. Which measure of the center (mean or median) is more resistant to outliers, and what does "resistant to outliers" mean?
a. Choose the best answer below.
O A. Outliers are observed values far from the main group of data. In a histogram they are separated from the others by space. Outliers must be looked at in closer context to know how to treat them. If they are mistakes, they might
be removed or corrected. If they are not mistakes, you might do the analysis twice, once with and once without the outliers.
B. Outliers are observations that are more than 1.5 interquartile ranges from the median in a data set. They should be looked at in closer context to determine how to treat the values. Sometimes they may be included, but
sometimes they are discarded.
C. Outliers are observed values far from the main group of data. In a histogram they are separated from the others by space. Outliers should be ignored because they can distort any calculations done and are not representative of
typical values.
D. Outliers are the minimum and maximum values observed in a data set. They should be treated as any other value, as the term "outliers" is just another name for the minimum and maximum.
b. Choose the best answer below.
O A. The median is more resistant, which indicates that neither the number nor the magnitude of outliers has any effect on the calculated value of the median.
B. The mean is more resistant, which indicates that it is easier to calculate when there are outliers present than the median is.
O C. The mean is more resistant, which indicates that it usually changes less than the median when comparing data with and without outliers.
D. The median is more resistant, which indicates that it usually changes less than the mean when comparing data with and without outliers.
Definition Definition Middle value of a data set. The median divides a data set into two halves, and it also called the 50th percentile. The median is much less affected by outliers and skewed data than the mean. If the number of elements in a dataset is odd, then the middlemost element of the data arranged in ascending or descending order is the median. If the number of elements in the dataset is even, the average of the two central elements of the arranged data is the median of the set. For example, if a dataset has five items—12, 13, 21, 27, 31—the median is the third item in ascending order, or 21. If a dataset has six items—12, 13, 21, 27, 31, 33—the median is the average of the third (21) and fourth (27) items. It is calculated as follows: (21 + 27) / 2 = 24.
Expert Solution
Step 1
In statistics, three measures of central tendency are widely used. These three measures are also called the measures of location. These are:
Median Mode Mean
These could be defined as:
Mean is generally represented as total sum of all observations divided by number of observations. It tells the on an average value for the whole data set. It is most commonly used measure of central tendency. It includes each and every observation. This makes the mean value sensitive to outliers as it is affected by them.
Median is one of the measure of central tendency which is used to describe the average value of data set.
Mode is the value that happens frequently in the provided data set. Mode represents a most common value of data set and it can be calculated by finding the most occurred value in a date set.