What is Central Tendency in Statistics?

It is a descriptive summary of a data set. It can be defined by using some of the measures. The central tendencies do not provide information regarding individual data from the dataset. However, they give a summary of the data set. The central tendency or measure of central tendency is a central or typical value for a probability distribution.

The central tendency is known as the statistical measure. This statistical measure represents the single value of the data set or entire distribution. The objective of evaluating a central tendency is to provide an accurate description of the entire data in the distribution.

The measure or outcome of the central tendency is a single value. It attempts to explain a set of data by identifying the central position within that set of data. The numerical expressions which represent the characteristics of a group (a large collection of numerical data) are called measures of central tendency. They are also described as measures of central location.

The measures of central tendency are mean, median, and mode. However, in different conditions, some measures of central tendency become more appropriate to use than others.

Mean (Arithmetic)

The most widely known and well-accepted measure of tendency is the mean or average. It is mostly used with continuous data. The mean represents the average value of a dataset. It can be calculated as the quotient of the sum of all the values in the data set by the number of values in the data set. The mean is usually denoted as $\bar{x}$ (pronounced “x-bar”).

Example:

If there are n observations in a data set and they have values $x_{1}, x_{2}, ..., x_{n}$ , then the mean is equal to:

\bar{x} = \frac{x_{1} + x_{2} + ... + x_{n}}{n}

The formula is also written as:

\bar{x} = \frac{\sum_{i = 1}^{n} x_{i}}{n}

Where $\sum$ is Greek capital letter, which means “sum of…” and is pronounced as “sigma”.

A very significant characteristic of the mean is that it involves every value in the set of data as part of the calculation. Additionally, the mean is the lone measure of central tendency where the sum of the deviations of each value calculated from the mean is always zero.

When Not to Use the Mean?

The mean is principally susceptible to the influence of outliers, which could be considered as its one main disadvantage. There are observations that are unusual when compared to the rest of the set of data by being particularly small or large in numerical value. For example, consider the salary of staff at an organization below:

Employee	Salary ($)
1	13000
2	17000
3	15000
4	17500
5	15000
6	12000
7	18500
8	15500
9	86000
10	93000

The mean salary for ten employees is $30,250. However, the data set suggests that this mean value might not be the best way to accurately reflect the typical salary of an employee, as most employees have salaries in the $12000 to $18500 range. The mean is being altered by the two hefty salaries. Therefore, in this situation, there is a need to use other better measures of central tendency instead of mean.

Median

The middle value of a data set is called the median of the data. The median divides the data set into two halves and is called the 50^th percentile. The median is much less affected by outliers and skewed data than the mean. If the number of elements in a dataset is odd, then the middlemost element of the data arranged in ascending or descending order is the median. If the number of elements in a data set is even, the average of the two central elements of the arranged data is the median of the set.

Median with Even Data Set

When the dataset contains an even number of values, then the median value of the dataset can be found by taking the mean of the middle two values. Let’s use the same example of salary of 10 employees and after arranging data in ascending order –

Salary ($)
12000
13000
15000
15000
15500
17000
17500
18500
86000
93000

Two middle values (5^th and 6^th) are 15500 and 17000 and average of it will give the median value i.e. 16250.

Median with Odd Data Set

When the dataset contains an odd number of values, then the middle value of the data set will be the median value. As per the below table, after arranging data in ascending order –

Salary ($)
12000
13000
15000
15000
15500
17500
18500
86000
93000

The middle value (5^thvalue) is 15500 is the median value of the data set.

Mode

The value that occurs most frequently in a data set is called the mode of the data. If no two categories in the given data are the same, then the dataset has no mode. A dataset may have more than one mode if multiple categories repeat an equal number of times. The mode is the only measure of central tendency that is used for categorical variables.

Consider the given dataset 5, 4, 2, 3, 2, 1, 5, 4, 5

Mode
5
5
5
4
4
3
2
2
1

Since the mode represents the most common value. Therefore, the most recurrently occurring value in the given data set is 5.

On a histogram or bar chart, the element with the highest bar represents the mode. Therefore, the mode is sometimes considered the most popular option.

“The histogram representing the mode of a data”

Consider the example given below:

“The bar graph representing the preferred modes of transport”

In this particular data set, the preferred mode of transport is the bus.

Why is Mode Rarely used with Continuous data?

The mode is particularly problematic with continuous data because it is more likely not to have any value that is more frequent than the other.

For example, consider the data set consisting of the weights of 30 people. How likely is it that that two or more people with exactly the same weight (e.g., 55.4 kg) are present in the same sample? The answer would be that it is perhaps highly unlikely. Though many people might be close, it is impossible to find two people with exactly the same weight (to the nearest 0.1 kg), with such a small sample (30 people) and a large range of possible weights. This is why the mode is very rarely used with continuous data.

Other Limitations of Using Mode

One of the major limitations with the mode is that it is not unique. So it leaves with problems when having two or more values that share the highest frequency, such as below:

Summary of When to Use Mean, Median and Mode

The below table will help to choose the best measures of central tendency with respect to different types of variables.

Type of Variable	The Best Measure of Central Tendency
Nominal	Mode
Ordinal	Median
Interval/Ratio (not skewed)	Mean
Interval/Ratio (skewed)	Median

Formula

Arithmetic mean: $\bar{x} = \frac{x_{1} + x_{2} + ... + x_{n}}{n}$

Context and Applications

Measures of central tendency are useful for:
School and college-level education
Post-graduation course in mathematics
Data analysis courses
Engineering courses

Want more help with your statistics homework?

We've got you covered with step-by-step solutions to millions of textbook problems, subject matter experts on standby 24/7 when you're stumped, and more.

Check out a sample statistics Q&A solution here!

*Response times may vary by subject and question complexity. Median response time is 34 minutes for paid subscribers and may be longer for promotional offers.

Search. Solve. Succeed!

Study smarter access to millions of step-by step textbook solutions, our Q&A library, and AI powered Math Solver. Plus, you get 30 questions to ask an expert each month.

Tagged in

Math Statistics

Descriptive Statistics

Centre, Spread, and Shape of a Distribution

Mean, Median, Mode Homework Questions from Fellow Students

Browse our recently answered Mean, Median, Mode homework questions.

Q: We consider the one-period model studied in class as an example. Namely, we assumethat the current…

Q: Construct a model of population flow between metropolitan and nonmetropolitan areas of a given…

Q: You draw two cards from a standard deck of 52 cards, but before you draw the second card, you put…

Q: 23 வ dous biops Which marginal probabilities that you find in a two-way table should sum to 1? 著

Q: Pls help asap

Q: solve part a on paper

Q: Review a classmate's Main Post. 1. State if you agree or disagree with the choices made for…

Q: Suppose that the average length of stay in Europe for American tourists is 17 days, with standard…

Q: 3. A bag of Skittles contains five colors: red, orange, green, yellow, and purple. The probabilities…

Q: 7% of all Americans live in poverty. If 40 Americans are randomly selected, find the probability…

Q: If a uniform distribution is defined over the interval from 6 to 10, then answer the followings:…

Q: C4 Q6 V1: Randomly collected student data in the dataset STATISTICSSTUDENTSSURVEYFORR contains the…

Q: 1) and let Xt is stochastic process with WSS and Rxlt t+t) 1) E (X5) = \ 1 2 Show that E (X5 = X 3 =…

Q: Problem 4. Margrabe formula and the Greeks (20 pts) In the homework, we determined the Margrabe…

Q: Three players (one divider and two choosers) are going to divide a cake fairly using the lone…

Q: Need help with the following questions on statistics.

Q: 310015 K Question 9, 5.2.28-T Part 1 of 4 HW Score: 85.96%, 49 of 57 points Points: 1 Save of 6…

Q: We consider a (European) call option on a stock with expiration in 3 months and strike price $10.…

Q: I need help with this problem and an explanation of the solution for the image described below.…

Q: 2 50 Describe the relationship between X and Y shown by the scatterplot in the following figure.…

Q: Business Discuss

Q: For unemployed persons in the United States, the average number of months of unemployment at the end…

Q: DATA TABLE VALUES Meal Price ($) 22.78 31.90 33.89 22.77 18.04 23.29 35.28…

Q: The class will include a data exercise where students will be introduced to publicly available data…

Q: (b) State Fubini's Theorem without proof. Theorem to demonstrate that

Q: We consider a one-period market with the following properties: the current stock priceis S0 = 4. At…

Q: Given the following sample data values: 7, 12, 15, 9, 15, 13, 12, 10, 18,12 Find the following: a) Σ…

Q: According to health professionals, a person’s weight is expected to increase with age. To examine…

Q: Please help with this following question I'm not too sure if question (a) and (b) are correct…

Q: Consider an event X comprised of three outcomes whose probabilities are 9/18, 1/18,and 6/18.…

Q: y of 45 home- televisions u find that 010020 le own one, ee, and 1 owns y histogram of 4 Suppose…

Q: The college hiking club is having a fundraiser to buy new equipment for fall and winter outings. The…

Q: What percentage of the general U.S. population have bachelor's degrees? Suppose that the Statistical…

Q: = Consider the hypothesis test Ho: μ₁ = μ₂ against H₁ μ₁ μ2. Suppose that sample sizes are n₁ = 15…

Q: The following data show the year to date percent change (YTD % Change) for 30 stock-market indexes…

Q: 1 M&Ms colors come in the following percent- ages: 13 percent brown, 14 percent yellow, 13 percent…

Q: Exercise 4.2 Prove that, if A and B are independent, then so are A and B, Ac and B, and A and B.

Q: Question 6 The data shown in Table 3 are and R values for 24 samples of size n = 5 taken from a…

Q: 30. (a) What is meant by the term "product measur"? AND

Q: Obtain the linear equation for trend for time series with St² = 140, Ey = 16.91 and Σty= 62.02, m n…

Q: Peggy conducted a study to identify the randomness of rainy days in fall. For 15 days, she recorded…

Q: 2 Make a histogram from this data set of test scores: 72, 79, 81, 80, 63, 62, 89, 99, 50, 78, 87,…

Q: Table of hours of television watched per week: 11 15 24 34 36 22 20 30 12 32 24 36 42 36 42 26 37 39…

Q: Cycles to failure Position in ascending order 0.5 f(x)) (x;) Problem 44 Marsha, a renowned cake…

Q: Information for questions 4 • • Please Download "wages" from Canvas (the link to this dataset is…

Q: 13 Can the mean of a data set be higher than most of the values in the set? If so, how? Can the…

Q: Consider the hypotheses: Hop=po H₁ppo where 2 is known. Derive a general expression for determining…

Q: If 40 percent of university students purchase their textbooks online, in a random sample of five…

Q: A poll before the elections showed that in a given sample 79% of people vote for candidate C. How…

Q: Question 4. We consider a CRR model with So == 5 and up and down factors u = 1.03 and d = 0.96. We…

Search. Solve. Succeed!

Study smarter access to millions of step-by step textbook solutions, our Q&A library, and AI powered Math Solver. Plus, you get 30 questions to ask an expert each month.

Tagged in

Math Statistics