Concept explainers
The motion picture industry is an extremely competitive business. Dozens of movie studios produce hundreds of movies each year, many of which cost hundreds of millions of dollars to produce and distribute. Some of these movies will go on to earn hundreds of millions of dollars in box office revenues, while others will earn much less than their production cost.
Data from 50 of the top box-office-receipt-generating movies are provided in the file Top50Movies. The following table shows the first 10 movies contained in this data set. The categorical variables included in the data set for each movie are the rating and genre. Quantitative variables for the movie’s release year, inflation- and noninflation-adjusted box-office receipts in the United States, budget, and the world box-office receipts are also included.
Managerial Report
Use the data-visualization methods presented in this chapter to explore these data and discover relationships between the variables. Include the following in your report:
- 1. Create a scatter chart to examine the relationship between the year released and the inflation-adjusted U.S. box office receipts. Include a trendline for this scatter chart. What does the scatter chart indicate about inflation-adjusted U.S. box office receipts over time for these top 50 movies?
- 2. Create a scatter chart to examine the relationship between the budget and the noninflation-adjusted world box office receipts. (Note: You may have to adjust the data in Excel to ignore the missing budget data values to create your scatter chart. You can do this by first sorting the data using Budget and then creating a scatter chart using only the movies that include data for Budget.) What does this scatter chart indicate about the relationship between the movie’s budget and the world box office receipts?
- 3. Create a frequency distribution, percent frequency distribution, and histogram for inflation-adjusted U.S. box office receipts. Use bin sizes of $100 million. Interpret the results. Do any data points appear to be outliers in this distribution?
- 4. Create a PivotTable for these data. Use the PivotTable to generate a crosstabulation for movie genre and rating. Determine which combinations of genre and rating are most represented in the top 50 movie data. Now filter the data to consider only movies released in 1980 or later. What combinations of genre and rating are most represented for movies after 1980? What does this indicate about how the preferences of moviegoers may have changed over time?
- 5. Use the PivotTable to display the average inflation-adjusted U.S. box-office receipts for each genre–rating pair for all movies in the data set. Interpret the results.
1. Give a scatter chart to examine the relationship between the year released and the inflation-adjusted U.S. box office receipts.
2. Give a scatter chart to examine the relationship between the budget and the noninflation-adjusted world box office receipts.
3. Construct a frequency distribution, percent frequency distribution, and histogram for inflation-adjusted U.S. box office receipts. Give interpretation of the results.
4. Construct crosstabulation for movie genre and rating. Find the combinations of genre and rating that are most represented in the top 50 movie data. Find the combinations of genre that are most represented for movies after 1980.
5. Construct the average inflation-adjusted U.S. box-office receipts for genre-rating pair for all movies in the data set using a PivotTable. Give interpretation.
Answer to Problem 1C
1. The scatter chart of the year released and the inflation-adjusted U.S. box office receipts are as follows:
2. The scatter chart to examine the relationship between the budget and the noninflation-adjusted world box office receipts is as follows:
3. The frequency distribution and percent frequency distribution for inflation-adjusted U.S. box office receipts are given below:
The histogram for inflation-adjusted U.S. box office receipts using an Excel:
4. The crosstabulation for movie genre and rating for top 50 movies is given below:
The crosstabulation for movie genre and rating for the movies released after 1980 is as follows:
5. The average inflation-adjusted U.S. box-office receipts for genre-rating pair for all movies in the data set is given below:
Explanation of Solution
1.
Step-by-step procedure to obtain the scatter chart of the year released and the inflation-adjusted U.S. box office receipts using an Excel:
- Select the data of Year Released and U.S. Box Office Receipts (Inflation Adjusted Millions $).
- Select Insert.
- Choose Scatter under Charts.
- In Chart Elements, check Trendline.
Thus, the scatter chart of the year released and the inflation-adjusted U.S. box office receipts is obtained.
From the scatter chart, it is clear that there is slight decrease in inflation over years. However, there is no clear linear pattern observed.
2.
Step-by-step procedure to obtain the scatter chart to examine the relationship between the budget and the noninflation-adjusted world box office receipts using an Excel:
- Select the data of Budget (Non-Inflation Adjusted Millions $) and U.S. Box Office Receipts (Non-Inflation Adjusted Millions $).
- Select Insert.
- Choose Scatter under Charts.
- In Chart Elements, check Trendline.
Thus, the scatter chart to examine the relationship between the budget and the noninflation-adjusted world box office receipts is obtained.
From the output, it is clear that as the budget increases, the noninflation-adjusted world box office receipts also increase.
3.
Step-by-step procedure to create frequency distribution and percent frequency distribution for inflation-adjusted U.S. box office receipts using an Excel:
- Select Insert > PivotTable.
- In Select a table or range, select the data of U.S. Box Office Receipts (Inflation Adjusted Millions $) and click OK.
- In PivotTable Fields, move U.S. Box Office Receipts (Inflation Adjusted Millions $) to Rows and Σ Values.
- Right click on a value from Row Labels.
- Enter 100 in By.
- Click on U.S. Box Office Receipts (Inflation Adjusted Millions $) from Σ Values.
- Select Value Field settings.
- In Summarize value field by, choose Count and click OK.
- Again, move U.S. Box Office Receipts (Inflation Adjusted Millions $) to Rows and Σ Values.
- Click on U.S. Box Office Receipts (Inflation Adjusted Millions $) from Σ Values.
- Select Value Field settings.
- In Show Values As, choose % of Grand Total and click OK.
Thus, the frequency distribution and percent frequency distribution are obtained.
Step-by-step procedure to obtain histogram for inflation-adjusted U.S. box office receipts using an Excel:
- Select the data of class interval and percent frequency.
- Select Insert.
- Choose Clustered Column under Charts.
- Click on a bar in the graph.
- In Format Data Series, enter Gap width as 0%.
Thus, the histogram is obtained.
From the distribution table and histogram, it is clear that the frequency for the lowest inflation-adjusted U.S. box office receipts value is the highest. As the value of inflation-adjusted U.S. box office receipts increases, the frequency decreases. The frequency is very low (2%) for the inflation-adjusted U.S. box office receipts value from 1,393 to 1,593.5. This values seem to be outlier.
4.
Step-by-step procedure to obtain crosstabulation for movie genre and rating for top 50 movies using an Excel:
- Select Insert > PivotTable.
- In Select a table or range, select the data of Rating and Genre and click OK.
- In PivotTable Fields, move Rating to Rows, Genre to Columns, and Genre to Σ Values.
- Click on Genre from Σ Values.
- Select Value Field settings.
- In Summarize value field by, choose Count and click OK.
Thus, the crosstabulation for movie genre and rating for top 50 movies is obtained.
From the crosstabulation of movie genre and rating for top 50 movies, it is observed that the combination of G and Animated (=8) is most represented in the top 50 movie data.
Step-by-step procedure to obtain crosstabulation for movie genre and rating for the movies released after 1980 using an Excel:
- Select the data and choose Filter under Sort & Filter.
- Click on the drop-down box in Year Release column.
- Select Number Filters and choose Greater than.
- In Is greater than, enter 1980.
- Select Insert > PivotTable.
- In Select a table or range, select the filtered data of Rating and Genre and click OK.
- In PivotTable Fields, move Rating to Rows, Genre to Columns, and Genre to Σ Values.
- Click on Genre from Σ Values.
- Select Value Field settings.
- In Summarize value field by, choose Count and click OK.
Thus, the crosstabulation for movie genre and rating for the movies released after 1980 is obtained.
From the crosstabulation of movie genre and rating for the movies released after 1980, it is observed that the combination of PG-13 and SciFi/Fantasy (=6) is most represented.
Also, over the time changes, the number of dramas released became reduced. The rating of G and PG becomes high.
5.
Step-by-step procedure to construct the average inflation-adjusted U.S. box-office receipts for genre-rating pair for all movies in the data set using an Excel:
- Select Insert > PivotTable.
- In Select a table or range, select the filtered data of U.S. Box Office Receipts (Inflation Adjusted Millions $), Rating, and Genre and click OK.
- In PivotTable Fields, move Rating to Rows, Genre to Columns, and U.S. Box Office Receipts (Inflation Adjusted Millions $) to Σ Values.
- Click on Genre from Σ Values.
- Select Value Field settings.
- In Summarize value field by, choose Average and click OK.
Thus, the average inflation-adjusted U.S. box-office receipts for genre-rating pair for all movies in the data set is constructed.
From the table, it is clear that the average U.S. box-office receipts are the highest for the genre-rating pair of G and Drama. Also, it is the lowest for G and Action.
Want to see more full solutions like this?
Chapter 3 Solutions
Essentials of Business Analytics (MindTap Course List)
- A college wants to estimate what students typically spend on textbooks. A report fromthe college bookstore observes that textbooks range in price from $22 to $186. Toobtain a 95% confidence level for a confidence interval estimate to plus or minus $10,how many students should the college survey? (We may estimate the populationstandard deviation as (range) ÷ 4.)arrow_forwardIn a study of how students give directions, forty volunteers were given the task ofexplaining to another person how to reach a destination. Researchers measured thefollowing five aspects of the subjects’ direction-giving behavior:• whether a map was available or if directions were given from memory without a map,• the gender of the direction-giver,• the distances given as part of the directions,• the number of times directions such as “north” or “left” were used,• the frequency of errors in directions. Identify each of the variables in this study, and whether each is quantitative orqualitative. For each quantitative variable, state whether it is discrete or continuous. Was this an observational study or an experimental study? Explain your answer.arrow_forwardexplain the difference between the confident interval and the confident level. provide an example to show how to correctly interpret a confidence interval.arrow_forward
- Sketch to scale the orbit of Earth about the sun. Graph Icarus’ orbit on the same set of axesWhile the sun is the center of Earth’s orbit, it is a focus of Icarus’ orbit. There aretwo points of intersection on the graph. Based on the graph, what is the approximate distance between the two points of intersection (in AU)?arrow_forwardThe diameters of ball bearings are distributed normally. The mean diameter is 67 millimeters and the standard deviation is 3 millimeters. Find the probability that the diameter of a selected bearing is greater than 63 millimeters. Round to four decimal places.arrow_forwardSuppose you like to keep a jar of change on your desk. Currently, the jar contains the following: 22 Pennies 27 Dimes 9 Nickels 30 Quarters What is the probability that you reach into the jar and randomly grab a penny and then, without replacement, a dime? Express as a fraction or a decimal number rounded to four decimal places.arrow_forward
- A box contains 14 large marbles and 10 small marbles. Each marble is either green or white. 9 of the large marbles are green, and 4 of the small marbles are white. If a marble is randomly selected from the box, what is the probability that it is small or white? Express as a fraction or a decimal number rounded to four decimal places.arrow_forwardCan I get help with this step please? At a shooting range, instructors can determine if a shooter is consistently missing the target because of the gun sight or because of the shooter's ability. If a gun's sight is off, the variance of the distances between the shots and the center of the shot pattern will be small (even if the shots are not in the center of the target). A student claims that it is the sight that is off, not his aim, and wants the instructor to confirm his claim. If a skilled shooter fires a gun at a target multiple times, the distances between the shots and the center of the shot pattern, measured in centimeters (cm), will have a variance of less than 0.33. After the student shoots 28 shots at the target, the instructor calculates that the distances between his shots and the center of the shot pattern, measured in cm, have a variance of 0.25. Does this evidence support the student's claim that the gun's sight is off? Use a 0.025 level of significance. Assume that the…arrow_forwardThe National Academy of Science reported that 38% of research in mathematics is published by US authors. The mathematics chairperson of a prestigious university wishes to test the claim that this percentage is no longer 38%. He has no indication of whether the percentage has increased or decreased since that time. He surveys a simple random sample of 279 recent articles published by reputable mathematics research journals and finds that 123 of these articles have US authors. Does this evidence support the mathematics chairperson's claim that the percentage is no longer 38 % ? Use a 0.02 level of significance. Compute the value of the test statistic. Round to two decimal places.arrow_forward
- A marketing research company desires to know the mean consumption of milk per week among males over age 32. They believe that the milk consumption has a mean of 4 liters, and want to construct a 98% confidence interval with a maximum error of 0.07 liters. Assuming a variance of 0.64 liters, what is the minimum number of males over age 32 they must include in their sample? Round up to the next integer.arrow_forwardSuppose GRE Verbal scores are normally distributed with a mean of 461 and a standard deviation of 118. A university plans to recruit students whose scores are in the top 4 % . What is the minimum score required for recruitment? Round to the nearest whole number, if necessaryarrow_forwardNeed help with my homework thank you random sample of 6 fields of durum wheat has a mean yield of 45.5 bushels per acre and standard deviation of 7.43 bushels per acre. Determine the 80 % confidence interval for the true mean yield. Assume the population is approximately normal. Step 1: Find the critical value that should be used in constructing the confidence interval. Round to three decimal places. Step 2 of 2: Construct the 80% confidence interval. Round to one decimal place. I got 1.476 as my critical value and 41.0 and 49.9 as my confidence intervalarrow_forward
- Glencoe Algebra 1, Student Edition, 9780079039897...AlgebraISBN:9780079039897Author:CarterPublisher:McGraw HillElementary Geometry for College StudentsGeometryISBN:9781285195698Author:Daniel C. Alexander, Geralyn M. KoeberleinPublisher:Cengage LearningHolt Mcdougal Larson Pre-algebra: Student Edition...AlgebraISBN:9780547587776Author:HOLT MCDOUGALPublisher:HOLT MCDOUGAL
- Functions and Change: A Modeling Approach to Coll...AlgebraISBN:9781337111348Author:Bruce Crauder, Benny Evans, Alan NoellPublisher:Cengage Learning