Concept explainers
The motion picture industry is an extremely competitive business. Dozens of movie studios produce hundreds of movies each year, many of which cost hundreds of millions of dollars to produce and distribute. Some of these movies will go on to earn hundreds of millions of dollars in box office revenues, while others will earn much less than their production cost.
Data from 50 of the top box-office-receipt-generating movies are provided in the file Top50Movies. The following table shows the first 10 movies contained in this data set. The categorical variables included in the data set for each movie are the rating and genre. Quantitative variables for the movie’s release year, inflation- and noninflation-adjusted box-office receipts in the United States, budget, and the world box-office receipts are also included.
Managerial Report
Use the data-visualization methods presented in this chapter to explore these data and discover relationships between the variables. Include the following in your report:
- 1. Create a scatter chart to examine the relationship between the year released and the inflation-adjusted U.S. box office receipts. Include a trendline for this scatter chart. What does the scatter chart indicate about inflation-adjusted U.S. box office receipts over time for these top 50 movies?
- 2. Create a scatter chart to examine the relationship between the budget and the noninflation-adjusted world box office receipts. (Note: You may have to adjust the data in Excel to ignore the missing budget data values to create your scatter chart. You can do this by first sorting the data using Budget and then creating a scatter chart using only the movies that include data for Budget.) What does this scatter chart indicate about the relationship between the movie’s budget and the world box office receipts?
- 3. Create a frequency distribution, percent frequency distribution, and histogram for inflation-adjusted U.S. box office receipts. Use bin sizes of $100 million. Interpret the results. Do any data points appear to be outliers in this distribution?
- 4. Create a PivotTable for these data. Use the PivotTable to generate a crosstabulation for movie genre and rating. Determine which combinations of genre and rating are most represented in the top 50 movie data. Now filter the data to consider only movies released in 1980 or later. What combinations of genre and rating are most represented for movies after 1980? What does this indicate about how the preferences of moviegoers may have changed over time?
- 5. Use the PivotTable to display the average inflation-adjusted U.S. box-office receipts for each genre–rating pair for all movies in the data set. Interpret the results.
1. Give a scatter chart to examine the relationship between the year released and the inflation-adjusted U.S. box office receipts.
2. Give a scatter chart to examine the relationship between the budget and the noninflation-adjusted world box office receipts.
3. Construct a frequency distribution, percent frequency distribution, and histogram for inflation-adjusted U.S. box office receipts. Give interpretation of the results.
4. Construct crosstabulation for movie genre and rating. Find the combinations of genre and rating that are most represented in the top 50 movie data. Find the combinations of genre that are most represented for movies after 1980.
5. Construct the average inflation-adjusted U.S. box-office receipts for genre-rating pair for all movies in the data set using a PivotTable. Give interpretation.
Answer to Problem 1C
1. The scatter chart of the year released and the inflation-adjusted U.S. box office receipts are as follows:
2. The scatter chart to examine the relationship between the budget and the noninflation-adjusted world box office receipts is as follows:
3. The frequency distribution and percent frequency distribution for inflation-adjusted U.S. box office receipts are given below:
The histogram for inflation-adjusted U.S. box office receipts using an Excel:
4. The crosstabulation for movie genre and rating for top 50 movies is given below:
The crosstabulation for movie genre and rating for the movies released after 1980 is as follows:
5. The average inflation-adjusted U.S. box-office receipts for genre-rating pair for all movies in the data set is given below:
Explanation of Solution
1.
Step-by-step procedure to obtain the scatter chart of the year released and the inflation-adjusted U.S. box office receipts using an Excel:
- Select the data of Year Released and U.S. Box Office Receipts (Inflation Adjusted Millions $).
- Select Insert.
- Choose Scatter under Charts.
- In Chart Elements, check Trendline.
Thus, the scatter chart of the year released and the inflation-adjusted U.S. box office receipts is obtained.
From the scatter chart, it is clear that there is slight decrease in inflation over years. However, there is no clear linear pattern observed.
2.
Step-by-step procedure to obtain the scatter chart to examine the relationship between the budget and the noninflation-adjusted world box office receipts using an Excel:
- Select the data of Budget (Non-Inflation Adjusted Millions $) and U.S. Box Office Receipts (Non-Inflation Adjusted Millions $).
- Select Insert.
- Choose Scatter under Charts.
- In Chart Elements, check Trendline.
Thus, the scatter chart to examine the relationship between the budget and the noninflation-adjusted world box office receipts is obtained.
From the output, it is clear that as the budget increases, the noninflation-adjusted world box office receipts also increase.
3.
Step-by-step procedure to create frequency distribution and percent frequency distribution for inflation-adjusted U.S. box office receipts using an Excel:
- Select Insert > PivotTable.
- In Select a table or range, select the data of U.S. Box Office Receipts (Inflation Adjusted Millions $) and click OK.
- In PivotTable Fields, move U.S. Box Office Receipts (Inflation Adjusted Millions $) to Rows and Σ Values.
- Right click on a value from Row Labels.
- Enter 100 in By.
- Click on U.S. Box Office Receipts (Inflation Adjusted Millions $) from Σ Values.
- Select Value Field settings.
- In Summarize value field by, choose Count and click OK.
- Again, move U.S. Box Office Receipts (Inflation Adjusted Millions $) to Rows and Σ Values.
- Click on U.S. Box Office Receipts (Inflation Adjusted Millions $) from Σ Values.
- Select Value Field settings.
- In Show Values As, choose % of Grand Total and click OK.
Thus, the frequency distribution and percent frequency distribution are obtained.
Step-by-step procedure to obtain histogram for inflation-adjusted U.S. box office receipts using an Excel:
- Select the data of class interval and percent frequency.
- Select Insert.
- Choose Clustered Column under Charts.
- Click on a bar in the graph.
- In Format Data Series, enter Gap width as 0%.
Thus, the histogram is obtained.
From the distribution table and histogram, it is clear that the frequency for the lowest inflation-adjusted U.S. box office receipts value is the highest. As the value of inflation-adjusted U.S. box office receipts increases, the frequency decreases. The frequency is very low (2%) for the inflation-adjusted U.S. box office receipts value from 1,393 to 1,593.5. This values seem to be outlier.
4.
Step-by-step procedure to obtain crosstabulation for movie genre and rating for top 50 movies using an Excel:
- Select Insert > PivotTable.
- In Select a table or range, select the data of Rating and Genre and click OK.
- In PivotTable Fields, move Rating to Rows, Genre to Columns, and Genre to Σ Values.
- Click on Genre from Σ Values.
- Select Value Field settings.
- In Summarize value field by, choose Count and click OK.
Thus, the crosstabulation for movie genre and rating for top 50 movies is obtained.
From the crosstabulation of movie genre and rating for top 50 movies, it is observed that the combination of G and Animated (=8) is most represented in the top 50 movie data.
Step-by-step procedure to obtain crosstabulation for movie genre and rating for the movies released after 1980 using an Excel:
- Select the data and choose Filter under Sort & Filter.
- Click on the drop-down box in Year Release column.
- Select Number Filters and choose Greater than.
- In Is greater than, enter 1980.
- Select Insert > PivotTable.
- In Select a table or range, select the filtered data of Rating and Genre and click OK.
- In PivotTable Fields, move Rating to Rows, Genre to Columns, and Genre to Σ Values.
- Click on Genre from Σ Values.
- Select Value Field settings.
- In Summarize value field by, choose Count and click OK.
Thus, the crosstabulation for movie genre and rating for the movies released after 1980 is obtained.
From the crosstabulation of movie genre and rating for the movies released after 1980, it is observed that the combination of PG-13 and SciFi/Fantasy (=6) is most represented.
Also, over the time changes, the number of dramas released became reduced. The rating of G and PG becomes high.
5.
Step-by-step procedure to construct the average inflation-adjusted U.S. box-office receipts for genre-rating pair for all movies in the data set using an Excel:
- Select Insert > PivotTable.
- In Select a table or range, select the filtered data of U.S. Box Office Receipts (Inflation Adjusted Millions $), Rating, and Genre and click OK.
- In PivotTable Fields, move Rating to Rows, Genre to Columns, and U.S. Box Office Receipts (Inflation Adjusted Millions $) to Σ Values.
- Click on Genre from Σ Values.
- Select Value Field settings.
- In Summarize value field by, choose Average and click OK.
Thus, the average inflation-adjusted U.S. box-office receipts for genre-rating pair for all movies in the data set is constructed.
From the table, it is clear that the average U.S. box-office receipts are the highest for the genre-rating pair of G and Drama. Also, it is the lowest for G and Action.
Want to see more full solutions like this?
Chapter 3 Solutions
ESSEN OF BUSINESS ANALYTICS (LL) BOM
- A recent survey of 400 americans asked whether or not parents do too much for their young adult children. The results of the survey are shown in the data file. a) Construct the frequency and relative frequency distributions. How many respondents felt that parents do too much for their adult children? What proportion of respondents felt that parents do too little for their adult children? b) Construct a pie chart. Summarize the findingsarrow_forwardThe average number of minutes Americans commute to work is 27.7 minutes (Sterling's Best Places, April 13, 2012). The average commute time in minutes for 48 cities are as follows: Click on the datafile logo to reference the data. DATA file Albuquerque 23.3 Jacksonville 26.2 Phoenix 28.3 Atlanta 28.3 Kansas City 23.4 Pittsburgh 25.0 Austin 24.6 Las Vegas 28.4 Portland 26.4 Baltimore 32.1 Little Rock 20.1 Providence 23.6 Boston 31.7 Los Angeles 32.2 Richmond 23.4 Charlotte 25.8 Louisville 21.4 Sacramento 25.8 Chicago 38.1 Memphis 23.8 Salt Lake City 20.2 Cincinnati 24.9 Miami 30.7 San Antonio 26.1 Cleveland 26.8 Milwaukee 24.8 San Diego 24.8 Columbus 23.4 Minneapolis 23.6 San Francisco 32.6 Dallas 28.5 Nashville 25.3 San Jose 28.5 Denver 28.1 New Orleans 31.7 Seattle 27.3 Detroit 29.3 New York 43.8 St. Louis 26.8 El Paso 24.4 Oklahoma City 22.0 Tucson 24.0 Fresno 23.0 Orlando 27.1 Tulsa 20.1 Indianapolis 24.8 Philadelphia 34.2 Washington, D.C. 32.8 a. What is the mean commute time for…arrow_forwardMorningstar tracks the total return for a large number of mutual funds. The following table shows the total return and the number of funds for four categories of mutual funds. Click on the datafile logo to reference the data. DATA file Type of Fund Domestic Equity Number of Funds Total Return (%) 9191 4.65 International Equity 2621 18.15 Hybrid 1419 2900 11.36 6.75 Specialty Stock a. Using the number of funds as weights, compute the weighted average total return for these mutual funds. (to 2 decimals) % b. Is there any difficulty associated with using the "number of funds" as the weights in computing the weighted average total return in part (a)? Discuss. What else might be used for weights? The input in the box below will not be graded, but may be reviewed and considered by your instructor. c. Suppose you invested $10,000 in this group of mutual funds and diversified the investment by placing $2000 in Domestic Equity funds, $4000 in International Equity funds, $3000 in Specialty Stock…arrow_forward
- The days to maturity for a sample of five money market funds are shown here. The dollar amounts invested in the funds are provided. Days to Maturity 20 Dollar Value ($ millions) 20 12 30 7 10 5 6 15 10 Use the weighted mean to determine the mean number of days to maturity for dollars invested in these five money market funds (to 1 decimal). daysarrow_forwardc. What are the first and third quartiles? First Quartiles (to 1 decimals) Third Quartiles (to 4 decimals) × ☑ Which companies spend the most money on advertising? Business Insider maintains a list of the top-spending companies. In 2014, Procter & Gamble spent more than any other company, a whopping $5 billion. In second place was Comcast, which spent $3.08 billion (Business Insider website, December 2014). The top 12 companies and the amount each spent on advertising in billions of dollars are as follows. Click on the datafile logo to reference the data. DATA file Company Procter & Gamble Comcast Advertising ($billions) $5.00 3.08 2.91 Company American Express General Motors Advertising ($billions) $2.19 2.15 ETET AT&T Ford Verizon L'Oreal 2.56 2.44 2.34 Toyota Fiat Chrysler Walt Disney Company J.P Morgan a. What is the mean amount spent on advertising? (to 2 decimals) 2.55 b. What is the median amount spent on advertising? (to 3 decimals) 2.09 1.97 1.96 1.88arrow_forwardMartinez Auto Supplies has retail stores located in eight cities in California. The price they charge for a particular product in each city are vary because of differing competitive conditions. For instance, the price they charge for a case of a popular brand of motor oil in each city follows. Also shown are the number of cases that Martinez Auto sold last quarter in each city. City Price ($) Sales (cases) Bakersfield 34.99 501 Los Angeles 38.99 1425 Modesto 36.00 294 Oakland 33.59 882 Sacramento 40.99 715 San Diego 38.59 1088 San Francisco 39.59 1644 San Jose 37.99 819 Compute the average sales price per case for this product during the last quarter? Round your answer to two decimal places.arrow_forward
- Consider the following data and corresponding weights. xi Weight(wi) 3.2 6 2.0 3 2.5 2 5.0 8 a. Compute the weighted mean (to 2 decimals). b. Compute the sample mean of the four data values without weighting. Note the difference in the results provided by the two computations (to 3 decimals).arrow_forwardExpert only,if you don't know it don't attempt it, no Artificial intelligence or screen shot it solvingarrow_forwardFor context, the image provided below is a quesion from a Sepetember, 2024 past paper in statistical modelingarrow_forward
- For context, the images attached below (the question and the related figure) is from a january 2024 past paperarrow_forwardFor context, the image attached below is a question from a June 2024 past paper in statisical modelingarrow_forwardFor context, the images attached below are a question from a June, 2024 past paper in statistical modelingarrow_forward
- Glencoe Algebra 1, Student Edition, 9780079039897...AlgebraISBN:9780079039897Author:CarterPublisher:McGraw HillElementary Geometry for College StudentsGeometryISBN:9781285195698Author:Daniel C. Alexander, Geralyn M. KoeberleinPublisher:Cengage LearningHolt Mcdougal Larson Pre-algebra: Student Edition...AlgebraISBN:9780547587776Author:HOLT MCDOUGALPublisher:HOLT MCDOUGAL
- Functions and Change: A Modeling Approach to Coll...AlgebraISBN:9781337111348Author:Bruce Crauder, Benny Evans, Alan NoellPublisher:Cengage Learning