Heavenly Chocolates manufactures and sells quality chocolate products at its plant and retail store located in Saratoga Springs, New York. Two years ago, the company developed a web site and began selling its products over the Internet. Web-site sales have exceeded the company’s expectations, and management is now considering strategies to increase sales even further. To learn more about the web-site customers, a sample of 50 Heavenly Chocolate transactions was selected from the previous month’s sales. Data showing the day of the week each transaction was made, the type of browser the customer used, the time spent on the web site, the number of web pages viewed, and the amount spent by each of the 50 customers are contained in the file named Heavenly Chocolates. A portion of the data is shown in the table that follows:
Heavenly Chocolates would like to use the sample data to determine whether online shoppers who spend more time and view more pages also spend more money during their visit to the web site. The company would also like to investigate the effect that the day of the week and the type of browser have on sales.
Managerial Report
Use the methods of
- 1. Graphical and numerical summaries for the length of time the shopper spends on the web site, the number of pages viewed, and the
mean amount spent per transaction. Discuss what you learn about Heavenly Chocolates’ online shoppers from these numerical summaries. - 2. Summarize the frequency, the total dollars spent, and the mean amount spent per transaction for each day of week. Discuss the observations you can make about Heavenly Chocolates’ business based on the day of the week?
- 3. Summarize the frequency, the total dollars spent, and the mean amount spent per transaction for each type of browser. Discuss the observations you can make about Heavenly Chocolates’ business based on the type of browser?
- 4. Develop a
scatter diagram , and compute the samplecorrelation coefficient to explore the relationship between the time spent on the web site and the dollar amount spent. Use the horizontal axis for the time spent on the web site. Discuss your findings. - 5. Develop a scatter diagram, and compute the sample correlation coefficient to explore the relationship between the number of web pages viewed and the amount spent. Use the horizontal axis for the number of web pages viewed. Discuss your findings.
- 6. Develop a scatter diagram, and compute the sample correlation coefficient to explore the relationship between the time spent on the web site and the number of pages viewed. Use the horizontal axis to represent the number of pages viewed. Discuss your findings.
1. Provide the graphical and numerical summaries for the length of time that the shopper spends on the website, the number of pages viewed, and the mean amount spent per transaction. Explain that is observed from the numerical summaries.
2. Give a summary for the frequency, the total dollars spent, and the mean amount spent per transaction for each day of week. Give interpretation.
3. Give a summary for the frequency, the total dollars spent, and the mean amount spent per transaction for each type of browser. Give interpretation.
4. Give a scatter diagram and calculate the sample correlation coefficient to explore the relationship between the time spent on the web site and the dollar amount spent. Give interpretation.
5. Give a scatter diagram and calculate the sample correlation coefficient to explore the relationship between the number of web pages viewed and the dollar amount spent. Give interpretation.
6. Give a scatter diagram and calculate the sample correlation coefficient to explore the relationship between the time spent on the web site and the number of pages viewed. Give interpretation.
Answer to Problem 1C
1. Numerical summaries:
The frequency distribution and percent frequency distribution for the length of time that the shopper spends on the website are given below:
The frequency distribution and percent frequency distribution for the number of pages viewed are given below:
The frequency distribution and percent frequency distribution for the amount spent are given below:
The mean amount spent per transaction is obtained as 68.13.
Graphical summary:
The histogram for the length of time that the shopper spends on the website is shown below:
The bar chart for the number of pages viewed is shown below:
The histogram for the amount spent is shown below:
2. The frequency, the total dollars spent, and the mean amount spent per transaction for each day of week are as follows:
3. The frequency, the total dollars spent, and the mean amount spent per transaction for each type of browser are as follows:
4. The scatter diagram of the time spent on the web site and the dollar amount spent are shown below:
The sample correlation coefficient that explores the relationship between the time spent on the web site and the dollar amount spent is 0.58.
5. The scatter diagram of the number of web pages viewed and the dollar amount spent are given below:
The sample correlation coefficient that explores the relationship between the number of web pages viewed and the dollar amount spent is 0.72.
6. The scatter diagram of the time spent on the web site and the number of pages viewed are shown below:
The sample correlation coefficient that explores the relationship between the time spent on the web site and the number of pages viewed is 0.60.
Explanation of Solution
1.
Numerical summary:
Step-by-step procedure to create frequency distribution and percent frequency distribution for the length of time the shopper spends on the website using an Excel:
- Select Insert > PivotTable.
- In Select a table or range, select the data of Time (min) and click OK.
- In PivotTable Fields, move Time (min) to Rows and Σ Values.
- Right click on a value from Row Labels and select Group.
- Enter 5 in By.
- Click on Time (min) from Σ Values.
- Select Value Field settings.
- In Summarize value field by, choose Count and click OK.
- Again, move Time (min) to Rows and Σ Values.
- Click on Time (min) from Σ Values.
- Select Value Field settings.
- In Show Values As, choose % of Grand Total and click OK.
Thus, the frequency distribution and percent frequency distribution for the length of time the shopper spends on the website are obtained.
Step-by-step procedure to create frequency distribution and percent frequency distribution for the number of pages viewed using an Excel:
- Select Insert > PivotTable.
- In Select a table or range, select the data of Pages Viewed and click OK.
- In PivotTable Fields, move Pages Viewed to Rows and Σ Values.
- Right click on a value from Row Labels and select Group.
- Enter 5 in By.
- Click on Pages Viewed from Σ Values.
- Select Value Field settings.
- In Summarize value field by, choose Count and click OK.
- Again, move Pages Viewed to Rows and Σ Values.
- Click on Pages Viewed from Σ Values.
- Select Value Field settings.
- In Show Values As, choose % of Grand Total and click OK.
Thus, the frequency distribution and percent frequency distribution for the number of pages viewed are obtained.
Step-by-step procedure to obtain the mean amount spent per transaction using an Excel:
- In cell F52, enter “=AVERAGE(F2:F51)”.
- Click Enter.
Thus, the mean amount spent per transaction is obtained as 68.13.
From the numerical summaries, it is clear that the highest percent frequency for length of time that the shopper spends on the website is 9.3 hours to 14.3 hours. Also, the highest percent frequency for the number of pages viewed is 4.
Graphical summaries:
Step-by-step procedure to obtain a histogram for the length of time that the shopper spends on the website using an Excel:
- Select the data of class interval and percent frequency.
- Select Insert.
- Choose Clustered Column under Charts.
- Click on a bar in the graph.
- In Format Data Series, enter Gap width as 0%.
Thus, the histogram for the length of time that the shopper spends on the website is obtained.
Step-by-step procedure to obtain a bar chart for the number of pages viewed using an Excel:
- Select the data of class interval and frequency.
- Select Insert.
- Choose Clustered Column under Charts.
Thus, the bar chart for the number of pages viewed is obtained.
Step-by-step procedure to obtain a histogram for the amount spent using Excel:
- Select the data of class interval and percent frequency.
- Select Insert.
- Choose Clustered Column under Charts.
- Click on a bar in the graph.
- In Format Data Series, enter Gap width as 0%.
Thus, the histogram for the amount spent is obtained.
2.
Step-by-step procedure to obtain the frequency, the total dollars spent, and the mean amount spent per transaction for each day of week using an Excel:
- Select Insert > PivotTable.
- In Select a table or range, select the data of Customer, Day, and Amount Spent ($) and click OK.
- In PivotTable Fields, move Day to Rows and Customer and Amount Spent ($) to Σ Values.
- Click on Amount Spent ($) from Σ Values.
- Select Value Field settings.
- In Summarize value field by, choose Sum and click OK.
- Again, move Amount Spent ($) to Σ Values.
- Click on Amount Spent ($) from Σ Values.
- Select Value Field settings.
- In Summarize value field by, choose Average and click OK.
Thus, the frequency, the total dollars spent, and the mean amount spent per transaction for each day of week are obtained.
It is clear from the output that the average amount spent on Monday is higher than other days. However, the sum of amount spent is higher on Friday. This is because the number of customers on Friday is more than the number of customers on Monday.
3.
Step-by-step procedure to obtain the frequency, the total dollars spent, and the mean amount spent per transaction for each type of browser using an Excel:
- Select Insert > PivotTable.
- In Select a table or range, select the data of Customer, Browser, and Amount Spent ($) and click OK.
- In PivotTable Fields, move Browser to Rows and Customer and Amount Spent ($) to Σ Values.
- Click on Amount Spent ($) from Σ Values.
- Select Value Field settings.
- In Summarize value field by, choose Sum and click OK.
- Again, move Amount Spent ($) to Σ Values.
- Click on Amount Spent ($) from Σ Values.
- Select Value Field settings.
- In Summarize value field by, choose Average and click OK.
Thus, the frequency, the total dollars spent, and the mean amount spent per transaction for each type of browser are obtained.
It is clear from the output that the average amount spent for Firefox is higher than other browsers. However, the sum of amount spent is higher for chrome. This is because that the number of customers using Chrome is higher than the number of customers using Firefox.
4.
Step-by-step procedure to obtain a scatter diagram of the time spent on the web site and the dollar amount spent using an Excel:
- Select the data of Time (min) and Amount Spent ($).
- Select Insert.
- Choose Scatter under Charts.
Thus, the scatter diagram of the time spent on the web site and the dollar amount spent are obtained.
Step-by-step procedure to obtain the sample correlation coefficient that explores the relationship between the time spent on the web site and the dollar amount spent using an Excel:
- In an empty cell, type “=CORREL(D2:D51, F2:F51)”.
- Click Enter.
The sample correlation coefficient that explores the relationship between the time spent on the web site and the dollar amount spent is obtained as 0.58.
It is clear that there is a positive linear correlation between the time spent on the web site and the dollar amount spent. That is, as time spent on the web site increases, the dollar amount spent also increases.
5.
Step-by-step procedure to obtain a scatter diagram of the number of web pages viewed and the dollar amount spent using an Excel:
- Select the data of Pages Viewed and Amount Spent ($).
- Select Insert.
- Choose Scatter under Charts.
Thus, the scatter diagram of the number of web pages viewed and the dollar amount spent is obtained.
Step-by-step procedure to obtain the sample correlation coefficient that explores the relationship between the number of web pages viewed and the dollar amount spent using an Excel:
- In an empty cell, type “=CORREL(E2:E51, F2:F51)”.
- Click Enter.
The sample correlation coefficient that explores the relationship between the number of web pages viewed and the dollar amount spent is 0.72.
It is clear that there is a positive linear correlation between the number of web pages viewed and the dollar amount spent. That is, as number of web pages viewed increases, the dollar amount spent also increases.
6.
Step-by-step procedure to obtain the scatter diagram of the time spent on the web site and the number of pages viewed using an Excel:
- Select the data of Time (min) and Pages Viewed.
- Select Insert.
- Choose Scatter under Charts.
Thus, the scatter diagram of the time spent on the web site and the number of pages viewed are obtained.
Step-by-step procedure to obtain the sample correlation coefficient that explores the relationship between the time spent on the web site and the number of pages viewed using an Excel:
- In an empty cell, type “=CORREL(D2:D51, E2:E51)”.
- Click Enter.
The sample correlation coefficient that explores the relationship between the time spent on the web site and the number of pages viewed is 0.60.
It is clear that there is a positive linear correlation between the time spent on the web site and the number of pages viewed. That is, as the number of pages viewed increases, the time spent on the web site also increases.
Want to see more full solutions like this?
Chapter 2 Solutions
Mindtap Business Analytics, 1 Term (6 Months) Printed Access Card For Camm/cochran/fry/ohlmann/anderson/sweeney/williams' Essentials Of Business Analytics, 2nd
- y of 45 home- televisions u find that 010020 le own one, ee, and 1 owns y histogram of 4 Suppose that you have a loaded die. You roll it several times and record the outcomes, which are shown in the following figure. Histogram for Loaded Die 444% 34.00 48% 6% 2% Frequency 20 20 15 155 10 5- ம 0 1 2 3 4 Outcome 5 6 a. Make a relative frequency histogram of these results. b. You can make a relative frequency histo- gram from a frequency histogram; can you go the other direction?arrow_forwardCalculate the mean for Study Hours and Test Scores. Compute the covariance between the two variables using the formula: Calculate the standard deviation for Study Hours (X) and Test Scores (Y). Determine the correlation coefficient Interpret the results: What does the calculated r-value indicate about the relationship between study hours and test scores?arrow_forwardFor unemployed persons in the United States, the average number of months of unemployment at the end of December 2009 was approximately seven months (Bureau of Labor Statistics, January 2010). Suppose the following data are for a particular region in upstate New York. The values in the first column show the number of months unemployed and the values in the second column show the corresponding number of unemployed persons. Months Unemployed Number Unemployed 1 1029 2 1686 3 2269 4 2675 5 3487 6 4652 7 4145 8 3587 9 2325 10 1120 Let x be a random variable indicating the number of months a person is unemployed. a. Use the data to develop an empirical discrete probability distribution for x (to 4 decimals). (x) f(x) 1 2 3 4 5 6 7 8 9 10 b. Show that your probability distribution satisfies the conditions for a valid discrete probability distribution. The input in the box below will not be graded, but may be reviewed and considered by your instructor. blank c. What is the probability that a…arrow_forward
- West Virginia has one of the highest divorce rates in the nation, with an annual rate of approximately 5 divorces per 1000 people (Centers for Disease Control and Prevention website, January 12, 2012). The Marital Counseling Center, Inc. (MCC) thinks that the high divorce rate in the state may require them to hire additional staff. Working with a consultant, the management of MCC has developed the following probability distribution for x = the number of new clients for marriage counseling for the next year. Excel File: data05-19.xls x 10 f(x) .05 20 30 .10 .10 40 .20 50 60 .35 .20 a. Is this probability distribution valid? - Select your answer- Explain. f(x) Σf(x) Select your answer Select your answer b. What is the probability MCC will obtain more than 30 new clients (to 2 decimals)? c. What is the probability MCC will obtain fewer than 20 new clients (to 2 decimals)? d. Compute the expected value and variance of x. Expected value Variance clients per year squared clients per yeararrow_forwardFor unemployed persons in the United States, the average number of months of unemployment at the end of December 2009 was approximately seven months (Bureau of Labor Statistics, January 2010). Suppose the following data are for a particular region in upstate New York. The values in the first column show the number of months unemployed and the values in the second column show the corresponding number of unemployed persons. Months Unemployed Number Unemployed 1 1029 2 1686 3 2269 4 2675 5 3487 6 4652 7 4145 8 3587 9 2325 10 1120 Let x be a random variable indicating the number of months a person is unemployed. a. Use the data to develop an empirical discrete probability distribution for x (to 4 decimals). (x) f(x) 1 2 3 4 5 6 7 8 9 10 b. Show that your probability distribution satisfies the conditions for a valid discrete probability distribution. The input in the box below will not be graded, but may be reviewed and considered by your instructor. c. What is the probability that a person…arrow_forwardIn Gallup's Annual Consumption Habits Poll, telephone interviews were conducted for a random sample of 1014 adults aged 18 and over. One of the questions was "How many cups of coffee, if any, do you drink on an average day?" The following table shows the results obtained (Gallup website, August 6, 2012). Excel File: data05-23.xls Number of Cups per Day Number of Responses 0 365 264 193 3 4 or more 91 101 Define a random variable x = number of cups of coffee consumed on an average day. Let x = 4 represent four or more cups. Round your answers to four decimal places. a. Develop a probability distribution for x. x 0 1 2 3 4 f(x) b. Compute the expected value of x. cups of coffee c. Compute the variance of x. cups of coffee squared d. Suppose we are only interested in adults that drink at least one cup of coffee on an average day. For this group, let y = the number of cups of coffee consumed on an average day. Compute the expected value of y. Compare it to the expected value of x. The…arrow_forward
- In Gallup's Annual Consumption Habits Poll, telephone interviews were conducted for a random sample of 1014 adults aged 18 and over. One of the questions was "How many cups of coffee, if any, do you drink on an average day?" The following table shows the results obtained (Gallup website, August 6, 2012). Excel File: data05-23.xls Number of Cups per Day Number of Responses 0 365 264 193 2 3 4 or more 91 101 Define a random variable x = number of cups of coffee consumed on an average day. Let x = 4 represent four or more cups. Round your answers to four decimal places. a. Develop a probability distribution for x. x 0 1 2 3 f(x) b. Compute the expected value of x. cups of coffee c. Compute the variance of x. cups of coffee squared d. Suppose we are only interested in adults that drink at least one cup of coffee on an average day. For this group, let y = the number of cups of coffee consumed on an average day. Compute the expected value of y. Compare it to the expected value of x. The…arrow_forwardA technician services mailing machines at companies in the Phoenix area. Depending on the type of malfunction, the service call can take 1, 2, 3, or 4 hours. The different types of malfunctions occur at about the same frequency. Develop a probability distribution for the duration of a service call. Duration of Call x f(x) 1 2 3 4 Which of the following probability distribution graphs accurately represents the data set? Consider the required conditions for a discrete probability function, shown below.Does this probability distribution satisfy equation (5.1)?Does this probability distribution satisfy equation (5.2)? What is the probability a service call will take three hours? A service call has just come in, but the type of malfunction is unknown. It is 3:00 P.M. and service technicians usually get off at 5:00 P.M. What is the probability the service technician will have to work overtime to fix the machine today?arrow_forwardA psychologist determined that the number of sessions required to obtain the trust of a new patient is either 1, 2, or 3. Let x be a random variable indicating the number of sessions required to gain the patient's trust. The following probability function has been proposed. x f(x) for x = 1, 2, or 3 a. Consider the required conditions for a discrete probability function, shown below. f(x) ≥0 Σf(x) = 1 (5.1) (5.2) Does this probability distribution satisfy equation (5.1)? Select Does this probability distribution satisfy equation (5.2)? Select b. What is the probability that it takes exactly 2 sessions to gain the patient's trust (to 3 decimals)? c. What is the probability that it takes at least 2 sessions to gain the patient's trust (to 3 decimals)?arrow_forward
- A technician services mailing machines at companies in the Phoenix area. Depending on the type of malfunction, the service call can take 1, 2, 3, or 4 hours. The different types of malfunctions occur at about the same frequency. Develop a probability distribution for the duration of a service call. Which of the following probability distribution graphs accurately represents the data set? Consider the required conditions for a discrete probability function, shown below.Does this probability distribution satisfy equation (5.1)?Does this probability distribution satisfy equation (5.2)? What is the probability a service call will take three hours? A service call has just come in, but the type of malfunction is unknown. It is 3:00 P.M. and service technicians usually get off at 5:00 P.M. What is the probability the service technician will have to work overtime to fix the machine today?arrow_forwardWest Virginia has one of the highest divorce rates in the nation, with an annual rate of approximately 5 divorces per 1000 people (Centers for Disease Control and Prevention website, January 12, 2012). The Marital Counseling Center, Inc. (MCC) thinks that the high divorce rate in the state may require them to hire additional staff. Working with a consultant, the management of MCC has developed the following probability distribution for x = the number of new clients for marriage counseling for the next year. Excel File: data05-19.xls 10 20 f(x) .05 .10 11 30 40 50 60 .10 .20 .35 .20 a. Is this probability distribution valid? Yes Explain. greater than or equal to 0 f(x) Σf(x) equal to 1 b. What is the probability MCC will obtain more than 30 new clients (to 2 decimals)? c. What is the probability MCC will obtain fewer than 20 new clients (to 2 decimals)? d. Compute the expected value and variance of x. Expected value Variance clients per year squared clients per yeararrow_forwardReconsider the patient satisfaction data in Table 1. Fit a multiple regression model using both patient age and severity as the regressors. (a) Test for significance of regression. (b) Test for the individual contribution of the two regressors. Are both regressor variables needed in the model? (c) Has adding severity to the model improved the quality of the model fit? Explain your answer.arrow_forward
- Glencoe Algebra 1, Student Edition, 9780079039897...AlgebraISBN:9780079039897Author:CarterPublisher:McGraw HillBig Ideas Math A Bridge To Success Algebra 1: Stu...AlgebraISBN:9781680331141Author:HOUGHTON MIFFLIN HARCOURTPublisher:Houghton Mifflin HarcourtHolt Mcdougal Larson Pre-algebra: Student Edition...AlgebraISBN:9780547587776Author:HOLT MCDOUGALPublisher:HOLT MCDOUGAL