This problem does not require the use of data mining software and focuses on knowledge of concepts and basic calculations. The Boleyn sisters, a pair of entrepreneurs who recently sold their start-up for a multi-million-dollar sum, are looking for alternate investments for their newfound fortune. They are considering an investment in wine, similar to how some people invest in rare coins and fine art. To learn more about the properties of fine wine, they have collected data on 13 different characteristics of 178 wines. They have applied k-means clustering to these data for a range of k values and generated the following table of average silhouette values and elbow chart. k Average Silhouette Value 2 0.259 3 0.285 4 0.260 5 0.231 6 0.194 7 0.206 8 0.197 9 0.156 10 0.202 An elbow chart has a horizontal axis labeled "Number of Clusters" with values from 0 to 11 and a vertical axis labeled "Total Within-Cluster Sum of Squares" with values from 0 to 2,600. The elbow chart has 10 points connected by line segments. A pattern goes down and right becoming less steep from (1, 2,500) to (10, 880). The points are as follows.  (1, 2,500)  (2, 1,730)  (3, 1,390)  (4, 1,250)  (5, 1,190)  (6, 1,120)  (7, 1,050)  (8, 1,000)  (9, 950)  (10, 880) (a) Which value of k appears to be the most appropriate to categorize these wines? k =   (b) Because the average silhouette values are low for all values of k, the Boleyn sisters are concerned about the quality of their clustering results. What experiments could they conduct to potentially improve the clusters?

MATLAB: An Introduction with Applications
6th Edition
ISBN:9781119256830
Author:Amos Gilat
Publisher:Amos Gilat
Chapter1: Starting With Matlab
Section: Chapter Questions
Problem 1P
icon
Related questions
Question
This problem does not require the use of data mining software and focuses on knowledge of concepts and basic calculations.
The Boleyn sisters, a pair of entrepreneurs who recently sold their start-up for a multi-million-dollar sum, are looking for alternate investments for their newfound fortune. They are considering an investment in wine, similar to how some people invest in rare coins and fine art. To learn more about the properties of fine wine, they have collected data on 13 different characteristics of 178 wines. They have applied k-means clustering to these data for a range of k values and generated the following table of average silhouette values and elbow chart.
k Average Silhouette Value
2 0.259
3 0.285
4 0.260
5 0.231
6 0.194
7 0.206
8 0.197
9 0.156
10 0.202
An elbow chart has a horizontal axis labeled "Number of Clusters" with values from 0 to 11 and a vertical axis labeled "Total Within-Cluster Sum of Squares" with values from 0 to 2,600. The elbow chart has 10 points connected by line segments. A pattern goes down and right becoming less steep from (1, 2,500) to (10, 880). The points are as follows. 
  • (1, 2,500) 
  • (2, 1,730) 
  • (3, 1,390) 
  • (4, 1,250) 
  • (5, 1,190) 
  • (6, 1,120) 
  • (7, 1,050) 
  • (8, 1,000) 
  • (9, 950) 
  • (10, 880)
(a)
Which value of k appears to be the most appropriate to categorize these wines?
k =  
(b)
Because the average silhouette values are low for all values of k, the Boleyn sisters are concerned about the quality of their clustering results. What experiments could they conduct to potentially improve the clusters?
 
Expert Solution
trending now

Trending now

This is a popular solution!

steps

Step by step

Solved in 3 steps with 1 images

Blurred answer
Knowledge Booster
Optimization models
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, statistics and related others by exploring similar questions and additional content below.
Similar questions
  • SEE MORE QUESTIONS
Recommended textbooks for you
MATLAB: An Introduction with Applications
MATLAB: An Introduction with Applications
Statistics
ISBN:
9781119256830
Author:
Amos Gilat
Publisher:
John Wiley & Sons Inc
Probability and Statistics for Engineering and th…
Probability and Statistics for Engineering and th…
Statistics
ISBN:
9781305251809
Author:
Jay L. Devore
Publisher:
Cengage Learning
Statistics for The Behavioral Sciences (MindTap C…
Statistics for The Behavioral Sciences (MindTap C…
Statistics
ISBN:
9781305504912
Author:
Frederick J Gravetter, Larry B. Wallnau
Publisher:
Cengage Learning
Elementary Statistics: Picturing the World (7th E…
Elementary Statistics: Picturing the World (7th E…
Statistics
ISBN:
9780134683416
Author:
Ron Larson, Betsy Farber
Publisher:
PEARSON
The Basic Practice of Statistics
The Basic Practice of Statistics
Statistics
ISBN:
9781319042578
Author:
David S. Moore, William I. Notz, Michael A. Fligner
Publisher:
W. H. Freeman
Introduction to the Practice of Statistics
Introduction to the Practice of Statistics
Statistics
ISBN:
9781319013387
Author:
David S. Moore, George P. McCabe, Bruce A. Craig
Publisher:
W. H. Freeman