Collaborate with your team to complete the following requirements: Prepare a document bearing the course title, date, names of all team members, faculty member's name, and title “Week 3 Team Assignment.” Download the SkiResorts.xlsx data file. Import it into a data frame in RStudio. Name the data frame descriptively. Issue the command: set.seed(7) Create a 3-cluster k-means model that includes all the numeric attributes in the data set (columns 4 through 11). Create a centroid table comprised of the cluster sizes and the centers for each attribute. Name this data object descriptively. Create a new data frame comprised of the original data set and the cluster number for each ski resort. Name this data frame descriptively. In your team assignment document, write a report of your k-means cluster analysis. Address all the following points in your report: Using images and specific values from the centroid table, discuss the average attribute values of small, medium, and large resorts. Discuss the average number of lifts across the 3 different clusters. What about this attribute is different from the other attributes’ means? Examine the lift ticket price averages. Discuss how elevation, snowfall, acreage, and other variables appear to impact the cost of tickets. The cluster containing the smallest, least expensive ski areas has more resorts than the other two clusters combined. Discuss the average attributes of this largest cluster of ski destinations. Examine the 64 resorts that are classified into the third cluster. Discuss the attributes that best characterize these resorts. Evaluate the ticket price attribute for these resorts. Although the average ticket price for these resorts is over $106 per day, there are some resorts that cost less than $50. Examine the means for these resorts and discuss why you believe these resorts were categorized alongside the other cluster of 3 resorts despite their lower cost. Discuss as a group what additional insights your team has made using the k-means analysis of this data. Summarize your group discussion, including where team members may have disagreed. Discuss as a group how k-means data analysis tools can be used by different organizations and how valuable each team member found this tool to be. Conclude with a summary of this group discussion, including where team members may have disagreed.
Collaborate with your team to complete the following requirements: Prepare a document bearing the course title, date, names of all team members, faculty member's name, and title “Week 3 Team Assignment.” Download the SkiResorts.xlsx data file. Import it into a data frame in RStudio. Name the data frame descriptively. Issue the command: set.seed(7) Create a 3-cluster k-means model that includes all the numeric attributes in the data set (columns 4 through 11). Create a centroid table comprised of the cluster sizes and the centers for each attribute. Name this data object descriptively. Create a new data frame comprised of the original data set and the cluster number for each ski resort. Name this data frame descriptively. In your team assignment document, write a report of your k-means cluster analysis. Address all the following points in your report: Using images and specific values from the centroid table, discuss the average attribute values of small, medium, and large resorts. Discuss the average number of lifts across the 3 different clusters. What about this attribute is different from the other attributes’ means? Examine the lift ticket price averages. Discuss how elevation, snowfall, acreage, and other variables appear to impact the cost of tickets. The cluster containing the smallest, least expensive ski areas has more resorts than the other two clusters combined. Discuss the average attributes of this largest cluster of ski destinations. Examine the 64 resorts that are classified into the third cluster. Discuss the attributes that best characterize these resorts. Evaluate the ticket price attribute for these resorts. Although the average ticket price for these resorts is over $106 per day, there are some resorts that cost less than $50. Examine the means for these resorts and discuss why you believe these resorts were categorized alongside the other cluster of 3 resorts despite their lower cost. Discuss as a group what additional insights your team has made using the k-means analysis of this data. Summarize your group discussion, including where team members may have disagreed. Discuss as a group how k-means data analysis tools can be used by different organizations and how valuable each team member found this tool to be. Conclude with a summary of this group discussion, including where team members may have disagreed.
Database System Concepts
7th Edition
ISBN:9780078022159
Author:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Chapter1: Introduction
Section: Chapter Questions
Problem 1PE
Related questions
Question
Collaborate with your team to complete the following requirements:
- Prepare a document bearing the course title, date, names of all team members, faculty member's name, and title “Week 3 Team Assignment.”
- Download the SkiResorts.xlsx data file. Import it into a data frame in RStudio. Name the data frame descriptively.
- Issue the command: set.seed(7)
- Create a 3-cluster k-means model that includes all the numeric attributes in the data set (columns 4 through 11).
- Create a centroid table comprised of the cluster sizes and the centers for each attribute. Name this data object descriptively.
- Create a new data frame comprised of the original data set and the cluster number for each ski resort. Name this data frame descriptively.
- In your team assignment document, write a report of your k-means cluster analysis. Address all the following points in your report:
- Using images and specific values from the centroid table, discuss the average attribute values of small, medium, and large resorts.
- Discuss the average number of lifts across the 3 different clusters. What about this attribute is different from the other attributes’ means?
- Examine the lift ticket price averages. Discuss how elevation, snowfall, acreage, and other variables appear to impact the cost of tickets.
- The cluster containing the smallest, least expensive ski areas has more resorts than the other two clusters combined. Discuss the average attributes of this largest cluster of ski destinations.
- Examine the 64 resorts that are classified into the third cluster. Discuss the attributes that best characterize these resorts. Evaluate the ticket price attribute for these resorts. Although the average ticket price for these resorts is over $106 per day, there are some resorts that cost less than $50. Examine the means for these resorts and discuss why you believe these resorts were categorized alongside the other cluster of 3 resorts despite their lower cost.
- Discuss as a group what additional insights your team has made using the k-means analysis of this data. Summarize your group discussion, including where team members may have disagreed.
- Discuss as a group how k-means data analysis tools can be used by different organizations and how valuable each team member found this tool to be. Conclude with a summary of this group discussion, including where team members may have disagreed.
AI-Generated Solution
AI-generated content may present inaccurate or offensive content that does not represent bartleby’s views.
Unlock instant AI solutions
Tap the button
to generate a solution
Knowledge Booster
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, computer-science and related others by exploring similar questions and additional content below.Recommended textbooks for you
Database System Concepts
Computer Science
ISBN:
9780078022159
Author:
Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:
McGraw-Hill Education
Starting Out with Python (4th Edition)
Computer Science
ISBN:
9780134444321
Author:
Tony Gaddis
Publisher:
PEARSON
Digital Fundamentals (11th Edition)
Computer Science
ISBN:
9780132737968
Author:
Thomas L. Floyd
Publisher:
PEARSON
Database System Concepts
Computer Science
ISBN:
9780078022159
Author:
Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:
McGraw-Hill Education
Starting Out with Python (4th Edition)
Computer Science
ISBN:
9780134444321
Author:
Tony Gaddis
Publisher:
PEARSON
Digital Fundamentals (11th Edition)
Computer Science
ISBN:
9780132737968
Author:
Thomas L. Floyd
Publisher:
PEARSON
C How to Program (8th Edition)
Computer Science
ISBN:
9780133976892
Author:
Paul J. Deitel, Harvey Deitel
Publisher:
PEARSON
Database Systems: Design, Implementation, & Manag…
Computer Science
ISBN:
9781337627900
Author:
Carlos Coronel, Steven Morris
Publisher:
Cengage Learning
Programmable Logic Controllers
Computer Science
ISBN:
9780073373843
Author:
Frank D. Petruzella
Publisher:
McGraw-Hill Education