When we consider clustering techniques, which of the following apply: (select ALL answers that are correct, there may be more than 1) ☐ The k-means++ algorithm chooses only one of the centroids completely randomly ☐ The optimal value of k is selected based on the highest observed accuracy The goal in k-means is to minimize variance within clusters We cluster using the training data and then validate the clusters with the testing data We must have a target variable in the data that specifies the cluster labels The k-means (or k-means++) algorithm selects the optimal value of k when it performs clustering
When we consider clustering techniques, which of the following apply: (select ALL answers that are correct, there may be more than 1) ☐ The k-means++ algorithm chooses only one of the centroids completely randomly ☐ The optimal value of k is selected based on the highest observed accuracy The goal in k-means is to minimize variance within clusters We cluster using the training data and then validate the clusters with the testing data We must have a target variable in the data that specifies the cluster labels The k-means (or k-means++) algorithm selects the optimal value of k when it performs clustering
Database System Concepts
7th Edition
ISBN:9780078022159
Author:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Chapter1: Introduction
Section: Chapter Questions
Problem 1PE
Related questions
Question
k-means question from scikit-learn. Thank you.

Transcribed Image Text:When we consider clustering techniques, which of the following apply: (select ALL answers that
are correct, there may be more than 1)
The k-means++ algorithm chooses only one of the centroids completely randomly
The optimal value of k is selected based on the highest observed accuracy
The goal in k-means is to minimize variance within clusters
We cluster using the training data and then validate the clusters with the testing data
We must have a target variable in the data that specifies the cluster labels
The k-means (or k-means++) algorithm selects the optimal value of k when it performs
clustering
Expert Solution

This question has been solved!
Explore an expertly crafted, step-by-step solution for a thorough understanding of key concepts.
This is a popular solution!
Trending now
This is a popular solution!
Step by step
Solved in 2 steps

Knowledge Booster
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, computer-science and related others by exploring similar questions and additional content below.Recommended textbooks for you

Database System Concepts
Computer Science
ISBN:
9780078022159
Author:
Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:
McGraw-Hill Education

Starting Out with Python (4th Edition)
Computer Science
ISBN:
9780134444321
Author:
Tony Gaddis
Publisher:
PEARSON

Digital Fundamentals (11th Edition)
Computer Science
ISBN:
9780132737968
Author:
Thomas L. Floyd
Publisher:
PEARSON

Database System Concepts
Computer Science
ISBN:
9780078022159
Author:
Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:
McGraw-Hill Education

Starting Out with Python (4th Edition)
Computer Science
ISBN:
9780134444321
Author:
Tony Gaddis
Publisher:
PEARSON

Digital Fundamentals (11th Edition)
Computer Science
ISBN:
9780132737968
Author:
Thomas L. Floyd
Publisher:
PEARSON

C How to Program (8th Edition)
Computer Science
ISBN:
9780133976892
Author:
Paul J. Deitel, Harvey Deitel
Publisher:
PEARSON

Database Systems: Design, Implementation, & Manag…
Computer Science
ISBN:
9781337627900
Author:
Carlos Coronel, Steven Morris
Publisher:
Cengage Learning

Programmable Logic Controllers
Computer Science
ISBN:
9780073373843
Author:
Frank D. Petruzella
Publisher:
McGraw-Hill Education