True
docx
keyboard_arrow_up
School
Slippery Rock University of Pennsylvania *
*We aren’t endorsed by this school
Course
230
Subject
Industrial Engineering
Date
Dec 6, 2023
Type
docx
Pages
3
Uploaded by DrClover9807
True
the best-pruned tree is the smallest set, least complex tree, with the smallest validation error.
False
When a target variable is categorical, the CART algorithm produces a ___________ tree to predict the
class memberships of new cases.
Classification
To measure impurity in a regression tree, mean square error (MSE) is used.
True
The overall MSE split for Age = 25 is $22987.29 and for Age = 23 is $21983.40. Of the two presented, Age
= 25 is slightly higher and has a lower level of impurity for constructing a regression tree
False
Before constructing a decision tree, one of the first steps is identifying possible splits of the predictor
variable
True
Based on the following sorts 20 values for age, what are the possible split points? {20, 22, 24, 26, 28, 31,
33, 35, 40, 42, 43, 45, 47, 49, 50, 52, 53, 55, 57}
{21, 23, 25, 27, 29.5, 31.5, 32.5, 34, 37.5, 41, 42.5, 44, 46, 48, 49.5, 51, 52.5, 54, 56}
Which option is not one of the three common strategies used in creating ensemble models?
Bootstrapping
If the performance measures are based on a cutoff value of 0.5, then if we lower the cutoff value, more
cases will be in the target class, resulting in different performance measurement values. What chart can
be used to review the data that are independent of the cutoff value?
All options are independent of the cutoff value
If predictor variables are highly correlated, then repeated sampling of the training data, and a random
selection of features are used to construct trees. This is an example of which strategy?
random Forest
When using k-means clustering, the number of clusters are specified at the end of the analysis to remove
overlapping clusters.
False
When evaluating large data sets, it is customary to cluster large data sets using the k-means to reduce
the computation of measures during each iteration compared to hierarchical clustering methods
True
When using R, after the data is imported, set.seed function is used to set the random seed and the k
function sets the k parameters to preselect the number of clusters
False
In the k-Means Clustering Method, there is a general process of how k-means clustering algorithm can be
classified. Which one of the following is not one of the general processes?
Reassign each observation to the nearest observation point
The forming of groups into internally homogeneous groups where each has a unique characteristic,
different from other groups, is called cluster analysis.
True
The most commonly used approach for hierarchical clustering is divisive clustering
False
The Ward's method is the use of a different algorithm to minimize the dissimilarity within clusters by
using error sum of squares
True
When using R for Agglomerative Clustering, the plot function is used to create the dendrogram as well as
a banner plot. What function is used to split these results into distinct clusters?
cutree
In understanding the association rules, it is best to think of them as an If-Then statement
True
Under the association rule, a lift ratio between 0 and 1 indicates a positive association
False
If-Then logical statements are constructed with the If portion being the consequent and the Then being
the antecedent
False
The marketing department is examining the data pulled from the retail stores over the month of
December. In this time period, three items are of interest, Sound Bars, LED under counter lights, and
shelving units. In researching if two of the items are purchased, if the third will be also, the following
confidence level was calculated at 0.575, with an expected confidence of 0.10. Calculate the lift ratio.
5.75
Aimee's bookstore had a 45% increase in profits on Wednesday, June 12th, over the previous year's
sales. Without the presence of a holiday, events in the area, or sale promotion, this business event is
considered random.
True
The use of quantitative forecast can be criticized because biases in optimism and overconfidence may
skew the results
False
In a 3-period moving average, when a new observation becomes available, the highest numerical
observation is dropped
False
Sydney is evaluating monthly sales for her Etsy account. Based on the given data y(1)=4,321; y(2)=3876;
y(3)=4190, what is her 3-period moving average?
4,129
Mark is using a 3-period moving average to forecast the number of filters needed for the fourth quarter.
Using the following data, what is the forecasted amount?
39 Filters
When a time series is expected to grow by fixed amounts each time period, then the linear trend model
should be used
True
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help