True

docx

School

Slippery Rock University of Pennsylvania *

*We aren’t endorsed by this school

Course

230

Subject

Industrial Engineering

Date

Dec 6, 2023

Type

docx

Pages

3

Uploaded by DrClover9807

Report
True the best-pruned tree is the smallest set, least complex tree, with the smallest validation error. False When a target variable is categorical, the CART algorithm produces a ___________ tree to predict the class memberships of new cases. Classification To measure impurity in a regression tree, mean square error (MSE) is used. True The overall MSE split for Age = 25 is $22987.29 and for Age = 23 is $21983.40. Of the two presented, Age = 25 is slightly higher and has a lower level of impurity for constructing a regression tree False Before constructing a decision tree, one of the first steps is identifying possible splits of the predictor variable True Based on the following sorts 20 values for age, what are the possible split points? {20, 22, 24, 26, 28, 31, 33, 35, 40, 42, 43, 45, 47, 49, 50, 52, 53, 55, 57} {21, 23, 25, 27, 29.5, 31.5, 32.5, 34, 37.5, 41, 42.5, 44, 46, 48, 49.5, 51, 52.5, 54, 56} Which option is not one of the three common strategies used in creating ensemble models? Bootstrapping If the performance measures are based on a cutoff value of 0.5, then if we lower the cutoff value, more cases will be in the target class, resulting in different performance measurement values. What chart can be used to review the data that are independent of the cutoff value? All options are independent of the cutoff value If predictor variables are highly correlated, then repeated sampling of the training data, and a random selection of features are used to construct trees. This is an example of which strategy? random Forest When using k-means clustering, the number of clusters are specified at the end of the analysis to remove overlapping clusters. False When evaluating large data sets, it is customary to cluster large data sets using the k-means to reduce the computation of measures during each iteration compared to hierarchical clustering methods True
When using R, after the data is imported, set.seed function is used to set the random seed and the k function sets the k parameters to preselect the number of clusters False In the k-Means Clustering Method, there is a general process of how k-means clustering algorithm can be classified. Which one of the following is not one of the general processes? Reassign each observation to the nearest observation point The forming of groups into internally homogeneous groups where each has a unique characteristic, different from other groups, is called cluster analysis. True The most commonly used approach for hierarchical clustering is divisive clustering False The Ward's method is the use of a different algorithm to minimize the dissimilarity within clusters by using error sum of squares True When using R for Agglomerative Clustering, the plot function is used to create the dendrogram as well as a banner plot. What function is used to split these results into distinct clusters? cutree In understanding the association rules, it is best to think of them as an If-Then statement True Under the association rule, a lift ratio between 0 and 1 indicates a positive association False If-Then logical statements are constructed with the If portion being the consequent and the Then being the antecedent False The marketing department is examining the data pulled from the retail stores over the month of December. In this time period, three items are of interest, Sound Bars, LED under counter lights, and shelving units. In researching if two of the items are purchased, if the third will be also, the following confidence level was calculated at 0.575, with an expected confidence of 0.10. Calculate the lift ratio. 5.75 Aimee's bookstore had a 45% increase in profits on Wednesday, June 12th, over the previous year's sales. Without the presence of a holiday, events in the area, or sale promotion, this business event is considered random. True
The use of quantitative forecast can be criticized because biases in optimism and overconfidence may skew the results False In a 3-period moving average, when a new observation becomes available, the highest numerical observation is dropped False Sydney is evaluating monthly sales for her Etsy account. Based on the given data y(1)=4,321; y(2)=3876; y(3)=4190, what is her 3-period moving average? 4,129 Mark is using a 3-period moving average to forecast the number of filters needed for the fourth quarter. Using the following data, what is the forecasted amount? 39 Filters When a time series is expected to grow by fixed amounts each time period, then the linear trend model should be used True
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help