Quiz_-_2

pdf

School

University at Buffalo *

*We aren’t endorsed by this school

Course

587

Subject

Statistics

Date

Jan 9, 2024

Type

pdf

Pages

3

Uploaded by UltraPrairieDog3802

Report
1. What is the primary goal of exploratory data analysis (EDA)? A. To make predictions and build machine learning models. B. To communicate the results of a statistical analysis. C. To summarise data and generate insights. D. To clean and preprocess the data. Correct Answer : To summarise data and generate insights. 2. Which is not an EDA operation? A. Plotting histograms B. Finding Statistics C. Removing Null Values D. Finding the correlation between the features Correct Answer : Removing Null Values 3.: What is the primary objective of outlier detection in EDA? A. To identify extreme values that do not follow the data's distribution. B. To remove all data points that fall outside the expected range. C. To prove that the dataset is incorrect and needs to be discarded. D. To improve the accuracy of statistical tests. Correct Answer : To identify extreme values that do not follow the data's distribution. 4. Which EDA technique is used to discover the relationship between two numerical variables and visualise their correlation? A. Box plots B. Pie charts C. Scatter plots D. Histograms Correct Answer : Scatter plots
5. What is the formula for a simple linear regression model?, A. y = β0 + β1x1 + β2x2 + … + ϵ B. y = β0 + β1x + β2x2 + … C. z = x2 D. y = β0 + β1x Correct Answer: y = β0 + β1x 6. What is the best way to choose the K value for K-means? A. Hit and trial method B. Line of best fit C. Linear Regression D. Random Sampling Correct Answer : Hit and trial method 7. How do we quantify the uncertainty of our model?(Select all that apply) A. R2 B. p-values C. Cross Validation D. Least Squares Estimation Correct Answer: R2 and p-values, Cross Validation 8. By using which algorithm we can perform the following task - Divide your dataset in order to apply different models to each segment. A. Regression B. Cluster Correct Answer : Cluster 9. What is a potential issue with k-means? A. Inability to interpret results B. Inability to find the global minimum C. Inability to find local minimum D. Inability to re-assign points Correct Answer : Inability to find the global minimum
10. What is an example of an optimization algorithm? A. R2 B. p-values C. Least Squares Estimation D. Cross Validation Correct Answer: Least Squares Estimation
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help