1. Anscombe's quartet shows us that datasets with the same statistical characteristics (e.g., mean, variance) may, in fact, consist of very different raw data values. Visualizing these datasets helps us recognize their differences. True False
1. Anscombe's quartet shows us that datasets with the same statistical characteristics (e.g., mean, variance) may, in fact, consist of very different raw data values. Visualizing these datasets helps us recognize their differences.
True
False
2. For running ML models, you can use categorical variables as-is.
True
False
3. Specify one reason why linear regression cannot be used for classification problems.
4.
Imagine there are two datasets, A and B, with 1000 rows each. You join these datasets to create a dataset C. Which type of join will ensure that you have the maximum number of rows in C.
Inner Join |
||
Full Outer Join |
||
Information is insufficient |
5.
Visualizing data using a pie chart is an excellent idea because people can easily tell apart small angular differences between pie slices.
True
False
6. Imagine it is March 2021, and you would like to predict who will win Oscar for Best Actress. Name one ML model you could use from the models you learned as part of this course.
7.
Which of the following is not part of data preprocessing?
Data cleaning and/or statistical preprocessing |
||
Feature selection |
||
Feature reduction |
||
Building machine learning models on the data |
8. What type of data visualization will you use to show the GDP growth between the years 1990 - 2020
9. Imagine it is 1993, a time when almost all households had landlines. You want to know the average number of people in each household in your city. You randomly pick out 500 phone numbers from the phone book and conduct a phone survey. Would this be a good sample? Explain in one sentence
please tell me the answers of these questions using r studio
Trending now
This is a popular solution!
Step by step
Solved in 3 steps