A researcher wanted to use a random sample of 20 Spotify Songs to estimate the average Duration (in minutes) and average Tempo (in bpm) for the population of all songs on Spotify. The researcher does not have any additional information about the population of all Spotify Songs. The data set is called SongSample1. a) Based on the above scenario which method of statistical inference do you believe the

MATLAB: An Introduction with Applications
6th Edition
ISBN:9781119256830
Author:Amos Gilat
Publisher:Amos Gilat
Chapter1: Starting With Matlab
Section: Chapter Questions
Problem 1P
icon
Related questions
Question
A
B
с
1 Track Nar Track Arti Year
2
Hells Bells AC/DC
3 Barbie Gir Aqua
Heat Of Th Asia
4
5
Digital Lov Daft Punk
6 Where Th David Gue
Let Me Cle DJ Kool
7
9
8 Oye Mi Ca Gloria Este
When The Greta Van
10 Starving Hailee Ste
11 Be My Lov La Bouche
12 Immigrant Led Zeppe
13 Big Fool Levi Belle
14 Into the Se Liam Davis
15 Friedhof d Manuellse
16 One Thing Marshmel
17 Kids MGMT
18 Sixteen
19 Seasons Shaggy
20 Donde Est Shakira
21 Just Got P: ZZ Top
22
Rick Ross
D
Genre
1980 rock
2009 pop
1982 pop
2001 pop
2012 pop
1996 rap
1989 latin
2018 rap
2016 pop
2009 pop
1970 rock
2020 r&b
2011 rock
2015 rap
2019 latin
2007 rock
2012 rap
2017 latin
1995 latin
1972 rock
E
Mode
Minor
Minor
Major
Major
Major
Major
Major
Major
Major
Minor
Major
Major
Minor
Major
Minor
Major
Major
Major
Major
Major
G
F
Duration Tempo
5.204883 106.767
3.250667 129.967
3.793783 136.136
5.022883 124.726
3.247333
129.884
4.836
103.145
4.12845 119.588
3.716
96.004
99.989
134.774
2.4375 112.937
3.026917
105.009
7.645333
85.013
2.91845
81.161
2.4831 105.052
122.961
3.031333
3.995783
5.047333
8.25955
86.98
3.23805 97.022
3.86155 109.044
4.458
99.93
Transcribed Image Text:A B с 1 Track Nar Track Arti Year 2 Hells Bells AC/DC 3 Barbie Gir Aqua Heat Of Th Asia 4 5 Digital Lov Daft Punk 6 Where Th David Gue Let Me Cle DJ Kool 7 9 8 Oye Mi Ca Gloria Este When The Greta Van 10 Starving Hailee Ste 11 Be My Lov La Bouche 12 Immigrant Led Zeppe 13 Big Fool Levi Belle 14 Into the Se Liam Davis 15 Friedhof d Manuellse 16 One Thing Marshmel 17 Kids MGMT 18 Sixteen 19 Seasons Shaggy 20 Donde Est Shakira 21 Just Got P: ZZ Top 22 Rick Ross D Genre 1980 rock 2009 pop 1982 pop 2001 pop 2012 pop 1996 rap 1989 latin 2018 rap 2016 pop 2009 pop 1970 rock 2020 r&b 2011 rock 2015 rap 2019 latin 2007 rock 2012 rap 2017 latin 1995 latin 1972 rock E Mode Minor Minor Major Major Major Major Major Major Major Minor Major Major Minor Major Minor Major Major Major Major Major G F Duration Tempo 5.204883 106.767 3.250667 129.967 3.793783 136.136 5.022883 124.726 3.247333 129.884 4.836 103.145 4.12845 119.588 3.716 96.004 99.989 134.774 2.4375 112.937 3.026917 105.009 7.645333 85.013 2.91845 81.161 2.4831 105.052 122.961 3.031333 3.995783 5.047333 8.25955 86.98 3.23805 97.022 3.86155 109.044 4.458 99.93
Investigation 1: Appropriateness of Inference
For the following scenario, answer the questions below. Please note, do not conduct inference
in this problem; just answer each question.
A researcher wanted to use a random sample of 20 Spotify Songs to estimate the average
Duration (in minutes) and average Tempo (in bpm) for the population of all songs on Spotify.
The researcher does not have any additional information about the population of all Spotify
Songs. The data set is called Song Sample1.
a) Based on the above scenario, which method of statistical inference do you believe the
researcher will consider to complete this estimation? Answer this question in one
sentence and provide a reason for your choice.
b) If we attempt to conduct statistical inference using the collected sample, what are the two
parameters of interest to the researcher? Use the correct symbols and describe the
parameters in context in one sentence each.
c) Check the following conditions necessary to consider conducting inference using theory-
based methods and the t-distribution. There are three to consider: (1) Was a random
sample collected; (2) Is the population where the sample comes from normal; and (3) Is
the sample size greater than or equal to 30? Check each of these conditions in one
sentence.
d) For the Duration variable, construct a frequency histogram in Rguroo. Remember to
properly title and label the graph. Copy and paste this graph into your document.
e) Describe the shape of the Duration histogram in one sentence.
f)
For the Duration variable, construct one horizontal boxplots in Rguroo. Remember to
properly title and label the graph. Copy and paste this graph into your document.
g) Does the Duration boxplot show any outliers? Answer this question in one sentence and
identify any outliers if they are present.
h) For the Tempo variable, construct a frequency histogram in Rguroo. Remember to
properly title and label the graph. Copy and paste this graph into your document.
i)
Describe the shape of the Tempo histogram in one sentence.
j)
For the Tempo variable, construct one horizontal boxplots in Rguroo. Remember to
properly title and label the graph. Copy and paste this graph into your document.
k) Does the Tempo boxplot show any outliers? Answer this question in one sentence and
identify any outliers if they are present.
2
1) Considering your interpretation of each variable’s graphs, is theory-based inference using
the t-distribution appropriate for either variable (or both variables, or neither variable)?
Answer the question and provide a reason for your response.
m) If theory-based inference is not appropriate for one or both of the variables, present
another possibility if the researcher still wanted to conduct statistical inference. Use one
to three complete sentences in your response.
Transcribed Image Text:Investigation 1: Appropriateness of Inference For the following scenario, answer the questions below. Please note, do not conduct inference in this problem; just answer each question. A researcher wanted to use a random sample of 20 Spotify Songs to estimate the average Duration (in minutes) and average Tempo (in bpm) for the population of all songs on Spotify. The researcher does not have any additional information about the population of all Spotify Songs. The data set is called Song Sample1. a) Based on the above scenario, which method of statistical inference do you believe the researcher will consider to complete this estimation? Answer this question in one sentence and provide a reason for your choice. b) If we attempt to conduct statistical inference using the collected sample, what are the two parameters of interest to the researcher? Use the correct symbols and describe the parameters in context in one sentence each. c) Check the following conditions necessary to consider conducting inference using theory- based methods and the t-distribution. There are three to consider: (1) Was a random sample collected; (2) Is the population where the sample comes from normal; and (3) Is the sample size greater than or equal to 30? Check each of these conditions in one sentence. d) For the Duration variable, construct a frequency histogram in Rguroo. Remember to properly title and label the graph. Copy and paste this graph into your document. e) Describe the shape of the Duration histogram in one sentence. f) For the Duration variable, construct one horizontal boxplots in Rguroo. Remember to properly title and label the graph. Copy and paste this graph into your document. g) Does the Duration boxplot show any outliers? Answer this question in one sentence and identify any outliers if they are present. h) For the Tempo variable, construct a frequency histogram in Rguroo. Remember to properly title and label the graph. Copy and paste this graph into your document. i) Describe the shape of the Tempo histogram in one sentence. j) For the Tempo variable, construct one horizontal boxplots in Rguroo. Remember to properly title and label the graph. Copy and paste this graph into your document. k) Does the Tempo boxplot show any outliers? Answer this question in one sentence and identify any outliers if they are present. 2 1) Considering your interpretation of each variable’s graphs, is theory-based inference using the t-distribution appropriate for either variable (or both variables, or neither variable)? Answer the question and provide a reason for your response. m) If theory-based inference is not appropriate for one or both of the variables, present another possibility if the researcher still wanted to conduct statistical inference. Use one to three complete sentences in your response.
Expert Solution
trending now

Trending now

This is a popular solution!

steps

Step by step

Solved in 5 steps

Blurred answer
Similar questions
Recommended textbooks for you
MATLAB: An Introduction with Applications
MATLAB: An Introduction with Applications
Statistics
ISBN:
9781119256830
Author:
Amos Gilat
Publisher:
John Wiley & Sons Inc
Probability and Statistics for Engineering and th…
Probability and Statistics for Engineering and th…
Statistics
ISBN:
9781305251809
Author:
Jay L. Devore
Publisher:
Cengage Learning
Statistics for The Behavioral Sciences (MindTap C…
Statistics for The Behavioral Sciences (MindTap C…
Statistics
ISBN:
9781305504912
Author:
Frederick J Gravetter, Larry B. Wallnau
Publisher:
Cengage Learning
Elementary Statistics: Picturing the World (7th E…
Elementary Statistics: Picturing the World (7th E…
Statistics
ISBN:
9780134683416
Author:
Ron Larson, Betsy Farber
Publisher:
PEARSON
The Basic Practice of Statistics
The Basic Practice of Statistics
Statistics
ISBN:
9781319042578
Author:
David S. Moore, William I. Notz, Michael A. Fligner
Publisher:
W. H. Freeman
Introduction to the Practice of Statistics
Introduction to the Practice of Statistics
Statistics
ISBN:
9781319013387
Author:
David S. Moore, George P. McCabe, Bruce A. Craig
Publisher:
W. H. Freeman