Statistics Assignment
docx
keyboard_arrow_up
School
Seneca College *
*We aren’t endorsed by this school
Course
130
Subject
Statistics
Date
Feb 20, 2024
Type
docx
Pages
4
Uploaded by DoctorBravery103687
Statistics Assignment
/* 1. Performing a One-Sample t-Test */ libname DATALIB '/home/u63731080/DATALIB'
; data
DATALIB.normtemp
; set '/home/u63731080/DATALIB/normtemp.sas7bdat'
; run
; /* 1.a. Look at the distribution of the continuous variables in the data set using PROC UNIVARIATE, including producing histograms and insets with means, standard deviations and sample size. */ TITLE 'Distribution Analysis Using PROC UNIVARIATE'
; ods select histogram
; proc
univariate
data
=
DATALIB.normtemp noprint
; Var BodyTemp HeartRate
; histogram BodyTemp HeartRate
/ normal kernel
; inset N MEAN STD / position
=
ne
; run
;
/* 1.b. Perform a one-sample t-test to determine whether the mean of body temperatures (the variable BodyTemp in DATALIB.NormTemp) is 98.6. USING PROC TTEST*/ Title 'One-Sample t-test using PROC TTEST to test whether Mean BodyTemp=98.6'
; proc
ttest
data
=
DATALIB.normtemp h0
=
98.6
plots
(
only shownull
)=
interval
; var BodyTemp
; run
; /* Questions- */ /* 1) What is the value of the t statistic and the corresponding p-value? (2
points) Answer- The t value is -5.45, and the p-
value is <.0001. 2) Do you reject or fail to reject the null hypothesis at the 0.05 level that the average temperature is 98.6 degrees? (2 points) Answer- We reject the null hypothesis at the 0.05 level. */
/* 2. Using PROC TTEST for Comparing */ /* Analyze the data using PROC TTEST. Assess whether the treatment group improved
more than the control group */ libname DATALIB '/home/u63731080/DATALIB'
; data
DATALIB.german
; set '/home/u63731080/DATALIB/german.sas7bdat'
(
k
eep
=
Change Group
); run
; TITLE 'Comparing Two groups: Treatment and Control'
; proc
ttest
data
=
datalib.german plots
= (
interval qq
) ; class Group
; Var Change
; run
; /* Questions a. Do the two groups appear to be approximately normally distributed? (2 points)
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
Answer- Yes, the plots show evidence that supports approximate normality in both groups. b. Do the two groups have approximately equal variances? (2 points) Answer- The p-value for the equality of variances test is greater than 0.05 so, we do not reject null hypothesis which supports the assumption that the two graphs
have approximately equal variances. c. Does the new teaching technique seem to result in significantly different change
scores compared with the standard technique? (2 points) Answer- The p-value for the Pooled (Equal Variance) test for the difference between the two means has value greater than 0.05 which shows that the two groups are not statistically different. so, there is not enough evidence to say that the new teaching technique is significantly different from the old technique. */
Related Documents
Related Questions
#Original immigrant datax = c(61159, 57524,56186,52254,54884,51869,29743,39877,50086,53672,65707,73861)
#Age of labor dataage_over_16 = x*0.95
#Illegal immigrant data of employede1 = age_over_16*0.65
#Illegal immigrant data of unemployede2 = age_over_16*0.04
#Illegal immigrant data of Not in the labor forcee3 = age_over_16*0.30
What linear regression model should I use in R? Thanks.
arrow_forward
#Original immigrant datax = c(61159, 57524,56186,52254,54884,51869,29743,39877,50086,53672,65707,73861)
#Age of labor dataage_over_16 = x*0.95
#Illegal immigrant data of employede1 = age_over_16*0.65
#Illegal immigrant data of unemployede2 = age_over_16*0.04
#Illegal immigrant data of Not in the labor forcee3 = age_over_16*0.30
What linear regression model should I use in R? And how to draw the linear regression graph based on this model? Thanks.
arrow_forward
If someone gets a raw score that is also the mean, median and mode of the data sample, where on the normal curve would the data point be plotted after it is transformed to a standard score
At the first stanine
At the very beginning of the normal curve
At the point in the normal curve where it folds over on itself symmetrically, i.e., in the middle
The mean, median, and mode cannot be the same point on the curve
arrow_forward
Use a chi-square test for independence to compare the proportion of males and females (sex) that indicate that they have trouble falling asleep (trubslep)
²critical=
²calculated =
Decision =
Report results:
arrow_forward
As the sample size increases, the variability among the sample means
increases
decreases
remains the same
depends upon the specific population being sampled
arrow_forward
compute: standard deviation, Second quartile Q2, SEVENTH DECILE D7, 69TH PERENTILEP69, THE COEFFICIENT OF SKEWNESS USING KARL-PEARSON’S COEFFICIENT OF SKEWNESS SECOND , THIRD , AND THE FOURTH MOMENTS and coefficient of kurtosis
arrow_forward
Let X represent the SAT score of an entering freshman at University X. The random variable X is known to have a N(1170, 80) distribution. Let Y represent the SAT score of an entering freshman at University Y. The random variable Y is known to have a N(1200, 100) distribution. A random sample of 100 freshmen is obtained from each university. Let Xbar= the sample mean of the 100 scores from University X, andY^ = the sample mean of the 100 scores from University Y.
First, find the pvalue that Xbar will be less than 1180. Then find the probability (p-value) that Y^ will be greater than 1180. Please show calculations to understand.
arrow_forward
A researcher conducts a mileage economy test involving 80 cars. The frequency distribution describing average miles per gallon (mpg) appears in the following table.
(Chart is in image form)
(a) What percentage of the cars got 35 mpg or more?
(b) Is the distribution symmetric, positively skewed, or negatively skewed?
arrow_forward
A transect is an archaeological study area that is 1/5 mile wide and 1 mile long. A site in a transect is the location of a significant archaeological find. Let x represent the number of sites per transect. In a section of Chaco Canyon, a large number of transects showed that x has a population variance o^2= 25.1 . In a different section of Chaco Canyon, a random sample of 23 transects gave a sample variance s^2= 53.3 for the number of sites per transect. Use an a=0.05 to test the claim that the variance in the new section is greater than 25.1.Verify that P-value < 0.005, will you reject or fail to reject the null hypothesis of independence?
answer options:
Since the P-value is less than the level of significance, we reject the null hypothesis that the variance is equal to 25.1. At 0.05 level of significance, we conclude that the variance is greater than 25.1.
Since the P-value is less than the level of significance, we reject the null hypothesis that the variance is greater…
arrow_forward
Listed in the data table are amounts of strontium-90 (in millibecquerels, ormBq, per gram of calcium) in a simple random sample of baby teeth obtained from residents in two cities. Assume that the two samples are independent simple random samples selected from normally distributed populations. Do not assume that the population standard deviations are equal.
City_#1
City_#2
100
117
86
61
121
100
119
85
101
89
104
107
213
110
116
111
290
142
100
133
283
101
145
209
The test statistic is
The P-value is
construct a confidence interval suitable for testing the claim that the mean amount of strontium-90 from city #1 residents is greater than the mean amount from city #2 residents.
____mBq<μ1−μ2<____mBq
arrow_forward
HELP PLEASE
arrow_forward
An excess of high values for the cases distributed on a graph results in a positive skew in the data?
arrow_forward
Listed in the data table are amounts of
strontium-90 (in millibecquerels, or mBq, per
gram of calcium) in a simple random sample
of baby teeth obtained from residents in two
cities. Assume that the two samples are
independent simple random samples selected
from normally distributed populations. Do not
assume that the population standard
deviations are equal.
City_#1 City_#2
100 117
86 84
121 100
120 85
101 90
104 107
213 110
136 111
290 126
100 133
289 101
145 209
Use a 0.01 significance level to test the claim
that the mean amount of strontium-90 from
city #1 residents is greater than the mean amount from city #2 residents.
The test statistic is The P-value
State the conclusion for the test.
a ) reject, There is sufficient evidence to
support the claim that the mean amount of
strontium-90 from city #1 residents is greater.
b )reject, There is not sufficient evidence to
support the claim that the mean amount of
strontium-90 from city #1 residents is greater.
c ) fail to reject,…
arrow_forward
The time to fly between New York City Chicago is uniformly distributed with a minimum of 120 minutes and a maximum of 150 minutes. What is the mean?
arrow_forward
Each passenger on a plane brings one piece of luggage. The weight of a piece of luggage is normally distributed with mean 50 pounds and variance 150 pounds^2. The weight is independent across passengers. The total luggage weight on the plane cannot exceed 1000 pounds. What is the maximum number of passengers allowed such that with at least 0.9 probability the luggage weight is below the limit.
arrow_forward
correct normal distribution notation
1. X~N(93.25,13.^1)
2. X~N(93.25,14.^4)
3.X~N(93.25,3.6^2)
4. X~N(93.25,3.7^2)
arrow_forward
Suppose we want to test the hypothesis that mothers with low socioeconomic status (SES) deliver babies whose birth weights are different from normal. To test this hypothesis, a random sample of 100 birth weights is selected from a list of full-term babies of SES mothers. The mean birth weight is found to be 115 oz.
Suppose the average birth weight of all babies (based on nationwide surveys of millions of deliveries) is known to be 120 oz with
= 24 oz. Set = .05 Assume all conditions are met, what is the p-value of their test? Give your answer to 4 decimal places.
arrow_forward
SEE MORE QUESTIONS
Recommended textbooks for you
![Text book image](https://www.bartleby.com/isbn_cover_images/9780079039897/9780079039897_smallCoverImage.jpg)
Glencoe Algebra 1, Student Edition, 9780079039897...
Algebra
ISBN:9780079039897
Author:Carter
Publisher:McGraw Hill
Related Questions
- #Original immigrant datax = c(61159, 57524,56186,52254,54884,51869,29743,39877,50086,53672,65707,73861) #Age of labor dataage_over_16 = x*0.95 #Illegal immigrant data of employede1 = age_over_16*0.65 #Illegal immigrant data of unemployede2 = age_over_16*0.04 #Illegal immigrant data of Not in the labor forcee3 = age_over_16*0.30 What linear regression model should I use in R? Thanks.arrow_forward#Original immigrant datax = c(61159, 57524,56186,52254,54884,51869,29743,39877,50086,53672,65707,73861) #Age of labor dataage_over_16 = x*0.95 #Illegal immigrant data of employede1 = age_over_16*0.65 #Illegal immigrant data of unemployede2 = age_over_16*0.04 #Illegal immigrant data of Not in the labor forcee3 = age_over_16*0.30 What linear regression model should I use in R? And how to draw the linear regression graph based on this model? Thanks.arrow_forwardIf someone gets a raw score that is also the mean, median and mode of the data sample, where on the normal curve would the data point be plotted after it is transformed to a standard score At the first stanine At the very beginning of the normal curve At the point in the normal curve where it folds over on itself symmetrically, i.e., in the middle The mean, median, and mode cannot be the same point on the curvearrow_forward
- Use a chi-square test for independence to compare the proportion of males and females (sex) that indicate that they have trouble falling asleep (trubslep) ²critical= ²calculated = Decision = Report results:arrow_forwardAs the sample size increases, the variability among the sample means increases decreases remains the same depends upon the specific population being sampledarrow_forwardcompute: standard deviation, Second quartile Q2, SEVENTH DECILE D7, 69TH PERENTILEP69, THE COEFFICIENT OF SKEWNESS USING KARL-PEARSON’S COEFFICIENT OF SKEWNESS SECOND , THIRD , AND THE FOURTH MOMENTS and coefficient of kurtosisarrow_forward
- Let X represent the SAT score of an entering freshman at University X. The random variable X is known to have a N(1170, 80) distribution. Let Y represent the SAT score of an entering freshman at University Y. The random variable Y is known to have a N(1200, 100) distribution. A random sample of 100 freshmen is obtained from each university. Let Xbar= the sample mean of the 100 scores from University X, andY^ = the sample mean of the 100 scores from University Y. First, find the pvalue that Xbar will be less than 1180. Then find the probability (p-value) that Y^ will be greater than 1180. Please show calculations to understand.arrow_forwardA researcher conducts a mileage economy test involving 80 cars. The frequency distribution describing average miles per gallon (mpg) appears in the following table. (Chart is in image form) (a) What percentage of the cars got 35 mpg or more? (b) Is the distribution symmetric, positively skewed, or negatively skewed?arrow_forwardA transect is an archaeological study area that is 1/5 mile wide and 1 mile long. A site in a transect is the location of a significant archaeological find. Let x represent the number of sites per transect. In a section of Chaco Canyon, a large number of transects showed that x has a population variance o^2= 25.1 . In a different section of Chaco Canyon, a random sample of 23 transects gave a sample variance s^2= 53.3 for the number of sites per transect. Use an a=0.05 to test the claim that the variance in the new section is greater than 25.1.Verify that P-value < 0.005, will you reject or fail to reject the null hypothesis of independence? answer options: Since the P-value is less than the level of significance, we reject the null hypothesis that the variance is equal to 25.1. At 0.05 level of significance, we conclude that the variance is greater than 25.1. Since the P-value is less than the level of significance, we reject the null hypothesis that the variance is greater…arrow_forward
- Listed in the data table are amounts of strontium-90 (in millibecquerels, ormBq, per gram of calcium) in a simple random sample of baby teeth obtained from residents in two cities. Assume that the two samples are independent simple random samples selected from normally distributed populations. Do not assume that the population standard deviations are equal. City_#1 City_#2 100 117 86 61 121 100 119 85 101 89 104 107 213 110 116 111 290 142 100 133 283 101 145 209 The test statistic is The P-value is construct a confidence interval suitable for testing the claim that the mean amount of strontium-90 from city #1 residents is greater than the mean amount from city #2 residents. ____mBq<μ1−μ2<____mBqarrow_forwardHELP PLEASEarrow_forwardAn excess of high values for the cases distributed on a graph results in a positive skew in the data?arrow_forward
arrow_back_ios
SEE MORE QUESTIONS
arrow_forward_ios
Recommended textbooks for you
- Glencoe Algebra 1, Student Edition, 9780079039897...AlgebraISBN:9780079039897Author:CarterPublisher:McGraw Hill
![Text book image](https://www.bartleby.com/isbn_cover_images/9780079039897/9780079039897_smallCoverImage.jpg)
Glencoe Algebra 1, Student Edition, 9780079039897...
Algebra
ISBN:9780079039897
Author:Carter
Publisher:McGraw Hill