ECON312 Problem Set 4 (1) Empirical Application— Dummy Variable
pdf
keyboard_arrow_up
School
Egerton University *
*We aren’t endorsed by this school
Course
312
Subject
Statistics
Date
Nov 24, 2024
Type
Pages
3
Uploaded by CaptainViper3543
Question I: Empirical Application— Dummy Variable (30 points) Use the GPA2 data set in the R folder for this exercise. (a) Consider the following equation: = 0 + 1 + 2 2 + 3 +
𝑐???
𝑐???
𝑐???
𝑐???
𝑐??? ?𝑎
?𝑎
?𝑎
?𝑎
?𝑎
𝛽
𝛽
ℎ
?𝑖𝑧?
?𝑖𝑧?
?𝑖𝑧?
?𝑖𝑧?
?𝑖𝑧?
𝛽
ℎ
?𝑖𝑧?
?𝑖𝑧?
?𝑖𝑧?
?𝑖𝑧?
?𝑖𝑧?
𝛽
ℎ
????𝑐
????𝑐
????𝑐
????𝑐
????𝑐
𝛽
?𝑎?
?𝑎?
?𝑎?
?𝑎?
?𝑎?
𝛽
? ??𝑎??
??𝑎??
??𝑎??
??𝑎??
??𝑎??
𝛽
𝑎?
𝑎?
𝑎?
𝑎?
𝑎?
????
????
????
????
????
?
𝑐???
𝑐???
𝑐???
𝑐???
𝑐??? ?𝑎
?𝑎
?𝑎
?𝑎
?𝑎
4 + 5 + 6 ℎ
+ where is cumulative college grade point average; is size of high school graduating class, in hundreds; is ℎ
?𝑖𝑧?
?𝑖𝑧?
?𝑖𝑧?
?𝑖𝑧?
?𝑖𝑧?
ℎ
????𝑐
????𝑐
????𝑐
????𝑐
????𝑐
academic percentile in graduating class; is combined SAT score; is a ?𝑎?
?𝑎?
?𝑎?
?𝑎?
?𝑎?
? ??𝑎??
??𝑎??
??𝑎??
??𝑎??
??𝑎??
binary gender variable; and is a binary, which is one for students athletes. 𝑎?
𝑎?
𝑎?
𝑎?
𝑎?
????
????
????
????
????
ℎ
What are your expectations for the coefficients in this equation? Which ones are you
unsure about? Answer
Size of High School Graduating Class (hsiz): It's unclear how the size of the high school graduating class would affect college GPA. It could be positive if smaller classes allow for more individualized attention or negative if competition is higher in smaller classes.
Academic Percentile in Graduating Class (hsperc): Higher academic percentile might be associated with a higher college GPA.
SAT Score (sat): A positive relationship is expected; higher SAT scores may indicate better preparation for college.
Gender (fem): The effect could be uncertain without additional information. It may depend on various factors like gender-based academic performance trends.
Athlete Status (athlete): Athletes may have different time commitments, potentially affecting their GPA.
(b) Estimate the equation in part (a). What is the estimated GPA differential between athletes and non-athletes? Is it statistically significant?
The R Code are as follows:
# Assuming df is your dataset
model <- lm(colgpa ~ hsiz + I(hsiz^2) + hsperc + sat + fem + athlete, data = df)
summary(model)
Check the coefficient for the athlete variable to find the estimated GPA differential between athletes and non-athletes. Assess its statistical significance based on the p-value.
(c) Drop from the model and re-estimate the equation. Now, what is the estimated ?𝑎?
?𝑎?
?𝑎?
?𝑎?
?𝑎?
effect of being an athlete? Discuss why the estimate is different than that obtained in
part (b).
R code
model_no_sat <- lm(colgpa ~ hsiz + I(hsiz^2) + hsperc + fem + athlete, data = df)
summary(model_no_sat)
Compare the coefficient for athlete with the previous model. Discuss why it might differ (d) In the model from part (a), allow the effect of being an athlete to differ by gender and test the null hypothesis that there is no ceteris paribus differences between women athletes and women non-athletes.
R code
model_interaction <- lm(colgpa ~ hsiz + I(hsiz^2) + hsperc + sat + fem * athlete, data = df)
summary(model_interaction)
Look for the interaction term (fem * athlete) and assess its significance.
(e)
Does the effect of on differ by gender? Justify your answer.
?𝑎?
?𝑎?
?𝑎?
?𝑎?
?𝑎?
𝑐???
𝑐???
𝑐???
𝑐???
𝑐??? ?𝑎
?𝑎
?𝑎
?𝑎
?𝑎
R code
model_gender_interaction <- lm(colgpa ~ hsiz + I(hsiz^2) + hsperc + sat * fem + athlete, data = df)
summary(model_gender_interaction)
Check if the interaction term (sat * fem) is significant.
(f) ) Use the Chow Test to test whether there are differences in the coefficients of the model in part (a) (excluding the dummy variable) across genders. (Hint: look
? ??𝑎??
??𝑎??
??𝑎??
??𝑎??
??𝑎??
at the R-document to learn about how to run regressions on a subset of data
Chow Test for Gender Coefficient Differences:
R code
# Assuming you have a gender variable (gen) in your dataset
model_male <- lm(colgpa ~ hsiz + I(hsiz^2) + hsperc + sat + athlete, data = subset(df, gen == "Male"))
model_female <- lm(colgpa ~ hsiz + I(hsiz^2) + hsperc + sat + athlete, data = subset(df, gen == "Female"))
chow.test(model_male, model_female)
This tests whether there are differences in coefficients between genders.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
Related Questions
Determine the values of the three principal components for the following observation.
arrow_forward
Percent of SmokersThe data in NutritionStudy include information on nutrition and health habits of a sample of 315 people. One of the variables is Smoke, indicating whether a person smokes or not (yes or no). Use technology to test whether the data provide evidence that the proportion of smokers is different from 20% given that 43 identify themselves as smokers.
Clearly state the null and alternative hypotheses. Your answer should be an expression composed of symbols: =,≠,<,>,μ,μ1,μ2,p,p1,p2,0.20,ρ,p^,p^1,p^2,r.
arrow_forward
Suppose you have collected data on 1000 customers who visited your company's website last week. For each customer, the time spent on the page and the total amount of customer purchases during the visit were recorded. You want to explore the relationship between these two variables. What is the explanatory variable and what is the response variable? to. Explanatory: Time spent on the page Answer: Total amount of purchases b. It is not possible since neither explains c. Answer: Time spent on the page Explanatory: Total amount of purchases
arrow_forward
Do various occupational groups differ in their diets? A British study of this question compared 98
drivers and 83 conductors of London double-decker buses. The conductors' jobs require more
physical activity. The article reporting the study gives the data as "Mean daily consumption (±
se)." Some of the study results appear below.
Drivers
Conductors
Total calories
2821 +44
2844 +48
Alcohol (grams) 0.24 0.06 0.39 +0.11
(a) Give x and s for each of the alcohol measurements.
que e
quate
arrow_forward
Can you identify the explanatory variable and the response? Let me know which response is correct.
arrow_forward
Which of the following is not equivalent to the other three?
Choose the correct answer below.
O Dependent variable
Independent variable
000
O Predictor variable
O Explanatory variable
arrow_forward
What is the Independent variable and the Dependent
variable in the example above?
An experimenter wants to study the relationship between type of
milk and infant growth in underdeveloped countries. She
randomly assigns 300 infants to either a breast-feeding group or
an infant formula group. She then weighs the infants
days for the first four weeks of life.
three
every
What is the Independent variable and the Dependent
variable in the example above?
A researcher was interested in finding out if type of learning was
related to class performance. She randomly assigns 100 General
Psychology into 1 of 2 groups: 50 students use computerized
instruction exercises; the other 50 spend an equal amount of
time reviewing the text and lecture notes. At the end of the
semester the researcher compares final exam scores in both
groups.
What is the Independent variable and the Dependent
variable in the example above?
arrow_forward
1. Effect size and Cohen's D:
a. What is an effect size? What info does it tell you, when is it calculated, etc.?
b. What is Cohen’s d? What info does it tell you about the independent variable and dependent variable? (think of how you write the sentences explaining your numbers)
arrow_forward
Explanatory and Response Matching
For the following research question, identify the type for each of the relevant variables.
Research Question: For MAT 111, how are midterm exam scores related to final exam scores?
Choose the answers for each Group of answer choices attached as image.
Midterm Exam Score
Final Exam Score
arrow_forward
What is the major problem with R 2 and why Adj R 2 is preferred?
arrow_forward
What is Ha? (H0 is the mean VIQ scores for the younger, middle-aged, and older subjects are all not different from each other) (Note: 1= 5-25 Years, 2= 26-59 Years, 3= 60-92 Years)
Multiple Choices:
1. H0 is not true
2. The mean VIQ scores for the younger, middle-aged, and older subjects 3. are all different
3. The mean VIQ scores for the younger, middle-aged, and older subjects are not equal to one another
4. The mean VIQ scores for the younger, middle-aged, and older subjects are not the same as each other
arrow_forward
SEE MORE QUESTIONS
Recommended textbooks for you

Big Ideas Math A Bridge To Success Algebra 1: Stu...
Algebra
ISBN:9781680331141
Author:HOUGHTON MIFFLIN HARCOURT
Publisher:Houghton Mifflin Harcourt

Algebra: Structure And Method, Book 1
Algebra
ISBN:9780395977224
Author:Richard G. Brown, Mary P. Dolciani, Robert H. Sorgenfrey, William L. Cole
Publisher:McDougal Littell

Holt Mcdougal Larson Pre-algebra: Student Edition...
Algebra
ISBN:9780547587776
Author:HOLT MCDOUGAL
Publisher:HOLT MCDOUGAL

Glencoe Algebra 1, Student Edition, 9780079039897...
Algebra
ISBN:9780079039897
Author:Carter
Publisher:McGraw Hill
Related Questions
- Determine the values of the three principal components for the following observation.arrow_forwardPercent of SmokersThe data in NutritionStudy include information on nutrition and health habits of a sample of 315 people. One of the variables is Smoke, indicating whether a person smokes or not (yes or no). Use technology to test whether the data provide evidence that the proportion of smokers is different from 20% given that 43 identify themselves as smokers. Clearly state the null and alternative hypotheses. Your answer should be an expression composed of symbols: =,≠,<,>,μ,μ1,μ2,p,p1,p2,0.20,ρ,p^,p^1,p^2,r.arrow_forwardSuppose you have collected data on 1000 customers who visited your company's website last week. For each customer, the time spent on the page and the total amount of customer purchases during the visit were recorded. You want to explore the relationship between these two variables. What is the explanatory variable and what is the response variable? to. Explanatory: Time spent on the page Answer: Total amount of purchases b. It is not possible since neither explains c. Answer: Time spent on the page Explanatory: Total amount of purchasesarrow_forward
- Do various occupational groups differ in their diets? A British study of this question compared 98 drivers and 83 conductors of London double-decker buses. The conductors' jobs require more physical activity. The article reporting the study gives the data as "Mean daily consumption (± se)." Some of the study results appear below. Drivers Conductors Total calories 2821 +44 2844 +48 Alcohol (grams) 0.24 0.06 0.39 +0.11 (a) Give x and s for each of the alcohol measurements. que e quatearrow_forwardCan you identify the explanatory variable and the response? Let me know which response is correct.arrow_forwardWhich of the following is not equivalent to the other three? Choose the correct answer below. O Dependent variable Independent variable 000 O Predictor variable O Explanatory variablearrow_forward
- What is the Independent variable and the Dependent variable in the example above? An experimenter wants to study the relationship between type of milk and infant growth in underdeveloped countries. She randomly assigns 300 infants to either a breast-feeding group or an infant formula group. She then weighs the infants days for the first four weeks of life. three every What is the Independent variable and the Dependent variable in the example above? A researcher was interested in finding out if type of learning was related to class performance. She randomly assigns 100 General Psychology into 1 of 2 groups: 50 students use computerized instruction exercises; the other 50 spend an equal amount of time reviewing the text and lecture notes. At the end of the semester the researcher compares final exam scores in both groups. What is the Independent variable and the Dependent variable in the example above?arrow_forward1. Effect size and Cohen's D: a. What is an effect size? What info does it tell you, when is it calculated, etc.? b. What is Cohen’s d? What info does it tell you about the independent variable and dependent variable? (think of how you write the sentences explaining your numbers)arrow_forwardExplanatory and Response Matching For the following research question, identify the type for each of the relevant variables. Research Question: For MAT 111, how are midterm exam scores related to final exam scores? Choose the answers for each Group of answer choices attached as image. Midterm Exam Score Final Exam Scorearrow_forward
- What is the major problem with R 2 and why Adj R 2 is preferred?arrow_forwardWhat is Ha? (H0 is the mean VIQ scores for the younger, middle-aged, and older subjects are all not different from each other) (Note: 1= 5-25 Years, 2= 26-59 Years, 3= 60-92 Years) Multiple Choices: 1. H0 is not true 2. The mean VIQ scores for the younger, middle-aged, and older subjects 3. are all different 3. The mean VIQ scores for the younger, middle-aged, and older subjects are not equal to one another 4. The mean VIQ scores for the younger, middle-aged, and older subjects are not the same as each otherarrow_forward
arrow_back_ios
arrow_forward_ios
Recommended textbooks for you
- Big Ideas Math A Bridge To Success Algebra 1: Stu...AlgebraISBN:9781680331141Author:HOUGHTON MIFFLIN HARCOURTPublisher:Houghton Mifflin HarcourtAlgebra: Structure And Method, Book 1AlgebraISBN:9780395977224Author:Richard G. Brown, Mary P. Dolciani, Robert H. Sorgenfrey, William L. ColePublisher:McDougal LittellHolt Mcdougal Larson Pre-algebra: Student Edition...AlgebraISBN:9780547587776Author:HOLT MCDOUGALPublisher:HOLT MCDOUGAL
- Glencoe Algebra 1, Student Edition, 9780079039897...AlgebraISBN:9780079039897Author:CarterPublisher:McGraw Hill

Big Ideas Math A Bridge To Success Algebra 1: Stu...
Algebra
ISBN:9781680331141
Author:HOUGHTON MIFFLIN HARCOURT
Publisher:Houghton Mifflin Harcourt

Algebra: Structure And Method, Book 1
Algebra
ISBN:9780395977224
Author:Richard G. Brown, Mary P. Dolciani, Robert H. Sorgenfrey, William L. Cole
Publisher:McDougal Littell

Holt Mcdougal Larson Pre-algebra: Student Edition...
Algebra
ISBN:9780547587776
Author:HOLT MCDOUGAL
Publisher:HOLT MCDOUGAL

Glencoe Algebra 1, Student Edition, 9780079039897...
Algebra
ISBN:9780079039897
Author:Carter
Publisher:McGraw Hill