5-3 Assignment - Means, Test of Hypothesis
docx
keyboard_arrow_up
School
Southern New Hampshire University *
*We aren’t endorsed by this school
Course
240 APPLIE
Subject
Finance
Date
Jun 3, 2024
Type
docx
Pages
5
Uploaded by DoctorTurtle5212
Hypothesis Testing for Regional Real Estate Company
1
Hypothesis Testing for Regional Real Estate Company
Colleen Del Valle
Southern New Hampshire University
Hypothesis Testing for Regional Real Estate Company
2
Introduction
The purpose of this analysis is to analyze real estate data from the Pacific Region, to see if the average cost per square foot of a home is less than $280. I was able to generate a random sample by using the blank column ‘G’ and typing in the formula ‘=RAND()’ into ‘G2.’ The formula populated a random number, from there I copied the formula all the way down to the last
data set. Hypothesis Test Setup
The population parameter is the mean cost per square foot in the Pacific Region (
m
).
Null hypothesis, H
0
: µ = $280 per square foot.
Alternative hypothesis, H
1
<
$280 per square foot
For testing purposes, I will be using the left-tailed test as the left tailed test is used when the hypothesis asserts that the value of the parameter is less than the value asserted in the null hypothesis. Data Analysis Preparations
Descriptive Statistics
Sample Size
750
Sample Mean
$262
Sample Median
$203
Standard Deviation
162.490563
Hypothesis Testing for Regional Real Estate Company
3
The above model mirrors the cost per square foot for the Pacific Region with the x-axis being the cost per sq ft and the y-axis being the sample size. The shape of the histogram would be considered a multimodal because there are more then two “mounds.” With a skewness to the right because that is where the tail is. The center on the model is not in the center but to the left at $264 per sq ft and it is reflective of the spread or standard deviation being $162.50. The assumptions have been met being as the sample size is 750, and the sample mean is less than 280. The test significance level is α = .05.
Calculations
The sample mean for the cost per sq ft is $264, with the standard error being $5.93. To determine the test statistic, you must take the sample mean of 264 minus the target which is 280 then divide by 5.93. the equation will look like this, (264-280)/5.93 = -2.96910878. Now to calculate the p
value, you need to determine the best type of test to use, as I stated above, we will
be using a left-tailed test to complete the analysis. To calculate the p
value for a left-tailed test you need to figure out what your degree of freedom is, for this analysis the degree of freedom is taking the sample size of 750 and subtracting 1, that would make the degree of freedom 1. For
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
Hypothesis Testing for Regional Real Estate Company
4
the p value, input =T.DIST(test statistic, degree of freedom, 1). =T.DIST(-2.96910878, 749, 1) which equals 0.00154099 as the p
value.
In the curve graph above I have the test statistic or t-stat as I have labeled it and the
p
value. Test Decision
When comparing the p value to the significance level, the significance level is greater than the p
value. The p
value is 0.00154099 and the significance level is 0.05. With the p
value being less than the 0.05 we will reject the null hypothesis based off the supporting evidence above.
Conclusion
After viewing the data given to me by the sales representative stating that homes in the Pacific region cost less than $280 per sq ft, I am confident to agree with him as the data shows the cost per sq ft to be $264. The null hypothesis was rejected as it did not support what the sales representative claimed, and the alternate hypothesis was accepted as the cost per square foot is T-Stat
-2.96910878
P-Value
0.00154099
Hypothesis Testing for Regional Real Estate Company
5
lower than the average of $280. The mean cost per square foot in the Pacific region is $264 and the calculated p
value is 0.00154099. When the modifications are made to the advertisement it will be good to run.
Related Documents
Related Questions
You must use Excel to perform the regression analysis. Provide the answers under the space provided for each question. You must provide the Excel output for the question along with the answers. Round off the values on the output to three-five decimal places if appropriate.Questions for Real Estate Case Study-Model Building:As preliminary analysis the dataset includes information on 50 homes currently for sale, but some homes have unusually high prices, square footage, and lot sizes. To refine the dataset for analysis, apply the following exclusion criteria:- Exclude any home with a price greater than $1,000,000.- Exclude any home with square footage (SqFt) greater than 3000 ft².- Exclude any home with a lot size greater than 10,000 ft².After performing these exclusions, how many homes remain in the dataset?1) Show the observations that excluded (2p)2) How many categorical variables are present in the dataset? How will you incorporate these categorical variables into the regression…
arrow_forward
kindly help
arrow_forward
You must use Excel to perform the regression analysis. Provide the answers under the space provided for each question. You must provide the Excel output for the question along with the answers. Round off the values on the output to three-five decimal places if appropriate.Examine the interaction effect between Bedrooms and Square Footage (SqFt) on house price in a multiple regression model. Include an interaction term for Bedroom × SqFt in the regression model alongside the individual terms for Bathrooms and SqFt. 12) Provide the Excel output below here: (2p)13) Analyze the p-value of the interaction term to determine if the interaction significantly contributes to predicting house price. (α = 0.05) (3p)14) Interpret the coefficient of the interaction term to understand how the effect of Bedroom on house price may change depending on the Square Footage of the home. (3p)15) Why does the main effect of Square Footage (SqFt) become insignificant when the interaction term (SqFt × Bedrooms)…
arrow_forward
Assume there are 3,600 cases in the validation dataset, and 12% of these cases have a value of 1 for
churn (the primary/positive event). Questions a) to c) are based on data for the 3,600 cases (see
table below).
Depth
(% Contacted)
Model
Cumulative Gain
Cumulative Lift
Decision Tree
34.42
6.84
Logistic Regression
Neural Network
20.19
4.01
34.62
6.88
Decision Tree
Logistic Regression
Neural Network
10
64.90
6.06
10
36.06
3.15
10
62.50
5.54
Decision Tree
15
73.96
1.82
Logistic Regression
Neural Network
15
49.04
2.62
15
82.21
3.97
Decision Tree
Logistic Regression
20
78.39
0.87
20
59.13
2.01
Neural Network
20
86.54
0.86
a) Which model has the highest Cumulative Lift at a depth of 20%? What is the lift?
b) If the Cumulative Gain at a depth of 10% for the Decision Tree is converted to number of
primary/positive event cases, what will be the number of cases? Show your calculation.
c) If the Cumulative Gain at a depth of 15% for the Neural Network model is converted to number
of…
arrow_forward
What is the F- test statistic and p-value
arrow_forward
The production department is proposing the purchase of an automatic insertion machine. It has identified three machines and has asked the accountant to analyze them to determine
which one has the best average rate of return.
Machine A
$43,529.50 $80,697.00
Machine B Machine C
$64,675.95
310,925.00 268,990.00 431,173.00
Estimated average income
Average investment
Oa. Machine B
Ob. Machine A
Oc. Machine C
Od. Machines B and C have the same preferred average rate of return.
arrow_forward
You must use Excel to perform the regression analysis. Provide the answers under the space provided for each question. You must provide the Excel output for the question along with the answers. Round off the values on the output to three-five decimal places if appropriate.Conduct a simple linear regression analysis for each independent variable associated with the external factors, using house price as the dependent variable. Make sure to use the indicator variables for the categorical data. (Significance level of α = 0.05). For each regression:22) Report the p-value of the independent variable and indicate whether it is a significant predictor of house price (based on the p-value being less than 0.05). (4p)23) Report the explained variability (R-squared value) for each variable, whether it is significant based on the p-value. (4p)24) Identify and list any variables that are not significant predictors of house price (i.e., those with p-values greater than 0.05). (2p)Conduct a multiple…
arrow_forward
Newport, Inc. used Excel to run a least-squares regression analysis, which resulted in the following output:
Regression Statistics
Multiple R
R Square
Observations
0.7225
0.8500
30
Coefficients
Standard Error
T Stat
P-Value
0.021
Intercept
Production (X)
31,000
5.87
3,493
2.86
0.4640
14.30
0.000
a. What is Newport's total fixed cost?
Total Fixed Cost
b. What is Newport's variable cost per unit? (Round your intermediate calculations to 2 decimal places.)
Variable Cost
per unit
c. What total cost would Newport predict for a month in which they sold 5,000 units?
Total Costs
d. What proportion of variation in Newport's cost is explained by variation in production? (Round your intermediate calculations to 2
decimal places.)
Proportion of Variation
arrow_forward
SHOW YOUR WORK IN THE SPACES PROVIDED BELOW FOR FULL CREDIT.
Properties of Normal Distribution: (3p)
Write the word or phrase that best completes each statement or answers the question.
a. What is the total area under the normal curve?
b. The normal distribution is defined by two parameters. What are they?
c. What are the mean and standard deviation of the standard normal distribution?
arrow_forward
GIVEN THE FOLLOWING DATA, COMPUTE FOR THE FOLLOWING:
1. STRAIGHT LINE METHOD
2. ARITHMETIC GEOMETRIC CURVE
3. STATISTICAL PARABOLIC CURVE
WRITE A RECOMMENDATION REGARDING THE RESULTS AND WHICH OF THE
NETHOD IS BEST FIT FOR THE DATA.
Nate: answer on a separate document. Use excel in compute.
2.
Supposed this is Yc
(straightline)
450,000
370,000
750,000
1,100,000
1,500,000
1,000,000
1,700,000
2,000,000
1,900,000
2,300,000
Yi + 1
(Geometric)
YEAR
SALES
415,000 1
356,000
703,556
1,023,400
1,308,905
900,573
1,504,789
1,705,932
1,895,890
2,094,256
450,000
370,000
750,000
1,100,000
1,500,000
1,000,000
1,700,000
2,000,000
1,900,000
2,300,000
2011
2012
2013
3
2014
4.
2015
2016
6.
2017
2018
8
2019
2020
10
arrow_forward
Consider the following data for a dependent variable y and two independent variables, x1 and 12.
30
12
94
47
10
109
25
18
112
51
16
178
40
94
51
19
175
75
171
36
12
118
59
13
143
77
17
212
Round your all answers to two decimal places. Enter negative values as negative numbers, if necessary.
a. Develop an estimated regression equation relating Y to ¤1.
Predict y if æ1 = 35.
b. Develop an estimated regression equation relating y to x2.
ŷ =
+
Predict y if x2 = 25.
ŷ =
c. Develop an estimated regression equation relating y to ¤1 and 2.
Predict y if x1 = 35 and x2 = 25.
ŷ =
arrow_forward
The following data show the results of random sample of 10 batches of one pattern of product:
Sample
Batch Size
X
Support Costs
Y
1
10
10$
2
12
15
3
15
20
4
20
22
5
10
15
6
25
25
7
20
30
8
12
10
9
15
20
10
30
30
Calculate:
Using regression analysis (Y=a + bx ) and predict support cost for a batch size X = $10 then what is Mixed Cost ?
arrow_forward
What is answer for question b)
arrow_forward
What is the p-value and do we reject or do we not reject?
arrow_forward
SEE MORE QUESTIONS
Recommended textbooks for you

Essentials of Business Analytics (MindTap Course ...
Statistics
ISBN:9781305627734
Author:Jeffrey D. Camm, James J. Cochran, Michael J. Fry, Jeffrey W. Ohlmann, David R. Anderson
Publisher:Cengage Learning

Essentials Of Business Analytics
Statistics
ISBN:9781285187273
Author:Camm, Jeff.
Publisher:Cengage Learning,
Related Questions
- You must use Excel to perform the regression analysis. Provide the answers under the space provided for each question. You must provide the Excel output for the question along with the answers. Round off the values on the output to three-five decimal places if appropriate.Questions for Real Estate Case Study-Model Building:As preliminary analysis the dataset includes information on 50 homes currently for sale, but some homes have unusually high prices, square footage, and lot sizes. To refine the dataset for analysis, apply the following exclusion criteria:- Exclude any home with a price greater than $1,000,000.- Exclude any home with square footage (SqFt) greater than 3000 ft².- Exclude any home with a lot size greater than 10,000 ft².After performing these exclusions, how many homes remain in the dataset?1) Show the observations that excluded (2p)2) How many categorical variables are present in the dataset? How will you incorporate these categorical variables into the regression…arrow_forwardkindly helparrow_forwardYou must use Excel to perform the regression analysis. Provide the answers under the space provided for each question. You must provide the Excel output for the question along with the answers. Round off the values on the output to three-five decimal places if appropriate.Examine the interaction effect between Bedrooms and Square Footage (SqFt) on house price in a multiple regression model. Include an interaction term for Bedroom × SqFt in the regression model alongside the individual terms for Bathrooms and SqFt. 12) Provide the Excel output below here: (2p)13) Analyze the p-value of the interaction term to determine if the interaction significantly contributes to predicting house price. (α = 0.05) (3p)14) Interpret the coefficient of the interaction term to understand how the effect of Bedroom on house price may change depending on the Square Footage of the home. (3p)15) Why does the main effect of Square Footage (SqFt) become insignificant when the interaction term (SqFt × Bedrooms)…arrow_forward
- Assume there are 3,600 cases in the validation dataset, and 12% of these cases have a value of 1 for churn (the primary/positive event). Questions a) to c) are based on data for the 3,600 cases (see table below). Depth (% Contacted) Model Cumulative Gain Cumulative Lift Decision Tree 34.42 6.84 Logistic Regression Neural Network 20.19 4.01 34.62 6.88 Decision Tree Logistic Regression Neural Network 10 64.90 6.06 10 36.06 3.15 10 62.50 5.54 Decision Tree 15 73.96 1.82 Logistic Regression Neural Network 15 49.04 2.62 15 82.21 3.97 Decision Tree Logistic Regression 20 78.39 0.87 20 59.13 2.01 Neural Network 20 86.54 0.86 a) Which model has the highest Cumulative Lift at a depth of 20%? What is the lift? b) If the Cumulative Gain at a depth of 10% for the Decision Tree is converted to number of primary/positive event cases, what will be the number of cases? Show your calculation. c) If the Cumulative Gain at a depth of 15% for the Neural Network model is converted to number of…arrow_forwardWhat is the F- test statistic and p-valuearrow_forwardThe production department is proposing the purchase of an automatic insertion machine. It has identified three machines and has asked the accountant to analyze them to determine which one has the best average rate of return. Machine A $43,529.50 $80,697.00 Machine B Machine C $64,675.95 310,925.00 268,990.00 431,173.00 Estimated average income Average investment Oa. Machine B Ob. Machine A Oc. Machine C Od. Machines B and C have the same preferred average rate of return.arrow_forward
- You must use Excel to perform the regression analysis. Provide the answers under the space provided for each question. You must provide the Excel output for the question along with the answers. Round off the values on the output to three-five decimal places if appropriate.Conduct a simple linear regression analysis for each independent variable associated with the external factors, using house price as the dependent variable. Make sure to use the indicator variables for the categorical data. (Significance level of α = 0.05). For each regression:22) Report the p-value of the independent variable and indicate whether it is a significant predictor of house price (based on the p-value being less than 0.05). (4p)23) Report the explained variability (R-squared value) for each variable, whether it is significant based on the p-value. (4p)24) Identify and list any variables that are not significant predictors of house price (i.e., those with p-values greater than 0.05). (2p)Conduct a multiple…arrow_forwardNewport, Inc. used Excel to run a least-squares regression analysis, which resulted in the following output: Regression Statistics Multiple R R Square Observations 0.7225 0.8500 30 Coefficients Standard Error T Stat P-Value 0.021 Intercept Production (X) 31,000 5.87 3,493 2.86 0.4640 14.30 0.000 a. What is Newport's total fixed cost? Total Fixed Cost b. What is Newport's variable cost per unit? (Round your intermediate calculations to 2 decimal places.) Variable Cost per unit c. What total cost would Newport predict for a month in which they sold 5,000 units? Total Costs d. What proportion of variation in Newport's cost is explained by variation in production? (Round your intermediate calculations to 2 decimal places.) Proportion of Variationarrow_forwardSHOW YOUR WORK IN THE SPACES PROVIDED BELOW FOR FULL CREDIT. Properties of Normal Distribution: (3p) Write the word or phrase that best completes each statement or answers the question. a. What is the total area under the normal curve? b. The normal distribution is defined by two parameters. What are they? c. What are the mean and standard deviation of the standard normal distribution?arrow_forward
- GIVEN THE FOLLOWING DATA, COMPUTE FOR THE FOLLOWING: 1. STRAIGHT LINE METHOD 2. ARITHMETIC GEOMETRIC CURVE 3. STATISTICAL PARABOLIC CURVE WRITE A RECOMMENDATION REGARDING THE RESULTS AND WHICH OF THE NETHOD IS BEST FIT FOR THE DATA. Nate: answer on a separate document. Use excel in compute. 2. Supposed this is Yc (straightline) 450,000 370,000 750,000 1,100,000 1,500,000 1,000,000 1,700,000 2,000,000 1,900,000 2,300,000 Yi + 1 (Geometric) YEAR SALES 415,000 1 356,000 703,556 1,023,400 1,308,905 900,573 1,504,789 1,705,932 1,895,890 2,094,256 450,000 370,000 750,000 1,100,000 1,500,000 1,000,000 1,700,000 2,000,000 1,900,000 2,300,000 2011 2012 2013 3 2014 4. 2015 2016 6. 2017 2018 8 2019 2020 10arrow_forwardConsider the following data for a dependent variable y and two independent variables, x1 and 12. 30 12 94 47 10 109 25 18 112 51 16 178 40 94 51 19 175 75 171 36 12 118 59 13 143 77 17 212 Round your all answers to two decimal places. Enter negative values as negative numbers, if necessary. a. Develop an estimated regression equation relating Y to ¤1. Predict y if æ1 = 35. b. Develop an estimated regression equation relating y to x2. ŷ = + Predict y if x2 = 25. ŷ = c. Develop an estimated regression equation relating y to ¤1 and 2. Predict y if x1 = 35 and x2 = 25. ŷ =arrow_forwardThe following data show the results of random sample of 10 batches of one pattern of product: Sample Batch Size X Support Costs Y 1 10 10$ 2 12 15 3 15 20 4 20 22 5 10 15 6 25 25 7 20 30 8 12 10 9 15 20 10 30 30 Calculate: Using regression analysis (Y=a + bx ) and predict support cost for a batch size X = $10 then what is Mixed Cost ?arrow_forward
arrow_back_ios
SEE MORE QUESTIONS
arrow_forward_ios
Recommended textbooks for you
- Essentials of Business Analytics (MindTap Course ...StatisticsISBN:9781305627734Author:Jeffrey D. Camm, James J. Cochran, Michael J. Fry, Jeffrey W. Ohlmann, David R. AndersonPublisher:Cengage LearningEssentials Of Business AnalyticsStatisticsISBN:9781285187273Author:Camm, Jeff.Publisher:Cengage Learning,

Essentials of Business Analytics (MindTap Course ...
Statistics
ISBN:9781305627734
Author:Jeffrey D. Camm, James J. Cochran, Michael J. Fry, Jeffrey W. Ohlmann, David R. Anderson
Publisher:Cengage Learning

Essentials Of Business Analytics
Statistics
ISBN:9781285187273
Author:Camm, Jeff.
Publisher:Cengage Learning,