Module 4 Assignment for Applied Statistics

docx

School

Southern New Hampshire University *

*We aren’t endorsed by this school

Course

240

Subject

Economics

Date

Jan 9, 2024

Type

docx

Pages

7

Uploaded by CommodoreResolveHedgehog35

Report
Median Housing Price Prediction Model for D. M. Pan National Real Estate Company 1 Report: Housing Price Prediction Model for D. M. Pan National Real Estate Company Caitlynne Moreland Southern New Hampshire University
Median Housing Price Model for D. M. Pan National Real Estate Company 2 Introduction The purpose of this report is to determine whether or not you can determine the listing price of a home by considering its square footage. While trying to compare these two variables against each other, linear regression should be used. This is due to the fact that linear regression can be used to determine how strong the relationship between two variables is. In this case, how much does the listing price increase compared to the square feet of a home? When using linear regression for our two variables, I expect the pattern to increase in a positive way. If square footage goes up, so should the listing price. With what is trying to be accomplished, it is safe to say that our predictor variable would be square feet while the response variable would be listing price. This is due to the fact that the listing price is determined and affected by the square feet of the home. With that, we can use square feet to predict the listing price. Data Collection To obtain a random sample of 50 houses in Excel, I simply inserted a random column and entered the equation = rand ( ). I then used the bottom right corner to drag the equation down, applying it to every row with house data. Once a number was assigned to each row, I clicked data, then sort, and selected sort by random. Lastly, I deleted any row past the needed 50 samples. 0 1000 2000 3000 4000 5000 6000 $0 $100,000 $200,000 $300,000 $400,000 $500,000 $600,000 $700,000 $800,000 $900,000 Listing Price in Relation to Square Feet Square Feet Listing Price
Median Housing Price Model for D. M. Pan National Real Estate Company 3 My predictor variable for my sample is square feet, while my response variable is the listing price. Region State County Listing Price Square Feet West South Central TX Kerr 262,900 1,888 Mid-Atlantic NJ Ocean 547,400 4,178 East North Central IL Knox 205,100 1,740 New England MA Norfolk 394,600 1,949 East South Central TN Hawkins 269,000 2,385 Mid-Atlantic MD Baltimore 311,800 1,921 New England MA Middlesex 791,100 5,230 West South Central OK Comanche 252,100 1,806 East South Central KY Bullitt 319,300 2,527 Pacific CA Monterey 417,200 1,213 East North Central IN Wayne 203,800 1,441 West South Central LA Iberia 188,700 1,511 West North Central KS Riley 356,900 2,261 South Atlantic NC Buncombe 446,000 2,190 West South Central LA Ascension 303,200 2,030 West North Central MO St. Charles 430,300 2,302 Northeast PA Crawford 292,400 1,435 South Atlantic SC Berkeley 351,900 1,913 Pacific WA Clark 460,700 1,922 Mid-Atlantic VA Bedford 242,600 1,224 South Atlantic GA Houston 355,300 2,306 West South Central TX Liberty 206,400 1,822 Pacific CA San Bernardino 465,500 1,873 Pacific CA Santa Cruz 405,100 1,955 South Atlantic FL Hillsborough 362,900 1,994 Northeast PA Chester 786,800 5,290 East South Central AL Elmore 262,700 2,313 Pacific CA Nevada 377,400 1,614 East North Central IL Stephenson 235,600 1,682 Mountain MT Cascade 309,800 1,598 New England MA Norfolk 313,400 1,806 Mountain NM Curry 528,000 3,720 South Atlantic NC Franklin 329,700 1,871 East South Central AL Tuscaloosa 259,000 1,895 Mountain NM Eddy 599,300 3,636 Northeast NY Cayuga 523,300 3,141 East South Central MS Oktibbeha 264,400 2,135 New England MA Berkshire 422,800 2,511
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help
Median Housing Price Model for D. M. Pan National Real Estate Company 4 Northeast PA Lycoming 185,600 1,283 Northeast NY Warren 236,500 1,156 Mid-Atlantic NJ Monmouth 524,900 4,428 Northeast NY Otsego 225,700 1,104 Mid-Atlantic MD Cecil 357,700 1,779 Mountain NM Bernalillo 280,100 1,484 Mid-Atlantic MD Baltimore 354,100 1,686 East South Central MS Lafayette 605,300 5,248 New England MA Franklin 385,200 2,247 Mid-Atlantic DC District of Columbia 690,000 5,227 Northeast PA Beaver 315,300 2,010 West South Central LA St. Martin 252,600 1,845 Data Analysis Sample Statistics Listing Price Square Feet Area Mean $369,348 2,315 Median $340,800 1,922 Std Dev $145,830 1,117
Median Housing Price Model for D. M. Pan National Real Estate Company 5 The data collected from my sample shows no unusual characteristics. My scatterplot follows a positive linear regression with very few outliers. For the few points that can be considered outliers, they still tend to follow the trendline and, therefore aren’t really unusual. The center seems to not be too high or low, and the same with the spread. National Statistics Listing Price Square Feet Area Mean $342,365 2,111 Median $318,000 1,881 Std Dev $125,914 921 When comparing my data with that of the national population, it is relatively close. My mean, median, and standard deviation for both the listing price and square feet area are higher but not by much. With this comparison, I’ve come to the conclusion that my sample is a great representative of the national housing market sales. Develop Regression Model With the scatter plot from my sample, the regression model is definitely appropriate. It follows a relatively perfect linear model with most data points being close to the trendline. The strength of my model is strong. My direction is positive, which is to be expected. There are no significant gaps between data plots. There are a few plots that can be outliers but follow the trendline well. Due to this, I wouldn’t see it appropriate to remove them. Typically, you would want to remove outliers due to their negative effect on correlation by weakening it, 0 1000 2000 3000 4000 5000 6000 $0 $100,000 $200,000 $300,000 $400,000 $500,000 $600,000 $700,000 $800,000 $900,000 f(x) = 113.98 x + 105549.76 Listing Price in Relation to Square Feet Square Feet Listing Price
Median Housing Price Model for D. M. Pan National Real Estate Company 6 but when using the = correl ( ) equation in Excel, I discovered it to be strong. The correlation coefficient for my scatterplot is 0.874, which when compared to the scatterplot confirms the assumption of a strong model. Determine the Line of Best Fit My regression equation is y = 113.98x +105550 with x being square footage and y being the listing price. The slope of this equation is 113.98, which represents the change in listing price when there is a unit change in square feet area. For example, an increase of 100 square feet will increase the price by $11,398. This is calculated by 113.98 * 100. The intercept is 105,550. This represents what the listing price would be if the was a home with 0 square footage, therefore the price of the land. To determine the strength of my equation, I need to square the value of the correlation coefficient. This is represented as r-squared = 0.874 * 0.874. R-squared ends up equaling 0.763. This means 76.3% of the variation and listing price is explained by this model, which is really good. This shows that the linear regression equation is strong. With this equation, we can predict how much a home of 1,500 square feet should be listed. Just replace x with 1,500. y = 113.98 * 1500 + 105550 = 170970 + 105550 = $276,520 Conclusions In conclusion, with the information obtained throughout my research, I’ve concluded that the listing price of a home can be determined by its square footage. My sample provided results close to the national housing market sales, which shows how reliable the data is. My results
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help
Median Housing Price Model for D. M. Pan National Real Estate Company 7 surprised me a little with how strong the equation and correlation were. If I happened to get a random sample with less ideal outliers, my data would be less reliable due to the removal of some samples to get a stronger correlation. To solve different problems in the future, this model is something that should be used. The linear regression model is definitely useful when trying to determine the strength of the relationship between two variables. It’s very useful for making predictions. Something worth researching in the future would be other factors that help determine the listing price of a house. For example, “What effect does the region have on the price of a home when using square feet as a predictor variable?"