M6_R Practise_CharanThota

docx

School

Northeastern University *

*We aren’t endorsed by this school

Course

6010

Subject

Statistics

Date

Jan 9, 2024

Type

docx

Pages

7

Uploaded by KidDragonfly3195

Report
1
Avocado Dataset Regression Analysis Introduction The data set is obtained from the Hass Avocado Board in May of 2018. In this data set contains details of avocado like price, type, year, volume and any more. We are going to perform the regression analysis. In this dependent variable is total volume and independent variables are average price and total bags. Data Analysis 1. Import and summary of an avocado data set We have imported avocado.data.csv data set in R markdown. The data set contains the 13 columns. 2. Create a subset for year Observation: In this we create the year for 4 subsets such as for each year 2015, 2016, 2017 and 2018 using the mutate. Four regression line in the graph. 2
3. Linearity and Histogram of a total volume and bags Observation: From the histogram, we can the total volume is normal distributed. The relation between them is less obvious, although it still appears to be linear. We can proceed with the regression analysis. 3
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help
4. Regression Model for dependent variable Total Volume Observation: There is essentially no likelihood that this impact is attributable to chance for both factors. The residuals exhibit no skew in regression analysis; thus, we can declare our model fulfils the homoscedasticity condition. 5. Regression plot model 4
The above graph we show the regression of the four different year and we have 4 four different regression lines in it. Individual Subset graphs 5
Observation: In this can made separate data frame with the year as a 4 such as 2016,2015,2017 and 2018. Finally added the regression lines. Summary We observed substantial relation between total volume and total bags from the above graphs. References 1. Tutorialspoint. (2021). R - Linear Regression. Tutorialspoint. https://www.tutorialspoint.com/r/r_linear_regression.htm 2. Peng, R. S. D. K. (2020, December 20). 4.1 Basic Plotting With ggplot2 | Mastering Software Development in R. Bookdown. https://bookdown.org/rdpeng/RProgDA/basic-plotting-with- ggplot2.html 6
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help
3. I. Kabacoff, R. (2021). Quick-R: Correlations. Statmethods. https://www.statmethods.net/stats/correlations.html Appendix 1. For data analysis I used the data set named as avocado_prices_20211213.csv 2. I attached the R markdown file which is named as M6_RPractice_CharanThota.Rmd 7