Station-2-Notebook

pdf

School

Pennsylvania State University *

*We aren’t endorsed by this school

Course

300

Subject

Statistics

Date

Jan 9, 2024

Type

pdf

Pages

Uploaded by DeaconPencilApe14

Station 2 Allan Julian R Notebooks This is an R Markdown Notebook. When you execute code within the notebook, the results appear beneath the code. Try executing this chunk by clicking the Run button (sideways green arrow) within the chunk or by placing your cursor inside it and pressing Ctrl+Enter . TASK–Run the code below: plot (cars) 5 10 15 20 25 0 20 40 60 80 100 120 speed dist When you save the notebook, an HTML file containing the code and output will be saved alongside it. Click Preview (it may be under the Knit dropdown button), or press Ctrl+Shift+K to preview the HTML file). TASK–create an HTML file from this notebook: The preview shows you a rendered HTML copy of the contents of the editor. Note that Preview does not run any R code chunks. Instead, the output of the chunk when it was last run in the editor is displayed. Packages R is incredibly flexible and has tools for almost anything you want to do. Many of these tools are contained in packages that must be loaded into your workspace. The first time you want to use a package you have to 1

install it on your computer using the command ‘install.packages(“package_name”)’ and then load it into your workspace using the command ‘library(package_name)’. After the first time, you only need to call the library and you do not need to use install.packages. Our class will typically use the following packages: • mosaic • ggformula • Stat2Data • Lock5Data • tidyverse • tinytex TASK–Under the Packages tab in the session window, scroll through and check to see which packages of those above are not yet installed. For any that aren’t, install them by clicking the Install tab and entering their names in the popup window. Note that these names are case-sensitive . After installing the packages, you can load them for use by either checking their boxes in the Packages tab or by using the library command below for each package you want to use in the current session. Here is an example using the mosaic package. library (mosaic) ## Registered S3 method overwritten by ' mosaic ' : ## method from ## fortify.SpatialPolygonsDataFrame ggplot2 ## ## The ' mosaic ' package masks several functions from core packages in order to add ## additional features. The original behavior of these functions should not be affected by this. ## ## Attaching package: ' mosaic ' ## The following objects are masked from ' package:dplyr ' : ## ## count, do, tally ## The following object is masked from ' package:Matrix ' : ## ## mean ## The following object is masked from ' package:ggplot2 ' : ## ## stat ## The following objects are masked from ' package:stats ' : ## ## binom.test, cor, cor.test, cov, fivenum, IQR, median, prop.test, ## quantile, sd, t.test, var ## The following objects are masked from ' package:base ' : ## ## max, mean, min, prod, range, sample, sum library (tidyverse) # loads the mosaic package into your workspace ## -- Attaching core tidyverse packages ------------------------ tidyverse 2.0.0 -- ## v forcats 1.0.0 v stringr 1.5.0 ## v lubridate 1.9.2 v tibble 3.2.1 2

## v purrr 1.0.2 v tidyr 1.3.0 ## v readr 2.1.4 ## -- Conflicts ------------------------------------------ tidyverse_conflicts() -- ## x mosaic::count() masks dplyr::count() ## x purrr::cross() masks mosaic::cross() ## x mosaic::do() masks dplyr::do() ## x tidyr::expand() masks Matrix::expand() ## x dplyr::filter() masks stats::filter() ## x dplyr::lag() masks stats::lag() ## x tidyr::pack() masks Matrix::pack() ## x mosaic::stat() masks ggplot2::stat() ## x mosaic::tally() masks dplyr::tally() ## x tidyr::unpack() masks Matrix::unpack() ## i Use the conflicted package (<http://conflicted.r-lib.org/>) to force all conflicts to become errors TASK–Repeat the above steps to install and load ggformula, Stat2Data, Lock5Data, tinytex, and tidyverse packages. At this point you should have all six packages loaded and ready to use. Use data from an R package A great thing about R is that not only does it allow you to analyze data, but you can also access tons of datasets that come cleaned and formatted within R packages. Two of the packages you loaded above, Stat2Data and Lock5Data, are primarily datasets. TASK–Use the code below to load the dataset diamonds, which is part of the ggplot2 package and included in tidyverse. data (diamonds) Inspecting the data source Now you’re ready to learn a little bit about the diamonds data set. TASK: Edit the bullet list to add a short description in your own words describing what each function does. • glimpse() : this function. . . • head() : this function. . . • names() : this function. . . # Inspecting the data source glimpse (diamonds) ## Rows: 53,940 ## Columns: 10 ## $ carat <dbl> 0.23, 0.21, 0.23, 0.29, 0.31, 0.24, 0.24, 0.26, 0.22, 0.23, 0.~ ## $ cut <ord> Ideal, Premium, Good, Premium, Good, Very Good, Very Good, Ver~ ## $ color <ord> E, E, E, I, J, J, I, H, E, H, J, J, F, J, E, E, I, J, J, J, I,~ ## $ clarity <ord> SI2, SI1, VS1, VS2, SI2, VVS2, VVS1, SI1, VS2, VS1, SI1, VS1, ~ ## $ depth <dbl> 61.5, 59.8, 56.9, 62.4, 63.3, 62.8, 62.3, 61.9, 65.1, 59.4, 64~ ## $ table <dbl> 55, 61, 65, 58, 58, 57, 57, 55, 61, 61, 55, 56, 61, 54, 62, 58~ ## $ price <int> 326, 326, 327, 334, 335, 336, 336, 337, 337, 338, 339, 340, 34~ ## $ x <dbl> 3.95, 3.89, 4.05, 4.20, 4.34, 3.94, 3.95, 4.07, 3.87, 4.00, 4.~ ## $ y <dbl> 3.98, 3.84, 4.07, 4.23, 4.35, 3.96, 3.98, 4.11, 3.78, 4.05, 4.~ ## $ z <dbl> 2.43, 2.31, 2.31, 2.63, 2.75, 2.48, 2.47, 2.53, 2.49, 2.39, 2.~ 3

Your preview ends here