Descriptive Statistics Assignment-R-1

pdf

School

University of San Diego *

*We aren’t endorsed by this school

Course

330

Subject

English

Date

Apr 3, 2024

Type

pdf

Pages

2

Uploaded by CoachIronTurkey21

Report
E NGINEERING P ROBABILITY & S TATISTICS 3 Credit Hours Descriptive Statistics and Plots Case OBJECTIVES By the end of this case, you should be able to 1. Utilize descriptive statistics and plots on data using R 2. Perform exploratory analysis on an unknown dataset using descriptive statistics and plots to derive conclusions on dataset. Software R/R Studio Download both R and R Studio from here: https://posit.co/download/rstudio-desktop/ Deliverables The deliverables should be submitted on Canvas by the due date . A Word or PDF Document with the analysis of the data set which includes the following: o Descriptive Statistics and plots of data from R o Interpretations and conclusions for all statistics and plots R script file INTRODUCTION Engineers use statistical software to assist in the laborious and extensive calculations. The statistical packages can easily create a wide variety of graphs to display your data. This assignment is designed to enhance your skills and develop your ability to explore an unknown dataset and present your statistical results using R. Procedure You will perform analysis on a census dataset called state.x77 which is located on Blackboard . 1. Create a histogram using one of the continuous data columns of the spreadsheet (choose one column only). a. The plot should have a suitable number of bins to represent the variation in the data. The plot should also be formatted and labeled appropriately. b. Provide interpretations for the plot. 2. Calculate the following descriptive statistics for the selected data column for each of the regions. You need to calculate these by region. Present the results in a table format. a. Mean and Median b. Variance, Standard Deviation, and Range 3. Create a Box Plot to compare the selected data for the different regions in the United States.
a. The plot should be formatted and labeled appropriately. b. Provide interpretations for the plot. 4. Make an overall conclusion based on the above information. Compare and contrast the different regions. Be sure to discuss the central tendency and variation for each region. Download the R script and data set on Blackboard. state.x77: a dataset (matrix) with 50 rows and 8 columns giving the following statistics in the respective columns. State: list of states in alphabetical order Population: population estimate as of July 1, 1975 Region: regional location of state Income: per capita income (1974) Illiteracy: illiteracy (1970, percent of population) Life Exp: life expectancy in years (1969 71) Murder: murder and non-negligent manslaughter rate per 100,000 population (1976) HS Grad: percent high-school graduates (1970) Frost: mean days with minimum temperature below freezing (1931 60) in capital or large city Area: land area in square miles A Subset of the Census data State Region Population Income Illiteracy Life Exp Alabama South 3615 3624 2.1 69.05 Alaska West 365 6315 1.5 69.31 Arizona West 2212 4530 1.8 70.55 Arkansas South 2110 3378 1.9 70.66 California West 21198 5114 1.1 71.71
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help