DAT 375 Module 3 Journal

docx

School

Southern New Hampshire University *

*We aren’t endorsed by this school

Course

375

Subject

Statistics

Date

Feb 20, 2024

Type

docx

Pages

5

Uploaded by CommodoreDonkey3266

Report
Module 3 - Journal *** warning – this assignment only got a C- due to the harsh grading of the professor. Be warned. Module 3 – Journal – Setting Parameters and Extracting Data Professor Bradley DAT 375 – Data Analytics Southern New Hampshire University 01/28/2024
Module 2 - Journal As a data analyst for an insurance company, I have been tasked to find the three boroughs in New York City that have the highest accident rates. Define the Parameters Before defining the parameters, I need to be familiar with what data/columns are in the table. In MySQL – when the table is highlighted we can see the names of the columns, but since SQL is not my strength – Excel is – I ran the following query to see how many lines of data were in the table and if it would be possible to export it into Excel. (There were 5000 rows – something definitely easy to use in Excel. 2
Module 2 - Journal Once the data is downloaded into Excel, I noticed that while there was 5000 data lines, there was only data in 2000 lines. So I removed the extra 3000 lines and created a pivot table that isolated the Year, the borough and the number of violations in that time period. Once the data was visualized into the pivot table, it was easy to see that the data had some small violations listed in the years 2016 and 2019, however the bulk of the data contained 2020. The outliers in 2016 will be removed, but the 2019 ones will be kept because they are within the last two months of the year. I’ve also included a graphic representation of the crash data for easier analysis. 3
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help
Module 2 - Journal Determine the three boroughs with the highest accident rates Based on the pivot tables above, the boroughs with the highest number of accidents are Brooklyn, Manhattan and Queens. Full disclosure – while this is a holisitic number of accidents per borough, it doesn’t take into account the population of each borough – which is not available in the data – to get a true representation of what the actual accident rate is. Extract the data from these three boroughs with the highest accident rates From the data provided, the three boroughs had the highest violations of Speed not reasonable & prudent, following too closely and moved from lane unsafely (respectively) and these three violations represented 32.5% of all citations across the top three boroughs. 4
Module 2 - Journal 5