DAT 520 Module Three Lab Worksheet_AlexBunker

docx

School

Southern New Hampshire University *

*We aren’t endorsed by this school

Course

520

Subject

Electrical Engineering

Date

Apr 3, 2024

Type

docx

Pages

6

Uploaded by Bun8991

Report
DAT 520 Module Three Lab Worksheet Decision Trees in Power BI Overview In this lab, you will construct a decision tree using a bottom-up methodology in Power BI. You will break down its structure, interpret its results, and articulate a response to the proposed research question. Scenario In the spring of 1912, one of the most infamous maritime catastrophes occurred. The RMS Titanic, en route to New York City on its maiden voyage, struck an iceberg in the Atlantic Ocean and sank, resulting in the death of over 1500 individuals. This incident that occurred over 100 years ago is still considered the deadliest maritime accident to occur outside of warfare. This tragedy has captivated the minds of people worldwide and even sparked multi-million-dollar movies to tell its tale. This fascination has led many to ask, "Would I have survived the sinking of the Titanic?" Leveraging decision modeling within Power BI, we hope to understand more about what would have increased your odds of survival on that fateful day in the North Atlantic. Instructions Construct a decision tree leveraging the provided data set for Module 3, which provides insight into key variables that influenced the survivability of those who embarked on the Titanic in 1912. Effectively describe the model’s structure (nodes, branches, etc.), answer the questions posed below, and provide screenshots when prompted. Please note: This assignment will be submitted and graded in Brightspace. uCertify Instructions 1. Navigate to uCertify lab 5.2.1 Decision Trees in Power BI. 2. Open Power BI desktop. 3. Select Get Data and choose Text/CSV. 4. Navigate to the desktop and select the DAT-520 Data Files folder. a. Open Module 3. b. Select the data set Titanic.csv. c. Select the titanic tab. 5. Select Transform Data. a. Delete the columns: PassengerId , Parch , Embark b. Change the name of the "2urvived" variable to "Survived." c. Remove null values from the Age column. d. Transform Fare to a two-decimal variable. e. Close and Apply Changes. f. Create new calculated columns leveraging z-scores: i. Create a column called z.Age using the following formula: z.Age = (‘titanic’ [Age] - average (‘titanic’ [Age] ))/ STDEV.P (‘titanic’ [Age] ) ii. Create a column called z.Fare using the following formula: z.Fare = (‘titanic’ [Fare] - average (‘titanic’ [Fare] ))/ STDEV.P (‘titanic’ [Fare] )
iii. For Practice, Create a column called z.Sib using the following formula: z.Sib = (‘ titanic’ [SibSp] - AVERAGE (‘titanic’ [SibSp] ))/ STDEV.P (‘titanic’ [SibSp] ) 6. Select the Decision Tree visualization. a. Select Get more visuals from the more options button. b. Sign in with your SNHU credentials if necessary. c. Select Decision Tree. d. Enable required scripts and programs. 7. Construct your decision model. a. Select Survived as your target variable i. Click the drop-down carrot next to Survived and select Do not summarize , as you will use this as a binary predictor. b. Select z.Age and z.Fare as your input variables (in that order). c. Expand the visualization into Focus mode. Questions 1. Provide a screenshot of your existing decision tree.
a. Describe what you are seeing in terms of the model’s breakdown from the root node down through each level. You start to see the breakdown of how age and money played a part in your survival. The nodes on the “No” side increase twice as much as the nodes on the “Yes” side, based off the first node. b. What criteria leads you to a 3% survival rate? z.Fare was more than 0.36, z.Age less than 1.2, z.Fare less than 0.82 but more than 0.49. z.Fare was less than 0.36, z.Age less than -1.8 but less than -2.1. 2. Add the variable "sibsp" (siblings) to the model as an input and provide a screenshot of your new model.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help
a. Does adding this variable improve the model’s root error? No b. Does it add additional complexity to the model? Yes c. What are your initial impressions of this model versus the original model? The model with siblings added as a variable increased the complexity of the model and added a larger variance in the end outcome.
3. Add the variable Sex to the model as an input variable. a. Does adding this variable improve the model’s root error? Yes, by .01. from 0.30 to 0.29 b. Does it add additional complexity to the model? No, it reduces the complexity greatly. c. What are your initial impressions of this model versus the original model? Much more condensed and simply. 4. Filter your visualization so that the model only displays the results for those whose Fares were less than $25. a. Provide a screenshot of your new model. b. If you were a female (0 = Male, 1 = Female) and paid less than $13 (hint: use your filters), what were the chances of you surviving during tragedy onboard the Titanic? 60%
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help