3-2 Module Three Lab

docx

School

Southern New Hampshire University *

*We aren’t endorsed by this school

Course

520

Subject

Electrical Engineering

Date

Feb 20, 2024

Type

docx

Pages

3

Uploaded by DeanRiver5511

Report
DAT 520 Module Three Lab Worksheet Decision Trees in Power BI Overview In this lab, you will construct a decision tree using a bottom-up methodology in Power BI. You will break down its structure, interpret its results, and articulate a response to the proposed research question. Scenario In the spring of 1912, one of the most infamous maritime catastrophes occurred. The RMS Titanic, en route to New York City on its maiden voyage, struck an iceberg in the Atlantic Ocean and sank, resulting in the death of over 1500 individuals. This incident that occurred over 100 years ago is still considered the deadliest maritime accident to occur outside of warfare. This tragedy has captivated the minds of people worldwide and even sparked multi-million-dollar movies to tell its tale. This fascination has led many to ask, "Would I have survived the sinking of the Titanic?" Leveraging decision modeling within Power BI, we hope to understand more about what would have increased your odds of survival on that fateful day in the North Atlantic. Instructions Construct a decision tree leveraging the provided data set for Module 3, which provides insight into key variables that influenced the survivability of those who embarked on the Titanic in 1912. Effectively describe the model’s structure (nodes, branches, etc.), answer the questions posed below, and provide screenshots when prompted. Please note: This assignment will be submitted and graded in Brightspace. uCertify Instructions 1. Navigate to uCertify lab 5.2.1 Decision Trees in Power BI. 2. Open Power BI desktop. 3. Select Get Data and choose Text/CSV. 4. Navigate to the desktop and select the DAT-520 Data Files folder. a. Open Module 3. b. Select the data set Titanic.csv. c. Select the titanic tab. 5. Select Transform Data. a. Delete the columns: PassengerId , Parch , Embark b. Change the name of the "2urvived" variable to "Survived." c. Remove null values from the Age column. d. Transform Fare to a two-decimal variable. e. Close and Apply Changes. f. Create new calculated columns leveraging z-scores: i. Create a column called z.Age using the following formula: z.Age = (‘titanic’ [Age] - average (‘titanic’ [Age] ))/ STDEV.P (‘titanic’ [Age] ) ii. Create a column called z.Fare using the following formula: z.Fare = (‘titanic’ [Fare] - average (‘titanic’ [Fare] ))/ STDEV.P (‘titanic’ [Fare] )
iii. For Practice, Create a column called z.Sib using the following formula: z.Sib = (‘ titanic’ [SibSp] - AVERAGE (‘titanic’ [SibSp] ))/ STDEV.P (‘titanic’ [SibSp] ) 6. Select the Decision Tree visualization. a. Select Get more visuals from the more options button. b. Sign in with your SNHU credentials if necessary. c. Select Decision Tree. d. Enable required scripts and programs. 7. Construct your decision model. a. Select Survived as your target variable i. Click the drop-down carrot next to Survived and select Do not summarize , as you will use this as a binary predictor. b. Select z.Age and z.Fare as your input variables (in that order). c. Expand the visualization into Focus mode. Questions 1. Provide a screenshot of your existing decision tree. [Insert screenshot.] a. Describe what you are seeing in terms of the model’s breakdown from the root node down through each level. [Insert text.] b. What criteria leads you to a 3% survival rate? [Insert text.] 2. Add the variable "sibsp" (siblings) to the model as an input and provide a screenshot of your new model. [Insert screenshot.] a. Does adding this variable improve the model’s root error? [Insert text.] b. Does it add additional complexity to the model? [Insert text.] c. What are your initial impressions of this model versus the original model? [Insert text.] 3. Add the variable Sex to the model as an input variable. a. Does adding this variable improve the model’s root error? [Insert text.] b. Does it add additional complexity to the model? [Insert text.]
c. What are your initial impressions of this model versus the original model? [Insert text.] 4. Filter your visualization so that the model only displays the results for those whose Fares were less than $25. a. Provide a screenshot of your new model. [Insert screenshot.] b. If you were a female (0 = Male, 1 = Female) and paid less than $13 (hint: use your filters), what were the chances of you surviving during tragedy onboard the Titanic? [Insert text.]
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help