DAT 520 Module Three Lab Worksheet Adriana Carroll
docx
keyboard_arrow_up
School
Southern New Hampshire University *
*We aren’t endorsed by this school
Course
320
Subject
Electrical Engineering
Date
Feb 20, 2024
Type
docx
Pages
3
Uploaded by MateGerbil2595
DAT 520 Module Three Lab Worksheet
Decision Trees in Power BI
Overview
In this lab, you will construct a decision tree using a bottom-up methodology in Power BI. You will break down its structure, interpret its results, and articulate a response to the proposed research question.
Scenario
In the spring of 1912, one of the most infamous maritime catastrophes occurred. The RMS Titanic, en route to New York City on its maiden voyage, struck an iceberg in the Atlantic Ocean and sank, resulting in the death of over 1500 individuals. This incident that occurred over 100 years ago is still considered the deadliest maritime accident to occur outside of warfare.
This tragedy has captivated the minds of people worldwide and even sparked multi-million-dollar movies
to tell its tale. This fascination has led many to ask, "Would I have survived the sinking of the Titanic?" Leveraging decision modeling within Power BI, we hope to understand more about what would have increased your odds of survival on that fateful day in the North Atlantic. Instructions
Construct a decision tree leveraging the provided data set for Module 3, which provides insight into key variables that influenced the survivability of those who embarked on the Titanic in 1912. Effectively describe the model’s structure (nodes, branches, etc.), answer the questions posed below, and provide screenshots when prompted. Please note: This assignment will be submitted and graded in Brightspace. uCertify Instructions
•
Navigate to uCertify lab 5.2.1 Decision Trees in Power BI.
•
Open Power BI desktop.
•
Select Get Data and choose Text/CSV.
•
Navigate to the desktop and select the DAT-520 Data Files
folder.
•
Open Module 3.
•
Select the data set Titanic.csv.
•
Select the titanic
tab.
•
Select Transform Data.
•
Delete the columns: PassengerId
, Parch
, Embark
•
Change the name of the "2urvived" variable to "Survived."
•
Remove null values from the Age
column.
•
Transform Fare
to a two-decimal variable.
•
Close and Apply Changes.
•
Create new calculated columns leveraging z-scores:
•
Create a column called z.Age using the following formula: z.Age = (‘titanic’
[Age]
-
average
(‘titanic’ [Age]
))/
STDEV.P
(‘titanic’
[Age]
)
• Create a column called z.Fare using the following formula: z.Fare =
(‘titanic’ [Fare]
-
average
(‘titanic’ [Fare]
))/
STDEV.P
(‘titanic’ [Fare]
)
• For Practice, Create a column called z.Sib using the following formula: z.Sib = (‘ titanic’ [SibSp]
-
AVERAGE
(‘titanic’ [SibSp]
))/
STDEV.P
(‘titanic’ [SibSp]
)
•
Select the Decision Tree
visualization.
•
Select Get more visuals
from the more options
button.
•
Sign in with your SNHU credentials if necessary.
•
Select Decision Tree.
•
Enable required scripts and programs.
•
Construct your decision model.
•
Select Survived
as your target variable
• Click the drop-down carrot next to Survived
and select Do not summarize
, as you will use this as a binary predictor.
•
Select z.Age
and z.Fare
as your input variables (in that order).
•
Expand the visualization into Focus
mode.
Questions
•
Provide a screenshot of your existing decision tree.
•
Describe what you are seeing in terms of the model’s breakdown from the root node down through each level. Using the survival variable as a binary variable, each portion of the decision tree using age and fare to predict a yes/no survival percentage.
•
What criteria leads you to a 3% survival rate? Fare > 52, Age <= 45, Fare < 76, and Fare <= 59
Fare < 52, Age <= 6.5, and Age < 2.5
Fare < 52, Age <= 6.5, and Age > 2.5
•
Add the variable "sibsp" (siblings) to the model as an input and provide a screenshot of your new
model. •
Does adding this variable improve the model’s root error?
No, the root error remains at 0.30.
•
Does it add additional complexity to the model? It simply adds a new variable to consider when viewing the binary output.
•
What are your initial impressions of this model versus the original model?
The initial impression is not exciting, the outputs appear to remain relatively the same with no new decision breaks to consider.
•
Add the variable Sex to the model as an input variable.
•
Does adding this variable improve the model’s root error?
Root error has adjusted to 0.29.
•
Does it add additional complexity to the model? It looks more with a break at the start of the decision tree regarding sex but results in fewer outcomes.
•
What are your initial impressions of this model versus the original model?
The initial impressions of the new model is the addition of a new layer in a new layer to consider but sex doesn't appear to play a massive role other than dividing the results.
•
Filter your visualization so that the model only displays the results for those whose Fares were less than $25. •
Provide a screenshot of your new model.
•
If you were a female (0 = Male, 1 = Female) and paid less than $13 (hint: use your filters),
what were the chances of you surviving during tragedy onboard the Titanic?
6%
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
Related Documents
Related Questions
I need help with (a) and (b)
arrow_forward
Q5.
Predictive maintenance is to be applied to an industrial conveying
system.
a) Discuss and illustrate the difference between reactive, preventative,
and predictive maintenance.
b) Describe several of the techniques that could be used to develop a
predictive maintenance system for the conveyor system. Consider
some of the sensing techniques that might be used and what
parameters might be measured to detect component degradation.
c) If the predictive maintenance system described in b) detects that
the conveyor system is vibrating abnormally, discuss what condition
this could indicate and what maintenance might be required.
arrow_forward
Please, I do not want a theoretical solution or using artificial intelligence. I want a solution on paper using the mathematical laws of the topic
arrow_forward
Write the code in matlab
arrow_forward
Help please
arrow_forward
Fill the following blanks in the sub-program with suitable numbers to generate 7 sec delay.
arrow_forward
FIVE QUESTIONS ANSWER ALL
Question 1
A cylindrical specimen of Aluminum having a diameter of 12.8 mm and a gauge length of
50.800 mm is pulled in tension. Use the load-elongation characteristics shown in the following
table to complete Q1(a) through (f).
Raw Data
Load
(N)
Length (mm)
50.800
7330
50.851
15100
50.902
23100
50.952
30400
51.003
34400
51.054
38400
51.308
41300
51.816
44800
52.832
46200
53.848
47300
54.864
47500
55.880
46100
56.896
44800
57.658
42600
58.420
36400
59.182
a) Plot both the engineering stress versus engineering strain and true stress versus true
strain curves on the same axis.
EV[8]
b) Compute the modulus of elasticity.
EV[3]
c) Determine the yield strength at a strain offset of 0.002.
AN[3]
d) Determine the tensile strength of this alloy.
AN[2]
e) Compute the modulus of resilience.
AN[2]
f) What is the ductility, in percent elongation?
AN[2]
arrow_forward
I have a wooden toy train that runs on wooden tracks. I would like to place two ESP-WROOM-32 Boards on top of the train. Broadcast communication (which involves radio frequency, not Wi-Fi) would help communicate about the train's lateness to five other wooden trains on the same track. Local communication would help communicate to the train station about the train's lateness. I would also need radio frequency Tx-Rx Modules for discharging and undergoing the signal, and repeaters just in case the signal loses power due to long-distance traveling. My wooden train should be automated. It should run at 4 inches per second, keep a minimum distance of 30 seconds away from the other wooden trains, and make a 17-second-long stop every 500 seconds.
What's the procedure to connect the components?
What's the Arduino code (C++) for this project?
arrow_forward
Stop ducking the question and just answer it.
Number 14 letter a.
arrow_forward
The characteristic equation of control system is given as following.
S3 + 6s2 + 11s + 6(1 + Kc) = 0
Determine: 1. The Value of Kc for which the control system is stable. 2. The roots of the characteristic equation for the value of Kc for which the system is on the threshold of instability.
arrow_forward
Please solve A), B) and C) only using CodeVision AVR's Embedded C langauge. Please use Proteus for the interfacing circuit and provide screenshots too.
arrow_forward
Please answer in typing format
arrow_forward
SEE MORE QUESTIONS
Recommended textbooks for you
data:image/s3,"s3://crabby-images/268f2/268f2f07fcd4adad28e014cb34781aa4ebc69675" alt="Text book image"
EBK ELECTRICAL WIRING RESIDENTIAL
Electrical Engineering
ISBN:9781337516549
Author:Simmons
Publisher:CENGAGE LEARNING - CONSIGNMENT
Related Questions
- I need help with (a) and (b)arrow_forwardQ5. Predictive maintenance is to be applied to an industrial conveying system. a) Discuss and illustrate the difference between reactive, preventative, and predictive maintenance. b) Describe several of the techniques that could be used to develop a predictive maintenance system for the conveyor system. Consider some of the sensing techniques that might be used and what parameters might be measured to detect component degradation. c) If the predictive maintenance system described in b) detects that the conveyor system is vibrating abnormally, discuss what condition this could indicate and what maintenance might be required.arrow_forwardPlease, I do not want a theoretical solution or using artificial intelligence. I want a solution on paper using the mathematical laws of the topicarrow_forward
- FIVE QUESTIONS ANSWER ALL Question 1 A cylindrical specimen of Aluminum having a diameter of 12.8 mm and a gauge length of 50.800 mm is pulled in tension. Use the load-elongation characteristics shown in the following table to complete Q1(a) through (f). Raw Data Load (N) Length (mm) 50.800 7330 50.851 15100 50.902 23100 50.952 30400 51.003 34400 51.054 38400 51.308 41300 51.816 44800 52.832 46200 53.848 47300 54.864 47500 55.880 46100 56.896 44800 57.658 42600 58.420 36400 59.182 a) Plot both the engineering stress versus engineering strain and true stress versus true strain curves on the same axis. EV[8] b) Compute the modulus of elasticity. EV[3] c) Determine the yield strength at a strain offset of 0.002. AN[3] d) Determine the tensile strength of this alloy. AN[2] e) Compute the modulus of resilience. AN[2] f) What is the ductility, in percent elongation? AN[2]arrow_forwardI have a wooden toy train that runs on wooden tracks. I would like to place two ESP-WROOM-32 Boards on top of the train. Broadcast communication (which involves radio frequency, not Wi-Fi) would help communicate about the train's lateness to five other wooden trains on the same track. Local communication would help communicate to the train station about the train's lateness. I would also need radio frequency Tx-Rx Modules for discharging and undergoing the signal, and repeaters just in case the signal loses power due to long-distance traveling. My wooden train should be automated. It should run at 4 inches per second, keep a minimum distance of 30 seconds away from the other wooden trains, and make a 17-second-long stop every 500 seconds. What's the procedure to connect the components? What's the Arduino code (C++) for this project?arrow_forwardStop ducking the question and just answer it. Number 14 letter a.arrow_forward
- The characteristic equation of control system is given as following. S3 + 6s2 + 11s + 6(1 + Kc) = 0 Determine: 1. The Value of Kc for which the control system is stable. 2. The roots of the characteristic equation for the value of Kc for which the system is on the threshold of instability.arrow_forwardPlease solve A), B) and C) only using CodeVision AVR's Embedded C langauge. Please use Proteus for the interfacing circuit and provide screenshots too.arrow_forwardPlease answer in typing formatarrow_forward
arrow_back_ios
arrow_forward_ios
Recommended textbooks for you
- EBK ELECTRICAL WIRING RESIDENTIALElectrical EngineeringISBN:9781337516549Author:SimmonsPublisher:CENGAGE LEARNING - CONSIGNMENT
data:image/s3,"s3://crabby-images/268f2/268f2f07fcd4adad28e014cb34781aa4ebc69675" alt="Text book image"
EBK ELECTRICAL WIRING RESIDENTIAL
Electrical Engineering
ISBN:9781337516549
Author:Simmons
Publisher:CENGAGE LEARNING - CONSIGNMENT