ECS171 Winter 2024 Mid study guide-1

pdf

School

University of California, Davis *

*We aren’t endorsed by this school

Course

171

Subject

Computer Science

Date

Feb 20, 2024

Type

pdf

Pages

Uploaded by MagistrateTeam18466

ECS171 Winter 2024 Midterm Study Guide Cheat sheet Sigmoid and derivative of sigmoid: ࠵?(࠵?) = 1 1 + ࠵? !" ࠵?′(࠵?) = ࠵? ࠵?࠵? ࠵?(࠵?) = ࠵?(࠵?)(1 − ࠵?(࠵?)) ReLU and Derivative for ReLU: ࠵?(࠵?) = ࠵?࠵?࠵?(0, ࠵?) = E ࠵? ࠵?࠵? ࠵? > 0, 0 ࠵?࠵?ℎ࠵?࠵?࠵?࠵?࠵?࠵?. ࠵?′(࠵?) = O 1 ࠵?࠵? ࠵? > 0, 0 ࠵?࠵?ℎ࠵?࠵?࠵?࠵?࠵?࠵?, Gradient descent weight update: ࠵? #$% = ࠵? # − ࠵? ⋅ ࠵?࠵?(࠵? # ) Errors: ࠵?࠵?࠵? = 1 ࠵? ‘(࠵? & − ࠵? ’ b) ( ) &*% ࠵?࠵?࠵? = ‘(࠵? & − ࠵? ’ b) ( ) &*% Variance: ࠵?࠵?࠵? = ࠵? ( = ∑ (࠵? & − ࠵?̅) ( ) &*% ࠵? Chain Rule: ࠵?࠵? ℎ(࠵?) = ࠵?i࠵?(࠵?)k ℎ + (࠵?) = ࠵?′(࠵?(࠵?)) ∙ ࠵?′(࠵?) Performance Metrics: ࠵?࠵?࠵?࠵?࠵?࠵?࠵?࠵?࠵? = ࠵?࠵? ࠵?࠵? + ࠵?࠵? ࠵?࠵?࠵?࠵?࠵?࠵? (࠵?࠵?࠵?࠵? ࠵?࠵?࠵?࠵?࠵?࠵?࠵?࠵? ࠵?࠵?࠵?࠵?) = ࠵?࠵? ࠵?࠵? + ࠵?࠵? ࠵? ! ࠵?࠵?࠵?࠵?࠵? = 2 ∗ ࠵?࠵?࠵?࠵?࠵?࠵?࠵?࠵?࠵? ∗ ࠵?࠵?࠵?࠵?࠵?࠵? ࠵?࠵?࠵?࠵?࠵?࠵?࠵?࠵?࠵? + ࠵?࠵?࠵?࠵?࠵?࠵? ࠵?࠵?࠵?࠵?࠵?࠵?࠵?࠵? = ࠵?࠵? + ࠵?࠵? ࠵?࠵? + ࠵?࠵? + ࠵?࠵? + ࠵?࠵? (࠵?࠵?࠵?࠵? ࠵?࠵?࠵?࠵?࠵?࠵?࠵?࠵? ࠵?࠵?࠵?࠵?) ࠵?࠵?࠵? = ࠵?࠵? ࠵?࠵? + ࠵?࠵? (࠵?࠵?࠵?࠵?࠵? ࠵?࠵?࠵?࠵?࠵?࠵?࠵?࠵? ࠵?࠵?࠵?࠵?) ࠵?࠵?࠵? = ࠵?࠵? ࠵?࠵? + ࠵?࠵?

Q1: In machine learning, what is the dropout technique, and how does it help prevent overfitting in neural networks? Answer: Dropout is a regularization technique in neural networks where during training, a random set of neurons are "dropped out" or turned off with a certain probability. This helps to prevent overfitting by making the network more robust and less reliant on specific neurons. For example, let's consider a neural network with dropout applied to a hidden layer. If the dropout rate is set to 0.5, during training, for each forward and backward pass, half of the neurons in that layer will be randomly deactivated. This means the network has to learn to be effective even when only a fraction of its neurons are active. This prevents the network from becoming overly specialized in recognizing patterns in the training data and helps it generalize better to unseen data. Q2: Given the hours of exercising per week measured in hours ( ࠵? ! ) and time from last Covid-19 infection measured in weeks( ࠵? " ), the model predicts the probability that the person will be re- infected with Covid-19 in the next 5 months. The model follows the Linear Regression ŷ = 0.4 − 0.05࠵? ! + 0.07࠵? " , for each of the data pairs in the training and testing sets, do the following: (a) Compute the predicted output for the given regression model for each set of input (b) Compute the bias (SSE) for each set of inputs (c) Compute the variance. (d) Is the model overfit, underfit, or a good fit? Justify your answer using the following metrics for the base case: SSE_train = 0.050 SSE_test = 0.049 Variance = 0.01 Where needed, round your answer to 4 d.p. Training set: ࠵? ! ࠵? " ࠵? ŷ (࠵? − ŷ) " 0.0 4.0 0.70 3.0 10.0 0.95 2.0 3.0 0.60 5.0 1.0 0.15 8.0 5.5 0.25

12.0 7.5 0.23 10.0 4.0 0.20 3.0 2.0 0.30 Testing set: ࠵? ! ࠵? " ࠵? ŷ (࠵? − ŷ) " 1.0 9.0 0.95 9.0 6.0 0.20 7.0 3.0 0.25 5.0 5.0 0.45 Answer: Training set: ࠵? ! ࠵? " ࠵? ŷ (࠵? − ŷ) " 0.0 4.0 0.70 0.68 0.0004 3.0 10.0 0.95 0.95 0 2.0 3.0 0.60 0.51 0.0081 5.0 1.0 0.15 0.22 0.0049 8.0 5.5 0.25 0.385 0.0182 12.0 7.5 0.23 0.325 0.0009 10.0 4.0 0.20 0.18 0.0004 3.0 2.0 0.30 0.39 0.0081

Your preview ends here