Backward Step (i.e. back propagation) and Gradient Descent When training a neural network model we use back propagation to update the weight/bias parameters using gradient descent (or some variation of it). Gradient descent utilizes the chain rule, since each layer of a neural network can be described by a function, and the activation function (and additional multiple layers) can be described by a composition of functions. Suppose that we have the following functions: fi (x, w, b) = w·x + f2 (:) = ReLU (·) f (:) = f2 (fi (•)) note: the dot is used a placeholder here, so you could assume it to be u, or x, or any variable) Assume that we are going to carry out back propagation and a gradient descent step. Suppose that Ne = 0.5, and that x = 0.9, and that the learning rate is equal to 0.01. You can assume that b > 0. What will be the value of wt+1? note: be sure to carry out your answer to at least two (2) decimal places, i.e. 1e-2 precision)
Backward Step (i.e. back propagation) and Gradient Descent When training a neural network model we use back propagation to update the weight/bias parameters using gradient descent (or some variation of it). Gradient descent utilizes the chain rule, since each layer of a neural network can be described by a function, and the activation function (and additional multiple layers) can be described by a composition of functions. Suppose that we have the following functions: fi (x, w, b) = w·x + f2 (:) = ReLU (·) f (:) = f2 (fi (•)) note: the dot is used a placeholder here, so you could assume it to be u, or x, or any variable) Assume that we are going to carry out back propagation and a gradient descent step. Suppose that Ne = 0.5, and that x = 0.9, and that the learning rate is equal to 0.01. You can assume that b > 0. What will be the value of wt+1? note: be sure to carry out your answer to at least two (2) decimal places, i.e. 1e-2 precision)
Computer Networking: A Top-Down Approach (7th Edition)
7th Edition
ISBN:9780133594140
Author:James Kurose, Keith Ross
Publisher:James Kurose, Keith Ross
Chapter1: Computer Networks And The Internet
Section: Chapter Questions
Problem R1RQ: What is the difference between a host and an end system? List several different types of end...
Related questions
Question
5
Expert Solution
This question has been solved!
Explore an expertly crafted, step-by-step solution for a thorough understanding of key concepts.
Step by step
Solved in 3 steps with 1 images
Recommended textbooks for you
Computer Networking: A Top-Down Approach (7th Edi…
Computer Engineering
ISBN:
9780133594140
Author:
James Kurose, Keith Ross
Publisher:
PEARSON
Computer Organization and Design MIPS Edition, Fi…
Computer Engineering
ISBN:
9780124077263
Author:
David A. Patterson, John L. Hennessy
Publisher:
Elsevier Science
Network+ Guide to Networks (MindTap Course List)
Computer Engineering
ISBN:
9781337569330
Author:
Jill West, Tamara Dean, Jean Andrews
Publisher:
Cengage Learning
Computer Networking: A Top-Down Approach (7th Edi…
Computer Engineering
ISBN:
9780133594140
Author:
James Kurose, Keith Ross
Publisher:
PEARSON
Computer Organization and Design MIPS Edition, Fi…
Computer Engineering
ISBN:
9780124077263
Author:
David A. Patterson, John L. Hennessy
Publisher:
Elsevier Science
Network+ Guide to Networks (MindTap Course List)
Computer Engineering
ISBN:
9781337569330
Author:
Jill West, Tamara Dean, Jean Andrews
Publisher:
Cengage Learning
Concepts of Database Management
Computer Engineering
ISBN:
9781337093422
Author:
Joy L. Starks, Philip J. Pratt, Mary Z. Last
Publisher:
Cengage Learning
Prelude to Programming
Computer Engineering
ISBN:
9780133750423
Author:
VENIT, Stewart
Publisher:
Pearson Education
Sc Business Data Communications and Networking, T…
Computer Engineering
ISBN:
9781119368830
Author:
FITZGERALD
Publisher:
WILEY