4 Backpropagation Consider the following network with sigmoid activation functions in the hidden and output neurons, and binary cross entropy loss function. Assume we initialize the weights as follows: W11 = 1, W12 = 0.5, W21 = 0.1, W 22 = 0.2, W 13 = 1, W23 = 0.5. The biases for the hidden nodes are initialized as bl = 0.1, 62 = 0.1, and for the the output node is initialized as 63 = 0.5. Backward pass: Calculate the derivative of the loss w.r.t W11. What's the value of the derivative for the initialized weights, input 1 = 0.1, input 2 = 0.2, and label = 1? How do you update W11 using gradient descent, based on the derivative you derived in the previous part and learning rate ŋ = 0.1? What is the value of the loss using the updated W11? How did it change after the update?

Database System Concepts
7th Edition
ISBN:9780078022159
Author:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Chapter1: Introduction
Section: Chapter Questions
Problem 1PE
icon
Related questions
Question
4 Backpropagation
Consider the following network with sigmoid activation functions in the hidden and output neurons,
and binary cross entropy loss function. Assume we initialize the weights as follows: W11 = 1, W12 =
0.5, W21 = 0.1, W22 = 0.2, W13 = 1, W23 = 0.5. The biases for the hidden nodes are initialized as
bl = 0.1, 62 = 0.1, and for the the output node is initialized as 63 = 0.5.
Backward pass: Calculate the derivative of the loss w.r.t W11. What's the value
of the derivative for the initialized weights, input 1 = 0.1, input 2 = 0.2, and label = 1?
How do you update W11 using gradient descent, based on the derivative you
derived in the previous part and learning rate 7 = 0.1? What is the value of the loss using
the updated W11? How did it change after the update?
Transcribed Image Text:4 Backpropagation Consider the following network with sigmoid activation functions in the hidden and output neurons, and binary cross entropy loss function. Assume we initialize the weights as follows: W11 = 1, W12 = 0.5, W21 = 0.1, W22 = 0.2, W13 = 1, W23 = 0.5. The biases for the hidden nodes are initialized as bl = 0.1, 62 = 0.1, and for the the output node is initialized as 63 = 0.5. Backward pass: Calculate the derivative of the loss w.r.t W11. What's the value of the derivative for the initialized weights, input 1 = 0.1, input 2 = 0.2, and label = 1? How do you update W11 using gradient descent, based on the derivative you derived in the previous part and learning rate 7 = 0.1? What is the value of the loss using the updated W11? How did it change after the update?
Input 1
W11
Hidden 1
W13
W21
Output
(3)
W12
W23
Input 2
Hidden 2
W22
Transcribed Image Text:Input 1 W11 Hidden 1 W13 W21 Output (3) W12 W23 Input 2 Hidden 2 W22
Expert Solution
steps

Step by step

Solved in 2 steps with 3 images

Blurred answer
Knowledge Booster
Use of XOR function
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, computer-science and related others by exploring similar questions and additional content below.
Similar questions
  • SEE MORE QUESTIONS
Recommended textbooks for you
Database System Concepts
Database System Concepts
Computer Science
ISBN:
9780078022159
Author:
Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:
McGraw-Hill Education
Starting Out with Python (4th Edition)
Starting Out with Python (4th Edition)
Computer Science
ISBN:
9780134444321
Author:
Tony Gaddis
Publisher:
PEARSON
Digital Fundamentals (11th Edition)
Digital Fundamentals (11th Edition)
Computer Science
ISBN:
9780132737968
Author:
Thomas L. Floyd
Publisher:
PEARSON
C How to Program (8th Edition)
C How to Program (8th Edition)
Computer Science
ISBN:
9780133976892
Author:
Paul J. Deitel, Harvey Deitel
Publisher:
PEARSON
Database Systems: Design, Implementation, & Manag…
Database Systems: Design, Implementation, & Manag…
Computer Science
ISBN:
9781337627900
Author:
Carlos Coronel, Steven Morris
Publisher:
Cengage Learning
Programmable Logic Controllers
Programmable Logic Controllers
Computer Science
ISBN:
9780073373843
Author:
Frank D. Petruzella
Publisher:
McGraw-Hill Education