The cost function of a general neural network is defined as J(ŷ,y) 1 m L(VW), y() The loss function L(ỹ(¹), y() is defined by the logistic loss function L(¹),y) = [ylogy) + (1-y)log (1 - ¹)] Please list the stochastic gradient descent update rule, batch gradient descent update rule, and mini-batch gradient descent update rule. Explain the main difference of these three update rules.

Database System Concepts
7th Edition
ISBN:9780078022159
Author:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Chapter1: Introduction
Section: Chapter Questions
Problem 1PE
icon
Related questions
Question
The cost function of a general neural network is defined as
J(ŷ,y) =
m
// [4 (9(0), y(1)
The loss function L(ŷ), y() is defined by the logistic loss function
Ly, y) = [ylogy) + (1-y)log (1 - ¹)]
Please list the stochastic gradient descent update rule, batch gradient descent update rule, and
mini-batch gradient descent update rule. Explain the main difference of these three update
rules.
Transcribed Image Text:The cost function of a general neural network is defined as J(ŷ,y) = m // [4 (9(0), y(1) The loss function L(ŷ), y() is defined by the logistic loss function Ly, y) = [ylogy) + (1-y)log (1 - ¹)] Please list the stochastic gradient descent update rule, batch gradient descent update rule, and mini-batch gradient descent update rule. Explain the main difference of these three update rules.
Given a neural network, its structure is shown below. z." is the output of the linear part of ith
neuron in layer l; a¹ = g(z) is the output of the activation part of jth neuron in layer I and
g(z) is the activation function.
X₁
x₂
1
Xn
[1]
[1][1]
za
z[¹]|a²²]
Z3
[1] [¹]
a4
XXIS
[2][2]
z₁ a
[2][2]
z₂a₂
[2]
[3][3]
Z₁9₁
Transcribed Image Text:Given a neural network, its structure is shown below. z." is the output of the linear part of ith neuron in layer l; a¹ = g(z) is the output of the activation part of jth neuron in layer I and g(z) is the activation function. X₁ x₂ 1 Xn [1] [1][1] za z[¹]|a²²] Z3 [1] [¹] a4 XXIS [2][2] z₁ a [2][2] z₂a₂ [2] [3][3] Z₁9₁
Expert Solution
trending now

Trending now

This is a popular solution!

steps

Step by step

Solved in 4 steps with 2 images

Blurred answer
Knowledge Booster
Use of XOR function
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, computer-science and related others by exploring similar questions and additional content below.
Similar questions
  • SEE MORE QUESTIONS
Recommended textbooks for you
Database System Concepts
Database System Concepts
Computer Science
ISBN:
9780078022159
Author:
Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:
McGraw-Hill Education
Starting Out with Python (4th Edition)
Starting Out with Python (4th Edition)
Computer Science
ISBN:
9780134444321
Author:
Tony Gaddis
Publisher:
PEARSON
Digital Fundamentals (11th Edition)
Digital Fundamentals (11th Edition)
Computer Science
ISBN:
9780132737968
Author:
Thomas L. Floyd
Publisher:
PEARSON
C How to Program (8th Edition)
C How to Program (8th Edition)
Computer Science
ISBN:
9780133976892
Author:
Paul J. Deitel, Harvey Deitel
Publisher:
PEARSON
Database Systems: Design, Implementation, & Manag…
Database Systems: Design, Implementation, & Manag…
Computer Science
ISBN:
9781337627900
Author:
Carlos Coronel, Steven Morris
Publisher:
Cengage Learning
Programmable Logic Controllers
Programmable Logic Controllers
Computer Science
ISBN:
9780073373843
Author:
Frank D. Petruzella
Publisher:
McGraw-Hill Education