2. LSTM Bookmark this page The diagram below shows a single LSTM unit that consists of Input, Output, and Forget gates The behavior of such a unit as a recurrent neural network is specified by a set of update equations. These equations define how the gates, "memory cell" and the "visible state" h, are updated in response to input ar, and previous states - hr-1. For the LSTM unit, sigmoid (Whi+Wa+by) sigmoid (Whe+W+b) sigmoid (Wh1+W*+b) =f1+ tanh (W+Wa+b) tank (c) where symbol Ⓒ stands for element-wise multiplication. The adjustable parameters in this unit are matrices WWE W W W W W We, as well as the offset parameter vectors by, b,, b, and b. By changing these parameters, we change how the unit evolves as a function of inputs ar To keep things simple, in this problem we assume that C, and h are all scalars. Concretely, suppose that the parameters are given by =0 by-100 =-100 100 b 100 =50 We Calculate the values at each time-step and enter them below as an array [ho, ha, ha, ha, ha.hs 0 100 b = 0, We run this unit with initial conditions = 0 and e-= 0, and in response to the following input sequence: 10, 0, 1, 1, 1, 0] (For example, ₁ = 0, 0, 1, and so on). LSTM states (Please round h, to the closest integer in every time-step. It h±0.5, then round it to 0. For ease of calculation, assume that sigmoid (r) 1 and tanh (r) 1 for > 1, and sigmoid (z) = 0 and Submit You have used 0 of 5 attempts Save LSTM states 2 Now, we run the same model again with the same parameters and same initial conditions as in the previous question. The only difference is that our input sequence in now: [1, 1, 0, 1, 11. Calculate the values he at each time-step and enter them below as an array [hos hy, ha, ha, ha (Please round h, to the closest integer in every time-step. If hy±0.5, then round it to 0. For ease of calculation, assume that sigmoid (a) 1 and tanh (2) 1 for a 1, and sigmoid (r) = 0) and tanh(2)=−1 for r<-1)

Database System Concepts
7th Edition
ISBN:9780078022159
Author:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Chapter1: Introduction
Section: Chapter Questions
Problem 1PE
icon
Related questions
Question

2. LSTM

Bookmark this page

The diagram below shows a single LSTM unit that consists of Input, Output, and Forget gates

The behavior of such a unit as a recurrent neural network is specified by a set of update equations. These equations define how the gates, "memory cell" and the "visible state" h, are updated in response to input ar, and previous states - hr-1. For the LSTM unit,

sigmoid (Whi+Wa+by)

sigmoid (Whe+W+b)

sigmoid (Wh1+W*+b)

=f1+ tanh (W+Wa+b)

tank (c)

where symbol Ⓒ stands for element-wise multiplication. The adjustable parameters in this unit are matrices WWE

W W W W W We, as well as the offset parameter vectors by, b,, b, and b. By changing these

parameters, we change how the unit evolves as a function of inputs ar To keep things simple, in this problem we assume that C, and h are all scalars. Concretely, suppose that the

parameters are given by

=0

by-100

=-100

100 b

100

=50

We

Calculate the values at each time-step and enter them below as an array [ho, ha, ha, ha, ha.hs

0 100 b = 0,

We run this unit with initial conditions = 0 and e-= 0, and in response to the following input sequence: 10, 0, 1, 1, 1,

0] (For example, ₁ = 0, 0, 1, and so on).

LSTM states

(Please round h, to the closest integer in every time-step. It h±0.5, then round it to 0. For ease of calculation, assume that sigmoid (r) 1 and tanh (r) 1 for > 1, and sigmoid (z) = 0 and

Submit

You have used 0 of 5 attempts

Save

LSTM states 2

Now, we run the same model again with the same parameters and same initial conditions as in the previous question. The only difference is that our input sequence in now: [1, 1, 0, 1, 11.

Calculate the values he at each time-step and enter them below as an array [hos hy, ha, ha, ha

(Please round h, to the closest integer in every time-step. If hy±0.5, then round it to 0. For ease of calculation, assume that sigmoid (a) 1 and tanh (2) 1 for a 1, and sigmoid (r) = 0) and

tanh(2)=−1 for r<-1) 

Expert Solution
trending now

Trending now

This is a popular solution!

steps

Step by step

Solved in 3 steps

Blurred answer
Knowledge Booster
Use of XOR function
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, computer-science and related others by exploring similar questions and additional content below.
Similar questions
Recommended textbooks for you
Database System Concepts
Database System Concepts
Computer Science
ISBN:
9780078022159
Author:
Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:
McGraw-Hill Education
Starting Out with Python (4th Edition)
Starting Out with Python (4th Edition)
Computer Science
ISBN:
9780134444321
Author:
Tony Gaddis
Publisher:
PEARSON
Digital Fundamentals (11th Edition)
Digital Fundamentals (11th Edition)
Computer Science
ISBN:
9780132737968
Author:
Thomas L. Floyd
Publisher:
PEARSON
C How to Program (8th Edition)
C How to Program (8th Edition)
Computer Science
ISBN:
9780133976892
Author:
Paul J. Deitel, Harvey Deitel
Publisher:
PEARSON
Database Systems: Design, Implementation, & Manag…
Database Systems: Design, Implementation, & Manag…
Computer Science
ISBN:
9781337627900
Author:
Carlos Coronel, Steven Morris
Publisher:
Cengage Learning
Programmable Logic Controllers
Programmable Logic Controllers
Computer Science
ISBN:
9780073373843
Author:
Frank D. Petruzella
Publisher:
McGraw-Hill Education