3. Suppose we have four training examples under the two-category case, i.e. D* = {(x₁,w₁) |1 ≤ i ≤ 4} where x₁ = (1, 2), x₂ = (2, 2), x3 = (1, 1)¹, x₁ = (2,0) and w₁ = 62 = -1, W3 = w₁ = 1. Furthermore, linear discriminant function g(x) = wx+b is adopted to learn from the training examples and the -5²,Wi. · g(x₁). criterion function to be minimized is set as (w. b) 11

Database System Concepts
7th Edition
ISBN:9780078022159
Author:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Chapter1: Introduction
Section: Chapter Questions
Problem 1PE
icon
Related questions
Question
100%

The question is detailled and explained in the picture attached 

**Problem Statement:**

Suppose we have four training examples under the two-category case, i.e., 

\[
D^* = \{(x_i, \omega_i) \mid 1 \leq i \leq 4\}
\]

where 

\[
x_1 = (1, 2)^t, \quad x_2 = (2, 2)^t, \quad x_3 = (1, 1)^t, \quad x_4 = (2, 0)^t
\]

and 

\[
\omega_1 = \omega_2 = -1, \quad \omega_3 = \omega_4 = 1.
\]

Furthermore, linear discriminant function 

\[
g(x) = w^t x + b 
\]

is adopted to learn from the training examples and the criterion function to be minimized is set as 

\[
J(w, b) = -\sum_{i=1}^{4} \omega_i \cdot g(x_i).
\]

Given the initial model 

\[
w_0 = (-2, 2)^t \quad \text{and} \quad b_0 = -3.
\]

If gradient descent techniques are utilized to minimize the criterion function, what is the resulting discriminant function after three gradient descent steps with learning rate 

\[
\eta = 0.1 
\]

and threshold 

\[
\epsilon = 10^{-6}?
\]
Transcribed Image Text:**Problem Statement:** Suppose we have four training examples under the two-category case, i.e., \[ D^* = \{(x_i, \omega_i) \mid 1 \leq i \leq 4\} \] where \[ x_1 = (1, 2)^t, \quad x_2 = (2, 2)^t, \quad x_3 = (1, 1)^t, \quad x_4 = (2, 0)^t \] and \[ \omega_1 = \omega_2 = -1, \quad \omega_3 = \omega_4 = 1. \] Furthermore, linear discriminant function \[ g(x) = w^t x + b \] is adopted to learn from the training examples and the criterion function to be minimized is set as \[ J(w, b) = -\sum_{i=1}^{4} \omega_i \cdot g(x_i). \] Given the initial model \[ w_0 = (-2, 2)^t \quad \text{and} \quad b_0 = -3. \] If gradient descent techniques are utilized to minimize the criterion function, what is the resulting discriminant function after three gradient descent steps with learning rate \[ \eta = 0.1 \] and threshold \[ \epsilon = 10^{-6}? \]
Expert Solution
steps

Step by step

Solved in 3 steps with 43 images

Blurred answer
Knowledge Booster
Temporal Difference Learning
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, computer-science and related others by exploring similar questions and additional content below.
Recommended textbooks for you
Database System Concepts
Database System Concepts
Computer Science
ISBN:
9780078022159
Author:
Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:
McGraw-Hill Education
Starting Out with Python (4th Edition)
Starting Out with Python (4th Edition)
Computer Science
ISBN:
9780134444321
Author:
Tony Gaddis
Publisher:
PEARSON
Digital Fundamentals (11th Edition)
Digital Fundamentals (11th Edition)
Computer Science
ISBN:
9780132737968
Author:
Thomas L. Floyd
Publisher:
PEARSON
C How to Program (8th Edition)
C How to Program (8th Edition)
Computer Science
ISBN:
9780133976892
Author:
Paul J. Deitel, Harvey Deitel
Publisher:
PEARSON
Database Systems: Design, Implementation, & Manag…
Database Systems: Design, Implementation, & Manag…
Computer Science
ISBN:
9781337627900
Author:
Carlos Coronel, Steven Morris
Publisher:
Cengage Learning
Programmable Logic Controllers
Programmable Logic Controllers
Computer Science
ISBN:
9780073373843
Author:
Frank D. Petruzella
Publisher:
McGraw-Hill Education