* In problem 1 and 2, we consider a two-state Markov decision process (S, A, P, R, X) where S =state space, A = {a¹, a²} is an action space, P is a transition probability matrix such thatP(1|1, a¹) = 0.9, P(2|1, a¹) = 0.1,P(1|2, a¹) = 0, P(2|2, a¹) = 1,R is the reward such thatP(1|1, a²) = P(2|1, a²) = 0.5,P(1|2, a²) = 0.2, P(2|2, a²) = 0.8,R(1, a¹) = 1, R(1, a²) = 3, R(2, a¹) = −1, R(2, a²) = 0{1,2} is aand X = [0, 1) is a discounted factor.2. Assume the discounted factor X = 0.9. Find all optimal Markovian deterministic stationary policies.

To find all optimal Markovian deterministic stationary policies, we can use the policy iteration…

Answered: * In problem 1 and 2, we consider a…

A First Course in Probability (10th Edition)

10th Edition

ISBN:9780134753119

Author:Sheldon Ross

Publisher:Sheldon Ross

Chapter1: Combinatorial Analysis

Section: Chapter Questions

Problem 1.1P: a. How many different 7-place license plates are possible if the first 2 places are for letters and...

See similar textbooks

Similar questions

3. Consider an undiscounted Markov decision process with three states 1, 2, 3, with respec- tive rewards -1, -2,0 for each visit to that state. In states 1 and 2, there are two possible actions: a and b. The transitions are as follows: • In state 1, action a moves the agent to state 2 with probability 0.8 and makes the agent stay put with probability 0.2. In state 2, action a moves the agent to state 1 with probability 0.8 and makes the agent stay put with probability 0.2. . In either state 1 or state 2, action b moves the agent to state 3 with probability 0.1 and makes the agent stay put with probability 0.9. Find the optimal policy that minimises the expected total cost and find the corresponding value function.
2. Consider a Markov chain (Xn) with state space S = {1, 2, 3, 4, 5} and transition matrix 0.5 0.5 0 0 0 0.4 0.6 0 0 0 P = 0 0.3 0.3 0.4 0 0 0 0 0.6 0.4 000 0.6 0.4 One stationary distribution for this Markov chain is (4, 5, 0, 0, 0). Suppose the Markov chain (X) is started from the initial distribution λ = (0.1 0 0.7 0.2 0). What are the limiting probabilities limn→∞ P(X = i) for each i € S?
5. consider the example below, where the states are Condition State and the transition matrix is 0 1 23 2 Πο Good as new Operable-minimum deterioration Operable-major deterioration Inoperable and replaced by a good-as-new machine we found that the steady-state probabilities are 2 13' T1= = P = 78314 00 1 0 7 13' HARTNO LELBLINO 1 16 1 1 1 1 16 8 8 0 2 2 = π2 2 13' I3 = 2 13 (a) Find the expected recurrence time for state 0 (i.e., the expected length of time a machine can be used before it must be replaced) by solving a linear system for Moo, M10, 20, and μ30. (b) Find the expected recurrence time for state O directly by the formula Moo 1 πο =
9. |Show that if X,, X2,... is a Markov chain, then it is statistically determined by its |second-order probability masses: P{X, = Xị , X, = x}, for any i, j = 1, 2,..
60. The following is the transition probability matrix of a Markov chain with states 1, 2, 3, 4 P= .4 .2 .25 .2 .3 .2 .1 .2 .2 .4 25 50 .1 .4 .3 If Xo = 1 (a) find the probability that state 3 is entered before state 4; (b) find the mean number of transitions until either state 3 or state 4 is entered.
Can someone please help me with this question. I am having so much trouble.
8. List the Gauss–Markov conditions required for applying a t & F-tests.
7. Player A and player B have a payoff matrix shown below. If A (row player) has an optimal strategy of (424) and B has an optimal strategy of (1/3 1/3 1/3), what is the expected value of the game? P = 2 5 -2 -3 1 2-2 4 3

Recommended textbooks for you

A First Course in Probability (10th Edition)

Probability

ISBN:

9780134753119

Author:

Sheldon Ross

Publisher:

PEARSON

A First Course in Probability

Probability

ISBN:

9780321794772

Author:

Sheldon Ross

Publisher:

PEARSON

A First Course in Probability (10th Edition)

Probability

ISBN:

9780134753119

Author:

Sheldon Ross

Publisher:

PEARSON

A First Course in Probability

Probability

ISBN:

9780321794772

Author:

Sheldon Ross

Publisher:

PEARSON

SEE MORE TEXTBOOKS