readme

md

School

Georgia Institute Of Technology *

*We aren’t endorsed by this school

Course

7642

Subject

Electrical Engineering

Date

Feb 20, 2024

Type

md

Pages

1

Uploaded by jellogecko

Report
# Project 1 - Random Walk The purpose of this report is to replicate and analyze the experiments conducted by Richard S. Sutton in the paper titled 'Learning to Predict by the Methods of Temporal Differences' (1988) The paper introduces a class of learning methods for prediction problems named temporal-difference (TD) methods. ### Required libraries Python Matlplotlib Numpy ### To run the project Create a python environment and make sure to install Numpy, and Matplotlib. ### For generating figure 3 Run this command from the terminal python Sutton88.py generate_fig3 ### For generating figure 4 Run this command from the terminal python Sutton88.py generate_fig4 ### For generating figure 5 Run this command from the terminal python Sutton88.py generate_fig5 ### For generating figure 6 This figure shows the weight updates for each training set for TD(0) Run this command from the terminal python Sutton88.py generate_fig6 ### For generating figure 7 This figure shows the weight updates for each training set for TD(1) Run this command from the terminal python Sutton88.py generate_fig7 ### For generating figure 8 This figure shows the weight updates for each training set for TD(0.8) Run this command from the terminal python Sutton88.py generate_fig8
Discover more documents: Sign up today!
Unlock a world of knowledge! Explore tailored content for a richer learning experience. Here's what you'll get:
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help