Name		Name	Last commit message	Last commit date
parent directory ..
QSigma_1.PNG		QSigma_1.PNG
QSigma_2.PNG		QSigma_2.PNG
nStepSARSA_Algo.PNG		nStepSARSA_Algo.PNG
nStepTreeBackup_Algo.PNG		nStepTreeBackup_Algo.PNG
n_StepMethods.ipynb		n_StepMethods.ipynb
readme.md		readme.md

readme.md

Exercise 06

For this exercise we will have a look at n-step methods, which are the generalization of Monte-Carlo and TD learning algorithms. The environment under examination is given by the inverted pendulum, which is a popular system for toy examples of control theory.

Tasks:

discretization of continuous state spaces in order to make corresponding systems available for tabular RL algorithms
on-policy epsilon-greedy control using n-step Sarsa
off-policy epsilon-greedy control using tree backups
hyperparameter optimization for the Q(σ) algorithm

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ex06

ex06

readme.md

Exercise 06

Tasks:

Files

ex06

Directory actions

More options

Directory actions

More options

Latest commit

History

ex06

Folders and files

parent directory

readme.md

Exercise 06

Tasks: