DEPARTMENT OF COMPUTING

Course Home | Syllabus | Assignments | Schedule | Notes | Downloads | [print]

CS 4320: Machine Learning

Assignment: Temporal Difference Q-Function (Reinforcement Learning)

Train a reinforcement agent to perform in the MountainCar-v0 environment.

It is expected that you will use the CartPole example code as a starting point for your code development.

Train a Q-Function agent to obtain the highest score on 20 scoring epochs.

Consider hyper parameters such as gamma, epsilon-chance-factor, learning policies, neural network architecture, modified training rewards, serial training sessions, and any other potentially useful modifications to the Q-Function or training process.

Create a report that includes:

Required Steps

Last Updated 03/20/2023