| Class | Description |
|---|---|
| QLearning |
The update equation for TD Q-learning is:
Q(s,a)= Q(s,a) + alpha * (r(s) + gamma * Max(s', all actions) - Q(s,a))
which is calculated whenever action a is executed in state s leading to state s'.
|
| Simple2dRLearn |
This class is used to perform a simple reinforcement learning with multiple state dimensions.
|
| SimpleRLearn |
This class is used to perform a simple reinforcement learning.
|