Class | Description |
---|---|
QLearning |
The update equation for TD Q-learning is:
Q(s,a)= Q(s,a) + alpha * (r(s) + gamma * Max(s', all actions) - Q(s,a))
which is calculated whenever action a is executed in state s leading to state s'.
|
Simple2dRLearn |
This class is used to perform a simple reinforcement learning with multiple state dimensions.
|
SimpleRLearn |
This class is used to perform a simple reinforcement learning.
|