Epoch
Reward reviver
Reward reviver function that returns modified reward value
Reset environment.
Sample an action.
Agent
Sampled action
Set new state.
Returns current state.
Current state
Do action and returns new state.
Actions to be performed by the agent
Agent
state, reward, done
Do actioin without changing environment and returns new state.
state, reward, done
Empty environment