Static
BLACKStatic
EMPTYStatic
OTHERStatic
OWNStatic
WHITEEpoch
Reward
Reward object
Reset environment.
Sample an action.
Agent
Sampled action
Set new state.
New state
Agent
Returns current state.
Agent
Current state
Do action and returns new state.
Actions to be performed by the agent
Agent
state, reward, done
Do actioin without changing environment and returns new state.
Environment state
Actions to be performed by the agent
Agent
state, reward, done
Reversi environment