Environment
Resolution of actions
Network layers
Optimizer of the network
DQN Method
New method name
Returns a action.
Current states
Optional
greedy_rate: numberGreedy rate
Action
Returns a score.
Score values
Update model.
Action
Current states
Next states
Reward
Done epoch or not
Learning rate
Batch size
Loss value
Deep Q-Network agent