SUMOSIM training

Sumosim is a simulation for sumo robots. Two robots are placed on a circular field. Each of them tries to push the opponent out of the field. Winner is who stays longer in the field.

Sumosim is inspired by Robot-sumo

Use Q-learning to train sumo agents