Q1_TEST_1 First experiments with the stay-in-field controller as opponent

First experiments with the stay-in-field controller as opponent

training.parallel.ParallelConfig.q1-test-1

First tests with the 'stay-in-field' opponent

L0 L1 L2 L3
learning rate 0.8 0.8 0.8 0.8

L0

q-values
video 0 video 1 video 2 video 3
video 4 video 5 video 6 video 7
video 8 video 9 video 10
L1

q-values
video 0 video 1 video 2 video 3
video 4 video 5 video 6 video 7
video 8 video 9 video 10
L2

q-values
video 0 video 1 video 2 video 3
video 4 video 5 video 6 video 7
video 8 video 9 video 10
L3

q-values
video 0 video 1 video 2 video 3
video 4 video 5 video 6 video 7
video 8 video 9 video 10