Approach - SanderNugteren/data GitHub Wiki

This is a brief outline of the approach. Please feel free to comment and/or add stuff.

State space:

Which team possesses which control points
Do I have ammo? (maybe we should rephrase this to 'How much ammo do I have?'
How many foes do I observe?

Possible actions/subgoals:

Get ammo
Defend control point (which is already in our possession)
Capture a certain control point

Now we try to do Q-learning on these state-action pairs.