Approach - SanderNugteren/data GitHub Wiki
This is a brief outline of the approach. Please feel free to comment and/or add stuff.
State space:
- Which team possesses which control points
- Do I have ammo? (maybe we should rephrase this to 'How much ammo do I have?'
- How many foes do I observe?
Possible actions/subgoals:
- Get ammo
- Defend control point (which is already in our possession)
- Capture a certain control point
Now we try to do Q-learning on these state-action pairs.