Notes 8 27 2021 - opendigital/RL-collective-action GitHub Wiki

New states:

  • Agent only has access to last round
  • Agent has access to whole game

New reward functions:

  • Sum Bush-Mosteller agents' contributions only
  • Proportion of contributions by Bush-Mosteller agents that are > 0.5

Have 5 experiments:

  • 2 reward functions x 2 state
  • 2 baselines: 3 and 4 Bush-Mosteller agents

Results analysis:

  • Compare total contributions of ALL 4 agents (including DQN agent) in experiments to 4-Bush-Mosteller baseline
  • Compare Bush-Mosteller agent contributions in experiments to 3-Bush-Mosteller baseline
  • Use simple t-test to prove that DQN agent optimizes contributions