This improvement is based on the previous posts, corresponding to the first step: Step 1: Deal with information problem: using only closest neighborhood information.

Code is here

The main improvent is to set the observation field.

Current mean-field assumes all agents' information are available, but now we make that only agents in the observation field can be observable.

Observable Information

  1. if there is a wall
  2. belong to Team 1 or 2
  3. health point
  4. agent ID
  5. last action
  6. relative position

Next Post Previous Post