FlameF0X commited on
Commit
c171a3f
·
verified ·
1 Parent(s): 1c160d4

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -0
README.md CHANGED
@@ -27,6 +27,9 @@ It uses PPO (Proximal Policy Optimization) to learn 2v2 gameplay through self-pl
27
  - Ball velocity toward goal
28
  - Goal scoring reward
29
 
 
 
 
30
  ## Training Configuration (from `config.json`)
31
 
32
  - **Number of processes:** 4
 
27
  - Ball velocity toward goal
28
  - Goal scoring reward
29
 
30
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6615494716917dfdc645c44e/1v9m5G8WSuJACQOs0AdDp.png)
31
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6615494716917dfdc645c44e/WTXHjHXw1ZmmMvZEr_DI5.png)
32
+
33
  ## Training Configuration (from `config.json`)
34
 
35
  - **Number of processes:** 4