You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Fixes the following issues:
* Missing component reference in BananaRL environment.
* Neural Network for multiple visual observations was not properly generated.
* Episode time-out value estimate bootstrapping used incorrect observation as input.
Copy file name to clipboardExpand all lines: docs/Learning-Environment-Examples.md
+5-5Lines changed: 5 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -154,15 +154,15 @@ If you would like to contribute environments, please see our
154
154

155
155
156
156
* Set-up: A multi-agent environment where agents compete to collect bananas.
157
-
* Goal: The agents must learn to move to as many yellow bananas as possible while avoiding red bananas.
158
-
* Agents: The environment contains 10 agents linked to a single brain.
157
+
* Goal: The agents must learn to move to as many yellow bananas as possible while avoiding blue bananas.
158
+
* Agents: The environment contains 5 agents linked to a single brain.
159
159
* Agent Reward Function (independent):
160
160
* +1 for interaction with yellow banana
161
-
* -1 for interaction with red banana.
161
+
* -1 for interaction with blue banana.
162
162
* Brains: One brain with the following observation/action space.
163
-
* Vector Observation space: (Continuous) 51 corresponding to velocity of agent, plus ray-based perception of objects around agent's forward direction.
163
+
* Vector Observation space: (Continuous) 53 corresponding to velocity of agent (2), whether agent is frozen and/or shot its laser (2), plus ray-based perception of objects around agent's forward direction (49; 7 raycast angles with 7 measurements for each).
164
164
* Vector Action space: (Continuous) Size of 3, corresponding to forward movement, y-axis rotation, and whether to use laser to disable other agents.
165
-
* Visual Observations (Optional): First-person view for each agent.
165
+
* Visual Observations (Optional; None by default): First-person view for each agent.
0 commit comments