You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Can machine learning help us make better decisions about a changing planet?
23
26
In this paper, we illustrate and discuss the potential of a promising
@@ -55,6 +58,18 @@ editor_options:
55
58
---
56
59
57
60
61
+
a short running title: **Deep Reinforcement Learning**
62
+
63
+
the type of article: **Method**
64
+
65
+
the number of words: **4,814**
66
+
67
+
the number of references: **62**
68
+
69
+
the number of figures, tables, and text boxes: **6** (five figures, 1 table)
70
+
71
+
\pagebreak
72
+
58
73
# Introduction
59
74
60
75
<!--
@@ -88,7 +103,7 @@ Neural-network-based RL techniques, referred to as *deep RL*, have proven partic
88
103
In this paper, we draw on examples from fisheries management and ecological tipping points to illustrate how deep RL techniques can successfully discover optimal solutions to previously solved management scenarios and discover highly effective solutions to unsolved problems.
89
104
We demonstrate that RL-based approaches are capable but by no means a magic bullet: reasonable solutions require careful design of training environments, choice of RL algorithms, tuning and evaluation, as well as substantial computational power.
90
105
Our examples are intentionally simple, aiming to provide a clear template for understanding that could be easily extended to cover more realistic conditions.
91
-
We include an extensive appendix with carefully annotated code which should allow readers to both reproduce and extend this analysis.
106
+
We include an extensive appendices with carefully annotated code which should allow readers to both reproduce and extend this analysis.
92
107
93
108
94
109
# RL overview
@@ -225,12 +240,13 @@ $$
225
240
$$
226
241
227
242
Although there are RL-based methods for infinite horizon problems, i.e. when $H=\infty$, we will only present episodic or finite horizon POMDPs in this study.
228
-
In appendix A, we will discuss in greater detail how deep RL algorithms attempt to optimize the RL objective.
243
+
In Appendix A, we will discuss in greater detail how deep RL algorithms attempt to optimize the RL objective.
229
244
230
245
231
246
# Examples
232
247
233
248
We provide two examples that illustrate the application and potential of deep RL to ecological and conservation problems, highlighting both the potential and the inherent challenges.
249
+
Annotated code for these examples may be found in Appendix B and at <https://github.com/boettiger-lab/rl-intro>.
0 commit comments