- Massachussetts
-
07:56
(UTC -05:00) - awestover.github.io
Pinned Loading
-
filtering-for-misalignment
filtering-for-misalignment PublicWhat training data should developers filter to reduce risk from misaligned AI? I propose that AI labs filter information about safety measures and strategies for subverting them. This repository he…
Python 4
-
how-bad-can-ai-be
how-bad-can-ai-be PublicDiffuse control project: how powerful of an affordance is training in diffuse control?
Python
-
misalignment-by-default
misalignment-by-default PublicModel organisms research: do AI goals drift due to "catastrophic forgetting"? Does alignment drift?
-
DQN-maze-solver
DQN-maze-solver PublicInvestigating whether or not RL agents can acausally collaborate with other instances of themselves.
Python 1
-
transformer-shortest-paths
transformer-shortest-paths PublicExperimentally evaluating transformer's generalization on a synthetic task
HTML 1
If the problem persists, check the GitHub status page or contact support.



