Skip to content

v1.0.6

Compare
Choose a tag to compare
@tomwardio tomwardio released this 11 May 21:34
· 87 commits to master since this release
  • tensor_spec.bounds() no longer broadcasts scalar bounds.
  • Fixed bug where reward and discount were inadvertently included in the
    observations when using dm_env_adaptor, without explicitly requesting
    these as observations.