Skip to content

v1.3.0 Bug fixes + Require torch 2.0

Latest
Compare
Choose a tag to compare
@muellerzr muellerzr released this 17 Jan 15:56
· 3 commits to main since this release

Torch 2.0

As it's been ~2 years since torch 2.0 was first released, we are now requiring this as the minimum version for Accelerate, which similarly was done in transformers as of its last release.

Core

  • [docs] no hard-coding cuda by @faaany in #3270
  • fix load_state_dict for npu by @ji-huazhong in #3211
  • Add keep_torch_compile param to unwrap_model and extract_model_from_parallel for distributed compiled model. by @ggoggam in #3282
  • [tests] make cuda-only test case device-agnostic by @faaany in #3340
  • latest bnb no longer has optim_args attribute on optimizer by @winglian in #3311
  • add torchdata version check to avoid "in_order" error by @faaany in #3344
  • [docs] fix typo, change "backoff_filter" to "backoff_factor" by @suchot in #3296
  • dataloader: check that in_order is in kwargs before trying to drop it by @dvrogozh in #3346
  • feat(tpu): remove nprocs from xla.spawn by @tengomucho in #3324

Big Modeling

Examples

  • Give example on how to handle gradient accumulation with cross-entropy by @ylacombe in #3193

Full Changelog

What's Changed

New Contributors

Full Changelog: v1.2.1...v1.3.0