What's Changed
- [misc] feat: update tutorial for opensource version by @PeterSH6 in #4
- [misc] fix: vllm gpu executor issue when world_size is 1 and typo in doc by @PeterSH6 in #9
- [ci] feat: add test files for ray hybrid programming model by @PeterSH6 in #23
- [chore] remove unnecessary updating of
_worker_names
by @kevin85421 in #19 - [misc] feat: add gemma example for small scale debug and fix gradient checkpoint in critic by @PeterSH6 in #27
- [misc] fix issue in hf_weight_loader and fix typo in doc by @PeterSH6 in #30
- [ci] test lint ci and lint tests dir by @PeterSH6 in #28
- [example] fix: fix math circular dependency by @eric-haibin-lin in #31
- [example] fix: make wandb optional dependency. allow extra args in existing scripts by @eric-haibin-lin in #32
- [docs] feat: add related publications by @eric-haibin-lin in #35
- [tokenizer] feat: support tokenizers whose pad_token_id is none by @eric-haibin-lin in #36
- [rollout] feat: support vLLM v0.6.3 and fix hf rollout import issue by @PeterSH6 in #33
- [distro] feat: add docker support by @eric-haibin-lin in #41
- [example] add a split placement tutorial by @PeterSH6 in #43
- [doc] add a new quickstart section by @PeterSH6 in #44
- [BREAKING][core] move single_controller into verl directory by @PeterSH6 in #45
New Contributors
- @eric-haibin-lin made their first contribution in #31
Full Changelog: v0.1rc...v0.1