Skip to content

Actions: huggingface/trl

Build documentation

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
735 workflow runs
735 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Standardize pushing to Hub in examples (#2126)
Build documentation #851: Commit 32d9d34 pushed by qgallouedec
September 26, 2024 08:00 3m 5s main
September 26, 2024 08:00 3m 5s
Remove max_length from RewardDataCollatorWithPadding (#2119)
Build documentation #850: Commit fb1b48f pushed by qgallouedec
September 26, 2024 07:59 3m 8s main
September 26, 2024 07:59 3m 8s
Update example_overview.md (#2125)
Build documentation #849: Commit b5e4bc5 pushed by lewtun
September 25, 2024 18:46 3m 26s main
September 25, 2024 18:46 3m 26s
Generalizes VSFT script to support REDACTED (#2120)
Build documentation #848: Commit 7a24565 pushed by kashif
September 25, 2024 17:54 3m 47s main
September 25, 2024 17:54 3m 47s
BCOTrainer conversational dataset support (#2107)
Build documentation #847: Commit 44a06fc pushed by qgallouedec
September 24, 2024 16:16 3m 34s main
September 24, 2024 16:16 3m 34s
Version 0.11.0-> 0.11.1
Build documentation #846: Commit 86ad7a7 pushed by qgallouedec
September 24, 2024 15:49 4m 3s v0.11-release
September 24, 2024 15:49 4m 3s
Fix packing test (#2111)
Build documentation #845: Commit a84fc5d pushed by qgallouedec
September 24, 2024 15:12 3m 6s main
September 24, 2024 15:12 3m 6s
[online-dpo] allow parse-args as list of floats (#2108)
Build documentation #844: Commit 80038a5 pushed by kashif
September 24, 2024 14:56 3m 33s main
September 24, 2024 14:56 3m 33s
fix formatting (#2109)
Build documentation #843: Commit cece86b pushed by kashif
September 24, 2024 14:05 3m 26s main
September 24, 2024 14:05 3m 26s
Fix documentation links (#2105)
Build documentation #842: Commit d005980 pushed by qgallouedec
September 24, 2024 13:35 3m 43s main
September 24, 2024 13:35 3m 43s
[RewardTrainer] Tokenize inputs within trainer (#2102)
Build documentation #841: Commit cc23b51 pushed by qgallouedec
September 24, 2024 11:03 3m 1s main
September 24, 2024 11:03 3m 1s
[CLI] trl env for printing system info (#2104)
Build documentation #840: Commit 2cad48d pushed by qgallouedec
September 24, 2024 07:57 3m 2s main
September 24, 2024 07:57 3m 2s
Fix PPO/RLOO examples (#2100)
Build documentation #839: Commit 6859e04 pushed by lewtun
September 23, 2024 09:49 2m 53s main
September 23, 2024 09:49 2m 53s
Clean up README and remove openrlbenchmark dependency (#2085)
Build documentation #838: Commit 92eea1f pushed by lewtun
September 23, 2024 07:21 3m 1s main
September 23, 2024 07:21 3m 1s
KTO: fix logits metric, add logits metric to BCOTrainer too (#2094)
Build documentation #837: Commit 663002f pushed by kashif
September 21, 2024 17:08 3m 37s main
September 21, 2024 17:08 3m 37s
Fix _process_tokens for empty prompts in KTOTrainer (#2093)
Build documentation #836: Commit 44d998b pushed by kashif
September 21, 2024 10:49 3m 26s main
September 21, 2024 10:49 3m 26s
fix: device could be in meta, transformers#33154 (#2089)
Build documentation #835: Commit 9b80f3d pushed by kashif
September 21, 2024 07:11 3m 18s main
September 21, 2024 07:11 3m 18s
Fix typo in orpo example. (#2092)
Build documentation #834: Commit 2038e52 pushed by kashif
September 21, 2024 07:11 3m 34s main
September 21, 2024 07:11 3m 34s
training_args for all TrainingArguments (#2082)
Build documentation #833: Commit 10c2f63 pushed by qgallouedec
September 19, 2024 13:03 3m 53s main
September 19, 2024 13:03 3m 53s
[SFT] fix neftune_noise_alpha in SFTTrainer (#1841)
Build documentation #832: Commit 9fb871f pushed by kashif
September 19, 2024 09:57 3m 10s main
September 19, 2024 09:57 3m 10s
Bump dev version
Build documentation #831: Commit 3cec013 pushed by lewtun
September 19, 2024 08:47 3m 29s main
September 19, 2024 08:47 3m 29s
Release v0.11.0
Build documentation #830: Commit e4935e1 pushed by lewtun
September 19, 2024 07:50 3m 16s v0.11-release
September 19, 2024 07:50 3m 16s
Fix DeepSpeed for PPOv2Trainer.save (#2080)
Build documentation #829: Commit cc80ac6 pushed by lewtun
September 19, 2024 07:46 3m 23s v0.11-release
September 19, 2024 07:46 3m 23s
Fix DeepSpeed for PPOv2Trainer.save (#2080)
Build documentation #828: Commit cc80ac6 pushed by qgallouedec
September 19, 2024 07:30 3m 26s main
September 19, 2024 07:30 3m 26s
Standardize dataset naming (#2081)
Build documentation #827: Commit 4c0c98d pushed by qgallouedec
September 19, 2024 06:59 3m 33s main
September 19, 2024 06:59 3m 33s