We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 9920715 commit 20e26a6Copy full SHA for 20e26a6
tinker_cookbook/recipes/chat_sl/README.md
@@ -3,7 +3,7 @@
3
## SFT on NoRobots
4
5
```bash
6
-python -m tinker_cookbook.recipes.chat_sl.train
+python -m tinker_cookbook.recipes.chat_sl.train \
7
model_name=Qwen/Qwen3-8B-Base \
8
dataset=no_robots \
9
learning_rate=5e-4 \
@@ -19,7 +19,7 @@ After 140 steps of training, `test/nll` decreases to 1.788.
19
## SFT on Tulu3 dataset
20
21
22
23
24
dataset=tulu3 \
25
0 commit comments