Skip to content

Commit 42ac624

Browse files
authored
Fixing incorrect module reference in twenty_questions README (#62)
1 parent 15bedd0 commit 42ac624

File tree

1 file changed

+1
-1
lines changed
  • tinker_cookbook/recipes/multiplayer_rl/twenty_questions

1 file changed

+1
-1
lines changed

tinker_cookbook/recipes/multiplayer_rl/twenty_questions/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -61,7 +61,7 @@ Our demo is simple, involving only one LLM other than the policy and a very simp
6161
### Next
6262

6363
In this example, we play against a static language model answerer, which does not update during training.
64-
In recipe.multiturn_rl.text_arena, we will demonstrate an example (tic-tac-toe), which updates the weights of both players in a game.
64+
In recipes.multiplayer_rl.text_arena, we will demonstrate an example (tic-tac-toe), which updates the weights of both players in a game.
6565

6666
[1] Checklists Are Better Than Reward Models For Aligning Language Models
6767
Viswanathan, V., Sun, Y., Ma, S., Kong, X., Cao, M., Neubig, G., & Wu, T. (2025). Checklists are better than reward models for aligning language models. arXiv. https://arxiv.org/abs/2507.18624

0 commit comments

Comments
 (0)