Skip to content

Commit

Permalink
Add citation
Browse files Browse the repository at this point in the history
  • Loading branch information
tlc4418 committed Mar 9, 2024
1 parent 911b2f8 commit f8a9ae6
Showing 1 changed file with 6 additions and 1 deletion.
7 changes: 6 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -191,5 +191,10 @@ Here we set the config to already contain the 5 trained reward models under the
## Citation
If you use the models, data, or code in this repo, please consider citing our work.
```
@inproceedings{}
@article{coste2023reward,
title={Reward model ensembles help mitigate overoptimization},
author={Coste, Thomas and Anwar, Usman and Kirk, Robert and Krueger, David},
journal={arXiv preprint arXiv:2310.02743},
year={2023}
}
```

0 comments on commit f8a9ae6

Please sign in to comment.