[CGPO] Mixture of judges #2159

gaetanlop · 2024-10-03T03:41:26Z

What does this PR do?

This PR adds the Mixture of judges part of the CGPO paper (https://arxiv.org/pdf/2409.20370). The judges are described in section 4.1.4 and the mixture of judges simply labels a generation as “violated” (0) if it fails any one of the constraint judgments and “satisfied” (1) otherwise.

Related to #2156

Before submitting

Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a GitHub issue? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines.
Did you write any new necessary tests?

Who can review?

@kashif @lewtun

gaetanlop · 2024-10-03T04:50:40Z

~~Still a draft PR~~ Ready

trl/trainer/judges.py

qgallouedec · 2024-10-04T13:34:03Z

Thanks a lot @gaetanlop. Added some suggestion and open questions

qgallouedec · 2024-10-04T13:34:58Z

Also, please make sure to run the pre-commits (make precommit)

Co-authored-by: Quentin Gallouédec <[email protected]>

gaetanlop · 2024-10-07T02:37:04Z

Hey @qgallouedec, thanks for the review. I have added a gold_answers parameter to the judge function for both the base and MoJs class. Also, I have added a safety_judge and factuality_judge as described in the CGPO paper. They also have rule based judges, but in my opinion they are a little bit too tailored to specific tasks (coding/maths) to be added to the trl library. If you think the factuality and safety judges are also too specific to be in the lib I can remove them from the PR.

For the naming of the judges, let's do BaseBinaryJudge for the base class and AllTrueJudge for the moj following your suggestions?

trl/trainer/judges.py

qgallouedec · 2024-10-10T09:15:19Z

Hey @qgallouedec, thanks for the review. I have added a gold_answers parameter to the judge function for both the base and MoJs class. Also, I have added a safety_judge and factuality_judge as described in the CGPO paper. They also have rule based judges, but in my opinion they are a little bit too tailored to specific tasks (coding/maths) to be added to the trl library. If you think the factuality and safety judges are also too specific to be in the lib I can remove them from the PR.

Thanks a lot! Nice work!

For the naming of the judges, let's do BaseBinaryJudge for the base class and AllTrueJudge for the moj following your suggestions?

LGTM.

WDYT of having generic classes in trl.judges (AllTrueJudge, BinaryJudge etc.) and subclass them in trl.trainer.cgpo_trainer to get SafetyConstraintJudge and FacultyConstraintJudge? If in the future we need these classes elsewhere, we can still move them in trl.judges

gaetanlop · 2024-10-11T01:56:42Z

Ok, I have removed the FacultyConstraintJudge and the SafetyConstraintJudge from the PR and made the required renamings. Thanks @qgallouedec for the feedback.

trl/trainer/judges.py

qgallouedec · 2024-10-24T18:17:24Z

trl/trainer/judges.py

+
+    @abstractmethod
+    def judge(
+        self, prompts: List[str], completions: List[str], gold_answers: List[str] = None, shuffle_order: bool = True


maybe "gold_completion" or even "ref_completion" would be more suited, what do you think?

"gold_completions" seems good to me. I have done the update.

Co-authored-by: Quentin Gallouédec <[email protected]>

trl/trainer/judges.py

qgallouedec · 2024-10-25T15:26:43Z

trl/trainer/judges.py

+        completions: List[str],
+        gold_completions: Optional[List[str]] = None,
+        shuffle_order: bool = True,
+    ) -> List[bool]:


This should return a list of int to be consistent with the super class

trl/trainer/judges.py

Co-authored-by: Quentin Gallouédec <[email protected]>

gaetanlop added 6 commits October 2, 2024 23:05

base judge

013aae4

adding mixture of judges

0ea5a48

update doc

517cfb0

update doc

9e5ed12

formatting

3406e53

fix small typo in doc

568d2b9

gaetanlop mentioned this pull request Oct 3, 2024

[CGPO] Add support for Constrained Generative Policy Optimization #2156

Open

3 tasks

gaetanlop marked this pull request as draft October 3, 2024 04:49

fix randomcontraintjudge

466292e

gaetanlop marked this pull request as ready for review October 4, 2024 01:01

Merge branch 'main' into cgpo_mixture_of_judges

a3d90df

qgallouedec reviewed Oct 4, 2024

View reviewed changes

trl/trainer/judges.py Outdated Show resolved Hide resolved

qgallouedec reviewed Oct 4, 2024

View reviewed changes

trl/trainer/judges.py Outdated Show resolved Hide resolved

qgallouedec reviewed Oct 4, 2024

View reviewed changes

trl/trainer/judges.py Outdated Show resolved Hide resolved

qgallouedec reviewed Oct 4, 2024

View reviewed changes

trl/trainer/judges.py Outdated Show resolved Hide resolved

qgallouedec reviewed Oct 4, 2024

View reviewed changes

trl/trainer/judges.py Outdated Show resolved Hide resolved

qgallouedec added the 👨‍⚖️ judge Related to judges label Oct 4, 2024

gaetanlop and others added 5 commits October 4, 2024 16:54

replace arxiv by hf papers

3f0b8b0

Co-authored-by: Quentin Gallouédec <[email protected]>

formatting

8995ab4

Co-authored-by: Quentin Gallouédec <[email protected]>

Merge branch 'main' into cgpo_mixture_of_judges

896259e

fix naming in __init__

ef1feb0

run precommi

3da4a06

gaetanlop mentioned this pull request Oct 6, 2024

[CGPO] CGPO Trainer (single task single objective) #2190

Draft

10 tasks

gaetanlop added 2 commits October 6, 2024 20:51

adding gold answers to judges

765768b

cgpo llm judges

8aaaaa1

fix init

cfc84ed

gaetanlop and others added 5 commits October 6, 2024 22:43

Merge branch 'main' into cgpo_mixture_of_judges

a1e8eeb

output type

6898285

adjust booleans in test

f5639a1

adapt moj doc

289b855

Merge branch 'main' into cgpo_mixture_of_judges

308e743

qgallouedec reviewed Oct 10, 2024

View reviewed changes

trl/trainer/judges.py Outdated Show resolved Hide resolved

gaetanlop and others added 4 commits October 10, 2024 21:28

Merge branch 'main' into cgpo_mixture_of_judges

2c6de87

renaming and removing factuality and safety judges

dedc859

fix typo in import

ba0fffb

fix small typo in naming

226de82

gaetanlop and others added 3 commits October 14, 2024 18:16

Merge branch 'main' into cgpo_mixture_of_judges

5626cd4

formatting

567b798

Merge branch 'main' into cgpo_mixture_of_judges

1c33494

qgallouedec reviewed Oct 24, 2024

View reviewed changes

trl/trainer/judges.py Outdated Show resolved Hide resolved

qgallouedec reviewed Oct 24, 2024

View reviewed changes

gaetanlop and others added 5 commits October 24, 2024 19:51

Merge branch 'main' into cgpo_mixture_of_judges

64c9de8

Update trl/trainer/judges.py

559cd1b

Co-authored-by: Quentin Gallouédec <[email protected]>

update parameter name

2c29ef5

update tests

bd1bed8

update doc

21e3ccd

qgallouedec reviewed Oct 25, 2024

View reviewed changes

trl/trainer/judges.py Outdated Show resolved Hide resolved

qgallouedec reviewed Oct 25, 2024

View reviewed changes

trl/trainer/judges.py Outdated Show resolved Hide resolved

gaetanlop and others added 5 commits October 28, 2024 21:03

Merge branch 'main' into cgpo_mixture_of_judges

43d6cca

Update trl/trainer/judges.py

9eca0f8

Co-authored-by: Quentin Gallouédec <[email protected]>

Update doc

d5b32f0

Co-authored-by: Quentin Gallouédec <[email protected]>

fix alltruejudge type

ac88c63

Merge branch 'main' into cgpo_mixture_of_judges

999154b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CGPO] Mixture of judges #2159

[CGPO] Mixture of judges #2159

gaetanlop commented Oct 3, 2024

gaetanlop commented Oct 3, 2024 •

edited

Loading

qgallouedec commented Oct 4, 2024

qgallouedec commented Oct 4, 2024

gaetanlop commented Oct 7, 2024

qgallouedec commented Oct 10, 2024

gaetanlop commented Oct 11, 2024

qgallouedec Oct 24, 2024

gaetanlop Oct 25, 2024

qgallouedec Oct 25, 2024

gaetanlop Oct 29, 2024

[CGPO] Mixture of judges #2159

Are you sure you want to change the base?

[CGPO] Mixture of judges #2159

Conversation

gaetanlop commented Oct 3, 2024

What does this PR do?

Before submitting

Who can review?

gaetanlop commented Oct 3, 2024 • edited Loading

qgallouedec commented Oct 4, 2024

qgallouedec commented Oct 4, 2024

gaetanlop commented Oct 7, 2024

qgallouedec commented Oct 10, 2024

gaetanlop commented Oct 11, 2024

qgallouedec Oct 24, 2024

Choose a reason for hiding this comment

gaetanlop Oct 25, 2024

Choose a reason for hiding this comment

qgallouedec Oct 25, 2024

Choose a reason for hiding this comment

gaetanlop Oct 29, 2024

Choose a reason for hiding this comment

gaetanlop commented Oct 3, 2024 •

edited

Loading