docs: Add XPO (2405.21046) to Paper Index #5068

behroozazarkhalili · 2026-02-10T21:40:09Z

Summary

Adds the XPO (Exploratory Preference Optimization) paper entry to the paper index under the Online Direct Preference Optimization section.

Relates to #4407

Note on hyperparameters: The XPO paper (arXiv 2405.21046) defines α > 0 (optimism coefficient) and β > 0 (KL regularization) in Algorithm 1 but does not specify numerical values in the paper — the experimental details are not publicly accessible, and the paper authors did not release a standalone codebase. The configuration uses TRL defaults (alpha=1e-5, beta=0.1) and this is clearly noted in the entry.

- Add Exploratory Preference Optimization entry under Online DPO section - Note that paper defines α > 0 and β > 0 but does not specify numerical values - Config uses TRL defaults (alpha=1e-5, beta=0.1)

HuggingFaceDocBuilderDev · 2026-02-10T21:42:59Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

sergiopaniego

thanks!

docs: Add XPO (2405.21046) to Paper Index

c8b8b4d

- Add Exploratory Preference Optimization entry under Online DPO section - Note that paper defines α > 0 and β > 0 but does not specify numerical values - Config uses TRL defaults (alpha=1e-5, beta=0.1)

Merge branch 'main' into docs/add-paper-2405.21046-xpo

491b58d

sergiopaniego mentioned this pull request Feb 11, 2026

Complete paper index #4407

Open

55 tasks

sergiopaniego approved these changes Feb 11, 2026

View reviewed changes

sergiopaniego merged commit e46005c into main Feb 11, 2026
3 checks passed

sergiopaniego deleted the docs/add-paper-2405.21046-xpo branch February 11, 2026 10:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: Add XPO (2405.21046) to Paper Index #5068

docs: Add XPO (2405.21046) to Paper Index #5068

behroozazarkhalili commented Feb 10, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Feb 10, 2026

Uh oh!

sergiopaniego left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

docs: Add XPO (2405.21046) to Paper Index #5068

docs: Add XPO (2405.21046) to Paper Index #5068

Conversation

behroozazarkhalili commented Feb 10, 2026

Summary

Uh oh!

HuggingFaceDocBuilderDev commented Feb 10, 2026

Uh oh!

sergiopaniego left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants