[Docs] Added documentation for benchmarks #3340

Tanuj-Taneja1 · 2025-10-29T16:09:29Z

Description

Fixes #3278

Checklist

Go over all the following points, and put an x in all the boxes that apply.

I have read the CONTRIBUTION guide (required)
I have linked this PR to an issue using the Development section on the right sidebar or by adding Fixes #issue-number in the PR description (required)
I have checked if any dependencies need to be added or updated in pyproject.toml and uv lock
I have updated the tests accordingly (required for a bug fix or a new feature)
I have updated the documentation if needed:
I have added examples if this is a new feature

If you are unsure about any of these, don't hesitate to ask. We are here to help!

coderabbitai · 2025-10-29T16:10:07Z

Important

Review skipped

Auto reviews are disabled on this repository.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

✨ Finishing touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch docs/benchmark

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

fengju0213 · 2025-10-30T06:24:53Z

@Tanuj-Taneja1 thanks for this pr！

fengju0213

@Tanuj-Taneja1 thanks for your contribution！

Tanuj-Taneja1 · 2025-11-01T16:44:27Z

Hi @fengju0213
Just a question do we need to run all the test for a docs pr. I think although github actions is free for open source projects we can still reduce the api usage.
Not sure how much a single set of test cost but if it is a considerable amount on scale we can maybe skip more tests that are not related to docs

Wendong-Fan · 2025-11-02T21:45:22Z

Hi @fengju0213 Just a question do we need to run all the test for a docs pr. I think although github actions is free for open source projects we can still reduce the api usage. Not sure how much a single set of test cost but if it is a considerable amount on scale we can maybe skip more tests that are not related to docs

thanks @Tanuj-Taneja1 , it's really a great suggestion, please feel free to open a new issue to isolate tests for changes unrelated to functional code (e.g., doc updates)

Wendong-Fan

i think we also need to add this new doc to index.rst file, also let's use benchmark.md instead of Benchmark.md to align with other docs' naming
cc @fengju0213

Wendong-Fan · 2025-11-02T21:52:25Z

docs/key_modules/Benchmark.md

+3. `run()`: Execute benchmark and populate `self._results`
+4. Optional: Override `train`, `valid`, `test` properties
+
+## References


seems we missing the reference for BrowseComp Benchmark cc @fengju0213

Added documentation for benchmarks

a0906d8

github-actions bot added the Review Required PR need to be reviewed label Oct 29, 2025

Merge branch 'master' into docs/benchmark

3215f42

fengju0213 added 2 commits November 1, 2025 17:26

Merge branch 'master' into docs/benchmark

9e6682d

Merge branch 'master' into docs/benchmark

6273f9e

fengju0213 approved these changes Nov 1, 2025

View reviewed changes

Wendong-Fan assigned Tanuj-Taneja1 Nov 2, 2025

Wendong-Fan added this to the Sprint 41 milestone Nov 2, 2025

Wendong-Fan added this to Project Camel Nov 2, 2025

Wendong-Fan reviewed Nov 2, 2025

View reviewed changes

Wendong-Fan added 2 commits November 3, 2025 05:58

enhance: Added documentation for benchmarks PR3340 (#3357)

0a08881

Merge branch 'master' into docs/benchmark

978e5f0

Wendong-Fan merged commit c8d5374 into master Nov 2, 2025
11 of 12 checks passed

Wendong-Fan deleted the docs/benchmark branch November 2, 2025 21:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Docs] Added documentation for benchmarks #3340

[Docs] Added documentation for benchmarks #3340

Tanuj-Taneja1 commented Oct 29, 2025

Uh oh!

coderabbitai bot commented Oct 29, 2025

Review skipped

Uh oh!

fengju0213 commented Oct 30, 2025

Uh oh!

fengju0213 left a comment

Uh oh!

Tanuj-Taneja1 commented Nov 1, 2025

Uh oh!

Wendong-Fan commented Nov 2, 2025

Uh oh!

Wendong-Fan left a comment •

edited

Loading

Uh oh!

Wendong-Fan Nov 2, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[Docs] Added documentation for benchmarks #3340

[Docs] Added documentation for benchmarks #3340

Conversation

Tanuj-Taneja1 commented Oct 29, 2025

Description

Checklist

Uh oh!

coderabbitai bot commented Oct 29, 2025

Review skipped

Uh oh!

fengju0213 commented Oct 30, 2025

Uh oh!

fengju0213 left a comment

Choose a reason for hiding this comment

Uh oh!

Tanuj-Taneja1 commented Nov 1, 2025

Uh oh!

Wendong-Fan commented Nov 2, 2025

Uh oh!

Wendong-Fan left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Wendong-Fan Nov 2, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Wendong-Fan left a comment •

edited

Loading