preliminary browser reflection implementation #3138

waleedalzarooni · 2025-09-15T12:54:10Z

Description

This is a preliminary implementation of a reflection mechanism to be included in the hybrid_browser-toolkit

Checklist

Go over all the following points, and put an x in all the boxes that apply.

[ X] I have read the CONTRIBUTION guide (required)
I have linked this PR to an issue using the Development section on the right sidebar or by adding Fixes #issue-number in the PR description (required)
[ X] I have checked if any dependencies need to be added or updated in pyproject.toml and uv lock
[ X] I have updated the tests accordingly (required for a bug fix or a new feature)
[X ] I have updated the documentation if needed:
[X ] I have added examples if this is a new feature

If you are unsure about any of these, don't hesitate to ask. We are here to help!

coderabbitai · 2025-09-15T12:54:18Z

Important

Review skipped

Auto reviews are disabled on this repository.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

✨ Finishing touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch browser-reflection-wrapper

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

waleedalzarooni · 2025-09-15T12:55:50Z

@nitpicker55555, here's my initial implementation. Let me know what you think of my approach, I also included an example file to show how it works, will deal with further refinement (exception handling, etc) in the next commit!

nitpicker55555 · 2025-09-15T12:57:47Z

Thanks for your contribution! Have you test it in wordle game or https://github.com/MinorJerry/WebVoyager/blob/main/data/WebVoyager_data.jsonl top 10 questions?

waleedalzarooni · 2025-09-15T12:59:39Z

Thanks for your contribution! Have you test it in wordle game or https://github.com/MinorJerry/WebVoyager/blob/main/data/WebVoyager_data.jsonl top 10 questions?

will do!

nitpicker55555 · 2025-09-19T14:11:26Z

I think you only want to push these commits?

aa5652f - metadata additions
48147d3 - nonagent planning model / tag for loop prevention
5c7b3f2 - preliminary reflection implementation
please clean other commits

waleedalzarooni · 2025-09-22T10:06:39Z

I think you only want to push these commits?

aa5652f - metadata additions

48147d3 - nonagent planning model / tag for loop prevention

5c7b3f2 - preliminary reflection implementation
please clean other commits

All done!

nitpicker55555 · 2025-09-23T18:51:17Z

examples/toolkits/browser_reflection.py

+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#


Is this empty file?

Thanks for working on the reflection wrapper feature. I've reviewed the implementation and have some suggestions.

The current approach adds significant complexity by intercepting every browser action and asking an LLM whether to proceed or change the action. This doubles the API calls and introduces unpredictable behavior. Additionally,
clearing the agent's conversation history before each action (self.agent.reset()) removes valuable context that the agent needs to maintain state.

I'd suggest a simpler approach: instead of intercepting execution, we could add optional parameters like thinking and next_goal to the browser action methods. Here's how it could work:

def add_reasoning_params(func): """Add optional reasoning parameters without changing execution flow""" @wraps(func) async def wrapper(self, *args, thinking: Optional[str] = None, next_goal: Optional[str] = None, **kwargs): # Log reasoning if provided if thinking: logger.info(f"[{func.__name__}] Thinking: {thinking}") if next_goal: logger.info(f"[{func.__name__}] Next goal: {next_goal}") # Execute original function without interference return await func(self, *args, **kwargs) # Update docstring to include new parameters if func.__doc__: additional_docs = """ Additional Parameters: thinking (Optional[str]): Your reasoning for this action. next_goal (Optional[str]): What you plan to do after this action. """ wrapper.__doc__ = func.__doc__.rstrip() + "\n" + additional_docs return wrapper

This would allow agents to provide their reasoning when calling the toolkit:

await toolkit.browser_click( ref="submit_button", thinking="Form is complete, submitting to server", next_goal="Wait for confirmation page to load" )

waleedalzarooni · 2025-09-26T12:48:31Z

@nitpicker55555 new implementation uploaded run python WebVoyager_wrapper_ts.py --num-tasks 10 results should be promising!

waleedalzarooni · 2025-10-05T15:50:00Z

@nitpicker55555 Latest experiment setup, results were 4 incorrect out of 50 for reflection mechanism, 5 incorrect for non-reflective

waleedalzarooni requested a review from nitpicker55555 September 15, 2025 12:54

nitpicker55555 marked this pull request as draft September 15, 2025 12:59

nitpicker55555 assigned waleedalzarooni Sep 15, 2025

waleedalzarooni force-pushed the browser-reflection-wrapper branch from aa5652f to 0ba17cb Compare September 22, 2025 10:05

nitpicker55555 reviewed Sep 23, 2025

View reviewed changes

ts reflection implementation promising results

f8af6ab

waleedalzarooni force-pushed the browser-reflection-wrapper branch from 0ba17cb to f8af6ab Compare September 26, 2025 12:47

waleedalzarooni marked this pull request as ready for review October 1, 2025 14:44

setup for reflection experiments 3/10

e16d6da

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

preliminary browser reflection implementation #3138

preliminary browser reflection implementation #3138

Uh oh!

waleedalzarooni commented Sep 15, 2025

Uh oh!

coderabbitai bot commented Sep 15, 2025 •

edited

Loading

Review skipped

Uh oh!

waleedalzarooni commented Sep 15, 2025

Uh oh!

nitpicker55555 commented Sep 15, 2025

Uh oh!

waleedalzarooni commented Sep 15, 2025

Uh oh!

nitpicker55555 commented Sep 19, 2025

Uh oh!

waleedalzarooni commented Sep 22, 2025

Uh oh!

nitpicker55555 Sep 23, 2025

Uh oh!

nitpicker55555 Sep 23, 2025

Uh oh!

waleedalzarooni commented Sep 26, 2025

Uh oh!

waleedalzarooni commented Oct 5, 2025

Uh oh!

Uh oh!

preliminary browser reflection implementation #3138

Are you sure you want to change the base?

preliminary browser reflection implementation #3138

Uh oh!

Conversation

waleedalzarooni commented Sep 15, 2025

Description

Checklist

Uh oh!

coderabbitai bot commented Sep 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Uh oh!

waleedalzarooni commented Sep 15, 2025

Uh oh!

nitpicker55555 commented Sep 15, 2025

Uh oh!

waleedalzarooni commented Sep 15, 2025

Uh oh!

nitpicker55555 commented Sep 19, 2025

Uh oh!

waleedalzarooni commented Sep 22, 2025

Uh oh!

nitpicker55555 Sep 23, 2025

Choose a reason for hiding this comment

Uh oh!

nitpicker55555 Sep 23, 2025

Choose a reason for hiding this comment

Uh oh!

waleedalzarooni commented Sep 26, 2025

Uh oh!

waleedalzarooni commented Oct 5, 2025

Uh oh!

Uh oh!

coderabbitai bot commented Sep 15, 2025 •

edited

Loading