Fix constant error observations appearing in the logs #5352

tofarr · 2024-12-01T21:11:28Z

Now checking that .gitignore exists. before trying to filter based on its contents.

Include this change in the Release Notes. If checked, you must provide an end-user friendly description for your change below

I see the following constantly in the logs:

|21:13:47 - openhands:DEBUG: action_execution_server.py:166 - Running action:
    |FileReadAction(path='.gitignore', start=0, end=-1, thought='', action='read', security_risk=None)
    ...
    |21:13:47 - openhands:DEBUG: action_execution_server.py:168 - Action output:
    |**ErrorObservation**
    |File not found: /workspace/.gitignore. Your current working directory is /workspace.
    ...

The reason is that every time we call /list-files we try to filter based on the contents of .gitignore - so we get an error message every time we try to list files if there is no .gitignore.

This change checks that the .gitignore exists before trying to read it.

To run this PR locally, use the following command:

docker run -it --rm   -p 3000:3000   -v /var/run/docker.sock:/var/run/docker.sock   --add-host host.docker.internal:host-gateway   -e SANDBOX_RUNTIME_CONTAINER_IMAGE=docker.all-hands.dev/all-hands-ai/runtime:6e252fe-nikolaik   --name openhands-app-6e252fe   docker.all-hands.dev/all-hands-ai/openhands:6e252fe

enyst · 2024-12-02T00:19:36Z

For some reason, one of the image builds got stuck for over 4h. I restarted them.

enyst · 2024-12-02T01:13:42Z

openhands/server/routes/files.py

-    async def filter_for_gitignore(file_list, base_path):
-        gitignore_path = os.path.join(base_path, '.gitignore')
+    async def has_gitignore() -> bool:
+        file_list = await call_sync_from_async(runtime.list_files, '')


I wonder if this is a bit heavy, to do twice, for every click. (first at line 68)

Could we avoid it somehow? Or it doesn't matter?

Yeah I agree we shouldn't do a list_files first--we should just try to access the file, and use error handling for FileNotFoundError

If that's causing logspam let's remove the logs upstream

enyst · 2024-12-02T01:16:32Z

openhands/server/routes/files.py

        try:
-            read_action = FileReadAction(gitignore_path)
+            read_action = FileReadAction('.gitignore')
            observation = await call_sync_from_async(runtime.run_action, read_action)


Just a thought, if there is no addition to the stream on this execution path (and it shouldn't be), could we just get/read the ErrorObs and treat it here directly?

With this approach, the logs would still contain:

|**ErrorObservation** |File not found: /workspace/.gitignore. Your current working directory is /workspace

The only ways around this that I can see are:

Live with this oddity.

Do not filter files using .gitignore

Check that the .gitignore exists. (The approach I took)

Update the FileReadAction with an optional flag to permit missing files and not yield an error observation.

Update the runtime to allow access to the FileStore and use this directly.

Correct me if wrong, but isn't there also
6. Do not log stuff in the FileReadAction, but in its client code?

Or that sounds really bad? I don't think they are many... but I could be wrong.

Actually, this one isn't doing it, it's the run_action, and that is really useful...

How about we use runtime.read directly instead of run_action

Looks like it also does run_action? We could write one that doesn't, I guess, unless I'm missing something.

OK, but it's late and sorry, it seems I no longer understand this problem in the first place: weren't these all in the sandbox now, both the results of list_files and .gitignore, so why do we have runtime.list_files return all files?

enyst

Thanks for doing this, the log spam is annoying and everyone will be better off when it's gone.

We have a good discussion in the comments, warts and confusions and all. I think hashing out the potential issues in comments makes a PR better, and I would love to see the final form here.

So I'll put a Request changes just to make sure... that it's not merged too fast. 😅

(I noticed you merge PRs a little too fast quite often, before comments are addressed or even acknowledged. Sorry, but I think we can do this better, because sometimes that can break things and some of us really don't like to break things in main here.)

tofarr · 2024-12-05T19:15:27Z

I'm gonna punt on this one now as there does seem to be concerns about the additional overhead of a file existence check. Perhaps a more thorough refactor will allow a better solution

Fix constant error observations appearing in the logs

2744a09

tofarr marked this pull request as ready for review December 1, 2024 21:20

enyst reviewed Dec 2, 2024

View reviewed changes

tofarr added 2 commits December 2, 2024 07:48

Merge branch 'main' into fix-constant-error-observations

c2691b8

Merge branch 'main' into fix-constant-error-observations

6e252fe

enyst requested changes Dec 4, 2024

View reviewed changes

tofarr closed this Dec 5, 2024

tofarr deleted the fix-constant-error-observations branch December 6, 2024 19:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix constant error observations appearing in the logs #5352

Fix constant error observations appearing in the logs #5352

tofarr commented Dec 1, 2024 •

edited by github-actions bot

Loading

enyst commented Dec 2, 2024

enyst Dec 2, 2024

rbren Dec 2, 2024

enyst Dec 2, 2024

tofarr Dec 2, 2024

enyst Dec 2, 2024

enyst Dec 2, 2024 •

edited

Loading

rbren Dec 3, 2024

enyst Dec 4, 2024 •

edited

Loading

enyst left a comment

tofarr commented Dec 5, 2024

Fix constant error observations appearing in the logs #5352

Fix constant error observations appearing in the logs #5352

Conversation

tofarr commented Dec 1, 2024 • edited by github-actions bot Loading

enyst commented Dec 2, 2024

enyst Dec 2, 2024

Choose a reason for hiding this comment

rbren Dec 2, 2024

Choose a reason for hiding this comment

enyst Dec 2, 2024

Choose a reason for hiding this comment

tofarr Dec 2, 2024

Choose a reason for hiding this comment

enyst Dec 2, 2024

Choose a reason for hiding this comment

enyst Dec 2, 2024 • edited Loading

Choose a reason for hiding this comment

rbren Dec 3, 2024

Choose a reason for hiding this comment

enyst Dec 4, 2024 • edited Loading

Choose a reason for hiding this comment

enyst left a comment

Choose a reason for hiding this comment

tofarr commented Dec 5, 2024

tofarr commented Dec 1, 2024 •

edited by github-actions bot

Loading

enyst Dec 2, 2024 •

edited

Loading

enyst Dec 4, 2024 •

edited

Loading