-
Notifications
You must be signed in to change notification settings - Fork 57
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Tensorboard profiler not working well with data from gcs bucket? #372
Comments
I'm experiencing a similar issue. In my case, tensorboard is running in a k8s pod for profiling tfserving. Tensorboard is run with the following command:
After clicking "Capture" from the tensorboard UI and sending requests to the TFServer, the However, if I run tensorboard locally from my laptop pointing it to the gcs bucket, tensorboard locally does show the profile:
Tensorboard version is 2.8.0, but the same issue occurs with version 2.4.1. Any fix or debugging tips would be greatly appreciated. Thank you. |
Any input on this? We are setting up tensorboard in a large-scale k8s deployment (>1000 pods), and so being able to store event logs in GCS is crucial for enabling this. I can reproduce the issue locally in docker with latest serving and tensorflow images and Tensorboard 2.7.0, and am happy to send my docker files it if helps. A local docker container runs tensorboard specifying a log directory in GCS. Tried running tensorboard both with and without the Below is a list of files in GCS produced after a profiling run. Noticed a profile-empty file in the list:
The file sizes are: Here is the output of tensorboard inspect. Strangely, there are tags but no stats shown for each tag:
The docker container has access to the GCS bucket. I verified this by exec'ing into the container and using gsutil to list and read files in the bucket. Also, tensorboard inspect works in the bucket. If I start a separate tensorboard instance from the command line from my laptop pointing to the same gcs bucket as so:
the profile results appear [after a log reload, clicked in the upper right hand corner of the UI]. |
I found that the issue specifically occurs with tensorboard-plugin-profile==2.5.0 with tensorboard 2.4.1 and 2.7.0 (and possibly other versions), but does not occur with version tensorboard-plugin-profile==2.4.0. |
Anything on this please? Problem is still there |
@rivershah can you please provide the directory structure of the GCS bucket, the command that you're using to run Tensorboard, and the code that you're using to collect the profile? Also, version information for tensorflow, tensorboard, and tensorboard_plugin_profile? |
I'm running the keras profing notebook on colab and all works fine.
Then I add a cell for logging into gcloud
and amend logging path to a gcs path:
and most of the times it works fine, but a few time I've got the "No profile data was found." page when browsing into tensorboard, even after refreshing.
Then I launch a tensorboard session in my local machine with logdir the gcs path:
and I always get the "No profile data was found." page when browsing into tensorboard, even after refreshing.
Finally I download the logging data from gcs bucket into a directory in my local machine and I start tensorboard with logdir my local path and it always shows the profile data.
Similar to #330 , but not quite like.
tensorboard 2.7.0, tensorboard_profiler_plugin 2.5.0
The text was updated successfully, but these errors were encountered: