-
Notifications
You must be signed in to change notification settings - Fork 57
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Wrong X axis on profiler's Step-time Graph #266
Comments
Found the same with no need for a custom loop, just calling
The result in Tensorboard PROFILE should be a Step-time Graph with only one step, instead getting a chart with steps from 18 to 105. See screeshot below. I will see if I can reduce the code to a smaller sample that still reproduces the issue. [CORRECTION] In the meanwhile, I have downloaded and run locally the Colab sample, and it DID reproduce the issue https://www.tensorflow.org/tensorboard/tensorboard_profiling_keras Using TF 2.4.1 and TB 2.4.1 installed with pip under Ubuntu 20.04 with CUDA 11.0. |
The problem can be reproduced running the Colab example https://www.tensorflow.org/tensorboard/tensorboard_profiling_keras Steps to reproduce:
The difference between the two runs is in the Dataset pipeline. The first run, which shows the issue, builds the pipeline like:
I have found that, if I add a |
I'm seeing the same. Using a profile_batch of Those numbers match the profile steps (9) but the values are completely random and vary from run to run |
I can also see the same inconsistency in the step number on the overview page for my model code. It is custom training loop similar to @andreykramer |
I'm having the same issue. I'm also using tf.profiler.experimental to trace a script and the step numbers that appear seem really random. I tried to run the example in https://www.tensorflow.org/tensorboard/tensorboard_profiling_keras and had the same problem |
Based on the tensorflow guide "writing a training loop from scratch" I've created a reproducible example (see bottom) to show that I can't seem to get the step number on the Step-time graph right. I adapted this example on tf.profiler.experimental.Trace to trace steps [20,29] on my training loop. The trace is correct:
But on the step-time graph in the overview page, the range of the X axis is [1,8]:
It's even worse on the actual code I'm trying to get the profiler working on, where I try to trace in a similar way steps [20,29] but the resulting step-time graph looks like this:
Am I getting something wrong? Where does [1,8] range come from?
Thank you in advance.
Here's the code for the reproducible example:
The text was updated successfully, but these errors were encountered: