Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Minor] Fix doc and MARL tests #2759

Merged
merged 2 commits into from
Feb 5, 2025
Merged

[Minor] Fix doc and MARL tests #2759

merged 2 commits into from
Feb 5, 2025

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Feb 5, 2025

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Feb 5, 2025
ghstack-source-id: 2171f334cfbfb3b2fe33fbdff9b5023e581fa5c8
Pull Request resolved: #2759
Copy link

pytorch-bot bot commented Feb 5, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2759

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 5, 2025
Copy link

github-actions bot commented Feb 5, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}14$. Worsened: $\large\color{#d91a1a}7$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.5338s 0.4457s 2.2438 Ops/s 2.2429 Ops/s $\color{#35bf28}+0.04\%$
test_transformed 0.9767s 0.8868s 1.1276 Ops/s 1.0955 Ops/s $\color{#35bf28}+2.93\%$
test_serial 1.3363s 1.3350s 0.7491 Ops/s 0.7315 Ops/s $\color{#35bf28}+2.40\%$
test_parallel 1.1919s 1.1802s 0.8473 Ops/s 0.8060 Ops/s $\textbf{\color{#35bf28}+5.13\%}$
test_step_mdp_speed[True-True-True-True-True] 0.1829ms 30.9377μs 32.3230 KOps/s 33.3582 KOps/s $\color{#d91a1a}-3.10\%$
test_step_mdp_speed[True-True-True-True-False] 67.8080μs 18.1240μs 55.1754 KOps/s 56.1288 KOps/s $\color{#d91a1a}-1.70\%$
test_step_mdp_speed[True-True-True-False-True] 47.2990μs 17.3738μs 57.5579 KOps/s 59.0544 KOps/s $\color{#d91a1a}-2.53\%$
test_step_mdp_speed[True-True-True-False-False] 59.0000μs 10.1562μs 98.4616 KOps/s 101.3189 KOps/s $\color{#d91a1a}-2.82\%$
test_step_mdp_speed[True-True-False-True-True] 62.3060μs 32.9393μs 30.3589 KOps/s 31.5681 KOps/s $\color{#d91a1a}-3.83\%$
test_step_mdp_speed[True-True-False-True-False] 51.2460μs 20.0601μs 49.8501 KOps/s 50.9901 KOps/s $\color{#d91a1a}-2.24\%$
test_step_mdp_speed[True-True-False-False-True] 45.9470μs 19.4234μs 51.4843 KOps/s 52.6538 KOps/s $\color{#d91a1a}-2.22\%$
test_step_mdp_speed[True-True-False-False-False] 67.1960μs 12.1141μs 82.5482 KOps/s 84.6950 KOps/s $\color{#d91a1a}-2.53\%$
test_step_mdp_speed[True-False-True-True-True] 0.1044ms 35.0827μs 28.5041 KOps/s 29.7455 KOps/s $\color{#d91a1a}-4.17\%$
test_step_mdp_speed[True-False-True-True-False] 51.0360μs 22.0325μs 45.3876 KOps/s 46.8072 KOps/s $\color{#d91a1a}-3.03\%$
test_step_mdp_speed[True-False-True-False-True] 74.3700μs 19.5301μs 51.2031 KOps/s 53.3878 KOps/s $\color{#d91a1a}-4.09\%$
test_step_mdp_speed[True-False-True-False-False] 33.2220μs 12.2061μs 81.9265 KOps/s 85.6534 KOps/s $\color{#d91a1a}-4.35\%$
test_step_mdp_speed[True-False-False-True-True] 98.6650μs 36.4723μs 27.4181 KOps/s 28.2652 KOps/s $\color{#d91a1a}-3.00\%$
test_step_mdp_speed[True-False-False-True-False] 68.2920μs 23.9538μs 41.7470 KOps/s 42.9071 KOps/s $\color{#d91a1a}-2.70\%$
test_step_mdp_speed[True-False-False-False-True] 39.2640μs 21.2119μs 47.1434 KOps/s 49.0912 KOps/s $\color{#d91a1a}-3.97\%$
test_step_mdp_speed[True-False-False-False-False] 55.2030μs 13.9373μs 71.7499 KOps/s 74.0143 KOps/s $\color{#d91a1a}-3.06\%$
test_step_mdp_speed[False-True-True-True-True] 87.2340μs 35.0094μs 28.5638 KOps/s 29.7128 KOps/s $\color{#d91a1a}-3.87\%$
test_step_mdp_speed[False-True-True-True-False] 49.2430μs 22.2108μs 45.0231 KOps/s 46.5965 KOps/s $\color{#d91a1a}-3.38\%$
test_step_mdp_speed[False-True-True-False-True] 2.3206ms 22.4153μs 44.6124 KOps/s 46.2017 KOps/s $\color{#d91a1a}-3.44\%$
test_step_mdp_speed[False-True-True-False-False] 40.1450μs 13.5554μs 73.7711 KOps/s 75.8444 KOps/s $\color{#d91a1a}-2.73\%$
test_step_mdp_speed[False-True-False-True-True] 94.3170μs 36.9542μs 27.0605 KOps/s 28.1949 KOps/s $\color{#d91a1a}-4.02\%$
test_step_mdp_speed[False-True-False-True-False] 55.8050μs 24.1319μs 41.4389 KOps/s 42.9467 KOps/s $\color{#d91a1a}-3.51\%$
test_step_mdp_speed[False-True-False-False-True] 60.8030μs 25.6006μs 39.0616 KOps/s 43.1236 KOps/s $\textbf{\color{#d91a1a}-9.42\%}$
test_step_mdp_speed[False-True-False-False-False] 64.5610μs 15.3975μs 64.9457 KOps/s 67.1966 KOps/s $\color{#d91a1a}-3.35\%$
test_step_mdp_speed[False-False-True-True-True] 94.6880μs 38.6315μs 25.8856 KOps/s 26.7999 KOps/s $\color{#d91a1a}-3.41\%$
test_step_mdp_speed[False-False-True-True-False] 75.2580μs 25.6028μs 39.0582 KOps/s 39.8938 KOps/s $\color{#d91a1a}-2.09\%$
test_step_mdp_speed[False-False-True-False-True] 58.3490μs 24.3988μs 40.9856 KOps/s 43.7822 KOps/s $\textbf{\color{#d91a1a}-6.39\%}$
test_step_mdp_speed[False-False-True-False-False] 40.6460μs 15.4499μs 64.7253 KOps/s 67.9473 KOps/s $\color{#d91a1a}-4.74\%$
test_step_mdp_speed[False-False-False-True-True] 0.1185ms 40.0354μs 24.9779 KOps/s 25.9919 KOps/s $\color{#d91a1a}-3.90\%$
test_step_mdp_speed[False-False-False-True-False] 51.0160μs 27.3759μs 36.5285 KOps/s 37.8296 KOps/s $\color{#d91a1a}-3.44\%$
test_step_mdp_speed[False-False-False-False-True] 54.2120μs 25.3211μs 39.4928 KOps/s 40.9291 KOps/s $\color{#d91a1a}-3.51\%$
test_step_mdp_speed[False-False-False-False-False] 57.2780μs 16.9368μs 59.0432 KOps/s 60.5093 KOps/s $\color{#d91a1a}-2.42\%$
test_values[generalized_advantage_estimate-True-True] 12.0010ms 9.7375ms 102.6960 Ops/s 101.9000 Ops/s $\color{#35bf28}+0.78\%$
test_values[vec_generalized_advantage_estimate-True-True] 26.5117ms 24.0447ms 41.5893 Ops/s 37.5017 Ops/s $\textbf{\color{#35bf28}+10.90\%}$
test_values[td0_return_estimate-False-False] 0.2313ms 0.1747ms 5.7234 KOps/s 5.5727 KOps/s $\color{#35bf28}+2.70\%$
test_values[td1_return_estimate-False-False] 24.3369ms 23.6975ms 42.1986 Ops/s 40.8607 Ops/s $\color{#35bf28}+3.27\%$
test_values[vec_td1_return_estimate-False-False] 25.8576ms 24.1739ms 41.3670 Ops/s 38.1299 Ops/s $\textbf{\color{#35bf28}+8.49\%}$
test_values[td_lambda_return_estimate-True-False] 36.9197ms 34.4203ms 29.0526 Ops/s 28.3532 Ops/s $\color{#35bf28}+2.47\%$
test_values[vec_td_lambda_return_estimate-True-False] 26.5981ms 24.1632ms 41.3852 Ops/s 37.8359 Ops/s $\textbf{\color{#35bf28}+9.38\%}$
test_gae_speed[generalized_advantage_estimate-False-1-512] 11.5822ms 8.6080ms 116.1708 Ops/s 117.6758 Ops/s $\color{#d91a1a}-1.28\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 2.2037ms 1.9402ms 515.3998 Ops/s 509.9689 Ops/s $\color{#35bf28}+1.06\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.6307ms 0.3760ms 2.6597 KOps/s 2.6933 KOps/s $\color{#d91a1a}-1.25\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 44.0450ms 42.8221ms 23.3524 Ops/s 22.2141 Ops/s $\textbf{\color{#35bf28}+5.12\%}$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 4.3550ms 3.4547ms 289.4587 Ops/s 289.2882 Ops/s $\color{#35bf28}+0.06\%$
test_dqn_speed[False-None] 1.9796ms 1.4072ms 710.6112 Ops/s 698.8068 Ops/s $\color{#35bf28}+1.69\%$
test_dqn_speed[False-backward] 1.9841ms 1.9133ms 522.6549 Ops/s 519.6819 Ops/s $\color{#35bf28}+0.57\%$
test_dqn_speed[True-None] 0.6287ms 0.4824ms 2.0731 KOps/s 2.0550 KOps/s $\color{#35bf28}+0.88\%$
test_dqn_speed[True-backward] 0.9675ms 0.9306ms 1.0746 KOps/s 1.0602 KOps/s $\color{#35bf28}+1.36\%$
test_dqn_speed[reduce-overhead-None] 0.7773ms 0.4838ms 2.0668 KOps/s 2.0412 KOps/s $\color{#35bf28}+1.25\%$
test_dqn_speed[reduce-overhead-backward] 1.0063ms 0.9389ms 1.0651 KOps/s 1.0732 KOps/s $\color{#d91a1a}-0.75\%$
test_ddpg_speed[False-None] 0.1874s 3.4563ms 289.3282 Ops/s 346.3236 Ops/s $\textbf{\color{#d91a1a}-16.46\%}$
test_ddpg_speed[False-backward] 4.3723ms 4.0566ms 246.5099 Ops/s 249.9149 Ops/s $\color{#d91a1a}-1.36\%$
test_ddpg_speed[True-None] 1.6793ms 1.2341ms 810.3239 Ops/s 802.3884 Ops/s $\color{#35bf28}+0.99\%$
test_ddpg_speed[True-backward] 2.7847ms 2.1706ms 460.7074 Ops/s 465.4129 Ops/s $\color{#d91a1a}-1.01\%$
test_ddpg_speed[reduce-overhead-None] 1.6870ms 1.2350ms 809.7394 Ops/s 808.0358 Ops/s $\color{#35bf28}+0.21\%$
test_ddpg_speed[reduce-overhead-backward] 2.2641ms 2.1378ms 467.7725 Ops/s 469.4502 Ops/s $\color{#d91a1a}-0.36\%$
test_sac_speed[False-None] 13.2720ms 8.3844ms 119.2692 Ops/s 124.5036 Ops/s $\color{#d91a1a}-4.20\%$
test_sac_speed[False-backward] 11.7654ms 10.6775ms 93.6546 Ops/s 92.0424 Ops/s $\color{#35bf28}+1.75\%$
test_sac_speed[True-None] 5.7174ms 2.0983ms 476.5765 Ops/s 472.8986 Ops/s $\color{#35bf28}+0.78\%$
test_sac_speed[True-backward] 3.8530ms 3.7482ms 266.7934 Ops/s 262.8057 Ops/s $\color{#35bf28}+1.52\%$
test_sac_speed[reduce-overhead-None] 3.3029ms 2.0965ms 476.9872 Ops/s 476.4180 Ops/s $\color{#35bf28}+0.12\%$
test_sac_speed[reduce-overhead-backward] 3.8393ms 3.7721ms 265.1017 Ops/s 262.1358 Ops/s $\color{#35bf28}+1.13\%$
test_redq_speed[False-None] 14.8191ms 13.1646ms 75.9611 Ops/s 76.8664 Ops/s $\color{#d91a1a}-1.18\%$
test_redq_speed[False-backward] 24.6143ms 22.6643ms 44.1223 Ops/s 43.5479 Ops/s $\color{#35bf28}+1.32\%$
test_redq_speed[True-None] 6.5780ms 5.4089ms 184.8808 Ops/s 201.6476 Ops/s $\textbf{\color{#d91a1a}-8.31\%}$
test_redq_speed[True-backward] 13.6147ms 12.1402ms 82.3709 Ops/s 73.7716 Ops/s $\textbf{\color{#35bf28}+11.66\%}$
test_redq_speed[reduce-overhead-None] 6.4198ms 5.0215ms 199.1418 Ops/s 177.1708 Ops/s $\textbf{\color{#35bf28}+12.40\%}$
test_redq_speed[reduce-overhead-backward] 13.5461ms 12.6351ms 79.1449 Ops/s 75.9177 Ops/s $\color{#35bf28}+4.25\%$
test_redq_deprec_speed[False-None] 14.0261ms 12.9336ms 77.3181 Ops/s 77.7942 Ops/s $\color{#d91a1a}-0.61\%$
test_redq_deprec_speed[False-backward] 24.7745ms 18.9371ms 52.8063 Ops/s 53.9179 Ops/s $\color{#d91a1a}-2.06\%$
test_redq_deprec_speed[True-None] 4.4976ms 3.7916ms 263.7386 Ops/s 255.2490 Ops/s $\color{#35bf28}+3.33\%$
test_redq_deprec_speed[True-backward] 8.4259ms 8.1264ms 123.0554 Ops/s 120.2239 Ops/s $\color{#35bf28}+2.36\%$
test_redq_deprec_speed[reduce-overhead-None] 5.1752ms 3.8236ms 261.5309 Ops/s 256.6960 Ops/s $\color{#35bf28}+1.88\%$
test_redq_deprec_speed[reduce-overhead-backward] 8.9600ms 8.1711ms 122.3824 Ops/s 119.2268 Ops/s $\color{#35bf28}+2.65\%$
test_td3_speed[False-None] 8.2108ms 7.9523ms 125.7491 Ops/s 113.4519 Ops/s $\textbf{\color{#35bf28}+10.84\%}$
test_td3_speed[False-backward] 11.2422ms 10.3483ms 96.6340 Ops/s 94.8315 Ops/s $\color{#35bf28}+1.90\%$
test_td3_speed[True-None] 1.8747ms 1.7797ms 561.8918 Ops/s 533.6020 Ops/s $\textbf{\color{#35bf28}+5.30\%}$
test_td3_speed[True-backward] 4.6716ms 3.5588ms 280.9925 Ops/s 285.2413 Ops/s $\color{#d91a1a}-1.49\%$
test_td3_speed[reduce-overhead-None] 2.2094ms 1.7891ms 558.9494 Ops/s 529.7331 Ops/s $\textbf{\color{#35bf28}+5.52\%}$
test_td3_speed[reduce-overhead-backward] 3.4439ms 3.3918ms 294.8324 Ops/s 285.8135 Ops/s $\color{#35bf28}+3.16\%$
test_cql_speed[False-None] 40.8716ms 37.0887ms 26.9624 Ops/s 27.0793 Ops/s $\color{#d91a1a}-0.43\%$
test_cql_speed[False-backward] 50.0137ms 46.6185ms 21.4507 Ops/s 20.4361 Ops/s $\color{#35bf28}+4.97\%$
test_cql_speed[True-None] 17.4015ms 15.9097ms 62.8549 Ops/s 62.6319 Ops/s $\color{#35bf28}+0.36\%$
test_cql_speed[True-backward] 24.6702ms 23.0470ms 43.3896 Ops/s 44.1965 Ops/s $\color{#d91a1a}-1.83\%$
test_cql_speed[reduce-overhead-None] 17.2627ms 16.2910ms 61.3834 Ops/s 61.3571 Ops/s $\color{#35bf28}+0.04\%$
test_cql_speed[reduce-overhead-backward] 23.7951ms 22.7798ms 43.8985 Ops/s 44.1398 Ops/s $\color{#d91a1a}-0.55\%$
test_a2c_speed[False-None] 7.7438ms 7.0885ms 141.0740 Ops/s 137.2912 Ops/s $\color{#35bf28}+2.76\%$
test_a2c_speed[False-backward] 15.4946ms 14.1465ms 70.6886 Ops/s 68.4759 Ops/s $\color{#35bf28}+3.23\%$
test_a2c_speed[True-None] 4.9591ms 3.6962ms 270.5496 Ops/s 269.1526 Ops/s $\color{#35bf28}+0.52\%$
test_a2c_speed[True-backward] 10.4203ms 10.0862ms 99.1449 Ops/s 98.1508 Ops/s $\color{#35bf28}+1.01\%$
test_a2c_speed[reduce-overhead-None] 4.7602ms 3.6794ms 271.7844 Ops/s 269.1985 Ops/s $\color{#35bf28}+0.96\%$
test_a2c_speed[reduce-overhead-backward] 10.7004ms 10.0873ms 99.1342 Ops/s 98.0379 Ops/s $\color{#35bf28}+1.12\%$
test_ppo_speed[False-None] 7.9979ms 7.3644ms 135.7885 Ops/s 133.4391 Ops/s $\color{#35bf28}+1.76\%$
test_ppo_speed[False-backward] 15.5650ms 14.6592ms 68.2168 Ops/s 67.4744 Ops/s $\color{#35bf28}+1.10\%$
test_ppo_speed[True-None] 4.7933ms 4.0420ms 247.4024 Ops/s 243.6299 Ops/s $\color{#35bf28}+1.55\%$
test_ppo_speed[True-backward] 10.3213ms 9.9381ms 100.6227 Ops/s 99.6517 Ops/s $\color{#35bf28}+0.97\%$
test_ppo_speed[reduce-overhead-None] 4.4920ms 4.0419ms 247.4063 Ops/s 243.1333 Ops/s $\color{#35bf28}+1.76\%$
test_ppo_speed[reduce-overhead-backward] 10.5904ms 9.9414ms 100.5899 Ops/s 97.0364 Ops/s $\color{#35bf28}+3.66\%$
test_reinforce_speed[False-None] 7.6193ms 6.4574ms 154.8613 Ops/s 148.4724 Ops/s $\color{#35bf28}+4.30\%$
test_reinforce_speed[False-backward] 9.8846ms 9.6441ms 103.6902 Ops/s 98.0584 Ops/s $\textbf{\color{#35bf28}+5.74\%}$
test_reinforce_speed[True-None] 3.7475ms 3.0122ms 331.9807 Ops/s 322.6108 Ops/s $\color{#35bf28}+2.90\%$
test_reinforce_speed[True-backward] 9.8868ms 8.9327ms 111.9481 Ops/s 109.0831 Ops/s $\color{#35bf28}+2.63\%$
test_reinforce_speed[reduce-overhead-None] 3.7981ms 3.0755ms 325.1513 Ops/s 312.1836 Ops/s $\color{#35bf28}+4.15\%$
test_reinforce_speed[reduce-overhead-backward] 9.5455ms 8.9798ms 111.3606 Ops/s 109.1305 Ops/s $\color{#35bf28}+2.04\%$
test_iql_speed[False-None] 37.2070ms 32.7006ms 30.5805 Ops/s 29.5965 Ops/s $\color{#35bf28}+3.32\%$
test_iql_speed[False-backward] 46.6077ms 45.4998ms 21.9781 Ops/s 21.8552 Ops/s $\color{#35bf28}+0.56\%$
test_iql_speed[True-None] 12.4437ms 11.4016ms 87.7069 Ops/s 86.5702 Ops/s $\color{#35bf28}+1.31\%$
test_iql_speed[True-backward] 23.4029ms 22.4134ms 44.6162 Ops/s 44.0779 Ops/s $\color{#35bf28}+1.22\%$
test_iql_speed[reduce-overhead-None] 12.9518ms 11.6944ms 85.5107 Ops/s 85.9708 Ops/s $\color{#d91a1a}-0.54\%$
test_iql_speed[reduce-overhead-backward] 24.3199ms 22.9623ms 43.5496 Ops/s 44.8727 Ops/s $\color{#d91a1a}-2.95\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 7.0522ms 5.1016ms 196.0181 Ops/s 205.3560 Ops/s $\color{#d91a1a}-4.55\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.7754ms 0.5415ms 1.8466 KOps/s 1.8302 KOps/s $\color{#35bf28}+0.89\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.8363ms 0.5250ms 1.9047 KOps/s 1.9433 KOps/s $\color{#d91a1a}-1.99\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 5.2409ms 4.8461ms 206.3531 Ops/s 217.0469 Ops/s $\color{#d91a1a}-4.93\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 1.3731ms 0.5402ms 1.8512 KOps/s 1.8751 KOps/s $\color{#d91a1a}-1.27\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.7581ms 0.5104ms 1.9594 KOps/s 1.9936 KOps/s $\color{#d91a1a}-1.72\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 2.4049ms 1.7460ms 572.7373 Ops/s 567.9523 Ops/s $\color{#35bf28}+0.84\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 2.1123ms 1.6633ms 601.2108 Ops/s 599.9788 Ops/s $\color{#35bf28}+0.21\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 7.7240ms 4.9816ms 200.7376 Ops/s 209.2404 Ops/s $\color{#d91a1a}-4.06\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.1831ms 0.6988ms 1.4309 KOps/s 1.4564 KOps/s $\color{#d91a1a}-1.75\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 1.1726ms 0.6724ms 1.4873 KOps/s 1.5368 KOps/s $\color{#d91a1a}-3.22\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 4.9751ms 4.8382ms 206.6881 Ops/s 216.6330 Ops/s $\color{#d91a1a}-4.59\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 1.0304ms 0.5484ms 1.8234 KOps/s 1.8303 KOps/s $\color{#d91a1a}-0.38\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.7648ms 0.5229ms 1.9124 KOps/s 1.9582 KOps/s $\color{#d91a1a}-2.34\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 5.4245ms 4.8264ms 207.1951 Ops/s 217.0857 Ops/s $\color{#d91a1a}-4.56\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.8458ms 0.5352ms 1.8683 KOps/s 1.8158 KOps/s $\color{#35bf28}+2.89\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.7604ms 0.5123ms 1.9519 KOps/s 1.9379 KOps/s $\color{#35bf28}+0.72\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 5.3080ms 4.9919ms 200.3245 Ops/s 212.3367 Ops/s $\textbf{\color{#d91a1a}-5.66\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 3.7698ms 0.6918ms 1.4455 KOps/s 1.4133 KOps/s $\color{#35bf28}+2.28\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.8641ms 0.6570ms 1.5221 KOps/s 1.4928 KOps/s $\color{#35bf28}+1.96\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 6.5785ms 4.3297ms 230.9653 Ops/s 245.8061 Ops/s $\textbf{\color{#d91a1a}-6.04\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 5.6448ms 2.4122ms 414.5608 Ops/s 420.8949 Ops/s $\color{#d91a1a}-1.50\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 7.9821ms 1.5617ms 640.3480 Ops/s 733.1511 Ops/s $\textbf{\color{#d91a1a}-12.66\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 0.4899s 14.1018ms 70.9132 Ops/s 33.4444 Ops/s $\textbf{\color{#35bf28}+112.03\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 8.6005ms 2.4549ms 407.3546 Ops/s 413.5803 Ops/s $\color{#d91a1a}-1.51\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 6.9300ms 1.4679ms 681.2542 Ops/s 703.1502 Ops/s $\color{#d91a1a}-3.11\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 5.8627ms 4.4558ms 224.4253 Ops/s 219.1503 Ops/s $\color{#35bf28}+2.41\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 7.0120ms 2.5674ms 389.4917 Ops/s 380.7005 Ops/s $\color{#35bf28}+2.31\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 6.1427ms 1.5875ms 629.9232 Ops/s 565.5000 Ops/s $\textbf{\color{#35bf28}+11.39\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 12.0041ms 11.6593ms 85.7685 Ops/s 81.0497 Ops/s $\textbf{\color{#35bf28}+5.82\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 15.5360ms 14.4914ms 69.0067 Ops/s 68.8116 Ops/s $\color{#35bf28}+0.28\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 22.1914ms 20.5814ms 48.5876 Ops/s 47.6679 Ops/s $\color{#35bf28}+1.93\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 17.0307ms 14.8004ms 67.5655 Ops/s 67.2234 Ops/s $\color{#35bf28}+0.51\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 21.0155ms 20.5731ms 48.6072 Ops/s 46.8813 Ops/s $\color{#35bf28}+3.68\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 18.0636ms 15.8668ms 63.0248 Ops/s 62.0549 Ops/s $\color{#35bf28}+1.56\%$

Copy link

github-actions bot commented Feb 5, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}19$. Worsened: $\large\color{#d91a1a}5$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.8271s 0.7417s 1.3482 Ops/s 1.3272 Ops/s $\color{#35bf28}+1.59\%$
test_transformed 1.2859s 1.2835s 0.7791 Ops/s 0.7413 Ops/s $\textbf{\color{#35bf28}+5.11\%}$
test_serial 2.1370s 2.1298s 0.4695 Ops/s 0.4622 Ops/s $\color{#35bf28}+1.59\%$
test_parallel 1.8922s 1.8579s 0.5382 Ops/s 0.5402 Ops/s $\color{#d91a1a}-0.36\%$
test_step_mdp_speed[True-True-True-True-True] 0.2013ms 38.6387μs 25.8808 KOps/s 25.4668 KOps/s $\color{#35bf28}+1.63\%$
test_step_mdp_speed[True-True-True-True-False] 56.3210μs 22.8852μs 43.6964 KOps/s 42.6029 KOps/s $\color{#35bf28}+2.57\%$
test_step_mdp_speed[True-True-True-False-True] 59.5310μs 21.5913μs 46.3150 KOps/s 45.5156 KOps/s $\color{#35bf28}+1.76\%$
test_step_mdp_speed[True-True-True-False-False] 44.4710μs 12.4685μs 80.2018 KOps/s 77.8316 KOps/s $\color{#35bf28}+3.05\%$
test_step_mdp_speed[True-True-False-True-True] 83.1710μs 41.4195μs 24.1432 KOps/s 23.7619 KOps/s $\color{#35bf28}+1.60\%$
test_step_mdp_speed[True-True-False-True-False] 61.1920μs 24.8170μs 40.2949 KOps/s 39.2264 KOps/s $\color{#35bf28}+2.72\%$
test_step_mdp_speed[True-True-False-False-True] 59.0310μs 23.8736μs 41.8874 KOps/s 40.7867 KOps/s $\color{#35bf28}+2.70\%$
test_step_mdp_speed[True-True-False-False-False] 66.8810μs 14.5792μs 68.5908 KOps/s 66.5705 KOps/s $\color{#35bf28}+3.03\%$
test_step_mdp_speed[True-False-True-True-True] 71.6410μs 43.5042μs 22.9863 KOps/s 23.1070 KOps/s $\color{#d91a1a}-0.52\%$
test_step_mdp_speed[True-False-True-True-False] 60.0710μs 27.4944μs 36.3711 KOps/s 37.1118 KOps/s $\color{#d91a1a}-2.00\%$
test_step_mdp_speed[True-False-True-False-True] 55.2310μs 24.1677μs 41.3775 KOps/s 42.2332 KOps/s $\color{#d91a1a}-2.03\%$
test_step_mdp_speed[True-False-True-False-False] 44.6210μs 14.9604μs 66.8430 KOps/s 66.4433 KOps/s $\color{#35bf28}+0.60\%$
test_step_mdp_speed[True-False-False-True-True] 80.9410μs 46.9173μs 21.3141 KOps/s 21.9168 KOps/s $\color{#d91a1a}-2.75\%$
test_step_mdp_speed[True-False-False-True-False] 67.9110μs 29.7158μs 33.6521 KOps/s 34.5704 KOps/s $\color{#d91a1a}-2.66\%$
test_step_mdp_speed[True-False-False-False-True] 59.6510μs 26.3801μs 37.9073 KOps/s 37.8002 KOps/s $\color{#35bf28}+0.28\%$
test_step_mdp_speed[True-False-False-False-False] 47.7100μs 17.2986μs 57.8080 KOps/s 57.4535 KOps/s $\color{#35bf28}+0.62\%$
test_step_mdp_speed[False-True-True-True-True] 76.9510μs 43.8731μs 22.7930 KOps/s 22.8551 KOps/s $\color{#d91a1a}-0.27\%$
test_step_mdp_speed[False-True-True-True-False] 63.5410μs 27.3758μs 36.5286 KOps/s 36.3339 KOps/s $\color{#35bf28}+0.54\%$
test_step_mdp_speed[False-True-True-False-True] 2.6088ms 28.4802μs 35.1121 KOps/s 36.3423 KOps/s $\color{#d91a1a}-3.38\%$
test_step_mdp_speed[False-True-True-False-False] 47.3910μs 16.9575μs 58.9710 KOps/s 58.6922 KOps/s $\color{#35bf28}+0.48\%$
test_step_mdp_speed[False-True-False-True-True] 0.1201ms 46.5087μs 21.5013 KOps/s 21.7586 KOps/s $\color{#d91a1a}-1.18\%$
test_step_mdp_speed[False-True-False-True-False] 60.0920μs 30.0145μs 33.3172 KOps/s 33.0558 KOps/s $\color{#35bf28}+0.79\%$
test_step_mdp_speed[False-True-False-False-True] 61.7410μs 30.3746μs 32.9223 KOps/s 31.8715 KOps/s $\color{#35bf28}+3.30\%$
test_step_mdp_speed[False-True-False-False-False] 58.7520μs 19.3125μs 51.7800 KOps/s 52.3211 KOps/s $\color{#d91a1a}-1.03\%$
test_step_mdp_speed[False-False-True-True-True] 94.3720μs 48.8944μs 20.4523 KOps/s 20.2672 KOps/s $\color{#35bf28}+0.91\%$
test_step_mdp_speed[False-False-True-True-False] 61.0810μs 32.0237μs 31.2268 KOps/s 30.7809 KOps/s $\color{#35bf28}+1.45\%$
test_step_mdp_speed[False-False-True-False-True] 61.4120μs 29.7345μs 33.6310 KOps/s 33.7253 KOps/s $\color{#d91a1a}-0.28\%$
test_step_mdp_speed[False-False-True-False-False] 57.5610μs 18.9395μs 52.7997 KOps/s 53.6356 KOps/s $\color{#d91a1a}-1.56\%$
test_step_mdp_speed[False-False-False-True-True] 77.9720μs 50.3831μs 19.8479 KOps/s 20.0220 KOps/s $\color{#d91a1a}-0.87\%$
test_step_mdp_speed[False-False-False-True-False] 63.7720μs 34.2566μs 29.1915 KOps/s 29.2005 KOps/s $\color{#d91a1a}-0.03\%$
test_step_mdp_speed[False-False-False-False-True] 70.5910μs 31.9060μs 31.3420 KOps/s 31.6113 KOps/s $\color{#d91a1a}-0.85\%$
test_step_mdp_speed[False-False-False-False-False] 46.5210μs 21.0488μs 47.5086 KOps/s 47.3762 KOps/s $\color{#35bf28}+0.28\%$
test_values[generalized_advantage_estimate-True-True] 25.4523ms 25.0496ms 39.9209 Ops/s 40.1648 Ops/s $\color{#d91a1a}-0.61\%$
test_values[vec_generalized_advantage_estimate-True-True] 0.1202s 3.3083ms 302.2703 Ops/s 327.4288 Ops/s $\textbf{\color{#d91a1a}-7.68\%}$
test_values[td0_return_estimate-False-False] 0.1062ms 81.0041μs 12.3451 KOps/s 12.3905 KOps/s $\color{#d91a1a}-0.37\%$
test_values[td1_return_estimate-False-False] 56.2718ms 55.7896ms 17.9245 Ops/s 18.0116 Ops/s $\color{#d91a1a}-0.48\%$
test_values[vec_td1_return_estimate-False-False] 1.3030ms 1.0896ms 917.7586 Ops/s 921.1390 Ops/s $\color{#d91a1a}-0.37\%$
test_values[td_lambda_return_estimate-True-False] 88.9378ms 88.4202ms 11.3096 Ops/s 11.3620 Ops/s $\color{#d91a1a}-0.46\%$
test_values[vec_td_lambda_return_estimate-True-False] 1.3863ms 1.0895ms 917.8161 Ops/s 923.4861 Ops/s $\color{#d91a1a}-0.61\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 25.0049ms 24.8283ms 40.2766 Ops/s 40.3637 Ops/s $\color{#d91a1a}-0.22\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 1.0251ms 0.7614ms 1.3134 KOps/s 1.3206 KOps/s $\color{#d91a1a}-0.55\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.7698ms 0.6789ms 1.4730 KOps/s 1.4783 KOps/s $\color{#d91a1a}-0.36\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 1.5336ms 1.4918ms 670.3519 Ops/s 674.0574 Ops/s $\color{#d91a1a}-0.55\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 0.7367ms 0.6943ms 1.4404 KOps/s 1.4511 KOps/s $\color{#d91a1a}-0.74\%$
test_dqn_speed[False-None] 7.0054ms 1.5236ms 656.3269 Ops/s 649.0149 Ops/s $\color{#35bf28}+1.13\%$
test_dqn_speed[False-backward] 2.1636ms 2.1272ms 470.0994 Ops/s 460.1191 Ops/s $\color{#35bf28}+2.17\%$
test_dqn_speed[True-None] 0.6313ms 0.5542ms 1.8045 KOps/s 1.7843 KOps/s $\color{#35bf28}+1.13\%$
test_dqn_speed[True-backward] 1.2256ms 1.1225ms 890.8795 Ops/s 882.8810 Ops/s $\color{#35bf28}+0.91\%$
test_dqn_speed[reduce-overhead-None] 0.9787ms 0.5729ms 1.7454 KOps/s 1.7393 KOps/s $\color{#35bf28}+0.35\%$
test_dqn_speed[reduce-overhead-backward] 1.0257ms 0.9609ms 1.0407 KOps/s 1.0311 KOps/s $\color{#35bf28}+0.93\%$
test_ddpg_speed[False-None] 3.2492ms 2.8610ms 349.5280 Ops/s 342.3651 Ops/s $\color{#35bf28}+2.09\%$
test_ddpg_speed[False-backward] 4.6411ms 4.1413ms 241.4697 Ops/s 238.8273 Ops/s $\color{#35bf28}+1.11\%$
test_ddpg_speed[True-None] 1.7402ms 1.3317ms 750.8920 Ops/s 746.2974 Ops/s $\color{#35bf28}+0.62\%$
test_ddpg_speed[True-backward] 2.4641ms 2.4055ms 415.7155 Ops/s 410.7102 Ops/s $\color{#35bf28}+1.22\%$
test_ddpg_speed[reduce-overhead-None] 1.7633ms 1.3474ms 742.1639 Ops/s 736.6748 Ops/s $\color{#35bf28}+0.75\%$
test_ddpg_speed[reduce-overhead-backward] 1.9591ms 1.8852ms 530.4481 Ops/s 526.9800 Ops/s $\color{#35bf28}+0.66\%$
test_sac_speed[False-None] 8.4053ms 8.0243ms 124.6210 Ops/s 121.8615 Ops/s $\color{#35bf28}+2.26\%$
test_sac_speed[False-backward] 11.4391ms 10.9214ms 91.5634 Ops/s 89.3427 Ops/s $\color{#35bf28}+2.49\%$
test_sac_speed[True-None] 1.9084ms 1.8193ms 549.6691 Ops/s 541.9819 Ops/s $\color{#35bf28}+1.42\%$
test_sac_speed[True-backward] 3.6025ms 3.5207ms 284.0348 Ops/s 278.5832 Ops/s $\color{#35bf28}+1.96\%$
test_sac_speed[reduce-overhead-None] 21.2444ms 12.0279ms 83.1398 Ops/s 81.9650 Ops/s $\color{#35bf28}+1.43\%$
test_sac_speed[reduce-overhead-backward] 1.6669ms 1.6153ms 619.0663 Ops/s 604.1788 Ops/s $\color{#35bf28}+2.46\%$
test_redq_speed[False-None] 7.9962ms 7.4926ms 133.4643 Ops/s 129.8756 Ops/s $\color{#35bf28}+2.76\%$
test_redq_speed[False-backward] 12.3459ms 11.3184ms 88.3518 Ops/s 86.3583 Ops/s $\color{#35bf28}+2.31\%$
test_redq_speed[True-None] 2.4984ms 2.2902ms 436.6456 Ops/s 421.9798 Ops/s $\color{#35bf28}+3.48\%$
test_redq_speed[True-backward] 4.4213ms 3.9730ms 251.6983 Ops/s 236.7359 Ops/s $\textbf{\color{#35bf28}+6.32\%}$
test_redq_speed[reduce-overhead-None] 2.6688ms 2.3098ms 432.9455 Ops/s 422.3992 Ops/s $\color{#35bf28}+2.50\%$
test_redq_speed[reduce-overhead-backward] 4.0279ms 3.9690ms 251.9542 Ops/s 235.1060 Ops/s $\textbf{\color{#35bf28}+7.17\%}$
test_redq_deprec_speed[False-None] 9.5553ms 9.0370ms 110.6567 Ops/s 107.1763 Ops/s $\color{#35bf28}+3.25\%$
test_redq_deprec_speed[False-backward] 12.6128ms 12.0182ms 83.2073 Ops/s 79.7030 Ops/s $\color{#35bf28}+4.40\%$
test_redq_deprec_speed[True-None] 2.7522ms 2.6269ms 380.6758 Ops/s 373.0325 Ops/s $\color{#35bf28}+2.05\%$
test_redq_deprec_speed[True-backward] 4.3284ms 4.2732ms 234.0149 Ops/s 227.2211 Ops/s $\color{#35bf28}+2.99\%$
test_redq_deprec_speed[reduce-overhead-None] 2.8108ms 2.6090ms 383.2927 Ops/s 373.3900 Ops/s $\color{#35bf28}+2.65\%$
test_redq_deprec_speed[reduce-overhead-backward] 4.7529ms 4.2982ms 232.6533 Ops/s 227.4836 Ops/s $\color{#35bf28}+2.27\%$
test_td3_speed[False-None] 8.1062ms 7.9621ms 125.5948 Ops/s 122.5479 Ops/s $\color{#35bf28}+2.49\%$
test_td3_speed[False-backward] 10.7573ms 10.2845ms 97.2334 Ops/s 95.4468 Ops/s $\color{#35bf28}+1.87\%$
test_td3_speed[True-None] 1.6445ms 1.6231ms 616.0899 Ops/s 586.2364 Ops/s $\textbf{\color{#35bf28}+5.09\%}$
test_td3_speed[True-backward] 3.1901ms 3.1520ms 317.2608 Ops/s 311.9054 Ops/s $\color{#35bf28}+1.72\%$
test_td3_speed[reduce-overhead-None] 54.1901ms 25.9466ms 38.5407 Ops/s 36.5602 Ops/s $\textbf{\color{#35bf28}+5.42\%}$
test_td3_speed[reduce-overhead-backward] 1.3801ms 1.3420ms 745.1561 Ops/s 722.3655 Ops/s $\color{#35bf28}+3.15\%$
test_cql_speed[False-None] 17.1992ms 16.7606ms 59.6637 Ops/s 58.2919 Ops/s $\color{#35bf28}+2.35\%$
test_cql_speed[False-backward] 22.3874ms 21.8980ms 45.6662 Ops/s 44.7810 Ops/s $\color{#35bf28}+1.98\%$
test_cql_speed[True-None] 3.3012ms 3.2311ms 309.4878 Ops/s 302.9426 Ops/s $\color{#35bf28}+2.16\%$
test_cql_speed[True-backward] 6.4178ms 5.6499ms 176.9938 Ops/s 175.6689 Ops/s $\color{#35bf28}+0.75\%$
test_cql_speed[reduce-overhead-None] 21.7066ms 13.2428ms 75.5127 Ops/s 57.8488 Ops/s $\textbf{\color{#35bf28}+30.53\%}$
test_cql_speed[reduce-overhead-backward] 2.1278ms 1.9869ms 503.3016 Ops/s 495.5534 Ops/s $\color{#35bf28}+1.56\%$
test_a2c_speed[False-None] 3.4172ms 3.2103ms 311.4927 Ops/s 301.5938 Ops/s $\color{#35bf28}+3.28\%$
test_a2c_speed[False-backward] 6.8905ms 6.3698ms 156.9907 Ops/s 152.7383 Ops/s $\color{#35bf28}+2.78\%$
test_a2c_speed[True-None] 1.4035ms 1.3391ms 746.7689 Ops/s 738.6743 Ops/s $\color{#35bf28}+1.10\%$
test_a2c_speed[True-backward] 3.1024ms 3.0205ms 331.0659 Ops/s 320.8578 Ops/s $\color{#35bf28}+3.18\%$
test_a2c_speed[reduce-overhead-None] 16.0872ms 9.0361ms 110.6675 Ops/s 111.8898 Ops/s $\color{#d91a1a}-1.09\%$
test_a2c_speed[reduce-overhead-backward] 1.7092ms 1.6023ms 624.1212 Ops/s 618.6805 Ops/s $\color{#35bf28}+0.88\%$
test_ppo_speed[False-None] 3.8048ms 3.6935ms 270.7431 Ops/s 262.1050 Ops/s $\color{#35bf28}+3.30\%$
test_ppo_speed[False-backward] 7.4938ms 7.0941ms 140.9623 Ops/s 137.4660 Ops/s $\color{#35bf28}+2.54\%$
test_ppo_speed[True-None] 1.4699ms 1.3948ms 716.9609 Ops/s 700.3222 Ops/s $\color{#35bf28}+2.38\%$
test_ppo_speed[True-backward] 3.2860ms 3.1873ms 313.7455 Ops/s 304.0267 Ops/s $\color{#35bf28}+3.20\%$
test_ppo_speed[reduce-overhead-None] 1.0596ms 0.9549ms 1.0472 KOps/s 1.0341 KOps/s $\color{#35bf28}+1.26\%$
test_ppo_speed[reduce-overhead-backward] 1.7398ms 1.5464ms 646.6750 Ops/s 621.1755 Ops/s $\color{#35bf28}+4.11\%$
test_reinforce_speed[False-None] 2.7062ms 2.2701ms 440.5175 Ops/s 426.0587 Ops/s $\color{#35bf28}+3.39\%$
test_reinforce_speed[False-backward] 3.7897ms 3.3133ms 301.8114 Ops/s 285.9467 Ops/s $\textbf{\color{#35bf28}+5.55\%}$
test_reinforce_speed[True-None] 1.4497ms 1.2894ms 775.5639 Ops/s 759.4471 Ops/s $\color{#35bf28}+2.12\%$
test_reinforce_speed[True-backward] 2.9555ms 2.9164ms 342.8935 Ops/s 324.7299 Ops/s $\textbf{\color{#35bf28}+5.59\%}$
test_reinforce_speed[reduce-overhead-None] 18.0971ms 10.0471ms 99.5316 Ops/s 100.2395 Ops/s $\color{#d91a1a}-0.71\%$
test_reinforce_speed[reduce-overhead-backward] 1.5598ms 1.4696ms 680.4432 Ops/s 601.8540 Ops/s $\textbf{\color{#35bf28}+13.06\%}$
test_iql_speed[False-None] 9.5976ms 9.1563ms 109.2146 Ops/s 105.3235 Ops/s $\color{#35bf28}+3.69\%$
test_iql_speed[False-backward] 13.3227ms 12.8293ms 77.9466 Ops/s 73.8429 Ops/s $\textbf{\color{#35bf28}+5.56\%}$
test_iql_speed[True-None] 2.6093ms 2.2112ms 452.2473 Ops/s 437.7366 Ops/s $\color{#35bf28}+3.31\%$
test_iql_speed[True-backward] 5.0782ms 4.7442ms 210.7842 Ops/s 200.5108 Ops/s $\textbf{\color{#35bf28}+5.12\%}$
test_iql_speed[reduce-overhead-None] 18.2206ms 10.8847ms 91.8719 Ops/s 90.0024 Ops/s $\color{#35bf28}+2.08\%$
test_iql_speed[reduce-overhead-backward] 2.0087ms 1.8935ms 528.1104 Ops/s 464.4470 Ops/s $\textbf{\color{#35bf28}+13.71\%}$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 7.8176ms 6.2052ms 161.1548 Ops/s 156.0724 Ops/s $\color{#35bf28}+3.26\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.4936ms 0.2593ms 3.8560 KOps/s 3.2684 KOps/s $\textbf{\color{#35bf28}+17.98\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.6659ms 0.2363ms 4.2322 KOps/s 3.8129 KOps/s $\textbf{\color{#35bf28}+11.00\%}$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.5120ms 6.0026ms 166.5939 Ops/s 164.1716 Ops/s $\color{#35bf28}+1.48\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 2.0542ms 0.3059ms 3.2690 KOps/s 3.3624 KOps/s $\color{#d91a1a}-2.78\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.7034ms 0.2732ms 3.6609 KOps/s 4.1401 KOps/s $\textbf{\color{#d91a1a}-11.57\%}$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 1.7954ms 1.3266ms 753.7901 Ops/s 781.9566 Ops/s $\color{#d91a1a}-3.60\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 1.5750ms 1.2693ms 787.8443 Ops/s 853.1827 Ops/s $\textbf{\color{#d91a1a}-7.66\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.3257ms 6.1501ms 162.5980 Ops/s 158.0995 Ops/s $\color{#35bf28}+2.85\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 0.8255ms 0.3975ms 2.5158 KOps/s 2.2900 KOps/s $\textbf{\color{#35bf28}+9.86\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.8932ms 0.4631ms 2.1595 KOps/s 2.3673 KOps/s $\textbf{\color{#d91a1a}-8.78\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 9.5042ms 6.0207ms 166.0935 Ops/s 162.7574 Ops/s $\color{#35bf28}+2.05\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 2.3980ms 0.3369ms 2.9684 KOps/s 3.1179 KOps/s $\color{#d91a1a}-4.80\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.7270ms 0.3062ms 3.2654 KOps/s 3.1289 KOps/s $\color{#35bf28}+4.36\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.3946ms 5.9291ms 168.6594 Ops/s 163.2646 Ops/s $\color{#35bf28}+3.30\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 1.6089ms 0.3204ms 3.1210 KOps/s 3.6796 KOps/s $\textbf{\color{#d91a1a}-15.18\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.5563ms 0.2830ms 3.5336 KOps/s 3.4441 KOps/s $\color{#35bf28}+2.60\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.5794ms 6.1401ms 162.8647 Ops/s 157.8212 Ops/s $\color{#35bf28}+3.20\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 2.1402ms 0.4808ms 2.0800 KOps/s 2.1340 KOps/s $\color{#d91a1a}-2.53\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.9384ms 0.4585ms 2.1812 KOps/s 2.1429 KOps/s $\color{#35bf28}+1.79\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 7.0603ms 5.4582ms 183.2108 Ops/s 179.3538 Ops/s $\color{#35bf28}+2.15\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 9.2417ms 2.0489ms 488.0641 Ops/s 439.3151 Ops/s $\textbf{\color{#35bf28}+11.10\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 7.3293ms 1.1920ms 838.9343 Ops/s 810.6455 Ops/s $\color{#35bf28}+3.49\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 7.2101ms 5.5803ms 179.2032 Ops/s 181.9762 Ops/s $\color{#d91a1a}-1.52\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 9.6043ms 2.0561ms 486.3558 Ops/s 432.1378 Ops/s $\textbf{\color{#35bf28}+12.55\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 6.8198ms 1.1326ms 882.9372 Ops/s 858.7324 Ops/s $\color{#35bf28}+2.82\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 0.4916s 15.4085ms 64.8993 Ops/s 31.3214 Ops/s $\textbf{\color{#35bf28}+107.20\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 6.9306ms 2.1816ms 458.3822 Ops/s 443.7977 Ops/s $\color{#35bf28}+3.29\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 9.2264ms 1.3850ms 722.0055 Ops/s 705.5021 Ops/s $\color{#35bf28}+2.34\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 13.4745ms 12.9213ms 77.3916 Ops/s 73.5633 Ops/s $\textbf{\color{#35bf28}+5.20\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 18.6346ms 16.8977ms 59.1795 Ops/s 57.1314 Ops/s $\color{#35bf28}+3.59\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 18.1887ms 17.5428ms 57.0034 Ops/s 55.3775 Ops/s $\color{#35bf28}+2.94\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 18.8036ms 17.2669ms 57.9144 Ops/s 56.6374 Ops/s $\color{#35bf28}+2.25\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 17.9156ms 17.4668ms 57.2516 Ops/s 55.9080 Ops/s $\color{#35bf28}+2.40\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 19.8296ms 18.4848ms 54.0986 Ops/s 52.9233 Ops/s $\color{#35bf28}+2.22\%$

@vmoens vmoens added documentation Improvements or additions to documentation formatting Code formatting changes labels Feb 5, 2025
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Feb 5, 2025
ghstack-source-id: 9308be3ebc7fac30b5bde321792eb97069d55996
Pull Request resolved: #2759
vmoens added a commit that referenced this pull request Feb 5, 2025
ghstack-source-id: 9308be3ebc7fac30b5bde321792eb97069d55996
Pull Request resolved: #2759
@vmoens vmoens merged commit 429915b into gh/vmoens/85/base Feb 5, 2025
51 of 61 checks passed
vmoens added a commit that referenced this pull request Feb 5, 2025
ghstack-source-id: 9308be3ebc7fac30b5bde321792eb97069d55996
Pull Request resolved: #2759
@vmoens vmoens deleted the gh/vmoens/85/head branch February 5, 2025 10:48
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. documentation Improvements or additions to documentation formatting Code formatting changes
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants