Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CI] Fix Cairo-2 Chess import error #2743

Merged
merged 8 commits into from
Feb 3, 2025
Merged

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Feb 3, 2025

[ghstack-poisoned]
Copy link

pytorch-bot bot commented Feb 3, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2743

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 1 Cancelled Job, 26 Pending, 5 Unrelated Failures

As of commit ad0c606 with merge base ffa99b2 (image):

NEW FAILURE - The following job has failed:

CANCELLED JOB - The following job was cancelled. Please retry:

FLAKY - The following job failed but was likely due to flakiness present on trunk:

BROKEN TRUNK - The following jobs failed but was present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

vmoens added a commit that referenced this pull request Feb 3, 2025
ghstack-source-id: d46f4184ce9f4fa8c857a383d6d323aad47aa25d
Pull Request resolved: #2743
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 3, 2025
@vmoens vmoens added the Environments Adds or modifies an environment wrapper label Feb 3, 2025
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Feb 3, 2025
ghstack-source-id: f7608a89d650713cea743fdd9f39340e15997a6d
Pull Request resolved: #2743
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Feb 3, 2025
ghstack-source-id: 38e22a3fa8dcf9c081b4cb62e15197826456caf3
Pull Request resolved: #2743
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Feb 3, 2025
ghstack-source-id: 410fa391e8d6ae63f32d197d46a72df014c53472
Pull Request resolved: #2743
Copy link

github-actions bot commented Feb 3, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}18$. Worsened: $\large\color{#d91a1a}5$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.5644s 0.4625s 2.1623 Ops/s 2.1237 Ops/s $\color{#35bf28}+1.82\%$
test_transformed 1.0310s 0.9405s 1.0633 Ops/s 1.0554 Ops/s $\color{#35bf28}+0.75\%$
test_serial 1.5129s 1.3973s 0.7156 Ops/s 0.7160 Ops/s $\color{#d91a1a}-0.04\%$
test_parallel 1.3437s 1.2271s 0.8150 Ops/s 0.7811 Ops/s $\color{#35bf28}+4.33\%$
test_step_mdp_speed[True-True-True-True-True] 0.2644ms 30.4312μs 32.8610 KOps/s 32.5538 KOps/s $\color{#35bf28}+0.94\%$
test_step_mdp_speed[True-True-True-True-False] 44.1220μs 17.6990μs 56.5003 KOps/s 56.3656 KOps/s $\color{#35bf28}+0.24\%$
test_step_mdp_speed[True-True-True-False-True] 65.6100μs 17.2574μs 57.9463 KOps/s 59.2771 KOps/s $\color{#d91a1a}-2.25\%$
test_step_mdp_speed[True-True-True-False-False] 46.2960μs 10.0704μs 99.3010 KOps/s 99.9340 KOps/s $\color{#d91a1a}-0.63\%$
test_step_mdp_speed[True-True-False-True-True] 83.9860μs 32.2377μs 31.0196 KOps/s 30.9365 KOps/s $\color{#35bf28}+0.27\%$
test_step_mdp_speed[True-True-False-True-False] 51.0540μs 19.6109μs 50.9920 KOps/s 51.1188 KOps/s $\color{#d91a1a}-0.25\%$
test_step_mdp_speed[True-True-False-False-True] 68.4870μs 19.1554μs 52.2045 KOps/s 52.4341 KOps/s $\color{#d91a1a}-0.44\%$
test_step_mdp_speed[True-True-False-False-False] 0.1330ms 11.8743μs 84.2157 KOps/s 84.3057 KOps/s $\color{#d91a1a}-0.11\%$
test_step_mdp_speed[True-False-True-True-True] 0.2958ms 35.4581μs 28.2023 KOps/s 29.2621 KOps/s $\color{#d91a1a}-3.62\%$
test_step_mdp_speed[True-False-True-True-False] 72.4450μs 21.2147μs 47.1371 KOps/s 46.9614 KOps/s $\color{#35bf28}+0.37\%$
test_step_mdp_speed[True-False-True-False-True] 0.6601ms 19.2257μs 52.0138 KOps/s 52.7033 KOps/s $\color{#d91a1a}-1.31\%$
test_step_mdp_speed[True-False-True-False-False] 71.9230μs 11.9946μs 83.3711 KOps/s 84.5710 KOps/s $\color{#d91a1a}-1.42\%$
test_step_mdp_speed[True-False-False-True-True] 84.3060μs 35.9179μs 27.8412 KOps/s 28.2266 KOps/s $\color{#d91a1a}-1.37\%$
test_step_mdp_speed[True-False-False-True-False] 0.2233ms 22.9337μs 43.6039 KOps/s 43.2845 KOps/s $\color{#35bf28}+0.74\%$
test_step_mdp_speed[True-False-False-False-True] 95.9680μs 20.7813μs 48.1202 KOps/s 49.1083 KOps/s $\color{#d91a1a}-2.01\%$
test_step_mdp_speed[True-False-False-False-False] 64.7200μs 13.4451μs 74.3763 KOps/s 71.8142 KOps/s $\color{#35bf28}+3.57\%$
test_step_mdp_speed[False-True-True-True-True] 98.4030μs 34.5267μs 28.9631 KOps/s 29.7129 KOps/s $\color{#d91a1a}-2.52\%$
test_step_mdp_speed[False-True-True-True-False] 61.6250μs 21.5096μs 46.4909 KOps/s 42.4210 KOps/s $\textbf{\color{#35bf28}+9.59\%}$
test_step_mdp_speed[False-True-True-False-True] 82.5630μs 22.5597μs 44.3269 KOps/s 46.1522 KOps/s $\color{#d91a1a}-3.95\%$
test_step_mdp_speed[False-True-True-False-False] 0.1235ms 13.4007μs 74.6230 KOps/s 75.9202 KOps/s $\color{#d91a1a}-1.71\%$
test_step_mdp_speed[False-True-False-True-True] 0.1009ms 35.9619μs 27.8072 KOps/s 27.9494 KOps/s $\color{#d91a1a}-0.51\%$
test_step_mdp_speed[False-True-False-True-False] 58.5680μs 23.0900μs 43.3088 KOps/s 43.2067 KOps/s $\color{#35bf28}+0.24\%$
test_step_mdp_speed[False-True-False-False-True] 2.8353ms 23.4401μs 42.6619 KOps/s 42.7145 KOps/s $\color{#d91a1a}-0.12\%$
test_step_mdp_speed[False-True-False-False-False] 77.8340μs 14.9667μs 66.8151 KOps/s 66.5285 KOps/s $\color{#35bf28}+0.43\%$
test_step_mdp_speed[False-False-True-True-True] 97.8610μs 37.7699μs 26.4761 KOps/s 26.5646 KOps/s $\color{#d91a1a}-0.33\%$
test_step_mdp_speed[False-False-True-True-False] 86.4500μs 25.0238μs 39.9619 KOps/s 40.1357 KOps/s $\color{#d91a1a}-0.43\%$
test_step_mdp_speed[False-False-True-False-True] 76.8920μs 23.6250μs 42.3280 KOps/s 42.8051 KOps/s $\color{#d91a1a}-1.11\%$
test_step_mdp_speed[False-False-True-False-False] 0.6085ms 15.0401μs 66.4887 KOps/s 67.1080 KOps/s $\color{#d91a1a}-0.92\%$
test_step_mdp_speed[False-False-False-True-True] 79.0470μs 39.2804μs 25.4580 KOps/s 25.7235 KOps/s $\color{#d91a1a}-1.03\%$
test_step_mdp_speed[False-False-False-True-False] 55.8340μs 27.0859μs 36.9196 KOps/s 37.8032 KOps/s $\color{#d91a1a}-2.34\%$
test_step_mdp_speed[False-False-False-False-True] 99.2860μs 25.5346μs 39.1625 KOps/s 40.0665 KOps/s $\color{#d91a1a}-2.26\%$
test_step_mdp_speed[False-False-False-False-False] 67.4030μs 16.5824μs 60.3048 KOps/s 60.8316 KOps/s $\color{#d91a1a}-0.87\%$
test_values[generalized_advantage_estimate-True-True] 10.3189ms 10.0240ms 99.7603 Ops/s 102.3371 Ops/s $\color{#d91a1a}-2.52\%$
test_values[vec_generalized_advantage_estimate-True-True] 29.7870ms 26.9648ms 37.0854 Ops/s 41.0083 Ops/s $\textbf{\color{#d91a1a}-9.57\%}$
test_values[td0_return_estimate-False-False] 0.2501ms 0.1958ms 5.1068 KOps/s 5.5791 KOps/s $\textbf{\color{#d91a1a}-8.47\%}$
test_values[td1_return_estimate-False-False] 28.0716ms 24.7841ms 40.3484 Ops/s 41.5217 Ops/s $\color{#d91a1a}-2.83\%$
test_values[vec_td1_return_estimate-False-False] 29.0127ms 26.9436ms 37.1146 Ops/s 41.0803 Ops/s $\textbf{\color{#d91a1a}-9.65\%}$
test_values[td_lambda_return_estimate-True-False] 37.7342ms 35.2066ms 28.4038 Ops/s 29.3517 Ops/s $\color{#d91a1a}-3.23\%$
test_values[vec_td_lambda_return_estimate-True-False] 28.7348ms 26.9153ms 37.1536 Ops/s 39.2686 Ops/s $\textbf{\color{#d91a1a}-5.39\%}$
test_gae_speed[generalized_advantage_estimate-False-1-512] 8.6220ms 8.4589ms 118.2183 Ops/s 119.6830 Ops/s $\color{#d91a1a}-1.22\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 2.5698ms 2.0133ms 496.6914 Ops/s 515.6821 Ops/s $\color{#d91a1a}-3.68\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.4644ms 0.3688ms 2.7115 KOps/s 2.5510 KOps/s $\textbf{\color{#35bf28}+6.29\%}$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 44.7828ms 43.4494ms 23.0153 Ops/s 23.4133 Ops/s $\color{#d91a1a}-1.70\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 4.5227ms 3.4595ms 289.0617 Ops/s 291.0275 Ops/s $\color{#d91a1a}-0.68\%$
test_dqn_speed[False-None] 2.1820ms 1.4251ms 701.6885 Ops/s 717.3707 Ops/s $\color{#d91a1a}-2.19\%$
test_dqn_speed[False-backward] 1.9862ms 1.9099ms 523.5960 Ops/s 536.9752 Ops/s $\color{#d91a1a}-2.49\%$
test_dqn_speed[True-None] 2.3451ms 0.4995ms 2.0022 KOps/s 2.0591 KOps/s $\color{#d91a1a}-2.77\%$
test_dqn_speed[True-backward] 1.0414ms 0.9147ms 1.0932 KOps/s 804.7674 Ops/s $\textbf{\color{#35bf28}+35.84\%}$
test_dqn_speed[reduce-overhead-None] 0.6496ms 0.4848ms 2.0626 KOps/s 2.0473 KOps/s $\color{#35bf28}+0.75\%$
test_dqn_speed[reduce-overhead-backward] 0.9498ms 0.9042ms 1.1059 KOps/s 1.0542 KOps/s $\color{#35bf28}+4.91\%$
test_ddpg_speed[False-None] 3.8101ms 2.9307ms 341.2191 Ops/s 344.2998 Ops/s $\color{#d91a1a}-0.89\%$
test_ddpg_speed[False-backward] 5.1393ms 4.1299ms 242.1340 Ops/s 245.7426 Ops/s $\color{#d91a1a}-1.47\%$
test_ddpg_speed[True-None] 4.2437ms 1.2561ms 796.1097 Ops/s 812.3016 Ops/s $\color{#d91a1a}-1.99\%$
test_ddpg_speed[True-backward] 2.1749ms 2.1313ms 469.2080 Ops/s 472.9752 Ops/s $\color{#d91a1a}-0.80\%$
test_ddpg_speed[reduce-overhead-None] 1.4242ms 1.2271ms 814.9602 Ops/s 821.4636 Ops/s $\color{#d91a1a}-0.79\%$
test_ddpg_speed[reduce-overhead-backward] 2.1910ms 2.1144ms 472.9391 Ops/s 468.4472 Ops/s $\color{#35bf28}+0.96\%$
test_sac_speed[False-None] 9.3925ms 8.0374ms 124.4189 Ops/s 124.8836 Ops/s $\color{#d91a1a}-0.37\%$
test_sac_speed[False-backward] 13.0307ms 10.8632ms 92.0539 Ops/s 92.9369 Ops/s $\color{#d91a1a}-0.95\%$
test_sac_speed[True-None] 2.6632ms 2.1007ms 476.0206 Ops/s 475.3692 Ops/s $\color{#35bf28}+0.14\%$
test_sac_speed[True-backward] 4.0811ms 3.7704ms 265.2273 Ops/s 265.4410 Ops/s $\color{#d91a1a}-0.08\%$
test_sac_speed[reduce-overhead-None] 2.3945ms 2.1086ms 474.2521 Ops/s 474.6097 Ops/s $\color{#d91a1a}-0.08\%$
test_sac_speed[reduce-overhead-backward] 4.0969ms 3.7848ms 264.2159 Ops/s 261.8788 Ops/s $\color{#35bf28}+0.89\%$
test_redq_speed[False-None] 14.8352ms 13.2873ms 75.2601 Ops/s 74.3077 Ops/s $\color{#35bf28}+1.28\%$
test_redq_speed[False-backward] 24.1075ms 22.7707ms 43.9161 Ops/s 44.2690 Ops/s $\color{#d91a1a}-0.80\%$
test_redq_speed[True-None] 6.2707ms 5.3685ms 186.2719 Ops/s 192.4165 Ops/s $\color{#d91a1a}-3.19\%$
test_redq_speed[True-backward] 14.7316ms 13.2132ms 75.6817 Ops/s 76.5171 Ops/s $\color{#d91a1a}-1.09\%$
test_redq_speed[reduce-overhead-None] 6.6270ms 5.1897ms 192.6893 Ops/s 187.2198 Ops/s $\color{#35bf28}+2.92\%$
test_redq_speed[reduce-overhead-backward] 14.5218ms 12.7634ms 78.3489 Ops/s 74.0942 Ops/s $\textbf{\color{#35bf28}+5.74\%}$
test_redq_deprec_speed[False-None] 15.0197ms 13.3896ms 74.6849 Ops/s 73.8392 Ops/s $\color{#35bf28}+1.15\%$
test_redq_deprec_speed[False-backward] 21.4045ms 19.0501ms 52.4932 Ops/s 49.8977 Ops/s $\textbf{\color{#35bf28}+5.20\%}$
test_redq_deprec_speed[True-None] 4.8606ms 4.0215ms 248.6647 Ops/s 241.0227 Ops/s $\color{#35bf28}+3.17\%$
test_redq_deprec_speed[True-backward] 9.5157ms 8.6416ms 115.7199 Ops/s 115.6395 Ops/s $\color{#35bf28}+0.07\%$
test_redq_deprec_speed[reduce-overhead-None] 4.8710ms 4.0100ms 249.3739 Ops/s 249.9779 Ops/s $\color{#d91a1a}-0.24\%$
test_redq_deprec_speed[reduce-overhead-backward] 9.8683ms 8.9242ms 112.0554 Ops/s 100.7068 Ops/s $\textbf{\color{#35bf28}+11.27\%}$
test_td3_speed[False-None] 8.4556ms 8.0902ms 123.6061 Ops/s 115.9056 Ops/s $\textbf{\color{#35bf28}+6.64\%}$
test_td3_speed[False-backward] 12.0909ms 10.8928ms 91.8038 Ops/s 89.4528 Ops/s $\color{#35bf28}+2.63\%$
test_td3_speed[True-None] 2.0082ms 1.7755ms 563.2253 Ops/s 539.1209 Ops/s $\color{#35bf28}+4.47\%$
test_td3_speed[True-backward] 3.9811ms 3.5687ms 280.2122 Ops/s 280.1196 Ops/s $\color{#35bf28}+0.03\%$
test_td3_speed[reduce-overhead-None] 1.9667ms 1.7801ms 561.7775 Ops/s 546.7577 Ops/s $\color{#35bf28}+2.75\%$
test_td3_speed[reduce-overhead-backward] 3.4524ms 3.3623ms 297.4192 Ops/s 285.9414 Ops/s $\color{#35bf28}+4.01\%$
test_cql_speed[False-None] 39.5540ms 36.6827ms 27.2608 Ops/s 26.9475 Ops/s $\color{#35bf28}+1.16\%$
test_cql_speed[False-backward] 52.6441ms 48.3306ms 20.6908 Ops/s 20.7906 Ops/s $\color{#d91a1a}-0.48\%$
test_cql_speed[True-None] 17.0599ms 16.4791ms 60.6831 Ops/s 60.2326 Ops/s $\color{#35bf28}+0.75\%$
test_cql_speed[True-backward] 25.5598ms 23.7033ms 42.1882 Ops/s 42.4526 Ops/s $\color{#d91a1a}-0.62\%$
test_cql_speed[reduce-overhead-None] 17.4505ms 16.3465ms 61.1753 Ops/s 60.0561 Ops/s $\color{#35bf28}+1.86\%$
test_cql_speed[reduce-overhead-backward] 24.1570ms 23.2611ms 42.9903 Ops/s 41.9852 Ops/s $\color{#35bf28}+2.39\%$
test_a2c_speed[False-None] 7.8625ms 7.2825ms 137.3160 Ops/s 134.0322 Ops/s $\color{#35bf28}+2.45\%$
test_a2c_speed[False-backward] 16.0546ms 14.6084ms 68.4538 Ops/s 66.5660 Ops/s $\color{#35bf28}+2.84\%$
test_a2c_speed[True-None] 4.5593ms 3.7109ms 269.4761 Ops/s 262.2009 Ops/s $\color{#35bf28}+2.77\%$
test_a2c_speed[True-backward] 11.3523ms 10.6951ms 93.5011 Ops/s 88.6725 Ops/s $\textbf{\color{#35bf28}+5.45\%}$
test_a2c_speed[reduce-overhead-None] 4.1036ms 3.7274ms 268.2848 Ops/s 242.7544 Ops/s $\textbf{\color{#35bf28}+10.52\%}$
test_a2c_speed[reduce-overhead-backward] 11.7591ms 10.6060ms 94.2859 Ops/s 87.2639 Ops/s $\textbf{\color{#35bf28}+8.05\%}$
test_ppo_speed[False-None] 8.3084ms 7.8242ms 127.8092 Ops/s 125.9522 Ops/s $\color{#35bf28}+1.47\%$
test_ppo_speed[False-backward] 15.9945ms 15.3894ms 64.9796 Ops/s 63.5966 Ops/s $\color{#35bf28}+2.17\%$
test_ppo_speed[True-None] 6.0854ms 4.1527ms 240.8055 Ops/s 224.2958 Ops/s $\textbf{\color{#35bf28}+7.36\%}$
test_ppo_speed[True-backward] 11.3195ms 10.3774ms 96.3635 Ops/s 93.4677 Ops/s $\color{#35bf28}+3.10\%$
test_ppo_speed[reduce-overhead-None] 4.8813ms 4.0910ms 244.4414 Ops/s 236.7202 Ops/s $\color{#35bf28}+3.26\%$
test_ppo_speed[reduce-overhead-backward] 10.6891ms 10.1765ms 98.2659 Ops/s 94.4431 Ops/s $\color{#35bf28}+4.05\%$
test_reinforce_speed[False-None] 7.3611ms 6.6013ms 151.4858 Ops/s 145.8393 Ops/s $\color{#35bf28}+3.87\%$
test_reinforce_speed[False-backward] 10.5696ms 9.8673ms 101.3452 Ops/s 96.1972 Ops/s $\textbf{\color{#35bf28}+5.35\%}$
test_reinforce_speed[True-None] 3.8547ms 3.0839ms 324.2683 Ops/s 316.5660 Ops/s $\color{#35bf28}+2.43\%$
test_reinforce_speed[True-backward] 9.8011ms 9.1370ms 109.4454 Ops/s 104.9093 Ops/s $\color{#35bf28}+4.32\%$
test_reinforce_speed[reduce-overhead-None] 3.8422ms 3.0796ms 324.7219 Ops/s 306.8718 Ops/s $\textbf{\color{#35bf28}+5.82\%}$
test_reinforce_speed[reduce-overhead-backward] 9.9777ms 9.1554ms 109.2254 Ops/s 105.8827 Ops/s $\color{#35bf28}+3.16\%$
test_iql_speed[False-None] 33.3988ms 32.4299ms 30.8357 Ops/s 29.5328 Ops/s $\color{#35bf28}+4.41\%$
test_iql_speed[False-backward] 47.1736ms 45.3533ms 22.0491 Ops/s 20.6235 Ops/s $\textbf{\color{#35bf28}+6.91\%}$
test_iql_speed[True-None] 12.8871ms 11.8046ms 84.7130 Ops/s 84.8634 Ops/s $\color{#d91a1a}-0.18\%$
test_iql_speed[True-backward] 24.1696ms 22.7603ms 43.9362 Ops/s 42.7580 Ops/s $\color{#35bf28}+2.76\%$
test_iql_speed[reduce-overhead-None] 12.1217ms 11.3592ms 88.0340 Ops/s 86.0254 Ops/s $\color{#35bf28}+2.33\%$
test_iql_speed[reduce-overhead-backward] 34.8879ms 23.3405ms 42.8440 Ops/s 42.9207 Ops/s $\color{#d91a1a}-0.18\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.8168ms 5.1390ms 194.5906 Ops/s 198.8978 Ops/s $\color{#d91a1a}-2.17\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.8398ms 0.5264ms 1.8995 KOps/s 1.9273 KOps/s $\color{#d91a1a}-1.44\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.8211ms 0.4963ms 2.0149 KOps/s 1.9884 KOps/s $\color{#35bf28}+1.33\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 7.2315ms 4.8785ms 204.9815 Ops/s 200.3509 Ops/s $\color{#35bf28}+2.31\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 3.1412ms 0.5094ms 1.9631 KOps/s 1.9434 KOps/s $\color{#35bf28}+1.02\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.8137ms 0.4890ms 2.0449 KOps/s 2.0618 KOps/s $\color{#d91a1a}-0.82\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 4.4386ms 1.6961ms 589.5782 Ops/s 598.8395 Ops/s $\color{#d91a1a}-1.55\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 2.2414ms 1.5813ms 632.3917 Ops/s 633.7987 Ops/s $\color{#d91a1a}-0.22\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 7.4609ms 4.9327ms 202.7295 Ops/s 205.4521 Ops/s $\color{#d91a1a}-1.33\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.0473ms 0.6567ms 1.5228 KOps/s 1.5123 KOps/s $\color{#35bf28}+0.69\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 1.0118ms 0.6322ms 1.5818 KOps/s 1.5764 KOps/s $\color{#35bf28}+0.34\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 5.8522ms 4.7811ms 209.1584 Ops/s 204.6223 Ops/s $\color{#35bf28}+2.22\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 1.3787ms 0.5219ms 1.9162 KOps/s 1.9025 KOps/s $\color{#35bf28}+0.72\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.8757ms 0.5043ms 1.9829 KOps/s 1.9476 KOps/s $\color{#35bf28}+1.81\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 5.4641ms 4.7060ms 212.4939 Ops/s 206.9217 Ops/s $\color{#35bf28}+2.69\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 1.9560ms 0.5067ms 1.9735 KOps/s 1.9298 KOps/s $\color{#35bf28}+2.26\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.8202ms 0.4888ms 2.0459 KOps/s 2.0458 KOps/s $+0.00\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 5.2990ms 4.7959ms 208.5115 Ops/s 206.4995 Ops/s $\color{#35bf28}+0.97\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.3541ms 0.6524ms 1.5328 KOps/s 1.5103 KOps/s $\color{#35bf28}+1.49\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 1.0824ms 0.6358ms 1.5727 KOps/s 1.5717 KOps/s $\color{#35bf28}+0.06\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 5.5404ms 4.1468ms 241.1497 Ops/s 247.1833 Ops/s $\color{#d91a1a}-2.44\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 7.5777ms 2.3074ms 433.3970 Ops/s 429.9875 Ops/s $\color{#35bf28}+0.79\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 5.9576ms 1.3613ms 734.5972 Ops/s 707.5204 Ops/s $\color{#35bf28}+3.83\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 5.5042ms 4.2342ms 236.1698 Ops/s 33.0949 Ops/s $\textbf{\color{#35bf28}+613.61\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 7.9233ms 2.3610ms 423.5548 Ops/s 420.0150 Ops/s $\color{#35bf28}+0.84\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 5.0931ms 1.3350ms 749.0689 Ops/s 744.3940 Ops/s $\color{#35bf28}+0.63\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 0.4589s 13.4504ms 74.3474 Ops/s 224.1863 Ops/s $\textbf{\color{#d91a1a}-66.84\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 8.6768ms 2.5591ms 390.7660 Ops/s 397.2394 Ops/s $\color{#d91a1a}-1.63\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 2.1099ms 1.4238ms 702.3282 Ops/s 606.7661 Ops/s $\textbf{\color{#35bf28}+15.75\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 14.3679ms 11.6385ms 85.9218 Ops/s 77.5836 Ops/s $\textbf{\color{#35bf28}+10.75\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 15.3201ms 14.0644ms 71.1016 Ops/s 68.9461 Ops/s $\color{#35bf28}+3.13\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 22.4489ms 20.4231ms 48.9642 Ops/s 46.3417 Ops/s $\textbf{\color{#35bf28}+5.66\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 15.3793ms 14.3092ms 69.8851 Ops/s 68.8417 Ops/s $\color{#35bf28}+1.52\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 22.2810ms 20.4581ms 48.8804 Ops/s 47.2596 Ops/s $\color{#35bf28}+3.43\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 17.4404ms 15.7725ms 63.4013 Ops/s 63.0708 Ops/s $\color{#35bf28}+0.52\%$

Copy link

github-actions bot commented Feb 3, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}25$. Worsened: $\large\color{#d91a1a}9$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.8361s 0.7489s 1.3353 Ops/s 1.3718 Ops/s $\color{#d91a1a}-2.66\%$
test_transformed 1.4520s 1.3614s 0.7345 Ops/s 0.7537 Ops/s $\color{#d91a1a}-2.54\%$
test_serial 2.1669s 2.1646s 0.4620 Ops/s 0.4577 Ops/s $\color{#35bf28}+0.94\%$
test_parallel 1.8562s 1.8271s 0.5473 Ops/s 0.5312 Ops/s $\color{#35bf28}+3.03\%$
test_step_mdp_speed[True-True-True-True-True] 0.2331ms 39.8979μs 25.0640 KOps/s 25.2319 KOps/s $\color{#d91a1a}-0.67\%$
test_step_mdp_speed[True-True-True-True-False] 50.3310μs 23.4389μs 42.6642 KOps/s 42.4381 KOps/s $\color{#35bf28}+0.53\%$
test_step_mdp_speed[True-True-True-False-True] 51.6810μs 21.9322μs 45.5952 KOps/s 44.4338 KOps/s $\color{#35bf28}+2.61\%$
test_step_mdp_speed[True-True-True-False-False] 39.3100μs 13.0636μs 76.5486 KOps/s 75.7051 KOps/s $\color{#35bf28}+1.11\%$
test_step_mdp_speed[True-True-False-True-True] 71.8410μs 42.9588μs 23.2781 KOps/s 23.4148 KOps/s $\color{#d91a1a}-0.58\%$
test_step_mdp_speed[True-True-False-True-False] 0.1025ms 25.5947μs 39.0706 KOps/s 38.3947 KOps/s $\color{#35bf28}+1.76\%$
test_step_mdp_speed[True-True-False-False-True] 54.3810μs 25.0461μs 39.9263 KOps/s 40.2594 KOps/s $\color{#d91a1a}-0.83\%$
test_step_mdp_speed[True-True-False-False-False] 43.7200μs 15.4339μs 64.7924 KOps/s 64.4553 KOps/s $\color{#35bf28}+0.52\%$
test_step_mdp_speed[True-False-True-True-True] 75.8210μs 45.6540μs 21.9039 KOps/s 22.2030 KOps/s $\color{#d91a1a}-1.35\%$
test_step_mdp_speed[True-False-True-True-False] 55.0610μs 28.0877μs 35.6027 KOps/s 35.3687 KOps/s $\color{#35bf28}+0.66\%$
test_step_mdp_speed[True-False-True-False-True] 90.5320μs 24.7290μs 40.4384 KOps/s 40.1768 KOps/s $\color{#35bf28}+0.65\%$
test_step_mdp_speed[True-False-True-False-False] 48.7910μs 15.2874μs 65.4134 KOps/s 64.5356 KOps/s $\color{#35bf28}+1.36\%$
test_step_mdp_speed[True-False-False-True-True] 77.0210μs 47.2340μs 21.1712 KOps/s 21.1005 KOps/s $\color{#35bf28}+0.33\%$
test_step_mdp_speed[True-False-False-True-False] 60.7910μs 30.2285μs 33.0814 KOps/s 32.8838 KOps/s $\color{#35bf28}+0.60\%$
test_step_mdp_speed[True-False-False-False-True] 57.3810μs 26.7579μs 37.3722 KOps/s 38.1503 KOps/s $\color{#d91a1a}-2.04\%$
test_step_mdp_speed[True-False-False-False-False] 46.7710μs 17.4553μs 57.2893 KOps/s 56.1322 KOps/s $\color{#35bf28}+2.06\%$
test_step_mdp_speed[False-True-True-True-True] 0.1115ms 44.6779μs 22.3824 KOps/s 22.5209 KOps/s $\color{#d91a1a}-0.61\%$
test_step_mdp_speed[False-True-True-True-False] 59.4010μs 27.9607μs 35.7645 KOps/s 35.8879 KOps/s $\color{#d91a1a}-0.34\%$
test_step_mdp_speed[False-True-True-False-True] 2.6222ms 28.7783μs 34.7484 KOps/s 34.4647 KOps/s $\color{#35bf28}+0.82\%$
test_step_mdp_speed[False-True-True-False-False] 59.7310μs 17.2791μs 57.8734 KOps/s 57.8721 KOps/s $+0.00\%$
test_step_mdp_speed[False-True-False-True-True] 74.0210μs 47.8241μs 20.9100 KOps/s 21.2826 KOps/s $\color{#d91a1a}-1.75\%$
test_step_mdp_speed[False-True-False-True-False] 58.4510μs 30.4388μs 32.8528 KOps/s 33.0314 KOps/s $\color{#d91a1a}-0.54\%$
test_step_mdp_speed[False-True-False-False-True] 58.5110μs 31.3219μs 31.9266 KOps/s 31.6732 KOps/s $\color{#35bf28}+0.80\%$
test_step_mdp_speed[False-True-False-False-False] 45.0110μs 19.5151μs 51.2423 KOps/s 50.9017 KOps/s $\color{#35bf28}+0.67\%$
test_step_mdp_speed[False-False-True-True-True] 76.8110μs 50.2990μs 19.8811 KOps/s 20.2714 KOps/s $\color{#d91a1a}-1.93\%$
test_step_mdp_speed[False-False-True-True-False] 61.6610μs 32.8257μs 30.4640 KOps/s 30.3699 KOps/s $\color{#35bf28}+0.31\%$
test_step_mdp_speed[False-False-True-False-True] 70.3610μs 30.8080μs 32.4591 KOps/s 32.4872 KOps/s $\color{#d91a1a}-0.09\%$
test_step_mdp_speed[False-False-True-False-False] 49.0610μs 19.4233μs 51.4846 KOps/s 51.1946 KOps/s $\color{#35bf28}+0.57\%$
test_step_mdp_speed[False-False-False-True-True] 85.4010μs 51.0377μs 19.5934 KOps/s 19.5951 KOps/s $-0.01\%$
test_step_mdp_speed[False-False-False-True-False] 59.6910μs 35.2729μs 28.3504 KOps/s 28.7927 KOps/s $\color{#d91a1a}-1.54\%$
test_step_mdp_speed[False-False-False-False-True] 62.7410μs 32.7233μs 30.5593 KOps/s 30.4521 KOps/s $\color{#35bf28}+0.35\%$
test_step_mdp_speed[False-False-False-False-False] 52.2210μs 21.6093μs 46.2763 KOps/s 46.4168 KOps/s $\color{#d91a1a}-0.30\%$
test_values[generalized_advantage_estimate-True-True] 26.2624ms 25.2973ms 39.5299 Ops/s 38.9165 Ops/s $\color{#35bf28}+1.58\%$
test_values[vec_generalized_advantage_estimate-True-True] 98.5517ms 2.8710ms 348.3124 Ops/s 332.2703 Ops/s $\color{#35bf28}+4.83\%$
test_values[td0_return_estimate-False-False] 0.1060ms 79.6923μs 12.5483 KOps/s 12.1259 KOps/s $\color{#35bf28}+3.48\%$
test_values[td1_return_estimate-False-False] 59.0892ms 57.0551ms 17.5269 Ops/s 17.5016 Ops/s $\color{#35bf28}+0.14\%$
test_values[vec_td1_return_estimate-False-False] 1.2876ms 1.0843ms 922.2951 Ops/s 917.0060 Ops/s $\color{#35bf28}+0.58\%$
test_values[td_lambda_return_estimate-True-False] 93.4759ms 90.8512ms 11.0070 Ops/s 11.0932 Ops/s $\color{#d91a1a}-0.78\%$
test_values[vec_td_lambda_return_estimate-True-False] 1.3545ms 1.0857ms 921.0780 Ops/s 921.5810 Ops/s $\color{#d91a1a}-0.05\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 25.8427ms 25.3128ms 39.5056 Ops/s 39.3243 Ops/s $\color{#35bf28}+0.46\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 1.0570ms 0.7694ms 1.2998 KOps/s 1.3055 KOps/s $\color{#d91a1a}-0.44\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.7582ms 0.6701ms 1.4922 KOps/s 1.4810 KOps/s $\color{#35bf28}+0.76\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 1.6357ms 1.4933ms 669.6593 Ops/s 667.2395 Ops/s $\color{#35bf28}+0.36\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 0.8420ms 0.6864ms 1.4568 KOps/s 1.4461 KOps/s $\color{#35bf28}+0.74\%$
test_dqn_speed[False-None] 1.6908ms 1.5370ms 650.6110 Ops/s 650.4049 Ops/s $\color{#35bf28}+0.03\%$
test_dqn_speed[False-backward] 2.3892ms 2.1690ms 461.0406 Ops/s 466.3044 Ops/s $\color{#d91a1a}-1.13\%$
test_dqn_speed[True-None] 0.7331ms 0.5582ms 1.7914 KOps/s 1.7840 KOps/s $\color{#35bf28}+0.41\%$
test_dqn_speed[True-backward] 1.2567ms 1.1251ms 888.8378 Ops/s 805.9865 Ops/s $\textbf{\color{#35bf28}+10.28\%}$
test_dqn_speed[reduce-overhead-None] 0.7151ms 0.5756ms 1.7374 KOps/s 1.7588 KOps/s $\color{#d91a1a}-1.22\%$
test_dqn_speed[reduce-overhead-backward] 1.0417ms 0.9631ms 1.0383 KOps/s 922.9821 Ops/s $\textbf{\color{#35bf28}+12.49\%}$
test_ddpg_speed[False-None] 3.2263ms 2.9143ms 343.1407 Ops/s 347.3370 Ops/s $\color{#d91a1a}-1.21\%$
test_ddpg_speed[False-backward] 4.6186ms 4.1979ms 238.2167 Ops/s 234.8296 Ops/s $\color{#35bf28}+1.44\%$
test_ddpg_speed[True-None] 1.5681ms 1.3477ms 741.9986 Ops/s 743.2899 Ops/s $\color{#d91a1a}-0.17\%$
test_ddpg_speed[True-backward] 2.5895ms 2.4317ms 411.2358 Ops/s 385.1407 Ops/s $\textbf{\color{#35bf28}+6.78\%}$
test_ddpg_speed[reduce-overhead-None] 1.5647ms 1.3609ms 734.7979 Ops/s 732.3425 Ops/s $\color{#35bf28}+0.34\%$
test_ddpg_speed[reduce-overhead-backward] 1.9702ms 1.8954ms 527.5853 Ops/s 489.8369 Ops/s $\textbf{\color{#35bf28}+7.71\%}$
test_sac_speed[False-None] 8.9358ms 8.2745ms 120.8533 Ops/s 122.8912 Ops/s $\color{#d91a1a}-1.66\%$
test_sac_speed[False-backward] 11.6268ms 11.1553ms 89.6431 Ops/s 88.1356 Ops/s $\color{#35bf28}+1.71\%$
test_sac_speed[True-None] 2.0014ms 1.8352ms 544.8991 Ops/s 537.0169 Ops/s $\color{#35bf28}+1.47\%$
test_sac_speed[True-backward] 3.6755ms 3.5388ms 282.5813 Ops/s 266.5705 Ops/s $\textbf{\color{#35bf28}+6.01\%}$
test_sac_speed[reduce-overhead-None] 21.5908ms 12.0596ms 82.9215 Ops/s 81.3654 Ops/s $\color{#35bf28}+1.91\%$
test_sac_speed[reduce-overhead-backward] 1.6800ms 1.6152ms 619.1335 Ops/s 546.2234 Ops/s $\textbf{\color{#35bf28}+13.35\%}$
test_redq_speed[False-None] 7.9866ms 7.5797ms 131.9313 Ops/s 130.4439 Ops/s $\color{#35bf28}+1.14\%$
test_redq_speed[False-backward] 11.9150ms 11.4752ms 87.1447 Ops/s 84.3387 Ops/s $\color{#35bf28}+3.33\%$
test_redq_speed[True-None] 2.4636ms 2.3085ms 433.1880 Ops/s 427.6361 Ops/s $\color{#35bf28}+1.30\%$
test_redq_speed[True-backward] 4.1448ms 3.9703ms 251.8722 Ops/s 244.5882 Ops/s $\color{#35bf28}+2.98\%$
test_redq_speed[reduce-overhead-None] 2.4458ms 2.3059ms 433.6686 Ops/s 423.8925 Ops/s $\color{#35bf28}+2.31\%$
test_redq_speed[reduce-overhead-backward] 4.1549ms 3.9852ms 250.9260 Ops/s 244.7728 Ops/s $\color{#35bf28}+2.51\%$
test_redq_deprec_speed[False-None] 9.4872ms 9.1802ms 108.9297 Ops/s 108.0215 Ops/s $\color{#35bf28}+0.84\%$
test_redq_deprec_speed[False-backward] 12.6843ms 12.1410ms 82.3653 Ops/s 82.4261 Ops/s $\color{#d91a1a}-0.07\%$
test_redq_deprec_speed[True-None] 3.5781ms 2.6381ms 379.0585 Ops/s 375.6662 Ops/s $\color{#35bf28}+0.90\%$
test_redq_deprec_speed[True-backward] 4.7865ms 4.3598ms 229.3673 Ops/s 216.6007 Ops/s $\textbf{\color{#35bf28}+5.89\%}$
test_redq_deprec_speed[reduce-overhead-None] 2.6994ms 2.6264ms 380.7520 Ops/s 374.1754 Ops/s $\color{#35bf28}+1.76\%$
test_redq_deprec_speed[reduce-overhead-backward] 4.3844ms 4.2991ms 232.6093 Ops/s 218.7693 Ops/s $\textbf{\color{#35bf28}+6.33\%}$
test_td3_speed[False-None] 8.2160ms 8.0786ms 123.7834 Ops/s 123.6746 Ops/s $\color{#35bf28}+0.09\%$
test_td3_speed[False-backward] 10.9019ms 10.4283ms 95.8929 Ops/s 94.5038 Ops/s $\color{#35bf28}+1.47\%$
test_td3_speed[True-None] 1.6915ms 1.6512ms 605.6246 Ops/s 573.6396 Ops/s $\textbf{\color{#35bf28}+5.58\%}$
test_td3_speed[True-backward] 3.2343ms 3.1709ms 315.3670 Ops/s 296.1023 Ops/s $\textbf{\color{#35bf28}+6.51\%}$
test_td3_speed[reduce-overhead-None] 54.8412ms 26.3299ms 37.9796 Ops/s 36.4177 Ops/s $\color{#35bf28}+4.29\%$
test_td3_speed[reduce-overhead-backward] 1.3915ms 1.3493ms 741.1212 Ops/s 650.6718 Ops/s $\textbf{\color{#35bf28}+13.90\%}$
test_cql_speed[False-None] 17.3864ms 16.9828ms 58.8832 Ops/s 58.8969 Ops/s $\color{#d91a1a}-0.02\%$
test_cql_speed[False-backward] 22.6074ms 22.1436ms 45.1597 Ops/s 44.5159 Ops/s $\color{#35bf28}+1.45\%$
test_cql_speed[True-None] 3.8197ms 3.2510ms 307.6014 Ops/s 302.8617 Ops/s $\color{#35bf28}+1.56\%$
test_cql_speed[True-backward] 6.5146ms 5.6737ms 176.2518 Ops/s 171.2302 Ops/s $\color{#35bf28}+2.93\%$
test_cql_speed[reduce-overhead-None] 21.3917ms 13.2717ms 75.3484 Ops/s 58.0384 Ops/s $\textbf{\color{#35bf28}+29.83\%}$
test_cql_speed[reduce-overhead-backward] 2.1891ms 2.0128ms 496.8162 Ops/s 491.0841 Ops/s $\color{#35bf28}+1.17\%$
test_a2c_speed[False-None] 3.4122ms 3.2212ms 310.4438 Ops/s 309.0863 Ops/s $\color{#35bf28}+0.44\%$
test_a2c_speed[False-backward] 6.9735ms 6.3985ms 156.2867 Ops/s 155.9240 Ops/s $\color{#35bf28}+0.23\%$
test_a2c_speed[True-None] 1.5118ms 1.3510ms 740.1898 Ops/s 737.0920 Ops/s $\color{#35bf28}+0.42\%$
test_a2c_speed[True-backward] 3.0860ms 3.0281ms 330.2392 Ops/s 321.5504 Ops/s $\color{#35bf28}+2.70\%$
test_a2c_speed[reduce-overhead-None] 15.7650ms 8.9612ms 111.5917 Ops/s 111.3812 Ops/s $\color{#35bf28}+0.19\%$
test_a2c_speed[reduce-overhead-backward] 1.7681ms 1.6166ms 618.5812 Ops/s 613.6462 Ops/s $\color{#35bf28}+0.80\%$
test_ppo_speed[False-None] 3.9638ms 3.7576ms 266.1275 Ops/s 267.5875 Ops/s $\color{#d91a1a}-0.55\%$
test_ppo_speed[False-backward] 7.9073ms 7.1571ms 139.7220 Ops/s 141.5203 Ops/s $\color{#d91a1a}-1.27\%$
test_ppo_speed[True-None] 1.6870ms 1.4140ms 707.2041 Ops/s 701.3215 Ops/s $\color{#35bf28}+0.84\%$
test_ppo_speed[True-backward] 3.3137ms 3.1860ms 313.8694 Ops/s 305.1484 Ops/s $\color{#35bf28}+2.86\%$
test_ppo_speed[reduce-overhead-None] 1.3664ms 0.9691ms 1.0319 KOps/s 1.0266 KOps/s $\color{#35bf28}+0.51\%$
test_ppo_speed[reduce-overhead-backward] 1.5957ms 1.5016ms 665.9605 Ops/s 614.9464 Ops/s $\textbf{\color{#35bf28}+8.30\%}$
test_reinforce_speed[False-None] 2.7214ms 2.3249ms 430.1219 Ops/s 432.4201 Ops/s $\color{#d91a1a}-0.53\%$
test_reinforce_speed[False-backward] 3.5402ms 3.4468ms 290.1241 Ops/s 289.8506 Ops/s $\color{#35bf28}+0.09\%$
test_reinforce_speed[True-None] 1.6900ms 1.2971ms 770.9668 Ops/s 752.2168 Ops/s $\color{#35bf28}+2.49\%$
test_reinforce_speed[True-backward] 3.1534ms 3.0862ms 324.0198 Ops/s 324.3343 Ops/s $\color{#d91a1a}-0.10\%$
test_reinforce_speed[reduce-overhead-None] 18.1268ms 10.0310ms 99.6906 Ops/s 101.8148 Ops/s $\color{#d91a1a}-2.09\%$
test_reinforce_speed[reduce-overhead-backward] 1.6797ms 1.5664ms 638.3864 Ops/s 600.2323 Ops/s $\textbf{\color{#35bf28}+6.36\%}$
test_iql_speed[False-None] 9.7818ms 9.3583ms 106.8566 Ops/s 106.2985 Ops/s $\color{#35bf28}+0.53\%$
test_iql_speed[False-backward] 13.5378ms 12.9945ms 76.9557 Ops/s 74.6641 Ops/s $\color{#35bf28}+3.07\%$
test_iql_speed[True-None] 2.3912ms 2.2352ms 447.3916 Ops/s 428.9913 Ops/s $\color{#35bf28}+4.29\%$
test_iql_speed[True-backward] 5.1609ms 4.7753ms 209.4096 Ops/s 199.9692 Ops/s $\color{#35bf28}+4.72\%$
test_iql_speed[reduce-overhead-None] 19.0020ms 11.2381ms 88.9827 Ops/s 89.8799 Ops/s $\color{#d91a1a}-1.00\%$
test_iql_speed[reduce-overhead-backward] 2.0629ms 1.9339ms 517.0996 Ops/s 462.1147 Ops/s $\textbf{\color{#35bf28}+11.90\%}$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 8.0386ms 6.3632ms 157.1534 Ops/s 153.6576 Ops/s $\color{#35bf28}+2.28\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.5591ms 0.3153ms 3.1713 KOps/s 2.9696 KOps/s $\textbf{\color{#35bf28}+6.79\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.6654ms 0.2973ms 3.3636 KOps/s 3.1386 KOps/s $\textbf{\color{#35bf28}+7.17\%}$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.4173ms 6.0907ms 164.1839 Ops/s 161.5875 Ops/s $\color{#35bf28}+1.61\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 1.8591ms 0.3319ms 3.0130 KOps/s 3.8094 KOps/s $\textbf{\color{#d91a1a}-20.91\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.7490ms 0.2972ms 3.3650 KOps/s 3.4101 KOps/s $\color{#d91a1a}-1.32\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 1.7606ms 1.4552ms 687.2049 Ops/s 776.8783 Ops/s $\textbf{\color{#d91a1a}-11.54\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 1.6438ms 1.3307ms 751.4674 Ops/s 845.4939 Ops/s $\textbf{\color{#d91a1a}-11.12\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.4966ms 6.2878ms 159.0376 Ops/s 155.7877 Ops/s $\color{#35bf28}+2.09\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.9792ms 0.4892ms 2.0440 KOps/s 2.1760 KOps/s $\textbf{\color{#d91a1a}-6.07\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.9404ms 0.4518ms 2.2134 KOps/s 2.5555 KOps/s $\textbf{\color{#d91a1a}-13.39\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.2012ms 6.0626ms 164.9444 Ops/s 160.5726 Ops/s $\color{#35bf28}+2.72\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.8294ms 0.2668ms 3.7478 KOps/s 3.2795 KOps/s $\textbf{\color{#35bf28}+14.28\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.4502ms 0.2451ms 4.0806 KOps/s 3.5977 KOps/s $\textbf{\color{#35bf28}+13.42\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.2860ms 5.9934ms 166.8510 Ops/s 161.5159 Ops/s $\color{#35bf28}+3.30\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.8707ms 0.3320ms 3.0124 KOps/s 2.9078 KOps/s $\color{#35bf28}+3.60\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.4959ms 0.2832ms 3.5314 KOps/s 3.0253 KOps/s $\textbf{\color{#35bf28}+16.73\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.3832ms 6.2148ms 160.9074 Ops/s 155.5730 Ops/s $\color{#35bf28}+3.43\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.9744ms 0.4576ms 2.1853 KOps/s 2.3691 KOps/s $\textbf{\color{#d91a1a}-7.76\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.7596ms 0.4433ms 2.2557 KOps/s 2.5434 KOps/s $\textbf{\color{#d91a1a}-11.31\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 7.1246ms 5.5080ms 181.5553 Ops/s 177.7796 Ops/s $\color{#35bf28}+2.12\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 9.5605ms 2.0677ms 483.6264 Ops/s 436.5748 Ops/s $\textbf{\color{#35bf28}+10.78\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 7.1778ms 1.2130ms 824.3897 Ops/s 826.6495 Ops/s $\color{#d91a1a}-0.27\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 7.1281ms 5.6115ms 178.2041 Ops/s 179.4208 Ops/s $\color{#d91a1a}-0.68\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 9.1677ms 2.0599ms 485.4703 Ops/s 450.5637 Ops/s $\textbf{\color{#35bf28}+7.75\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 6.9727ms 1.1849ms 843.9795 Ops/s 756.8892 Ops/s $\textbf{\color{#35bf28}+11.51\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 0.5007s 15.6461ms 63.9135 Ops/s 31.3774 Ops/s $\textbf{\color{#35bf28}+103.69\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 10.7485ms 2.3343ms 428.3990 Ops/s 470.1482 Ops/s $\textbf{\color{#d91a1a}-8.88\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 2.2702ms 1.2652ms 790.4050 Ops/s 842.3760 Ops/s $\textbf{\color{#d91a1a}-6.17\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 13.0640ms 12.8704ms 77.6974 Ops/s 73.4949 Ops/s $\textbf{\color{#35bf28}+5.72\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 18.2546ms 16.7555ms 59.6818 Ops/s 58.7749 Ops/s $\color{#35bf28}+1.54\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 18.0518ms 17.6285ms 56.7262 Ops/s 55.5471 Ops/s $\color{#35bf28}+2.12\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 19.0600ms 17.1116ms 58.4399 Ops/s 58.8066 Ops/s $\color{#d91a1a}-0.62\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 18.0017ms 17.4594ms 57.2757 Ops/s 55.2820 Ops/s $\color{#35bf28}+3.61\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 20.1767ms 18.8766ms 52.9756 Ops/s 53.8454 Ops/s $\color{#d91a1a}-1.62\%$

[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
@vmoens vmoens merged commit ad0c606 into gh/vmoens/86/base Feb 3, 2025
57 of 69 checks passed
vmoens added a commit that referenced this pull request Feb 3, 2025
ghstack-source-id: c2bcbfc4522bd1b4f1fea3dbb006dc9552b09cb4
Pull Request resolved: #2743
@vmoens vmoens deleted the gh/vmoens/86/head branch February 3, 2025 17:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Environments Adds or modifies an environment wrapper
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants