Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CI] Longer timeout for windows #2765

Merged
merged 1 commit into from
Feb 5, 2025
Merged

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Feb 5, 2025

[ghstack-poisoned]
Copy link

pytorch-bot bot commented Feb 5, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2765

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

vmoens added a commit that referenced this pull request Feb 5, 2025
ghstack-source-id: 381e7e39d650e0178178a78076321a2210237b39
Pull Request resolved: #2765
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 5, 2025
@vmoens vmoens merged commit 3fa0a86 into gh/vmoens/86/base Feb 5, 2025
54 of 59 checks passed
vmoens added a commit that referenced this pull request Feb 5, 2025
ghstack-source-id: 381e7e39d650e0178178a78076321a2210237b39
Pull Request resolved: #2765
@vmoens vmoens deleted the gh/vmoens/86/head branch February 5, 2025 21:00
Copy link

github-actions bot commented Feb 5, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}11$. Worsened: $\large\color{#d91a1a}19$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.5486s 0.4598s 2.1750 Ops/s 2.2134 Ops/s $\color{#d91a1a}-1.73\%$
test_transformed 0.9930s 0.9162s 1.0915 Ops/s 1.1182 Ops/s $\color{#d91a1a}-2.39\%$
test_serial 1.4712s 1.3928s 0.7180 Ops/s 0.7265 Ops/s $\color{#d91a1a}-1.17\%$
test_parallel 1.3009s 1.2187s 0.8205 Ops/s 0.8130 Ops/s $\color{#35bf28}+0.92\%$
test_step_mdp_speed[True-True-True-True-True] 0.1748ms 30.1310μs 33.1884 KOps/s 33.2176 KOps/s $\color{#d91a1a}-0.09\%$
test_step_mdp_speed[True-True-True-True-False] 50.6550μs 17.8238μs 56.1047 KOps/s 56.0466 KOps/s $\color{#35bf28}+0.10\%$
test_step_mdp_speed[True-True-True-False-True] 47.8900μs 16.9222μs 59.0941 KOps/s 58.4440 KOps/s $\color{#35bf28}+1.11\%$
test_step_mdp_speed[True-True-True-False-False] 43.5220μs 10.0903μs 99.1051 KOps/s 100.2782 KOps/s $\color{#d91a1a}-1.17\%$
test_step_mdp_speed[True-True-False-True-True] 76.3230μs 32.5304μs 30.7405 KOps/s 30.9953 KOps/s $\color{#d91a1a}-0.82\%$
test_step_mdp_speed[True-True-False-True-False] 59.4210μs 19.8388μs 50.4062 KOps/s 50.9898 KOps/s $\color{#d91a1a}-1.14\%$
test_step_mdp_speed[True-True-False-False-True] 0.7167ms 19.3544μs 51.6677 KOps/s 52.4813 KOps/s $\color{#d91a1a}-1.55\%$
test_step_mdp_speed[True-True-False-False-False] 42.2990μs 11.8623μs 84.3008 KOps/s 83.9402 KOps/s $\color{#35bf28}+0.43\%$
test_step_mdp_speed[True-False-True-True-True] 76.5130μs 34.1478μs 29.2845 KOps/s 29.2502 KOps/s $\color{#35bf28}+0.12\%$
test_step_mdp_speed[True-False-True-True-False] 56.3950μs 21.7426μs 45.9927 KOps/s 46.3165 KOps/s $\color{#d91a1a}-0.70\%$
test_step_mdp_speed[True-False-True-False-True] 48.3000μs 18.8489μs 53.0535 KOps/s 52.3263 KOps/s $\color{#35bf28}+1.39\%$
test_step_mdp_speed[True-False-True-False-False] 53.7810μs 11.8868μs 84.1271 KOps/s 84.4971 KOps/s $\color{#d91a1a}-0.44\%$
test_step_mdp_speed[True-False-False-True-True] 81.0110μs 35.8326μs 27.9076 KOps/s 28.1329 KOps/s $\color{#d91a1a}-0.80\%$
test_step_mdp_speed[True-False-False-True-False] 53.4800μs 23.3629μs 42.8028 KOps/s 43.1894 KOps/s $\color{#d91a1a}-0.89\%$
test_step_mdp_speed[True-False-False-False-True] 50.3740μs 20.7239μs 48.2534 KOps/s 47.8718 KOps/s $\color{#35bf28}+0.80\%$
test_step_mdp_speed[True-False-False-False-False] 56.4050μs 13.7559μs 72.6960 KOps/s 73.2538 KOps/s $\color{#d91a1a}-0.76\%$
test_step_mdp_speed[False-True-True-True-True] 65.3720μs 34.2702μs 29.1799 KOps/s 29.0105 KOps/s $\color{#35bf28}+0.58\%$
test_step_mdp_speed[False-True-True-True-False] 0.2680ms 22.1973μs 45.0506 KOps/s 46.4843 KOps/s $\color{#d91a1a}-3.08\%$
test_step_mdp_speed[False-True-True-False-True] 52.8090μs 22.1158μs 45.2164 KOps/s 46.2275 KOps/s $\color{#d91a1a}-2.19\%$
test_step_mdp_speed[False-True-True-False-False] 49.4930μs 13.2510μs 75.4660 KOps/s 75.9425 KOps/s $\color{#d91a1a}-0.63\%$
test_step_mdp_speed[False-True-False-True-True] 83.3960μs 35.8393μs 27.9023 KOps/s 26.5788 KOps/s $\color{#35bf28}+4.98\%$
test_step_mdp_speed[False-True-False-True-False] 60.4130μs 23.5355μs 42.4890 KOps/s 43.0948 KOps/s $\color{#d91a1a}-1.41\%$
test_step_mdp_speed[False-True-False-False-True] 2.5203ms 24.2250μs 41.2796 KOps/s 42.3690 KOps/s $\color{#d91a1a}-2.57\%$
test_step_mdp_speed[False-True-False-False-False] 0.2241ms 15.5154μs 64.4522 KOps/s 64.4129 KOps/s $\color{#35bf28}+0.06\%$
test_step_mdp_speed[False-False-True-True-True] 0.7402ms 38.0295μs 26.2954 KOps/s 25.7484 KOps/s $\color{#35bf28}+2.12\%$
test_step_mdp_speed[False-False-True-True-False] 97.9630μs 25.4338μs 39.3177 KOps/s 39.6632 KOps/s $\color{#d91a1a}-0.87\%$
test_step_mdp_speed[False-False-True-False-True] 57.8380μs 23.0583μs 43.3683 KOps/s 42.6570 KOps/s $\color{#35bf28}+1.67\%$
test_step_mdp_speed[False-False-True-False-False] 47.5990μs 15.0004μs 66.6649 KOps/s 66.6856 KOps/s $\color{#d91a1a}-0.03\%$
test_step_mdp_speed[False-False-False-True-True] 83.3550μs 39.2960μs 25.4479 KOps/s 25.6187 KOps/s $\color{#d91a1a}-0.67\%$
test_step_mdp_speed[False-False-False-True-False] 66.5550μs 26.8864μs 37.1935 KOps/s 37.5363 KOps/s $\color{#d91a1a}-0.91\%$
test_step_mdp_speed[False-False-False-False-True] 0.1391ms 25.0870μs 39.8613 KOps/s 39.8739 KOps/s $\color{#d91a1a}-0.03\%$
test_step_mdp_speed[False-False-False-False-False] 0.2847ms 16.7392μs 59.7400 KOps/s 60.5186 KOps/s $\color{#d91a1a}-1.29\%$
test_values[generalized_advantage_estimate-True-True] 10.4208ms 9.7192ms 102.8886 Ops/s 100.5272 Ops/s $\color{#35bf28}+2.35\%$
test_values[vec_generalized_advantage_estimate-True-True] 28.6047ms 26.8470ms 37.2481 Ops/s 41.1102 Ops/s $\textbf{\color{#d91a1a}-9.39\%}$
test_values[td0_return_estimate-False-False] 0.2297ms 0.1926ms 5.1924 KOps/s 5.2342 KOps/s $\color{#d91a1a}-0.80\%$
test_values[td1_return_estimate-False-False] 41.4924ms 24.6369ms 40.5895 Ops/s 40.3368 Ops/s $\color{#35bf28}+0.63\%$
test_values[vec_td1_return_estimate-False-False] 29.7956ms 26.8067ms 37.3041 Ops/s 40.4537 Ops/s $\textbf{\color{#d91a1a}-7.79\%}$
test_values[td_lambda_return_estimate-True-False] 36.9356ms 34.1233ms 29.3055 Ops/s 27.9246 Ops/s $\color{#35bf28}+4.95\%$
test_values[vec_td_lambda_return_estimate-True-False] 30.2751ms 26.8685ms 37.2183 Ops/s 39.4248 Ops/s $\textbf{\color{#d91a1a}-5.60\%}$
test_gae_speed[generalized_advantage_estimate-False-1-512] 8.6663ms 8.5068ms 117.5528 Ops/s 115.8715 Ops/s $\color{#35bf28}+1.45\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 3.2238ms 1.9635ms 509.2838 Ops/s 516.5014 Ops/s $\color{#d91a1a}-1.40\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 1.1707ms 0.3740ms 2.6738 KOps/s 2.7098 KOps/s $\color{#d91a1a}-1.33\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 61.2176ms 46.7682ms 21.3820 Ops/s 24.4876 Ops/s $\textbf{\color{#d91a1a}-12.68\%}$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 4.2383ms 3.4416ms 290.5620 Ops/s 289.3004 Ops/s $\color{#35bf28}+0.44\%$
test_dqn_speed[False-None] 6.1410ms 1.3986ms 715.0251 Ops/s 719.6504 Ops/s $\color{#d91a1a}-0.64\%$
test_dqn_speed[False-backward] 1.9462ms 1.8712ms 534.4119 Ops/s 536.1708 Ops/s $\color{#d91a1a}-0.33\%$
test_dqn_speed[True-None] 0.7515ms 0.4792ms 2.0866 KOps/s 2.0806 KOps/s $\color{#35bf28}+0.29\%$
test_dqn_speed[True-backward] 0.9552ms 0.9016ms 1.1091 KOps/s 776.6173 Ops/s $\textbf{\color{#35bf28}+42.81\%}$
test_dqn_speed[reduce-overhead-None] 0.6676ms 0.4815ms 2.0770 KOps/s 2.0656 KOps/s $\color{#35bf28}+0.55\%$
test_dqn_speed[reduce-overhead-backward] 1.2132ms 0.9128ms 1.0956 KOps/s 1.0882 KOps/s $\color{#35bf28}+0.68\%$
test_ddpg_speed[False-None] 3.4527ms 2.8685ms 348.6091 Ops/s 345.7311 Ops/s $\color{#35bf28}+0.83\%$
test_ddpg_speed[False-backward] 4.2677ms 4.0498ms 246.9233 Ops/s 248.8861 Ops/s $\color{#d91a1a}-0.79\%$
test_ddpg_speed[True-None] 1.4759ms 1.2175ms 821.3349 Ops/s 811.8731 Ops/s $\color{#35bf28}+1.17\%$
test_ddpg_speed[True-backward] 2.1979ms 2.1195ms 471.8080 Ops/s 443.5090 Ops/s $\textbf{\color{#35bf28}+6.38\%}$
test_ddpg_speed[reduce-overhead-None] 1.5679ms 1.2207ms 819.2303 Ops/s 792.5249 Ops/s $\color{#35bf28}+3.37\%$
test_ddpg_speed[reduce-overhead-backward] 2.2072ms 2.1393ms 467.4422 Ops/s 454.6410 Ops/s $\color{#35bf28}+2.82\%$
test_sac_speed[False-None] 8.9919ms 7.8911ms 126.7252 Ops/s 123.2324 Ops/s $\color{#35bf28}+2.83\%$
test_sac_speed[False-backward] 12.9780ms 10.5907ms 94.4228 Ops/s 91.4453 Ops/s $\color{#35bf28}+3.26\%$
test_sac_speed[True-None] 2.3808ms 2.1231ms 470.9989 Ops/s 472.5743 Ops/s $\color{#d91a1a}-0.33\%$
test_sac_speed[True-backward] 3.9642ms 3.7917ms 263.7319 Ops/s 261.3236 Ops/s $\color{#35bf28}+0.92\%$
test_sac_speed[reduce-overhead-None] 2.7322ms 2.1767ms 459.4075 Ops/s 469.5426 Ops/s $\color{#d91a1a}-2.16\%$
test_sac_speed[reduce-overhead-backward] 4.3581ms 4.0795ms 245.1288 Ops/s 249.7981 Ops/s $\color{#d91a1a}-1.87\%$
test_redq_speed[False-None] 13.8575ms 12.8668ms 77.7194 Ops/s 74.9922 Ops/s $\color{#35bf28}+3.64\%$
test_redq_speed[False-backward] 24.1356ms 22.2400ms 44.9640 Ops/s 43.1865 Ops/s $\color{#35bf28}+4.12\%$
test_redq_speed[True-None] 6.2554ms 5.4514ms 183.4393 Ops/s 167.4342 Ops/s $\textbf{\color{#35bf28}+9.56\%}$
test_redq_speed[True-backward] 14.2445ms 12.8575ms 77.7757 Ops/s 78.7333 Ops/s $\color{#d91a1a}-1.22\%$
test_redq_speed[reduce-overhead-None] 6.0268ms 4.9669ms 201.3346 Ops/s 200.6290 Ops/s $\color{#35bf28}+0.35\%$
test_redq_speed[reduce-overhead-backward] 13.0771ms 12.4683ms 80.2034 Ops/s 78.5315 Ops/s $\color{#35bf28}+2.13\%$
test_redq_deprec_speed[False-None] 13.6813ms 12.5988ms 79.3726 Ops/s 76.8959 Ops/s $\color{#35bf28}+3.22\%$
test_redq_deprec_speed[False-backward] 19.4959ms 18.7341ms 53.3785 Ops/s 53.5994 Ops/s $\color{#d91a1a}-0.41\%$
test_redq_deprec_speed[True-None] 4.6587ms 3.9490ms 253.2307 Ops/s 252.1765 Ops/s $\color{#35bf28}+0.42\%$
test_redq_deprec_speed[True-backward] 8.3996ms 8.2273ms 121.5463 Ops/s 107.8944 Ops/s $\textbf{\color{#35bf28}+12.65\%}$
test_redq_deprec_speed[reduce-overhead-None] 4.2996ms 3.9072ms 255.9382 Ops/s 241.6580 Ops/s $\textbf{\color{#35bf28}+5.91\%}$
test_redq_deprec_speed[reduce-overhead-backward] 8.5059ms 8.2687ms 120.9384 Ops/s 106.1962 Ops/s $\textbf{\color{#35bf28}+13.88\%}$
test_td3_speed[False-None] 9.4944ms 7.9603ms 125.6235 Ops/s 120.8004 Ops/s $\color{#35bf28}+3.99\%$
test_td3_speed[False-backward] 10.5965ms 10.1782ms 98.2491 Ops/s 91.5476 Ops/s $\textbf{\color{#35bf28}+7.32\%}$
test_td3_speed[True-None] 2.1117ms 1.8261ms 547.6110 Ops/s 539.0829 Ops/s $\color{#35bf28}+1.58\%$
test_td3_speed[True-backward] 3.5133ms 3.4556ms 289.3843 Ops/s 266.8854 Ops/s $\textbf{\color{#35bf28}+8.43\%}$
test_td3_speed[reduce-overhead-None] 2.4114ms 1.8445ms 542.1410 Ops/s 527.9879 Ops/s $\color{#35bf28}+2.68\%$
test_td3_speed[reduce-overhead-backward] 3.5125ms 3.4167ms 292.6839 Ops/s 287.1358 Ops/s $\color{#35bf28}+1.93\%$
test_cql_speed[False-None] 38.2375ms 36.0878ms 27.7102 Ops/s 27.1707 Ops/s $\color{#35bf28}+1.99\%$
test_cql_speed[False-backward] 56.4670ms 46.9243ms 21.3109 Ops/s 21.1212 Ops/s $\color{#35bf28}+0.90\%$
test_cql_speed[True-None] 19.0144ms 16.5968ms 60.2525 Ops/s 61.8935 Ops/s $\color{#d91a1a}-2.65\%$
test_cql_speed[True-backward] 23.9220ms 22.9921ms 43.4932 Ops/s 42.0258 Ops/s $\color{#35bf28}+3.49\%$
test_cql_speed[reduce-overhead-None] 17.8156ms 16.5494ms 60.4253 Ops/s 62.8780 Ops/s $\color{#d91a1a}-3.90\%$
test_cql_speed[reduce-overhead-backward] 24.1461ms 22.7035ms 44.0461 Ops/s 44.3484 Ops/s $\color{#d91a1a}-0.68\%$
test_a2c_speed[False-None] 7.8977ms 7.0685ms 141.4726 Ops/s 137.5529 Ops/s $\color{#35bf28}+2.85\%$
test_a2c_speed[False-backward] 15.1365ms 14.1802ms 70.5207 Ops/s 69.7058 Ops/s $\color{#35bf28}+1.17\%$
test_a2c_speed[True-None] 4.3270ms 3.7902ms 263.8385 Ops/s 266.5300 Ops/s $\color{#d91a1a}-1.01\%$
test_a2c_speed[True-backward] 11.5174ms 10.6045ms 94.2994 Ops/s 97.9295 Ops/s $\color{#d91a1a}-3.71\%$
test_a2c_speed[reduce-overhead-None] 4.8019ms 3.7760ms 264.8324 Ops/s 267.2050 Ops/s $\color{#d91a1a}-0.89\%$
test_a2c_speed[reduce-overhead-backward] 10.9808ms 10.2834ms 97.2439 Ops/s 98.5642 Ops/s $\color{#d91a1a}-1.34\%$
test_ppo_speed[False-None] 8.8496ms 7.4709ms 133.8530 Ops/s 131.3735 Ops/s $\color{#35bf28}+1.89\%$
test_ppo_speed[False-backward] 15.7721ms 14.8881ms 67.1678 Ops/s 66.6801 Ops/s $\color{#35bf28}+0.73\%$
test_ppo_speed[True-None] 4.4820ms 4.1675ms 239.9549 Ops/s 239.6183 Ops/s $\color{#35bf28}+0.14\%$
test_ppo_speed[True-backward] 11.0653ms 10.6113ms 94.2393 Ops/s 91.7197 Ops/s $\color{#35bf28}+2.75\%$
test_ppo_speed[reduce-overhead-None] 4.7519ms 4.2003ms 238.0811 Ops/s 227.4023 Ops/s $\color{#35bf28}+4.70\%$
test_ppo_speed[reduce-overhead-backward] 11.4257ms 10.6974ms 93.4810 Ops/s 93.7579 Ops/s $\color{#d91a1a}-0.30\%$
test_reinforce_speed[False-None] 7.6358ms 6.7871ms 147.3378 Ops/s 145.6181 Ops/s $\color{#35bf28}+1.18\%$
test_reinforce_speed[False-backward] 11.0547ms 10.1398ms 98.6209 Ops/s 97.6102 Ops/s $\color{#35bf28}+1.04\%$
test_reinforce_speed[True-None] 3.7531ms 3.3051ms 302.5618 Ops/s 320.0344 Ops/s $\textbf{\color{#d91a1a}-5.46\%}$
test_reinforce_speed[True-backward] 11.3852ms 9.1767ms 108.9712 Ops/s 106.0632 Ops/s $\color{#35bf28}+2.74\%$
test_reinforce_speed[reduce-overhead-None] 3.5236ms 3.1142ms 321.1079 Ops/s 274.8492 Ops/s $\textbf{\color{#35bf28}+16.83\%}$
test_reinforce_speed[reduce-overhead-backward] 9.4139ms 8.9353ms 111.9155 Ops/s 110.6281 Ops/s $\color{#35bf28}+1.16\%$
test_iql_speed[False-None] 33.0095ms 31.8729ms 31.3746 Ops/s 30.0169 Ops/s $\color{#35bf28}+4.52\%$
test_iql_speed[False-backward] 47.2171ms 45.3304ms 22.0602 Ops/s 21.6566 Ops/s $\color{#35bf28}+1.86\%$
test_iql_speed[True-None] 19.0316ms 11.9688ms 83.5505 Ops/s 88.6822 Ops/s $\textbf{\color{#d91a1a}-5.79\%}$
test_iql_speed[True-backward] 25.3164ms 23.6116ms 42.3521 Ops/s 45.4686 Ops/s $\textbf{\color{#d91a1a}-6.85\%}$
test_iql_speed[reduce-overhead-None] 12.9469ms 12.2153ms 81.8643 Ops/s 88.2108 Ops/s $\textbf{\color{#d91a1a}-7.19\%}$
test_iql_speed[reduce-overhead-backward] 24.3539ms 23.7605ms 42.0867 Ops/s 45.5380 Ops/s $\textbf{\color{#d91a1a}-7.58\%}$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.6445ms 5.1090ms 195.7328 Ops/s 206.0408 Ops/s $\textbf{\color{#d91a1a}-5.00\%}$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.7632ms 0.5362ms 1.8649 KOps/s 1.8563 KOps/s $\color{#35bf28}+0.47\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.8773ms 0.5205ms 1.9212 KOps/s 1.9523 KOps/s $\color{#d91a1a}-1.59\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 5.5626ms 5.0152ms 199.3947 Ops/s 219.7928 Ops/s $\textbf{\color{#d91a1a}-9.28\%}$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 2.6906ms 0.5355ms 1.8675 KOps/s 1.9300 KOps/s $\color{#d91a1a}-3.24\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.7702ms 0.5066ms 1.9740 KOps/s 1.9921 KOps/s $\color{#d91a1a}-0.91\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 2.4278ms 1.6956ms 589.7633 Ops/s 583.3127 Ops/s $\color{#35bf28}+1.11\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 3.1236ms 1.6762ms 596.5705 Ops/s 614.5524 Ops/s $\color{#d91a1a}-2.93\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 7.8450ms 5.2423ms 190.7572 Ops/s 210.9237 Ops/s $\textbf{\color{#d91a1a}-9.56\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.6644ms 0.6891ms 1.4511 KOps/s 1.4911 KOps/s $\color{#d91a1a}-2.68\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.9329ms 0.6573ms 1.5214 KOps/s 1.5481 KOps/s $\color{#d91a1a}-1.72\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 5.3473ms 5.0009ms 199.9654 Ops/s 215.4381 Ops/s $\textbf{\color{#d91a1a}-7.18\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 2.7749ms 0.5533ms 1.8073 KOps/s 1.8835 KOps/s $\color{#d91a1a}-4.05\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.8630ms 0.5277ms 1.8951 KOps/s 1.9435 KOps/s $\color{#d91a1a}-2.49\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 7.5457ms 5.0654ms 197.4159 Ops/s 219.1743 Ops/s $\textbf{\color{#d91a1a}-9.93\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 2.4477ms 0.5350ms 1.8693 KOps/s 1.9070 KOps/s $\color{#d91a1a}-1.98\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.7771ms 0.5095ms 1.9626 KOps/s 2.0227 KOps/s $\color{#d91a1a}-2.97\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 7.6621ms 5.1498ms 194.1810 Ops/s 200.1341 Ops/s $\color{#d91a1a}-2.97\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 3.8303ms 0.6890ms 1.4514 KOps/s 1.4810 KOps/s $\color{#d91a1a}-2.00\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.9818ms 0.6557ms 1.5251 KOps/s 1.5614 KOps/s $\color{#d91a1a}-2.32\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 8.2349ms 4.4156ms 226.4675 Ops/s 236.6546 Ops/s $\color{#d91a1a}-4.30\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 3.5176ms 2.2562ms 443.2309 Ops/s 430.8108 Ops/s $\color{#35bf28}+2.88\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 7.6655ms 1.4637ms 683.2232 Ops/s 771.5922 Ops/s $\textbf{\color{#d91a1a}-11.45\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 0.4757s 13.8692ms 72.1023 Ops/s 248.2604 Ops/s $\textbf{\color{#d91a1a}-70.96\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 6.9521ms 2.3426ms 426.8767 Ops/s 452.5839 Ops/s $\textbf{\color{#d91a1a}-5.68\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 5.0539ms 1.3637ms 733.3072 Ops/s 795.9442 Ops/s $\textbf{\color{#d91a1a}-7.87\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 6.0869ms 4.5410ms 220.2145 Ops/s 33.7376 Ops/s $\textbf{\color{#35bf28}+552.73\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 10.0138ms 2.5873ms 386.5012 Ops/s 399.3754 Ops/s $\color{#d91a1a}-3.22\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 5.2344ms 1.5409ms 648.9717 Ops/s 725.0325 Ops/s $\textbf{\color{#d91a1a}-10.49\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 12.0967ms 11.6803ms 85.6146 Ops/s 80.8914 Ops/s $\textbf{\color{#35bf28}+5.84\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 15.3256ms 14.3423ms 69.7240 Ops/s 67.9475 Ops/s $\color{#35bf28}+2.61\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 30.3123ms 21.0517ms 47.5021 Ops/s 46.0149 Ops/s $\color{#35bf28}+3.23\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 16.3830ms 14.6303ms 68.3513 Ops/s 67.5071 Ops/s $\color{#35bf28}+1.25\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 21.3575ms 20.3657ms 49.1023 Ops/s 47.8736 Ops/s $\color{#35bf28}+2.57\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 16.9811ms 16.0039ms 62.4849 Ops/s 61.5648 Ops/s $\color{#35bf28}+1.49\%$

Copy link

github-actions bot commented Feb 5, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}15$. Worsened: $\large\color{#d91a1a}14$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.8217s 0.7344s 1.3617 Ops/s 1.3484 Ops/s $\color{#35bf28}+0.98\%$
test_transformed 1.2800s 1.2765s 0.7834 Ops/s 0.7466 Ops/s $\color{#35bf28}+4.93\%$
test_serial 2.1195s 2.1161s 0.4726 Ops/s 0.4656 Ops/s $\color{#35bf28}+1.49\%$
test_parallel 1.8693s 1.8401s 0.5435 Ops/s 0.5445 Ops/s $\color{#d91a1a}-0.18\%$
test_step_mdp_speed[True-True-True-True-True] 0.1896ms 40.0522μs 24.9674 KOps/s 25.6906 KOps/s $\color{#d91a1a}-2.82\%$
test_step_mdp_speed[True-True-True-True-False] 54.9510μs 23.2153μs 43.0751 KOps/s 43.5234 KOps/s $\color{#d91a1a}-1.03\%$
test_step_mdp_speed[True-True-True-False-True] 47.9610μs 21.7461μs 45.9853 KOps/s 44.8354 KOps/s $\color{#35bf28}+2.56\%$
test_step_mdp_speed[True-True-True-False-False] 0.1060ms 12.7147μs 78.6490 KOps/s 77.9493 KOps/s $\color{#35bf28}+0.90\%$
test_step_mdp_speed[True-True-False-True-True] 74.4510μs 42.2984μs 23.6416 KOps/s 23.6222 KOps/s $\color{#35bf28}+0.08\%$
test_step_mdp_speed[True-True-False-True-False] 55.2010μs 25.2347μs 39.6279 KOps/s 38.9486 KOps/s $\color{#35bf28}+1.74\%$
test_step_mdp_speed[True-True-False-False-True] 54.4010μs 24.4965μs 40.8221 KOps/s 40.6561 KOps/s $\color{#35bf28}+0.41\%$
test_step_mdp_speed[True-True-False-False-False] 42.4010μs 15.1960μs 65.8066 KOps/s 64.6001 KOps/s $\color{#35bf28}+1.87\%$
test_step_mdp_speed[True-False-True-True-True] 0.1489ms 44.2785μs 22.5843 KOps/s 22.0028 KOps/s $\color{#35bf28}+2.64\%$
test_step_mdp_speed[True-False-True-True-False] 50.5110μs 27.9650μs 35.7589 KOps/s 35.9346 KOps/s $\color{#d91a1a}-0.49\%$
test_step_mdp_speed[True-False-True-False-True] 52.5010μs 24.4591μs 40.8845 KOps/s 41.0866 KOps/s $\color{#d91a1a}-0.49\%$
test_step_mdp_speed[True-False-True-False-False] 47.1300μs 15.3462μs 65.1626 KOps/s 65.4220 KOps/s $\color{#d91a1a}-0.40\%$
test_step_mdp_speed[True-False-False-True-True] 0.2126ms 46.2846μs 21.6055 KOps/s 21.2045 KOps/s $\color{#35bf28}+1.89\%$
test_step_mdp_speed[True-False-False-True-False] 58.6010μs 30.1570μs 33.1598 KOps/s 32.9700 KOps/s $\color{#35bf28}+0.58\%$
test_step_mdp_speed[True-False-False-False-True] 0.1144ms 26.5393μs 37.6800 KOps/s 37.4711 KOps/s $\color{#35bf28}+0.56\%$
test_step_mdp_speed[True-False-False-False-False] 43.2910μs 17.4354μs 57.3547 KOps/s 57.2770 KOps/s $\color{#35bf28}+0.14\%$
test_step_mdp_speed[False-True-True-True-True] 75.1410μs 44.4190μs 22.5129 KOps/s 22.4745 KOps/s $\color{#35bf28}+0.17\%$
test_step_mdp_speed[False-True-True-True-False] 59.4610μs 27.7386μs 36.0508 KOps/s 35.7123 KOps/s $\color{#35bf28}+0.95\%$
test_step_mdp_speed[False-True-True-False-True] 65.4810μs 28.2182μs 35.4381 KOps/s 35.7770 KOps/s $\color{#d91a1a}-0.95\%$
test_step_mdp_speed[False-True-True-False-False] 0.1244ms 17.0206μs 58.7524 KOps/s 59.0991 KOps/s $\color{#d91a1a}-0.59\%$
test_step_mdp_speed[False-True-False-True-True] 81.6820μs 47.1473μs 21.2101 KOps/s 20.9077 KOps/s $\color{#35bf28}+1.45\%$
test_step_mdp_speed[False-True-False-True-False] 60.4310μs 30.4513μs 32.8393 KOps/s 32.7700 KOps/s $\color{#35bf28}+0.21\%$
test_step_mdp_speed[False-True-False-False-True] 3.2494ms 31.0383μs 32.2182 KOps/s 31.7338 KOps/s $\color{#35bf28}+1.53\%$
test_step_mdp_speed[False-True-False-False-False] 50.4110μs 19.4416μs 51.4362 KOps/s 51.5283 KOps/s $\color{#d91a1a}-0.18\%$
test_step_mdp_speed[False-False-True-True-True] 83.8210μs 49.6114μs 20.1567 KOps/s 20.1761 KOps/s $\color{#d91a1a}-0.10\%$
test_step_mdp_speed[False-False-True-True-False] 0.1310ms 32.8062μs 30.4820 KOps/s 31.2577 KOps/s $\color{#d91a1a}-2.48\%$
test_step_mdp_speed[False-False-True-False-True] 59.6110μs 30.6582μs 32.6178 KOps/s 33.1884 KOps/s $\color{#d91a1a}-1.72\%$
test_step_mdp_speed[False-False-True-False-False] 47.0310μs 19.5235μs 51.2204 KOps/s 52.4702 KOps/s $\color{#d91a1a}-2.38\%$
test_step_mdp_speed[False-False-False-True-True] 80.5310μs 51.5780μs 19.3881 KOps/s 19.6079 KOps/s $\color{#d91a1a}-1.12\%$
test_step_mdp_speed[False-False-False-True-False] 71.3410μs 34.9544μs 28.6087 KOps/s 29.1756 KOps/s $\color{#d91a1a}-1.94\%$
test_step_mdp_speed[False-False-False-False-True] 66.1010μs 31.8422μs 31.4049 KOps/s 31.3626 KOps/s $\color{#35bf28}+0.13\%$
test_step_mdp_speed[False-False-False-False-False] 57.8310μs 21.4499μs 46.6202 KOps/s 47.8395 KOps/s $\color{#d91a1a}-2.55\%$
test_values[generalized_advantage_estimate-True-True] 24.7142ms 24.1658ms 41.3808 Ops/s 39.8549 Ops/s $\color{#35bf28}+3.83\%$
test_values[vec_generalized_advantage_estimate-True-True] 0.1027s 2.9396ms 340.1815 Ops/s 323.4308 Ops/s $\textbf{\color{#35bf28}+5.18\%}$
test_values[td0_return_estimate-False-False] 0.1013ms 77.5695μs 12.8917 KOps/s 12.1614 KOps/s $\textbf{\color{#35bf28}+6.01\%}$
test_values[td1_return_estimate-False-False] 54.6766ms 54.1818ms 18.4564 Ops/s 17.6689 Ops/s $\color{#35bf28}+4.46\%$
test_values[vec_td1_return_estimate-False-False] 1.2998ms 1.0676ms 936.7233 Ops/s 926.5109 Ops/s $\color{#35bf28}+1.10\%$
test_values[td_lambda_return_estimate-True-False] 86.0845ms 85.6362ms 11.6773 Ops/s 11.3931 Ops/s $\color{#35bf28}+2.49\%$
test_values[vec_td_lambda_return_estimate-True-False] 1.3844ms 1.0687ms 935.6996 Ops/s 939.4698 Ops/s $\color{#d91a1a}-0.40\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 24.2945ms 24.1087ms 41.4789 Ops/s 41.3331 Ops/s $\color{#35bf28}+0.35\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 1.0363ms 0.7367ms 1.3575 KOps/s 1.3610 KOps/s $\color{#d91a1a}-0.26\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.7714ms 0.6539ms 1.5294 KOps/s 1.5347 KOps/s $\color{#d91a1a}-0.35\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 1.5127ms 1.4725ms 679.1032 Ops/s 681.4160 Ops/s $\color{#d91a1a}-0.34\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 0.7117ms 0.6679ms 1.4973 KOps/s 1.5011 KOps/s $\color{#d91a1a}-0.26\%$
test_dqn_speed[False-None] 1.5808ms 1.4897ms 671.2760 Ops/s 660.4423 Ops/s $\color{#35bf28}+1.64\%$
test_dqn_speed[False-backward] 2.1613ms 2.1052ms 475.0091 Ops/s 444.6136 Ops/s $\textbf{\color{#35bf28}+6.84\%}$
test_dqn_speed[True-None] 0.6199ms 0.5450ms 1.8348 KOps/s 1.8123 KOps/s $\color{#35bf28}+1.24\%$
test_dqn_speed[True-backward] 1.2713ms 1.2055ms 829.5542 Ops/s 901.6204 Ops/s $\textbf{\color{#d91a1a}-7.99\%}$
test_dqn_speed[reduce-overhead-None] 0.6245ms 0.5625ms 1.7777 KOps/s 1.7688 KOps/s $\color{#35bf28}+0.50\%$
test_dqn_speed[reduce-overhead-backward] 1.1143ms 1.0526ms 950.0046 Ops/s 1.0479 KOps/s $\textbf{\color{#d91a1a}-9.34\%}$
test_ddpg_speed[False-None] 3.1453ms 2.8289ms 353.4940 Ops/s 345.1426 Ops/s $\color{#35bf28}+2.42\%$
test_ddpg_speed[False-backward] 4.6663ms 4.1975ms 238.2355 Ops/s 240.7731 Ops/s $\color{#d91a1a}-1.05\%$
test_ddpg_speed[True-None] 1.3811ms 1.3158ms 759.9848 Ops/s 752.0978 Ops/s $\color{#35bf28}+1.05\%$
test_ddpg_speed[True-backward] 2.6134ms 2.5369ms 394.1758 Ops/s 407.4037 Ops/s $\color{#d91a1a}-3.25\%$
test_ddpg_speed[reduce-overhead-None] 1.5928ms 1.3581ms 736.3069 Ops/s 741.8093 Ops/s $\color{#d91a1a}-0.74\%$
test_ddpg_speed[reduce-overhead-backward] 2.0616ms 2.0066ms 498.3571 Ops/s 530.0118 Ops/s $\textbf{\color{#d91a1a}-5.97\%}$
test_sac_speed[False-None] 8.2761ms 7.8693ms 127.0760 Ops/s 121.3068 Ops/s $\color{#35bf28}+4.76\%$
test_sac_speed[False-backward] 11.4724ms 10.9972ms 90.9322 Ops/s 89.2612 Ops/s $\color{#35bf28}+1.87\%$
test_sac_speed[True-None] 1.9867ms 1.8072ms 553.3447 Ops/s 550.1775 Ops/s $\color{#35bf28}+0.58\%$
test_sac_speed[True-backward] 3.7570ms 3.6552ms 273.5807 Ops/s 269.7800 Ops/s $\color{#35bf28}+1.41\%$
test_sac_speed[reduce-overhead-None] 21.2976ms 12.1306ms 82.4363 Ops/s 82.6258 Ops/s $\color{#d91a1a}-0.23\%$
test_sac_speed[reduce-overhead-backward] 1.8140ms 1.7605ms 568.0129 Ops/s 552.0663 Ops/s $\color{#35bf28}+2.89\%$
test_redq_speed[False-None] 7.8066ms 7.3810ms 135.4837 Ops/s 131.6032 Ops/s $\color{#35bf28}+2.95\%$
test_redq_speed[False-backward] 11.9880ms 11.5208ms 86.7992 Ops/s 84.5144 Ops/s $\color{#35bf28}+2.70\%$
test_redq_speed[True-None] 2.4415ms 2.2797ms 438.6575 Ops/s 421.1152 Ops/s $\color{#35bf28}+4.17\%$
test_redq_speed[True-backward] 4.5179ms 4.0867ms 244.6952 Ops/s 250.5681 Ops/s $\color{#d91a1a}-2.34\%$
test_redq_speed[reduce-overhead-None] 2.4245ms 2.2768ms 439.2051 Ops/s 433.2323 Ops/s $\color{#35bf28}+1.38\%$
test_redq_speed[reduce-overhead-backward] 4.2832ms 4.0978ms 244.0332 Ops/s 249.2586 Ops/s $\color{#d91a1a}-2.10\%$
test_redq_deprec_speed[False-None] 9.4608ms 9.0027ms 111.0775 Ops/s 109.3304 Ops/s $\color{#35bf28}+1.60\%$
test_redq_deprec_speed[False-backward] 12.7803ms 12.1487ms 82.3136 Ops/s 82.4126 Ops/s $\color{#d91a1a}-0.12\%$
test_redq_deprec_speed[True-None] 2.7328ms 2.5801ms 387.5832 Ops/s 381.9959 Ops/s $\color{#35bf28}+1.46\%$
test_redq_deprec_speed[True-backward] 4.7954ms 4.4039ms 227.0701 Ops/s 228.8964 Ops/s $\color{#d91a1a}-0.80\%$
test_redq_deprec_speed[reduce-overhead-None] 2.6975ms 2.5753ms 388.3046 Ops/s 379.1005 Ops/s $\color{#35bf28}+2.43\%$
test_redq_deprec_speed[reduce-overhead-backward] 4.8006ms 4.3967ms 227.4440 Ops/s 229.5353 Ops/s $\color{#d91a1a}-0.91\%$
test_td3_speed[False-None] 7.8673ms 7.8269ms 127.7643 Ops/s 124.4115 Ops/s $\color{#35bf28}+2.69\%$
test_td3_speed[False-backward] 11.1481ms 10.3463ms 96.6529 Ops/s 96.2296 Ops/s $\color{#35bf28}+0.44\%$
test_td3_speed[True-None] 1.6340ms 1.6127ms 620.0842 Ops/s 602.2106 Ops/s $\color{#35bf28}+2.97\%$
test_td3_speed[True-backward] 3.3342ms 3.2769ms 305.1647 Ops/s 316.4623 Ops/s $\color{#d91a1a}-3.57\%$
test_td3_speed[reduce-overhead-None] 50.2892ms 25.8283ms 38.7173 Ops/s 36.9634 Ops/s $\color{#35bf28}+4.75\%$
test_td3_speed[reduce-overhead-backward] 1.5249ms 1.4590ms 685.3884 Ops/s 719.8207 Ops/s $\color{#d91a1a}-4.78\%$
test_cql_speed[False-None] 16.9848ms 16.4752ms 60.6973 Ops/s 59.1474 Ops/s $\color{#35bf28}+2.62\%$
test_cql_speed[False-backward] 22.3244ms 21.8551ms 45.7559 Ops/s 45.6424 Ops/s $\color{#35bf28}+0.25\%$
test_cql_speed[True-None] 3.5678ms 3.2358ms 309.0408 Ops/s 308.3003 Ops/s $\color{#35bf28}+0.24\%$
test_cql_speed[True-backward] 6.1411ms 5.6233ms 177.8322 Ops/s 183.4110 Ops/s $\color{#d91a1a}-3.04\%$
test_cql_speed[reduce-overhead-None] 21.5559ms 13.1967ms 75.7763 Ops/s 75.7564 Ops/s $\color{#35bf28}+0.03\%$
test_cql_speed[reduce-overhead-backward] 2.1314ms 1.9870ms 503.2809 Ops/s 548.5413 Ops/s $\textbf{\color{#d91a1a}-8.25\%}$
test_a2c_speed[False-None] 3.2497ms 3.1411ms 318.3567 Ops/s 310.6199 Ops/s $\color{#35bf28}+2.49\%$
test_a2c_speed[False-backward] 6.8680ms 6.2373ms 160.3255 Ops/s 165.1970 Ops/s $\color{#d91a1a}-2.95\%$
test_a2c_speed[True-None] 1.3783ms 1.3275ms 753.2903 Ops/s 743.5919 Ops/s $\color{#35bf28}+1.30\%$
test_a2c_speed[True-backward] 3.1080ms 3.0079ms 332.4572 Ops/s 342.0719 Ops/s $\color{#d91a1a}-2.81\%$
test_a2c_speed[reduce-overhead-None] 15.9397ms 9.0740ms 110.2052 Ops/s 111.8408 Ops/s $\color{#d91a1a}-1.46\%$
test_a2c_speed[reduce-overhead-backward] 1.7194ms 1.5983ms 625.6575 Ops/s 680.7886 Ops/s $\textbf{\color{#d91a1a}-8.10\%}$
test_ppo_speed[False-None] 3.7617ms 3.6341ms 275.1728 Ops/s 268.6436 Ops/s $\color{#35bf28}+2.43\%$
test_ppo_speed[False-backward] 7.4197ms 6.9928ms 143.0034 Ops/s 146.8593 Ops/s $\color{#d91a1a}-2.63\%$
test_ppo_speed[True-None] 1.5989ms 1.3817ms 723.7705 Ops/s 708.8206 Ops/s $\color{#35bf28}+2.11\%$
test_ppo_speed[True-backward] 3.2474ms 3.1693ms 315.5225 Ops/s 323.4104 Ops/s $\color{#d91a1a}-2.44\%$
test_ppo_speed[reduce-overhead-None] 1.0308ms 0.9452ms 1.0580 KOps/s 1.0428 KOps/s $\color{#35bf28}+1.45\%$
test_ppo_speed[reduce-overhead-backward] 1.7447ms 1.5433ms 647.9649 Ops/s 693.6691 Ops/s $\textbf{\color{#d91a1a}-6.59\%}$
test_reinforce_speed[False-None] 2.4453ms 2.2560ms 443.2689 Ops/s 433.4913 Ops/s $\color{#35bf28}+2.26\%$
test_reinforce_speed[False-backward] 3.8381ms 3.3834ms 295.5569 Ops/s 301.8018 Ops/s $\color{#d91a1a}-2.07\%$
test_reinforce_speed[True-None] 1.4152ms 1.2805ms 780.9291 Ops/s 767.7134 Ops/s $\color{#35bf28}+1.72\%$
test_reinforce_speed[True-backward] 3.0595ms 3.0184ms 331.2996 Ops/s 342.6838 Ops/s $\color{#d91a1a}-3.32\%$
test_reinforce_speed[reduce-overhead-None] 18.1799ms 10.0117ms 99.8832 Ops/s 100.1087 Ops/s $\color{#d91a1a}-0.23\%$
test_reinforce_speed[reduce-overhead-backward] 1.6832ms 1.5946ms 627.1355 Ops/s 659.5828 Ops/s $\color{#d91a1a}-4.92\%$
test_iql_speed[False-None] 9.4326ms 9.0275ms 110.7729 Ops/s 106.9785 Ops/s $\color{#35bf28}+3.55\%$
test_iql_speed[False-backward] 13.3478ms 12.8518ms 77.8103 Ops/s 76.6317 Ops/s $\color{#35bf28}+1.54\%$
test_iql_speed[True-None] 2.3799ms 2.1896ms 456.6995 Ops/s 434.5576 Ops/s $\textbf{\color{#35bf28}+5.10\%}$
test_iql_speed[True-backward] 5.1529ms 4.7129ms 212.1817 Ops/s 201.2876 Ops/s $\textbf{\color{#35bf28}+5.41\%}$
test_iql_speed[reduce-overhead-None] 19.6466ms 11.1873ms 89.3870 Ops/s 90.3290 Ops/s $\color{#d91a1a}-1.04\%$
test_iql_speed[reduce-overhead-backward] 1.9579ms 1.8670ms 535.6267 Ops/s 468.0325 Ops/s $\textbf{\color{#35bf28}+14.44\%}$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 7.9532ms 6.3138ms 158.3821 Ops/s 156.8819 Ops/s $\color{#35bf28}+0.96\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.5708ms 0.3378ms 2.9603 KOps/s 3.3239 KOps/s $\textbf{\color{#d91a1a}-10.94\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.5831ms 0.3233ms 3.0935 KOps/s 3.4530 KOps/s $\textbf{\color{#d91a1a}-10.41\%}$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.2520ms 6.0035ms 166.5703 Ops/s 164.9997 Ops/s $\color{#35bf28}+0.95\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 1.1081ms 0.3157ms 3.1675 KOps/s 2.9909 KOps/s $\textbf{\color{#35bf28}+5.91\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.5529ms 0.2989ms 3.3460 KOps/s 3.5485 KOps/s $\textbf{\color{#d91a1a}-5.71\%}$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 1.5056ms 1.3208ms 757.0887 Ops/s 790.5474 Ops/s $\color{#d91a1a}-4.23\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 1.6173ms 1.3372ms 747.8457 Ops/s 833.3034 Ops/s $\textbf{\color{#d91a1a}-10.26\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.3209ms 6.1952ms 161.4147 Ops/s 159.7187 Ops/s $\color{#35bf28}+1.06\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 2.3939ms 0.4664ms 2.1441 KOps/s 2.1814 KOps/s $\color{#d91a1a}-1.71\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.6722ms 0.4478ms 2.2331 KOps/s 2.4044 KOps/s $\textbf{\color{#d91a1a}-7.13\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.2121ms 6.0533ms 165.1995 Ops/s 162.0679 Ops/s $\color{#35bf28}+1.93\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 1.0468ms 0.3458ms 2.8920 KOps/s 2.9376 KOps/s $\color{#d91a1a}-1.55\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 1.3562ms 0.3462ms 2.8885 KOps/s 3.4788 KOps/s $\textbf{\color{#d91a1a}-16.97\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 8.7709ms 6.0119ms 166.3368 Ops/s 161.9272 Ops/s $\color{#35bf28}+2.72\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.5873ms 0.3145ms 3.1798 KOps/s 3.7234 KOps/s $\textbf{\color{#d91a1a}-14.60\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.5743ms 0.3005ms 3.3281 KOps/s 3.6050 KOps/s $\textbf{\color{#d91a1a}-7.68\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.5243ms 6.1858ms 161.6605 Ops/s 157.9524 Ops/s $\color{#35bf28}+2.35\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.8046ms 0.4096ms 2.4412 KOps/s 2.2184 KOps/s $\textbf{\color{#35bf28}+10.05\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.6062ms 0.3887ms 2.5725 KOps/s 2.1316 KOps/s $\textbf{\color{#35bf28}+20.68\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 7.0325ms 5.4335ms 184.0426 Ops/s 181.8390 Ops/s $\color{#35bf28}+1.21\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 9.4778ms 2.0868ms 479.2038 Ops/s 431.1286 Ops/s $\textbf{\color{#35bf28}+11.15\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 7.4396ms 1.2278ms 814.4558 Ops/s 775.6981 Ops/s $\color{#35bf28}+5.00\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 7.0758ms 5.5607ms 179.8333 Ops/s 184.0021 Ops/s $\color{#d91a1a}-2.27\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 9.2492ms 2.0256ms 493.6831 Ops/s 427.3624 Ops/s $\textbf{\color{#35bf28}+15.52\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 6.9100ms 1.1676ms 856.4826 Ops/s 822.5844 Ops/s $\color{#35bf28}+4.12\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 0.4932s 15.4075ms 64.9034 Ops/s 31.4993 Ops/s $\textbf{\color{#35bf28}+106.05\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 10.4502ms 2.3042ms 433.9961 Ops/s 445.2292 Ops/s $\color{#d91a1a}-2.52\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 2.4087ms 1.2434ms 804.2416 Ops/s 725.8559 Ops/s $\textbf{\color{#35bf28}+10.80\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 13.7329ms 13.3091ms 75.1365 Ops/s 73.9226 Ops/s $\color{#35bf28}+1.64\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 18.8460ms 17.2850ms 57.8535 Ops/s 58.2495 Ops/s $\color{#d91a1a}-0.68\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 18.5426ms 18.0719ms 55.3346 Ops/s 54.3956 Ops/s $\color{#35bf28}+1.73\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 18.6661ms 16.9068ms 59.1479 Ops/s 57.3434 Ops/s $\color{#35bf28}+3.15\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 18.6007ms 17.5526ms 56.9718 Ops/s 53.3894 Ops/s $\textbf{\color{#35bf28}+6.71\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 19.4501ms 18.0009ms 55.5529 Ops/s 52.2701 Ops/s $\textbf{\color{#35bf28}+6.28\%}$

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants