Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CI] Fix windows build #2760

Merged
merged 3 commits into from
Feb 5, 2025
Merged

[CI] Fix windows build #2760

merged 3 commits into from
Feb 5, 2025

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Feb 5, 2025

No description provided.

Copy link

pytorch-bot bot commented Feb 5, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2760

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

⏳ 82 Pending, 1 Unrelated Failure

As of commit cd8079f with merge base ad7d2a1 (image):

FLAKY - The following job failed but was likely due to flakiness present on trunk:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 5, 2025
@vmoens vmoens added ciflow/binaries/all Build all binaries and removed CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. labels Feb 5, 2025
Copy link

pytorch-bot bot commented Feb 5, 2025

No ciflow labels are configured for this repo.
For information on how to enable CIFlow bot see this wiki

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 5, 2025
@vmoens vmoens force-pushed the fix-windows-wheels branch 2 times, most recently from ee7aded to b2e94c4 Compare February 5, 2025 16:02
Copy link

github-actions bot commented Feb 5, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}6$. Worsened: $\large\color{#d91a1a}6$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.5368s 0.4530s 2.2075 Ops/s 2.2189 Ops/s $\color{#d91a1a}-0.51\%$
test_transformed 1.0132s 0.9290s 1.0765 Ops/s 1.0958 Ops/s $\color{#d91a1a}-1.76\%$
test_serial 1.4749s 1.3813s 0.7239 Ops/s 0.7211 Ops/s $\color{#35bf28}+0.39\%$
test_parallel 1.3036s 1.2161s 0.8223 Ops/s 0.8150 Ops/s $\color{#35bf28}+0.89\%$
test_step_mdp_speed[True-True-True-True-True] 0.1376ms 30.8175μs 32.4491 KOps/s 33.4888 KOps/s $\color{#d91a1a}-3.10\%$
test_step_mdp_speed[True-True-True-True-False] 53.1800μs 18.1657μs 55.0487 KOps/s 55.4250 KOps/s $\color{#d91a1a}-0.68\%$
test_step_mdp_speed[True-True-True-False-True] 57.8910μs 17.2123μs 58.0979 KOps/s 58.4917 KOps/s $\color{#d91a1a}-0.67\%$
test_step_mdp_speed[True-True-True-False-False] 40.9670μs 10.2149μs 97.8960 KOps/s 99.7441 KOps/s $\color{#d91a1a}-1.85\%$
test_step_mdp_speed[True-True-False-True-True] 80.7400μs 32.5986μs 30.6762 KOps/s 31.1214 KOps/s $\color{#d91a1a}-1.43\%$
test_step_mdp_speed[True-True-False-True-False] 46.3260μs 20.4962μs 48.7894 KOps/s 50.7728 KOps/s $\color{#d91a1a}-3.91\%$
test_step_mdp_speed[True-True-False-False-True] 69.3300μs 19.4288μs 51.4699 KOps/s 52.6891 KOps/s $\color{#d91a1a}-2.31\%$
test_step_mdp_speed[True-True-False-False-False] 40.0340μs 12.2420μs 81.6857 KOps/s 84.1916 KOps/s $\color{#d91a1a}-2.98\%$
test_step_mdp_speed[True-False-True-True-True] 0.6728ms 35.3069μs 28.3231 KOps/s 29.1744 KOps/s $\color{#d91a1a}-2.92\%$
test_step_mdp_speed[True-False-True-True-False] 74.6990μs 22.2826μs 44.8781 KOps/s 46.2834 KOps/s $\color{#d91a1a}-3.04\%$
test_step_mdp_speed[True-False-True-False-True] 43.4720μs 19.4524μs 51.4075 KOps/s 52.7753 KOps/s $\color{#d91a1a}-2.59\%$
test_step_mdp_speed[True-False-True-False-False] 53.8310μs 12.1331μs 82.4194 KOps/s 83.5897 KOps/s $\color{#d91a1a}-1.40\%$
test_step_mdp_speed[True-False-False-True-True] 79.7590μs 36.5881μs 27.3313 KOps/s 27.8981 KOps/s $\color{#d91a1a}-2.03\%$
test_step_mdp_speed[True-False-False-True-False] 68.6290μs 24.0928μs 41.5063 KOps/s 42.6484 KOps/s $\color{#d91a1a}-2.68\%$
test_step_mdp_speed[True-False-False-False-True] 70.2610μs 21.0942μs 47.4064 KOps/s 48.6928 KOps/s $\color{#d91a1a}-2.64\%$
test_step_mdp_speed[True-False-False-False-False] 52.5390μs 14.0878μs 70.9832 KOps/s 73.5391 KOps/s $\color{#d91a1a}-3.48\%$
test_step_mdp_speed[False-True-True-True-True] 71.7340μs 34.7981μs 28.7372 KOps/s 29.2536 KOps/s $\color{#d91a1a}-1.77\%$
test_step_mdp_speed[False-True-True-True-False] 72.6260μs 22.3050μs 44.8329 KOps/s 46.4338 KOps/s $\color{#d91a1a}-3.45\%$
test_step_mdp_speed[False-True-True-False-True] 51.9080μs 21.8278μs 45.8131 KOps/s 46.2670 KOps/s $\color{#d91a1a}-0.98\%$
test_step_mdp_speed[False-True-True-False-False] 63.2280μs 13.6530μs 73.2440 KOps/s 74.6873 KOps/s $\color{#d91a1a}-1.93\%$
test_step_mdp_speed[False-True-False-True-True] 74.8890μs 36.6113μs 27.3140 KOps/s 27.9116 KOps/s $\color{#d91a1a}-2.14\%$
test_step_mdp_speed[False-True-False-True-False] 69.9310μs 24.1310μs 41.4405 KOps/s 42.8487 KOps/s $\color{#d91a1a}-3.29\%$
test_step_mdp_speed[False-True-False-False-True] 2.5630ms 24.2399μs 41.2544 KOps/s 42.6560 KOps/s $\color{#d91a1a}-3.29\%$
test_step_mdp_speed[False-True-False-False-False] 59.5210μs 15.6714μs 63.8106 KOps/s 66.4090 KOps/s $\color{#d91a1a}-3.91\%$
test_step_mdp_speed[False-False-True-True-True] 97.0520μs 38.6935μs 25.8441 KOps/s 26.5869 KOps/s $\color{#d91a1a}-2.79\%$
test_step_mdp_speed[False-False-True-True-False] 0.6299ms 26.0681μs 38.3611 KOps/s 39.9956 KOps/s $\color{#d91a1a}-4.09\%$
test_step_mdp_speed[False-False-True-False-True] 78.8880μs 23.6523μs 42.2792 KOps/s 42.9927 KOps/s $\color{#d91a1a}-1.66\%$
test_step_mdp_speed[False-False-True-False-False] 48.1400μs 15.5635μs 64.2531 KOps/s 66.5899 KOps/s $\color{#d91a1a}-3.51\%$
test_step_mdp_speed[False-False-False-True-True] 0.1056ms 39.9156μs 25.0528 KOps/s 25.5500 KOps/s $\color{#d91a1a}-1.95\%$
test_step_mdp_speed[False-False-False-True-False] 85.7200μs 27.9032μs 35.8382 KOps/s 37.4929 KOps/s $\color{#d91a1a}-4.41\%$
test_step_mdp_speed[False-False-False-False-True] 80.8910μs 25.5216μs 39.1825 KOps/s 40.2183 KOps/s $\color{#d91a1a}-2.58\%$
test_step_mdp_speed[False-False-False-False-False] 67.3960μs 17.2999μs 57.8037 KOps/s 60.1933 KOps/s $\color{#d91a1a}-3.97\%$
test_values[generalized_advantage_estimate-True-True] 10.7992ms 9.9917ms 100.0827 Ops/s 102.0873 Ops/s $\color{#d91a1a}-1.96\%$
test_values[vec_generalized_advantage_estimate-True-True] 27.8423ms 25.8648ms 38.6626 Ops/s 41.3948 Ops/s $\textbf{\color{#d91a1a}-6.60\%}$
test_values[td0_return_estimate-False-False] 0.2335ms 0.1767ms 5.6601 KOps/s 5.6648 KOps/s $\color{#d91a1a}-0.08\%$
test_values[td1_return_estimate-False-False] 25.0023ms 24.4828ms 40.8450 Ops/s 40.5265 Ops/s $\color{#35bf28}+0.79\%$
test_values[vec_td1_return_estimate-False-False] 27.9973ms 26.1177ms 38.2882 Ops/s 41.2288 Ops/s $\textbf{\color{#d91a1a}-7.13\%}$
test_values[td_lambda_return_estimate-True-False] 38.7627ms 35.4247ms 28.2289 Ops/s 28.2060 Ops/s $\color{#35bf28}+0.08\%$
test_values[vec_td_lambda_return_estimate-True-False] 29.1118ms 26.1784ms 38.1995 Ops/s 41.1259 Ops/s $\textbf{\color{#d91a1a}-7.12\%}$
test_gae_speed[generalized_advantage_estimate-False-1-512] 9.0242ms 8.6378ms 115.7696 Ops/s 116.8815 Ops/s $\color{#d91a1a}-0.95\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 2.2279ms 1.9868ms 503.3312 Ops/s 491.3106 Ops/s $\color{#35bf28}+2.45\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.6144ms 0.3648ms 2.7410 KOps/s 2.6594 KOps/s $\color{#35bf28}+3.07\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 47.9794ms 45.0495ms 22.1978 Ops/s 23.8376 Ops/s $\textbf{\color{#d91a1a}-6.88\%}$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 4.3530ms 3.4350ms 291.1248 Ops/s 282.9921 Ops/s $\color{#35bf28}+2.87\%$
test_dqn_speed[False-None] 5.9468ms 1.3978ms 715.4238 Ops/s 700.5621 Ops/s $\color{#35bf28}+2.12\%$
test_dqn_speed[False-backward] 2.0287ms 1.8811ms 531.6125 Ops/s 415.0514 Ops/s $\textbf{\color{#35bf28}+28.08\%}$
test_dqn_speed[True-None] 0.6848ms 0.4878ms 2.0501 KOps/s 2.0354 KOps/s $\color{#35bf28}+0.72\%$
test_dqn_speed[True-backward] 0.9492ms 0.8980ms 1.1136 KOps/s 1.0890 KOps/s $\color{#35bf28}+2.26\%$
test_dqn_speed[reduce-overhead-None] 0.7282ms 0.4912ms 2.0357 KOps/s 2.0386 KOps/s $\color{#d91a1a}-0.14\%$
test_dqn_speed[reduce-overhead-backward] 0.9539ms 0.9051ms 1.1049 KOps/s 1.0818 KOps/s $\color{#35bf28}+2.14\%$
test_ddpg_speed[False-None] 3.6212ms 2.8729ms 348.0777 Ops/s 348.2642 Ops/s $\color{#d91a1a}-0.05\%$
test_ddpg_speed[False-backward] 4.1766ms 4.0296ms 248.1657 Ops/s 249.6081 Ops/s $\color{#d91a1a}-0.58\%$
test_ddpg_speed[True-None] 1.7084ms 1.2361ms 808.9696 Ops/s 808.0638 Ops/s $\color{#35bf28}+0.11\%$
test_ddpg_speed[True-backward] 2.6294ms 2.1282ms 469.8797 Ops/s 466.6866 Ops/s $\color{#35bf28}+0.68\%$
test_ddpg_speed[reduce-overhead-None] 1.8353ms 1.2293ms 813.4797 Ops/s 808.5009 Ops/s $\color{#35bf28}+0.62\%$
test_ddpg_speed[reduce-overhead-backward] 2.1743ms 2.1097ms 473.9941 Ops/s 469.2757 Ops/s $\color{#35bf28}+1.01\%$
test_sac_speed[False-None] 9.5169ms 7.9567ms 125.6809 Ops/s 125.3015 Ops/s $\color{#35bf28}+0.30\%$
test_sac_speed[False-backward] 10.9958ms 10.6409ms 93.9771 Ops/s 93.9855 Ops/s $-0.01\%$
test_sac_speed[True-None] 2.3901ms 2.1130ms 473.2580 Ops/s 474.3977 Ops/s $\color{#d91a1a}-0.24\%$
test_sac_speed[True-backward] 3.8236ms 3.7610ms 265.8843 Ops/s 242.3604 Ops/s $\textbf{\color{#35bf28}+9.71\%}$
test_sac_speed[reduce-overhead-None] 2.3771ms 2.1074ms 474.5150 Ops/s 469.1872 Ops/s $\color{#35bf28}+1.14\%$
test_sac_speed[reduce-overhead-backward] 3.9172ms 3.7818ms 264.4237 Ops/s 264.9310 Ops/s $\color{#d91a1a}-0.19\%$
test_redq_speed[False-None] 14.6501ms 12.8037ms 78.1023 Ops/s 78.4005 Ops/s $\color{#d91a1a}-0.38\%$
test_redq_speed[False-backward] 30.7791ms 24.0625ms 41.5585 Ops/s 45.2559 Ops/s $\textbf{\color{#d91a1a}-8.17\%}$
test_redq_speed[True-None] 6.7360ms 4.9898ms 200.4081 Ops/s 208.6092 Ops/s $\color{#d91a1a}-3.93\%$
test_redq_speed[True-backward] 13.7967ms 12.4454ms 80.3510 Ops/s 81.2898 Ops/s $\color{#d91a1a}-1.15\%$
test_redq_speed[reduce-overhead-None] 5.6103ms 4.9589ms 201.6594 Ops/s 203.1427 Ops/s $\color{#d91a1a}-0.73\%$
test_redq_speed[reduce-overhead-backward] 12.6399ms 12.4471ms 80.3400 Ops/s 81.4282 Ops/s $\color{#d91a1a}-1.34\%$
test_redq_deprec_speed[False-None] 15.0646ms 12.8006ms 78.1216 Ops/s 77.5697 Ops/s $\color{#35bf28}+0.71\%$
test_redq_deprec_speed[False-backward] 20.2673ms 18.5171ms 54.0043 Ops/s 53.7438 Ops/s $\color{#35bf28}+0.48\%$
test_redq_deprec_speed[True-None] 4.3095ms 3.9211ms 255.0286 Ops/s 260.7317 Ops/s $\color{#d91a1a}-2.19\%$
test_redq_deprec_speed[True-backward] 9.4980ms 8.6145ms 116.0838 Ops/s 121.2657 Ops/s $\color{#d91a1a}-4.27\%$
test_redq_deprec_speed[reduce-overhead-None] 4.6309ms 3.9379ms 253.9456 Ops/s 261.5513 Ops/s $\color{#d91a1a}-2.91\%$
test_redq_deprec_speed[reduce-overhead-backward] 9.2649ms 8.4179ms 118.7942 Ops/s 121.4855 Ops/s $\color{#d91a1a}-2.22\%$
test_td3_speed[False-None] 8.2603ms 7.9417ms 125.9178 Ops/s 122.9908 Ops/s $\color{#35bf28}+2.38\%$
test_td3_speed[False-backward] 10.8355ms 10.3311ms 96.7951 Ops/s 95.9860 Ops/s $\color{#35bf28}+0.84\%$
test_td3_speed[True-None] 2.0372ms 1.8347ms 545.0383 Ops/s 554.9101 Ops/s $\color{#d91a1a}-1.78\%$
test_td3_speed[True-backward] 3.4643ms 3.4213ms 292.2897 Ops/s 294.1747 Ops/s $\color{#d91a1a}-0.64\%$
test_td3_speed[reduce-overhead-None] 2.0055ms 1.8308ms 546.2018 Ops/s 552.8023 Ops/s $\color{#d91a1a}-1.19\%$
test_td3_speed[reduce-overhead-backward] 3.5548ms 3.4396ms 290.7289 Ops/s 292.6852 Ops/s $\color{#d91a1a}-0.67\%$
test_cql_speed[False-None] 37.4423ms 35.9771ms 27.7955 Ops/s 27.5025 Ops/s $\color{#35bf28}+1.07\%$
test_cql_speed[False-backward] 49.3688ms 46.4933ms 21.5085 Ops/s 20.9761 Ops/s $\color{#35bf28}+2.54\%$
test_cql_speed[True-None] 18.2208ms 16.4036ms 60.9620 Ops/s 62.8430 Ops/s $\color{#d91a1a}-2.99\%$
test_cql_speed[True-backward] 24.4189ms 23.0724ms 43.3418 Ops/s 43.6059 Ops/s $\color{#d91a1a}-0.61\%$
test_cql_speed[reduce-overhead-None] 17.0672ms 16.5172ms 60.5431 Ops/s 62.7311 Ops/s $\color{#d91a1a}-3.49\%$
test_cql_speed[reduce-overhead-backward] 24.5046ms 22.9192ms 43.6315 Ops/s 44.0127 Ops/s $\color{#d91a1a}-0.87\%$
test_a2c_speed[False-None] 7.8991ms 7.1711ms 139.4493 Ops/s 138.7700 Ops/s $\color{#35bf28}+0.49\%$
test_a2c_speed[False-backward] 14.4544ms 14.2034ms 70.4055 Ops/s 69.7941 Ops/s $\color{#35bf28}+0.88\%$
test_a2c_speed[True-None] 4.8083ms 3.7981ms 263.2889 Ops/s 267.3626 Ops/s $\color{#d91a1a}-1.52\%$
test_a2c_speed[True-backward] 10.4406ms 10.1893ms 98.1424 Ops/s 98.0704 Ops/s $\color{#35bf28}+0.07\%$
test_a2c_speed[reduce-overhead-None] 4.4703ms 3.7620ms 265.8140 Ops/s 268.0160 Ops/s $\color{#d91a1a}-0.82\%$
test_a2c_speed[reduce-overhead-backward] 10.4104ms 10.1748ms 98.2819 Ops/s 98.5861 Ops/s $\color{#d91a1a}-0.31\%$
test_ppo_speed[False-None] 9.0732ms 7.4537ms 134.1623 Ops/s 132.3944 Ops/s $\color{#35bf28}+1.34\%$
test_ppo_speed[False-backward] 15.2674ms 14.7363ms 67.8594 Ops/s 66.1437 Ops/s $\color{#35bf28}+2.59\%$
test_ppo_speed[True-None] 4.7069ms 4.1405ms 241.5142 Ops/s 245.2287 Ops/s $\color{#d91a1a}-1.51\%$
test_ppo_speed[True-backward] 10.8289ms 10.0524ms 99.4791 Ops/s 100.2909 Ops/s $\color{#d91a1a}-0.81\%$
test_ppo_speed[reduce-overhead-None] 4.8653ms 4.1308ms 242.0839 Ops/s 243.2537 Ops/s $\color{#d91a1a}-0.48\%$
test_ppo_speed[reduce-overhead-backward] 11.4207ms 10.0439ms 99.5631 Ops/s 100.0743 Ops/s $\color{#d91a1a}-0.51\%$
test_reinforce_speed[False-None] 7.2827ms 6.5431ms 152.8330 Ops/s 151.7660 Ops/s $\color{#35bf28}+0.70\%$
test_reinforce_speed[False-backward] 10.8298ms 9.8201ms 101.8321 Ops/s 100.7443 Ops/s $\color{#35bf28}+1.08\%$
test_reinforce_speed[True-None] 3.7356ms 3.0979ms 322.7962 Ops/s 326.8714 Ops/s $\color{#d91a1a}-1.25\%$
test_reinforce_speed[True-backward] 9.7134ms 9.0386ms 110.6368 Ops/s 109.5278 Ops/s $\color{#35bf28}+1.01\%$
test_reinforce_speed[reduce-overhead-None] 3.8084ms 3.1572ms 316.7390 Ops/s 274.4283 Ops/s $\textbf{\color{#35bf28}+15.42\%}$
test_reinforce_speed[reduce-overhead-backward] 9.8116ms 9.0409ms 110.6086 Ops/s 110.2288 Ops/s $\color{#35bf28}+0.34\%$
test_iql_speed[False-None] 33.8764ms 32.3246ms 30.9362 Ops/s 30.3113 Ops/s $\color{#35bf28}+2.06\%$
test_iql_speed[False-backward] 46.8284ms 45.2005ms 22.1236 Ops/s 21.7671 Ops/s $\color{#35bf28}+1.64\%$
test_iql_speed[True-None] 20.8285ms 11.5116ms 86.8689 Ops/s 89.4538 Ops/s $\color{#d91a1a}-2.89\%$
test_iql_speed[True-backward] 29.2999ms 22.7591ms 43.9385 Ops/s 44.9269 Ops/s $\color{#d91a1a}-2.20\%$
test_iql_speed[reduce-overhead-None] 12.3608ms 11.4492ms 87.3420 Ops/s 89.1906 Ops/s $\color{#d91a1a}-2.07\%$
test_iql_speed[reduce-overhead-backward] 23.0146ms 22.2402ms 44.9637 Ops/s 45.3259 Ops/s $\color{#d91a1a}-0.80\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.9103ms 4.7892ms 208.8046 Ops/s 206.9714 Ops/s $\color{#35bf28}+0.89\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.8130ms 0.5335ms 1.8746 KOps/s 1.8642 KOps/s $\color{#35bf28}+0.56\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.8931ms 0.5128ms 1.9500 KOps/s 1.9679 KOps/s $\color{#d91a1a}-0.91\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 7.1395ms 4.5928ms 217.7341 Ops/s 214.7948 Ops/s $\color{#35bf28}+1.37\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 1.0945ms 0.5205ms 1.9214 KOps/s 1.7742 KOps/s $\textbf{\color{#35bf28}+8.29\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.8595ms 0.5034ms 1.9864 KOps/s 2.0074 KOps/s $\color{#d91a1a}-1.04\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 2.3020ms 1.7023ms 587.4568 Ops/s 583.6244 Ops/s $\color{#35bf28}+0.66\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 2.3521ms 1.6190ms 617.6543 Ops/s 613.2658 Ops/s $\color{#35bf28}+0.72\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 5.0072ms 4.6954ms 212.9740 Ops/s 208.2738 Ops/s $\color{#35bf28}+2.26\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.4528ms 0.6718ms 1.4886 KOps/s 1.4933 KOps/s $\color{#d91a1a}-0.32\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.8447ms 0.6400ms 1.5624 KOps/s 1.5505 KOps/s $\color{#35bf28}+0.77\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 8.8127ms 4.5996ms 217.4124 Ops/s 216.3264 Ops/s $\color{#35bf28}+0.50\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 2.0656ms 0.5394ms 1.8541 KOps/s 1.8499 KOps/s $\color{#35bf28}+0.22\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.7064ms 0.5078ms 1.9693 KOps/s 1.9753 KOps/s $\color{#d91a1a}-0.30\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 4.9612ms 4.5381ms 220.3584 Ops/s 218.3740 Ops/s $\color{#35bf28}+0.91\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 1.6429ms 0.5240ms 1.9085 KOps/s 1.8890 KOps/s $\color{#35bf28}+1.03\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.9562ms 0.5067ms 1.9736 KOps/s 1.9733 KOps/s $\color{#35bf28}+0.01\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 5.3616ms 4.7134ms 212.1601 Ops/s 207.9898 Ops/s $\color{#35bf28}+2.01\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.2862ms 0.6764ms 1.4784 KOps/s 1.4924 KOps/s $\color{#d91a1a}-0.94\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.8565ms 0.6430ms 1.5552 KOps/s 1.5391 KOps/s $\color{#35bf28}+1.04\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 5.3291ms 4.1234ms 242.5162 Ops/s 245.4186 Ops/s $\color{#d91a1a}-1.18\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 7.5665ms 2.3682ms 422.2672 Ops/s 429.8908 Ops/s $\color{#d91a1a}-1.77\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 6.2645ms 1.4823ms 674.6370 Ops/s 708.9030 Ops/s $\color{#d91a1a}-4.83\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 0.4121s 12.3619ms 80.8934 Ops/s 244.5659 Ops/s $\textbf{\color{#d91a1a}-66.92\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 8.6866ms 2.3050ms 433.8404 Ops/s 36.1748 Ops/s $\textbf{\color{#35bf28}+1099.29\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 4.8455ms 1.3507ms 740.3742 Ops/s 710.8277 Ops/s $\color{#35bf28}+4.16\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 5.5903ms 4.3911ms 227.7325 Ops/s 222.3027 Ops/s $\color{#35bf28}+2.44\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 8.7824ms 2.5386ms 393.9115 Ops/s 386.7023 Ops/s $\color{#35bf28}+1.86\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 2.9218ms 1.4895ms 671.3790 Ops/s 661.1152 Ops/s $\color{#35bf28}+1.55\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 12.1894ms 11.4212ms 87.5561 Ops/s 81.2539 Ops/s $\textbf{\color{#35bf28}+7.76\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 15.6751ms 14.1284ms 70.7792 Ops/s 68.6520 Ops/s $\color{#35bf28}+3.10\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 21.0126ms 20.2284ms 49.4354 Ops/s 47.6740 Ops/s $\color{#35bf28}+3.69\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 16.5532ms 14.3344ms 69.7621 Ops/s 67.8438 Ops/s $\color{#35bf28}+2.83\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 21.8211ms 20.1973ms 49.5115 Ops/s 47.8671 Ops/s $\color{#35bf28}+3.44\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 16.4781ms 15.5998ms 64.1034 Ops/s 61.9252 Ops/s $\color{#35bf28}+3.52\%$

Copy link

github-actions bot commented Feb 5, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}22$. Worsened: $\large\color{#d91a1a}10$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.8321s 0.7441s 1.3440 Ops/s 1.3047 Ops/s $\color{#35bf28}+3.01\%$
test_transformed 1.3006s 1.2983s 0.7703 Ops/s 0.7638 Ops/s $\color{#35bf28}+0.85\%$
test_serial 2.1434s 2.1418s 0.4669 Ops/s 0.4598 Ops/s $\color{#35bf28}+1.55\%$
test_parallel 1.8829s 1.8237s 0.5483 Ops/s 0.5422 Ops/s $\color{#35bf28}+1.13\%$
test_step_mdp_speed[True-True-True-True-True] 0.1394ms 39.4137μs 25.3719 KOps/s 24.9647 KOps/s $\color{#35bf28}+1.63\%$
test_step_mdp_speed[True-True-True-True-False] 58.5910μs 23.5667μs 42.4327 KOps/s 43.1113 KOps/s $\color{#d91a1a}-1.57\%$
test_step_mdp_speed[True-True-True-False-True] 51.8110μs 22.5359μs 44.3737 KOps/s 44.4523 KOps/s $\color{#d91a1a}-0.18\%$
test_step_mdp_speed[True-True-True-False-False] 47.4310μs 13.1984μs 75.7669 KOps/s 76.9629 KOps/s $\color{#d91a1a}-1.55\%$
test_step_mdp_speed[True-True-False-True-True] 86.6920μs 46.7557μs 21.3878 KOps/s 23.5858 KOps/s $\textbf{\color{#d91a1a}-9.32\%}$
test_step_mdp_speed[True-True-False-True-False] 66.2010μs 26.7110μs 37.4377 KOps/s 39.2020 KOps/s $\color{#d91a1a}-4.50\%$
test_step_mdp_speed[True-True-False-False-True] 78.8410μs 27.9002μs 35.8420 KOps/s 40.7960 KOps/s $\textbf{\color{#d91a1a}-12.14\%}$
test_step_mdp_speed[True-True-False-False-False] 0.1538ms 14.8033μs 67.5524 KOps/s 65.7055 KOps/s $\color{#35bf28}+2.81\%$
test_step_mdp_speed[True-False-True-True-True] 83.7410μs 44.8859μs 22.2787 KOps/s 22.2713 KOps/s $\color{#35bf28}+0.03\%$
test_step_mdp_speed[True-False-True-True-False] 64.5710μs 27.7651μs 36.0164 KOps/s 35.6286 KOps/s $\color{#35bf28}+1.09\%$
test_step_mdp_speed[True-False-True-False-True] 54.1510μs 24.6934μs 40.4966 KOps/s 40.1781 KOps/s $\color{#35bf28}+0.79\%$
test_step_mdp_speed[True-False-True-False-False] 68.4720μs 14.9519μs 66.8809 KOps/s 64.5315 KOps/s $\color{#35bf28}+3.64\%$
test_step_mdp_speed[True-False-False-True-True] 79.8920μs 47.3135μs 21.1356 KOps/s 21.1476 KOps/s $\color{#d91a1a}-0.06\%$
test_step_mdp_speed[True-False-False-True-False] 63.9220μs 30.4121μs 32.8816 KOps/s 32.6761 KOps/s $\color{#35bf28}+0.63\%$
test_step_mdp_speed[True-False-False-False-True] 53.8320μs 26.6984μs 37.4555 KOps/s 36.7198 KOps/s $\color{#35bf28}+2.00\%$
test_step_mdp_speed[True-False-False-False-False] 45.2910μs 17.6728μs 56.5843 KOps/s 55.7824 KOps/s $\color{#35bf28}+1.44\%$
test_step_mdp_speed[False-True-True-True-True] 78.4710μs 45.5067μs 21.9748 KOps/s 22.8289 KOps/s $\color{#d91a1a}-3.74\%$
test_step_mdp_speed[False-True-True-True-False] 51.9510μs 28.1223μs 35.5589 KOps/s 35.8254 KOps/s $\color{#d91a1a}-0.74\%$
test_step_mdp_speed[False-True-True-False-True] 56.7810μs 28.7868μs 34.7381 KOps/s 35.4500 KOps/s $\color{#d91a1a}-2.01\%$
test_step_mdp_speed[False-True-True-False-False] 93.8620μs 17.2415μs 57.9997 KOps/s 59.2188 KOps/s $\color{#d91a1a}-2.06\%$
test_step_mdp_speed[False-True-False-True-True] 74.1320μs 47.8778μs 20.8865 KOps/s 21.3496 KOps/s $\color{#d91a1a}-2.17\%$
test_step_mdp_speed[False-True-False-True-False] 60.7710μs 30.2537μs 33.0538 KOps/s 32.5058 KOps/s $\color{#35bf28}+1.69\%$
test_step_mdp_speed[False-True-False-False-True] 3.1407ms 31.6494μs 31.5961 KOps/s 32.5388 KOps/s $\color{#d91a1a}-2.90\%$
test_step_mdp_speed[False-True-False-False-False] 46.0610μs 19.6275μs 50.9490 KOps/s 51.3586 KOps/s $\color{#d91a1a}-0.80\%$
test_step_mdp_speed[False-False-True-True-True] 0.1214ms 49.6684μs 20.1335 KOps/s 20.2447 KOps/s $\color{#d91a1a}-0.55\%$
test_step_mdp_speed[False-False-True-True-False] 66.4810μs 32.7295μs 30.5535 KOps/s 30.7306 KOps/s $\color{#d91a1a}-0.58\%$
test_step_mdp_speed[False-False-True-False-True] 65.9420μs 30.9008μs 32.3616 KOps/s 32.7061 KOps/s $\color{#d91a1a}-1.05\%$
test_step_mdp_speed[False-False-True-False-False] 50.6310μs 19.5376μs 51.1833 KOps/s 51.6223 KOps/s $\color{#d91a1a}-0.85\%$
test_step_mdp_speed[False-False-False-True-True] 79.5710μs 52.0297μs 19.2198 KOps/s 19.5366 KOps/s $\color{#d91a1a}-1.62\%$
test_step_mdp_speed[False-False-False-True-False] 66.1510μs 35.1001μs 28.4900 KOps/s 28.6809 KOps/s $\color{#d91a1a}-0.67\%$
test_step_mdp_speed[False-False-False-False-True] 60.3310μs 32.6658μs 30.6130 KOps/s 31.0027 KOps/s $\color{#d91a1a}-1.26\%$
test_step_mdp_speed[False-False-False-False-False] 46.2910μs 21.5088μs 46.4926 KOps/s 47.1726 KOps/s $\color{#d91a1a}-1.44\%$
test_values[generalized_advantage_estimate-True-True] 25.1187ms 24.6521ms 40.5646 Ops/s 37.8899 Ops/s $\textbf{\color{#35bf28}+7.06\%}$
test_values[vec_generalized_advantage_estimate-True-True] 0.1058s 3.0161ms 331.5535 Ops/s 330.5810 Ops/s $\color{#35bf28}+0.29\%$
test_values[td0_return_estimate-False-False] 0.1051ms 79.5907μs 12.5643 KOps/s 11.7847 KOps/s $\textbf{\color{#35bf28}+6.62\%}$
test_values[td1_return_estimate-False-False] 56.2529ms 55.5035ms 18.0169 Ops/s 17.5712 Ops/s $\color{#35bf28}+2.54\%$
test_values[vec_td1_return_estimate-False-False] 1.3185ms 1.0835ms 922.9466 Ops/s 915.2172 Ops/s $\color{#35bf28}+0.84\%$
test_values[td_lambda_return_estimate-True-False] 88.4814ms 88.0649ms 11.3553 Ops/s 11.0542 Ops/s $\color{#35bf28}+2.72\%$
test_values[vec_td_lambda_return_estimate-True-False] 1.2335ms 1.0800ms 925.9375 Ops/s 914.8737 Ops/s $\color{#35bf28}+1.21\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 24.8499ms 24.6987ms 40.4880 Ops/s 40.0376 Ops/s $\color{#35bf28}+1.12\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 1.0210ms 0.7539ms 1.3264 KOps/s 1.3124 KOps/s $\color{#35bf28}+1.07\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.7478ms 0.6702ms 1.4921 KOps/s 1.4756 KOps/s $\color{#35bf28}+1.12\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 1.5298ms 1.4844ms 673.6910 Ops/s 666.9080 Ops/s $\color{#35bf28}+1.02\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 0.7243ms 0.6867ms 1.4561 KOps/s 1.4053 KOps/s $\color{#35bf28}+3.62\%$
test_dqn_speed[False-None] 1.6000ms 1.5185ms 658.5400 Ops/s 642.8334 Ops/s $\color{#35bf28}+2.44\%$
test_dqn_speed[False-backward] 2.2044ms 2.1265ms 470.2574 Ops/s 446.7271 Ops/s $\textbf{\color{#35bf28}+5.27\%}$
test_dqn_speed[True-None] 0.9618ms 0.5552ms 1.8012 KOps/s 1.7496 KOps/s $\color{#35bf28}+2.95\%$
test_dqn_speed[True-backward] 1.1875ms 1.1394ms 877.6263 Ops/s 790.5184 Ops/s $\textbf{\color{#35bf28}+11.02\%}$
test_dqn_speed[reduce-overhead-None] 0.6219ms 0.5721ms 1.7479 KOps/s 1.6785 KOps/s $\color{#35bf28}+4.13\%$
test_dqn_speed[reduce-overhead-backward] 1.0427ms 0.9857ms 1.0145 KOps/s 932.3259 Ops/s $\textbf{\color{#35bf28}+8.82\%}$
test_ddpg_speed[False-None] 3.2319ms 2.9291ms 341.4070 Ops/s 342.8416 Ops/s $\color{#d91a1a}-0.42\%$
test_ddpg_speed[False-backward] 4.6316ms 4.1730ms 239.6359 Ops/s 233.8481 Ops/s $\color{#35bf28}+2.47\%$
test_ddpg_speed[True-None] 1.5585ms 1.3364ms 748.2574 Ops/s 748.7438 Ops/s $\color{#d91a1a}-0.06\%$
test_ddpg_speed[True-backward] 2.7003ms 2.5922ms 385.7658 Ops/s 386.0830 Ops/s $\color{#d91a1a}-0.08\%$
test_ddpg_speed[reduce-overhead-None] 1.4060ms 1.3540ms 738.5294 Ops/s 735.6840 Ops/s $\color{#35bf28}+0.39\%$
test_ddpg_speed[reduce-overhead-backward] 2.0715ms 2.0289ms 492.8842 Ops/s 487.6348 Ops/s $\color{#35bf28}+1.08\%$
test_sac_speed[False-None] 8.4284ms 8.0050ms 124.9223 Ops/s 124.1180 Ops/s $\color{#35bf28}+0.65\%$
test_sac_speed[False-backward] 11.6676ms 11.1705ms 89.5212 Ops/s 88.8342 Ops/s $\color{#35bf28}+0.77\%$
test_sac_speed[True-None] 2.0201ms 1.8494ms 540.7252 Ops/s 543.6015 Ops/s $\color{#d91a1a}-0.53\%$
test_sac_speed[True-backward] 3.7792ms 3.6994ms 270.3130 Ops/s 279.0933 Ops/s $\color{#d91a1a}-3.15\%$
test_sac_speed[reduce-overhead-None] 22.5421ms 12.4809ms 80.1222 Ops/s 80.5953 Ops/s $\color{#d91a1a}-0.59\%$
test_sac_speed[reduce-overhead-backward] 1.8235ms 1.7818ms 561.2229 Ops/s 600.3681 Ops/s $\textbf{\color{#d91a1a}-6.52\%}$
test_redq_speed[False-None] 8.0909ms 7.5189ms 132.9983 Ops/s 131.9388 Ops/s $\color{#35bf28}+0.80\%$
test_redq_speed[False-backward] 12.0282ms 11.5381ms 86.6695 Ops/s 88.1080 Ops/s $\color{#d91a1a}-1.63\%$
test_redq_speed[True-None] 2.4844ms 2.3028ms 434.2575 Ops/s 430.2908 Ops/s $\color{#35bf28}+0.92\%$
test_redq_speed[True-backward] 4.0307ms 3.9651ms 252.2016 Ops/s 237.5531 Ops/s $\textbf{\color{#35bf28}+6.17\%}$
test_redq_speed[reduce-overhead-None] 2.7485ms 2.3181ms 431.3787 Ops/s 427.9822 Ops/s $\color{#35bf28}+0.79\%$
test_redq_speed[reduce-overhead-backward] 4.1350ms 3.9837ms 251.0221 Ops/s 245.4864 Ops/s $\color{#35bf28}+2.26\%$
test_redq_deprec_speed[False-None] 10.0810ms 9.0228ms 110.8307 Ops/s 109.8158 Ops/s $\color{#35bf28}+0.92\%$
test_redq_deprec_speed[False-backward] 12.5016ms 11.9064ms 83.9885 Ops/s 83.1604 Ops/s $\color{#35bf28}+1.00\%$
test_redq_deprec_speed[True-None] 3.0480ms 2.6277ms 380.5663 Ops/s 376.9725 Ops/s $\color{#35bf28}+0.95\%$
test_redq_deprec_speed[True-backward] 4.7406ms 4.2914ms 233.0221 Ops/s 221.4855 Ops/s $\textbf{\color{#35bf28}+5.21\%}$
test_redq_deprec_speed[reduce-overhead-None] 2.6886ms 2.6294ms 380.3084 Ops/s 374.3691 Ops/s $\color{#35bf28}+1.59\%$
test_redq_deprec_speed[reduce-overhead-backward] 4.5710ms 4.4837ms 223.0304 Ops/s 220.0919 Ops/s $\color{#35bf28}+1.34\%$
test_td3_speed[False-None] 8.1519ms 7.9268ms 126.1537 Ops/s 125.1330 Ops/s $\color{#35bf28}+0.82\%$
test_td3_speed[False-backward] 10.9919ms 10.4425ms 95.7629 Ops/s 95.1022 Ops/s $\color{#35bf28}+0.69\%$
test_td3_speed[True-None] 1.7634ms 1.6528ms 605.0273 Ops/s 558.1586 Ops/s $\textbf{\color{#35bf28}+8.40\%}$
test_td3_speed[True-backward] 3.3743ms 3.3200ms 301.2038 Ops/s 298.0741 Ops/s $\color{#35bf28}+1.05\%$
test_td3_speed[reduce-overhead-None] 53.6520ms 27.6012ms 36.2304 Ops/s 35.8656 Ops/s $\color{#35bf28}+1.02\%$
test_td3_speed[reduce-overhead-backward] 1.5472ms 1.4771ms 676.9990 Ops/s 718.5577 Ops/s $\textbf{\color{#d91a1a}-5.78\%}$
test_cql_speed[False-None] 17.3017ms 16.7377ms 59.7453 Ops/s 59.4939 Ops/s $\color{#35bf28}+0.42\%$
test_cql_speed[False-backward] 22.5887ms 22.0894ms 45.2705 Ops/s 45.7422 Ops/s $\color{#d91a1a}-1.03\%$
test_cql_speed[True-None] 3.4135ms 3.2457ms 308.1043 Ops/s 305.5464 Ops/s $\color{#35bf28}+0.84\%$
test_cql_speed[True-backward] 6.1038ms 5.5971ms 178.6627 Ops/s 181.1538 Ops/s $\color{#d91a1a}-1.38\%$
test_cql_speed[reduce-overhead-None] 22.9081ms 13.7600ms 72.6745 Ops/s 74.8665 Ops/s $\color{#d91a1a}-2.93\%$
test_cql_speed[reduce-overhead-backward] 1.9439ms 1.8310ms 546.1506 Ops/s 543.2720 Ops/s $\color{#35bf28}+0.53\%$
test_a2c_speed[False-None] 3.5745ms 3.1853ms 313.9448 Ops/s 312.2612 Ops/s $\color{#35bf28}+0.54\%$
test_a2c_speed[False-backward] 6.6967ms 6.0806ms 164.4569 Ops/s 164.0766 Ops/s $\color{#35bf28}+0.23\%$
test_a2c_speed[True-None] 1.7191ms 1.3430ms 744.6001 Ops/s 735.9459 Ops/s $\color{#35bf28}+1.18\%$
test_a2c_speed[True-backward] 2.9876ms 2.8947ms 345.4545 Ops/s 339.2919 Ops/s $\color{#35bf28}+1.82\%$
test_a2c_speed[reduce-overhead-None] 16.6165ms 9.3837ms 106.5676 Ops/s 108.4065 Ops/s $\color{#d91a1a}-1.70\%$
test_a2c_speed[reduce-overhead-backward] 1.5309ms 1.4588ms 685.4848 Ops/s 634.7775 Ops/s $\textbf{\color{#35bf28}+7.99\%}$
test_ppo_speed[False-None] 3.8488ms 3.6878ms 271.1646 Ops/s 262.4579 Ops/s $\color{#35bf28}+3.32\%$
test_ppo_speed[False-backward] 7.3259ms 6.7578ms 147.9772 Ops/s 140.3374 Ops/s $\textbf{\color{#35bf28}+5.44\%}$
test_ppo_speed[True-None] 1.5456ms 1.4019ms 713.3199 Ops/s 704.4004 Ops/s $\color{#35bf28}+1.27\%$
test_ppo_speed[True-backward] 3.0974ms 3.0455ms 328.3516 Ops/s 302.2910 Ops/s $\textbf{\color{#35bf28}+8.62\%}$
test_ppo_speed[reduce-overhead-None] 1.0605ms 0.9563ms 1.0457 KOps/s 1.0401 KOps/s $\color{#35bf28}+0.54\%$
test_ppo_speed[reduce-overhead-backward] 1.5239ms 1.4111ms 708.6643 Ops/s 624.1340 Ops/s $\textbf{\color{#35bf28}+13.54\%}$
test_reinforce_speed[False-None] 2.4233ms 2.2805ms 438.5021 Ops/s 436.7322 Ops/s $\color{#35bf28}+0.41\%$
test_reinforce_speed[False-backward] 3.3625ms 3.2680ms 305.9945 Ops/s 292.4882 Ops/s $\color{#35bf28}+4.62\%$
test_reinforce_speed[True-None] 1.4924ms 1.2916ms 774.2473 Ops/s 764.0689 Ops/s $\color{#35bf28}+1.33\%$
test_reinforce_speed[True-backward] 3.0419ms 2.9196ms 342.5106 Ops/s 326.6599 Ops/s $\color{#35bf28}+4.85\%$
test_reinforce_speed[reduce-overhead-None] 18.9329ms 10.4601ms 95.6013 Ops/s 98.0973 Ops/s $\color{#d91a1a}-2.54\%$
test_reinforce_speed[reduce-overhead-backward] 1.5290ms 1.4730ms 678.8781 Ops/s 600.2203 Ops/s $\textbf{\color{#35bf28}+13.10\%}$
test_iql_speed[False-None] 9.5881ms 9.1531ms 109.2526 Ops/s 106.8218 Ops/s $\color{#35bf28}+2.28\%$
test_iql_speed[False-backward] 13.0499ms 12.7635ms 78.3484 Ops/s 75.1041 Ops/s $\color{#35bf28}+4.32\%$
test_iql_speed[True-None] 2.6381ms 2.2207ms 450.3103 Ops/s 420.3848 Ops/s $\textbf{\color{#35bf28}+7.12\%}$
test_iql_speed[True-backward] 5.0438ms 4.7449ms 210.7524 Ops/s 202.9161 Ops/s $\color{#35bf28}+3.86\%$
test_iql_speed[reduce-overhead-None] 19.6088ms 11.4981ms 86.9707 Ops/s 88.6862 Ops/s $\color{#d91a1a}-1.93\%$
test_iql_speed[reduce-overhead-backward] 1.9275ms 1.8860ms 530.2259 Ops/s 499.4843 Ops/s $\textbf{\color{#35bf28}+6.15\%}$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 7.9791ms 6.3159ms 158.3295 Ops/s 157.5281 Ops/s $\color{#35bf28}+0.51\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.7199ms 0.3086ms 3.2405 KOps/s 3.7542 KOps/s $\textbf{\color{#d91a1a}-13.68\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.5469ms 0.2906ms 3.4415 KOps/s 4.0759 KOps/s $\textbf{\color{#d91a1a}-15.57\%}$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.5768ms 6.0420ms 165.5085 Ops/s 165.7351 Ops/s $\color{#d91a1a}-0.14\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 2.0366ms 0.2951ms 3.3891 KOps/s 3.4747 KOps/s $\color{#d91a1a}-2.46\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.5741ms 0.2516ms 3.9747 KOps/s 3.6879 KOps/s $\textbf{\color{#35bf28}+7.78\%}$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 1.7224ms 1.3296ms 752.1259 Ops/s 793.9985 Ops/s $\textbf{\color{#d91a1a}-5.27\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 1.6233ms 1.2353ms 809.5458 Ops/s 803.2551 Ops/s $\color{#35bf28}+0.78\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.6305ms 6.2218ms 160.7261 Ops/s 159.3551 Ops/s $\color{#35bf28}+0.86\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.9939ms 0.4503ms 2.2205 KOps/s 2.2158 KOps/s $\color{#35bf28}+0.21\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.6851ms 0.4659ms 2.1466 KOps/s 2.4265 KOps/s $\textbf{\color{#d91a1a}-11.53\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 9.5866ms 6.1765ms 161.9033 Ops/s 164.1258 Ops/s $\color{#d91a1a}-1.35\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 2.1701ms 0.3689ms 2.7105 KOps/s 2.7916 KOps/s $\color{#d91a1a}-2.91\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.6464ms 0.3096ms 3.2303 KOps/s 3.1096 KOps/s $\color{#35bf28}+3.88\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.4756ms 5.9990ms 166.6947 Ops/s 164.5545 Ops/s $\color{#35bf28}+1.30\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.6730ms 0.2608ms 3.8336 KOps/s 3.2275 KOps/s $\textbf{\color{#35bf28}+18.78\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.6938ms 0.2510ms 3.9834 KOps/s 3.4709 KOps/s $\textbf{\color{#35bf28}+14.77\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.3782ms 6.1652ms 162.2002 Ops/s 158.1215 Ops/s $\color{#35bf28}+2.58\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.7878ms 0.4030ms 2.4814 KOps/s 2.1851 KOps/s $\textbf{\color{#35bf28}+13.56\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.6117ms 0.4366ms 2.2905 KOps/s 2.5474 KOps/s $\textbf{\color{#d91a1a}-10.09\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 7.0019ms 5.4187ms 184.5464 Ops/s 182.0353 Ops/s $\color{#35bf28}+1.38\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 9.5032ms 2.0542ms 486.8180 Ops/s 440.6571 Ops/s $\textbf{\color{#35bf28}+10.48\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 7.2313ms 1.2117ms 825.2573 Ops/s 849.3776 Ops/s $\color{#d91a1a}-2.84\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 7.1920ms 5.5524ms 180.1033 Ops/s 181.8746 Ops/s $\color{#d91a1a}-0.97\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 9.2215ms 2.0566ms 486.2321 Ops/s 427.9702 Ops/s $\textbf{\color{#35bf28}+13.61\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 6.9185ms 1.1833ms 845.1267 Ops/s 828.9118 Ops/s $\color{#35bf28}+1.96\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 0.4867s 15.3175ms 65.2846 Ops/s 32.3000 Ops/s $\textbf{\color{#35bf28}+102.12\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 10.6202ms 2.3358ms 428.1254 Ops/s 500.1360 Ops/s $\textbf{\color{#d91a1a}-14.40\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 2.3053ms 1.2726ms 785.7635 Ops/s 810.1536 Ops/s $\color{#d91a1a}-3.01\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 13.5067ms 13.2681ms 75.3688 Ops/s 73.2730 Ops/s $\color{#35bf28}+2.86\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 19.0059ms 17.2867ms 57.8479 Ops/s 57.6550 Ops/s $\color{#35bf28}+0.33\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 18.2662ms 17.9244ms 55.7899 Ops/s 55.1516 Ops/s $\color{#35bf28}+1.16\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 18.3756ms 16.9083ms 59.1427 Ops/s 56.3291 Ops/s $\color{#35bf28}+4.99\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 19.0400ms 17.8239ms 56.1044 Ops/s 54.7259 Ops/s $\color{#35bf28}+2.52\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 19.8849ms 18.4607ms 54.1691 Ops/s 53.3111 Ops/s $\color{#35bf28}+1.61\%$

@vmoens vmoens force-pushed the fix-windows-wheels branch 2 times, most recently from 27296fc to 7e988a9 Compare February 5, 2025 16:51
@vmoens vmoens force-pushed the fix-windows-wheels branch from 7e988a9 to cd8079f Compare February 5, 2025 17:24
@vmoens vmoens merged commit 03f56ff into main Feb 5, 2025
125 of 131 checks passed
@vmoens vmoens deleted the fix-windows-wheels branch February 5, 2025 17:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ciflow/binaries/all Build all binaries CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants