Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Doc] Fix tutorials #2768

Merged
merged 6 commits into from
Feb 7, 2025
Merged

[Doc] Fix tutorials #2768

merged 6 commits into from
Feb 7, 2025

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Feb 7, 2025

No description provided.

Copy link

pytorch-bot bot commented Feb 7, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2768

Note: Links to docs will display an error until the docs builds have been completed.

❌ 3 New Failures, 13 Pending

As of commit 1bfc3f3 with merge base 3be85c6 (image):

NEW FAILURES - The following jobs have failed:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 7, 2025
@vmoens vmoens added the documentation Improvements or additions to documentation label Feb 7, 2025
        from collections import Mapping
    ImportError: cannot import name 'Mapping' from 'collections'
Copy link

github-actions bot commented Feb 7, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}8$. Worsened: $\large\color{#d91a1a}2$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.5399s 0.4508s 2.2181 Ops/s 2.2189 Ops/s $\color{#d91a1a}-0.04\%$
test_transformed 1.0264s 0.9400s 1.0638 Ops/s 1.0935 Ops/s $\color{#d91a1a}-2.71\%$
test_serial 1.3590s 1.3575s 0.7367 Ops/s 0.7257 Ops/s $\color{#35bf28}+1.51\%$
test_parallel 1.1915s 1.1883s 0.8416 Ops/s 0.8273 Ops/s $\color{#35bf28}+1.73\%$
test_step_mdp_speed[True-True-True-True-True] 0.2258ms 30.0770μs 33.2480 KOps/s 33.4267 KOps/s $\color{#d91a1a}-0.53\%$
test_step_mdp_speed[True-True-True-True-False] 43.7610μs 17.7351μs 56.3853 KOps/s 56.9959 KOps/s $\color{#d91a1a}-1.07\%$
test_step_mdp_speed[True-True-True-False-True] 45.4140μs 17.1163μs 58.4239 KOps/s 59.0489 KOps/s $\color{#d91a1a}-1.06\%$
test_step_mdp_speed[True-True-True-False-False] 40.9660μs 10.0031μs 99.9693 KOps/s 99.6934 KOps/s $\color{#35bf28}+0.28\%$
test_step_mdp_speed[True-True-False-True-True] 0.6736ms 32.5021μs 30.7673 KOps/s 31.2963 KOps/s $\color{#d91a1a}-1.69\%$
test_step_mdp_speed[True-True-False-True-False] 63.3880μs 19.6946μs 50.7755 KOps/s 51.4479 KOps/s $\color{#d91a1a}-1.31\%$
test_step_mdp_speed[True-True-False-False-True] 48.2700μs 18.7178μs 53.4251 KOps/s 52.3965 KOps/s $\color{#35bf28}+1.96\%$
test_step_mdp_speed[True-True-False-False-False] 40.4050μs 11.7387μs 85.1884 KOps/s 84.0708 KOps/s $\color{#35bf28}+1.33\%$
test_step_mdp_speed[True-False-True-True-True] 74.4490μs 33.8446μs 29.5468 KOps/s 29.5565 KOps/s $\color{#d91a1a}-0.03\%$
test_step_mdp_speed[True-False-True-True-False] 64.1190μs 21.2952μs 46.9589 KOps/s 46.8994 KOps/s $\color{#35bf28}+0.13\%$
test_step_mdp_speed[True-False-True-False-True] 60.8440μs 18.8134μs 53.1537 KOps/s 53.0037 KOps/s $\color{#35bf28}+0.28\%$
test_step_mdp_speed[True-False-True-False-False] 38.9020μs 11.7288μs 85.2599 KOps/s 84.6724 KOps/s $\color{#35bf28}+0.69\%$
test_step_mdp_speed[True-False-False-True-True] 82.3340μs 35.4785μs 28.1861 KOps/s 28.1872 KOps/s $-0.00\%$
test_step_mdp_speed[True-False-False-True-False] 74.8690μs 22.8780μs 43.7102 KOps/s 43.3191 KOps/s $\color{#35bf28}+0.90\%$
test_step_mdp_speed[True-False-False-False-True] 52.9890μs 20.4966μs 48.7886 KOps/s 48.6861 KOps/s $\color{#35bf28}+0.21\%$
test_step_mdp_speed[True-False-False-False-False] 41.8980μs 13.3040μs 75.1655 KOps/s 73.7280 KOps/s $\color{#35bf28}+1.95\%$
test_step_mdp_speed[False-True-True-True-True] 68.2680μs 33.7809μs 29.6025 KOps/s 29.4177 KOps/s $\color{#35bf28}+0.63\%$
test_step_mdp_speed[False-True-True-True-False] 63.9890μs 21.1980μs 47.1743 KOps/s 46.3443 KOps/s $\color{#35bf28}+1.79\%$
test_step_mdp_speed[False-True-True-False-True] 2.0767ms 21.6139μs 46.2664 KOps/s 46.6322 KOps/s $\color{#d91a1a}-0.78\%$
test_step_mdp_speed[False-True-True-False-False] 36.7390μs 13.0056μs 76.8899 KOps/s 75.8264 KOps/s $\color{#35bf28}+1.40\%$
test_step_mdp_speed[False-True-False-True-True] 72.1150μs 35.3808μs 28.2639 KOps/s 28.3859 KOps/s $\color{#d91a1a}-0.43\%$
test_step_mdp_speed[False-True-False-True-False] 55.1220μs 22.9350μs 43.6014 KOps/s 43.1794 KOps/s $\color{#35bf28}+0.98\%$
test_step_mdp_speed[False-True-False-False-True] 0.5881ms 23.1112μs 43.2691 KOps/s 42.6205 KOps/s $\color{#35bf28}+1.52\%$
test_step_mdp_speed[False-True-False-False-False] 37.4900μs 14.7215μs 67.9280 KOps/s 66.5287 KOps/s $\color{#35bf28}+2.10\%$
test_step_mdp_speed[False-False-True-True-True] 76.6930μs 37.2729μs 26.8292 KOps/s 26.5434 KOps/s $\color{#35bf28}+1.08\%$
test_step_mdp_speed[False-False-True-True-False] 69.2390μs 24.7887μs 40.3410 KOps/s 39.6889 KOps/s $\color{#35bf28}+1.64\%$
test_step_mdp_speed[False-False-True-False-True] 49.7130μs 22.9843μs 43.5080 KOps/s 43.3091 KOps/s $\color{#35bf28}+0.46\%$
test_step_mdp_speed[False-False-True-False-False] 46.2660μs 14.7931μs 67.5992 KOps/s 66.5951 KOps/s $\color{#35bf28}+1.51\%$
test_step_mdp_speed[False-False-False-True-True] 76.6830μs 39.0137μs 25.6320 KOps/s 25.4247 KOps/s $\color{#35bf28}+0.82\%$
test_step_mdp_speed[False-False-False-True-False] 54.9130μs 26.1823μs 38.1938 KOps/s 37.2856 KOps/s $\color{#35bf28}+2.44\%$
test_step_mdp_speed[False-False-False-False-True] 69.8200μs 24.6105μs 40.6330 KOps/s 40.0645 KOps/s $\color{#35bf28}+1.42\%$
test_step_mdp_speed[False-False-False-False-False] 47.5890μs 16.3811μs 61.0458 KOps/s 60.0207 KOps/s $\color{#35bf28}+1.71\%$
test_values[generalized_advantage_estimate-True-True] 12.4867ms 9.8403ms 101.6226 Ops/s 105.1051 Ops/s $\color{#d91a1a}-3.31\%$
test_values[vec_generalized_advantage_estimate-True-True] 26.1179ms 24.0716ms 41.5427 Ops/s 41.5789 Ops/s $\color{#d91a1a}-0.09\%$
test_values[td0_return_estimate-False-False] 0.2331ms 0.1784ms 5.6063 KOps/s 5.6666 KOps/s $\color{#d91a1a}-1.06\%$
test_values[td1_return_estimate-False-False] 35.8148ms 24.5964ms 40.6563 Ops/s 42.4956 Ops/s $\color{#d91a1a}-4.33\%$
test_values[vec_td1_return_estimate-False-False] 27.3763ms 24.1115ms 41.4739 Ops/s 41.3462 Ops/s $\color{#35bf28}+0.31\%$
test_values[td_lambda_return_estimate-True-False] 38.2110ms 34.9934ms 28.5769 Ops/s 29.7983 Ops/s $\color{#d91a1a}-4.10\%$
test_values[vec_td_lambda_return_estimate-True-False] 26.0230ms 23.9899ms 41.6843 Ops/s 41.5664 Ops/s $\color{#35bf28}+0.28\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 8.7169ms 8.5824ms 116.5171 Ops/s 120.8086 Ops/s $\color{#d91a1a}-3.55\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 2.2189ms 1.9807ms 504.8779 Ops/s 515.5162 Ops/s $\color{#d91a1a}-2.06\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.5957ms 0.3701ms 2.7021 KOps/s 2.7261 KOps/s $\color{#d91a1a}-0.88\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 43.8499ms 41.0750ms 24.3457 Ops/s 24.4562 Ops/s $\color{#d91a1a}-0.45\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 4.2968ms 3.4352ms 291.0999 Ops/s 292.9386 Ops/s $\color{#d91a1a}-0.63\%$
test_dqn_speed[False-None] 1.8644ms 1.4120ms 708.2131 Ops/s 709.6372 Ops/s $\color{#d91a1a}-0.20\%$
test_dqn_speed[False-backward] 2.1715ms 1.9164ms 521.8027 Ops/s 514.0916 Ops/s $\color{#35bf28}+1.50\%$
test_dqn_speed[True-None] 0.6123ms 0.4760ms 2.1008 KOps/s 2.0708 KOps/s $\color{#35bf28}+1.45\%$
test_dqn_speed[True-backward] 0.9340ms 0.8942ms 1.1183 KOps/s 964.8269 Ops/s $\textbf{\color{#35bf28}+15.90\%}$
test_dqn_speed[reduce-overhead-None] 0.6813ms 0.4790ms 2.0876 KOps/s 2.0478 KOps/s $\color{#35bf28}+1.94\%$
test_dqn_speed[reduce-overhead-backward] 0.9261ms 0.8896ms 1.1241 KOps/s 1.0840 KOps/s $\color{#35bf28}+3.70\%$
test_ddpg_speed[False-None] 4.8076ms 2.8770ms 347.5813 Ops/s 349.0537 Ops/s $\color{#d91a1a}-0.42\%$
test_ddpg_speed[False-backward] 4.5617ms 4.0160ms 249.0025 Ops/s 248.5812 Ops/s $\color{#35bf28}+0.17\%$
test_ddpg_speed[True-None] 1.6054ms 1.2095ms 826.8102 Ops/s 812.0327 Ops/s $\color{#35bf28}+1.82\%$
test_ddpg_speed[True-backward] 2.1696ms 2.0848ms 479.6737 Ops/s 416.6361 Ops/s $\textbf{\color{#35bf28}+15.13\%}$
test_ddpg_speed[reduce-overhead-None] 1.7249ms 1.2064ms 828.8948 Ops/s 814.0248 Ops/s $\color{#35bf28}+1.83\%$
test_ddpg_speed[reduce-overhead-backward] 2.1376ms 2.0870ms 479.1659 Ops/s 471.0851 Ops/s $\color{#35bf28}+1.72\%$
test_sac_speed[False-None] 8.3346ms 7.9653ms 125.5440 Ops/s 125.4128 Ops/s $\color{#35bf28}+0.10\%$
test_sac_speed[False-backward] 12.6197ms 10.7063ms 93.4027 Ops/s 93.5425 Ops/s $\color{#d91a1a}-0.15\%$
test_sac_speed[True-None] 2.7326ms 2.0744ms 482.0754 Ops/s 478.9503 Ops/s $\color{#35bf28}+0.65\%$
test_sac_speed[True-backward] 4.1281ms 3.7679ms 265.4020 Ops/s 265.4845 Ops/s $\color{#d91a1a}-0.03\%$
test_sac_speed[reduce-overhead-None] 2.6520ms 2.0713ms 482.7875 Ops/s 479.3341 Ops/s $\color{#35bf28}+0.72\%$
test_sac_speed[reduce-overhead-backward] 4.0010ms 3.7497ms 266.6888 Ops/s 259.9396 Ops/s $\color{#35bf28}+2.60\%$
test_redq_speed[False-None] 15.0623ms 12.7110ms 78.6721 Ops/s 77.1472 Ops/s $\color{#35bf28}+1.98\%$
test_redq_speed[False-backward] 23.2909ms 21.9865ms 45.4824 Ops/s 44.1034 Ops/s $\color{#35bf28}+3.13\%$
test_redq_speed[True-None] 5.3542ms 4.7529ms 210.3971 Ops/s 199.9282 Ops/s $\textbf{\color{#35bf28}+5.24\%}$
test_redq_speed[True-backward] 13.9646ms 12.3118ms 81.2231 Ops/s 78.3148 Ops/s $\color{#35bf28}+3.71\%$
test_redq_speed[reduce-overhead-None] 5.3046ms 4.7885ms 208.8318 Ops/s 201.1785 Ops/s $\color{#35bf28}+3.80\%$
test_redq_speed[reduce-overhead-backward] 14.2179ms 12.2827ms 81.4156 Ops/s 79.2769 Ops/s $\color{#35bf28}+2.70\%$
test_redq_deprec_speed[False-None] 13.5490ms 12.6826ms 78.8484 Ops/s 75.3738 Ops/s $\color{#35bf28}+4.61\%$
test_redq_deprec_speed[False-backward] 19.7618ms 18.4078ms 54.3249 Ops/s 53.8286 Ops/s $\color{#35bf28}+0.92\%$
test_redq_deprec_speed[True-None] 4.5126ms 3.8086ms 262.5628 Ops/s 259.7770 Ops/s $\color{#35bf28}+1.07\%$
test_redq_deprec_speed[True-backward] 8.5001ms 8.1967ms 122.0007 Ops/s 121.7357 Ops/s $\color{#35bf28}+0.22\%$
test_redq_deprec_speed[reduce-overhead-None] 4.2440ms 3.7905ms 263.8195 Ops/s 258.0938 Ops/s $\color{#35bf28}+2.22\%$
test_redq_deprec_speed[reduce-overhead-backward] 8.7740ms 8.1214ms 123.1311 Ops/s 121.3672 Ops/s $\color{#35bf28}+1.45\%$
test_td3_speed[False-None] 8.1754ms 7.9473ms 125.8297 Ops/s 124.0707 Ops/s $\color{#35bf28}+1.42\%$
test_td3_speed[False-backward] 10.6942ms 10.3680ms 96.4502 Ops/s 94.6753 Ops/s $\color{#35bf28}+1.87\%$
test_td3_speed[True-None] 1.9065ms 1.7632ms 567.1518 Ops/s 554.3992 Ops/s $\color{#35bf28}+2.30\%$
test_td3_speed[True-backward] 3.4172ms 3.3314ms 300.1752 Ops/s 289.1744 Ops/s $\color{#35bf28}+3.80\%$
test_td3_speed[reduce-overhead-None] 1.9565ms 1.7532ms 570.3871 Ops/s 551.4316 Ops/s $\color{#35bf28}+3.44\%$
test_td3_speed[reduce-overhead-backward] 4.4474ms 3.3747ms 296.3203 Ops/s 294.7049 Ops/s $\color{#35bf28}+0.55\%$
test_cql_speed[False-None] 38.2270ms 36.0905ms 27.7081 Ops/s 27.4878 Ops/s $\color{#35bf28}+0.80\%$
test_cql_speed[False-backward] 49.4989ms 46.2950ms 21.6006 Ops/s 21.4458 Ops/s $\color{#35bf28}+0.72\%$
test_cql_speed[True-None] 17.2368ms 15.8510ms 63.0875 Ops/s 62.6124 Ops/s $\color{#35bf28}+0.76\%$
test_cql_speed[True-backward] 24.2263ms 22.6412ms 44.1672 Ops/s 44.4328 Ops/s $\color{#d91a1a}-0.60\%$
test_cql_speed[reduce-overhead-None] 17.8785ms 16.0784ms 62.1954 Ops/s 62.4478 Ops/s $\color{#d91a1a}-0.40\%$
test_cql_speed[reduce-overhead-backward] 23.7029ms 22.6771ms 44.0973 Ops/s 43.8008 Ops/s $\color{#35bf28}+0.68\%$
test_a2c_speed[False-None] 9.0690ms 7.1109ms 140.6301 Ops/s 137.7883 Ops/s $\color{#35bf28}+2.06\%$
test_a2c_speed[False-backward] 15.9227ms 14.3446ms 69.7128 Ops/s 69.3239 Ops/s $\color{#35bf28}+0.56\%$
test_a2c_speed[True-None] 4.4061ms 3.6516ms 273.8538 Ops/s 267.1499 Ops/s $\color{#35bf28}+2.51\%$
test_a2c_speed[True-backward] 10.7419ms 10.1811ms 98.2215 Ops/s 98.5005 Ops/s $\color{#d91a1a}-0.28\%$
test_a2c_speed[reduce-overhead-None] 4.4507ms 3.6761ms 272.0302 Ops/s 268.0416 Ops/s $\color{#35bf28}+1.49\%$
test_a2c_speed[reduce-overhead-backward] 10.4456ms 10.0872ms 99.1356 Ops/s 98.4581 Ops/s $\color{#35bf28}+0.69\%$
test_ppo_speed[False-None] 8.9452ms 7.4888ms 133.5322 Ops/s 134.1582 Ops/s $\color{#d91a1a}-0.47\%$
test_ppo_speed[False-backward] 16.2305ms 15.0968ms 66.2394 Ops/s 68.3879 Ops/s $\color{#d91a1a}-3.14\%$
test_ppo_speed[True-None] 4.3827ms 4.0493ms 246.9550 Ops/s 243.5010 Ops/s $\color{#35bf28}+1.42\%$
test_ppo_speed[True-backward] 10.6085ms 9.9845ms 100.1551 Ops/s 99.4548 Ops/s $\color{#35bf28}+0.70\%$
test_ppo_speed[reduce-overhead-None] 4.3787ms 4.0576ms 246.4526 Ops/s 241.8856 Ops/s $\color{#35bf28}+1.89\%$
test_ppo_speed[reduce-overhead-backward] 21.4250ms 10.4007ms 96.1469 Ops/s 99.4955 Ops/s $\color{#d91a1a}-3.37\%$
test_reinforce_speed[False-None] 7.6232ms 6.5407ms 152.8893 Ops/s 150.7580 Ops/s $\color{#35bf28}+1.41\%$
test_reinforce_speed[False-backward] 10.1904ms 9.8058ms 101.9800 Ops/s 102.0578 Ops/s $\color{#d91a1a}-0.08\%$
test_reinforce_speed[True-None] 3.6183ms 3.0308ms 329.9450 Ops/s 323.5602 Ops/s $\color{#35bf28}+1.97\%$
test_reinforce_speed[True-backward] 9.6294ms 9.0138ms 110.9409 Ops/s 111.2174 Ops/s $\color{#d91a1a}-0.25\%$
test_reinforce_speed[reduce-overhead-None] 3.8484ms 3.0553ms 327.2948 Ops/s 326.7491 Ops/s $\color{#35bf28}+0.17\%$
test_reinforce_speed[reduce-overhead-backward] 9.6667ms 9.0192ms 110.8740 Ops/s 110.8522 Ops/s $\color{#35bf28}+0.02\%$
test_iql_speed[False-None] 34.1941ms 32.3968ms 30.8673 Ops/s 30.3836 Ops/s $\color{#35bf28}+1.59\%$
test_iql_speed[False-backward] 48.8920ms 45.3909ms 22.0309 Ops/s 22.1160 Ops/s $\color{#d91a1a}-0.38\%$
test_iql_speed[True-None] 12.3641ms 11.3472ms 88.1276 Ops/s 89.0598 Ops/s $\color{#d91a1a}-1.05\%$
test_iql_speed[True-backward] 22.9372ms 22.0014ms 45.4517 Ops/s 44.6225 Ops/s $\color{#35bf28}+1.86\%$
test_iql_speed[reduce-overhead-None] 14.6679ms 11.3145ms 88.3821 Ops/s 88.2352 Ops/s $\color{#35bf28}+0.17\%$
test_iql_speed[reduce-overhead-backward] 23.0765ms 21.9010ms 45.6599 Ops/s 44.7345 Ops/s $\color{#35bf28}+2.07\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 5.9503ms 4.7305ms 211.3958 Ops/s 202.4340 Ops/s $\color{#35bf28}+4.43\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.7652ms 0.5123ms 1.9521 KOps/s 1.9347 KOps/s $\color{#35bf28}+0.90\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.8174ms 0.4909ms 2.0371 KOps/s 2.0447 KOps/s $\color{#d91a1a}-0.37\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 5.1491ms 4.5375ms 220.3863 Ops/s 214.0506 Ops/s $\color{#35bf28}+2.96\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 3.2539ms 0.5064ms 1.9746 KOps/s 2.0113 KOps/s $\color{#d91a1a}-1.82\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.7687ms 0.4809ms 2.0796 KOps/s 2.0619 KOps/s $\color{#35bf28}+0.86\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 2.1572ms 1.6285ms 614.0519 Ops/s 606.2814 Ops/s $\color{#35bf28}+1.28\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 2.2507ms 1.5483ms 645.8784 Ops/s 647.7102 Ops/s $\color{#d91a1a}-0.28\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 5.7429ms 4.7039ms 212.5913 Ops/s 208.3259 Ops/s $\color{#35bf28}+2.05\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.1048ms 0.6494ms 1.5399 KOps/s 1.5453 KOps/s $\color{#d91a1a}-0.35\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.9970ms 0.6266ms 1.5960 KOps/s 1.6046 KOps/s $\color{#d91a1a}-0.54\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 7.4574ms 4.5754ms 218.5600 Ops/s 209.0170 Ops/s $\color{#35bf28}+4.57\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 1.5120ms 0.5175ms 1.9324 KOps/s 1.9570 KOps/s $\color{#d91a1a}-1.26\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.7905ms 0.5004ms 1.9985 KOps/s 2.0309 KOps/s $\color{#d91a1a}-1.59\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 4.8302ms 4.5074ms 221.8590 Ops/s 212.8297 Ops/s $\color{#35bf28}+4.24\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 2.2422ms 0.5105ms 1.9590 KOps/s 2.0167 KOps/s $\color{#d91a1a}-2.86\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.7144ms 0.4895ms 2.0430 KOps/s 2.0799 KOps/s $\color{#d91a1a}-1.77\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 5.1723ms 4.7036ms 212.6051 Ops/s 203.7806 Ops/s $\color{#35bf28}+4.33\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.3590ms 0.6521ms 1.5335 KOps/s 1.5158 KOps/s $\color{#35bf28}+1.17\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 1.2981ms 0.6770ms 1.4770 KOps/s 1.6134 KOps/s $\textbf{\color{#d91a1a}-8.45\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 8.0891ms 4.2546ms 235.0422 Ops/s 230.0889 Ops/s $\color{#35bf28}+2.15\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 7.4893ms 2.3437ms 426.6798 Ops/s 438.9826 Ops/s $\color{#d91a1a}-2.80\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 4.1224ms 1.2933ms 773.2288 Ops/s 703.9349 Ops/s $\textbf{\color{#35bf28}+9.84\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 5.7153ms 4.2102ms 237.5176 Ops/s 243.5859 Ops/s $\color{#d91a1a}-2.49\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 9.7741ms 2.3083ms 433.2240 Ops/s 433.6033 Ops/s $\color{#d91a1a}-0.09\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 1.8848ms 1.2450ms 803.2417 Ops/s 704.4069 Ops/s $\textbf{\color{#35bf28}+14.03\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 0.4467s 13.2443ms 75.5044 Ops/s 233.4336 Ops/s $\textbf{\color{#d91a1a}-67.65\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 7.0370ms 2.4545ms 407.4086 Ops/s 370.1409 Ops/s $\textbf{\color{#35bf28}+10.07\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 6.3085ms 1.4925ms 670.0107 Ops/s 693.8667 Ops/s $\color{#d91a1a}-3.44\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 11.8002ms 11.5610ms 86.4974 Ops/s 81.9345 Ops/s $\textbf{\color{#35bf28}+5.57\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 15.3869ms 14.3013ms 69.9237 Ops/s 70.7956 Ops/s $\color{#d91a1a}-1.23\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 21.6049ms 20.3381ms 49.1688 Ops/s 46.6364 Ops/s $\textbf{\color{#35bf28}+5.43\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 16.2428ms 14.5935ms 68.5239 Ops/s 67.3632 Ops/s $\color{#35bf28}+1.72\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 20.6814ms 20.3135ms 49.2283 Ops/s 47.6510 Ops/s $\color{#35bf28}+3.31\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 17.5292ms 15.8141ms 63.2346 Ops/s 62.5829 Ops/s $\color{#35bf28}+1.04\%$

Copy link

github-actions bot commented Feb 7, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}21$. Worsened: $\large\color{#d91a1a}8$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.8381s 0.7553s 1.3239 Ops/s 1.3278 Ops/s $\color{#d91a1a}-0.29\%$
test_transformed 1.4146s 1.3270s 0.7536 Ops/s 0.7578 Ops/s $\color{#d91a1a}-0.56\%$
test_serial 2.1771s 2.1647s 0.4620 Ops/s 0.4585 Ops/s $\color{#35bf28}+0.74\%$
test_parallel 1.8536s 1.8263s 0.5476 Ops/s 0.5345 Ops/s $\color{#35bf28}+2.44\%$
test_step_mdp_speed[True-True-True-True-True] 0.2369ms 40.9835μs 24.4001 KOps/s 25.1431 KOps/s $\color{#d91a1a}-2.96\%$
test_step_mdp_speed[True-True-True-True-False] 82.6020μs 23.5066μs 42.5412 KOps/s 42.9529 KOps/s $\color{#d91a1a}-0.96\%$
test_step_mdp_speed[True-True-True-False-True] 50.9610μs 22.6069μs 44.2344 KOps/s 45.6500 KOps/s $\color{#d91a1a}-3.10\%$
test_step_mdp_speed[True-True-True-False-False] 76.0210μs 13.1180μs 76.2311 KOps/s 76.5184 KOps/s $\color{#d91a1a}-0.38\%$
test_step_mdp_speed[True-True-False-True-True] 68.1520μs 43.0965μs 23.2037 KOps/s 23.4905 KOps/s $\color{#d91a1a}-1.22\%$
test_step_mdp_speed[True-True-False-True-False] 74.7110μs 25.8068μs 38.7494 KOps/s 38.6941 KOps/s $\color{#35bf28}+0.14\%$
test_step_mdp_speed[True-True-False-False-True] 55.3610μs 24.7374μs 40.4246 KOps/s 40.2190 KOps/s $\color{#35bf28}+0.51\%$
test_step_mdp_speed[True-True-False-False-False] 50.0510μs 15.4345μs 64.7898 KOps/s 65.0144 KOps/s $\color{#d91a1a}-0.35\%$
test_step_mdp_speed[True-False-True-True-True] 0.1504ms 45.0610μs 22.1921 KOps/s 22.1072 KOps/s $\color{#35bf28}+0.38\%$
test_step_mdp_speed[True-False-True-True-False] 97.4310μs 28.2190μs 35.4371 KOps/s 36.6532 KOps/s $\color{#d91a1a}-3.32\%$
test_step_mdp_speed[True-False-True-False-True] 52.2610μs 24.7534μs 40.3985 KOps/s 40.5999 KOps/s $\color{#d91a1a}-0.50\%$
test_step_mdp_speed[True-False-True-False-False] 0.2056ms 15.5182μs 64.4404 KOps/s 65.2159 KOps/s $\color{#d91a1a}-1.19\%$
test_step_mdp_speed[True-False-False-True-True] 0.2384ms 47.5290μs 21.0398 KOps/s 21.3056 KOps/s $\color{#d91a1a}-1.25\%$
test_step_mdp_speed[True-False-False-True-False] 67.5410μs 30.4947μs 32.7926 KOps/s 33.0241 KOps/s $\color{#d91a1a}-0.70\%$
test_step_mdp_speed[True-False-False-False-True] 59.4810μs 26.9346μs 37.1269 KOps/s 36.9365 KOps/s $\color{#35bf28}+0.52\%$
test_step_mdp_speed[True-False-False-False-False] 0.1940ms 17.7740μs 56.2621 KOps/s 58.2699 KOps/s $\color{#d91a1a}-3.45\%$
test_step_mdp_speed[False-True-True-True-True] 74.1810μs 45.3758μs 22.0382 KOps/s 22.5141 KOps/s $\color{#d91a1a}-2.11\%$
test_step_mdp_speed[False-True-True-True-False] 55.6410μs 28.2634μs 35.3814 KOps/s 36.0231 KOps/s $\color{#d91a1a}-1.78\%$
test_step_mdp_speed[False-True-True-False-True] 2.5828ms 29.0491μs 34.4245 KOps/s 35.1311 KOps/s $\color{#d91a1a}-2.01\%$
test_step_mdp_speed[False-True-True-False-False] 74.1510μs 17.0382μs 58.6917 KOps/s 59.4815 KOps/s $\color{#d91a1a}-1.33\%$
test_step_mdp_speed[False-True-False-True-True] 72.6110μs 46.9026μs 21.3208 KOps/s 21.3779 KOps/s $\color{#d91a1a}-0.27\%$
test_step_mdp_speed[False-True-False-True-False] 54.4010μs 30.4125μs 32.8813 KOps/s 32.9206 KOps/s $\color{#d91a1a}-0.12\%$
test_step_mdp_speed[False-True-False-False-True] 59.0420μs 30.6126μs 32.6663 KOps/s 31.9432 KOps/s $\color{#35bf28}+2.26\%$
test_step_mdp_speed[False-True-False-False-False] 73.1110μs 19.4264μs 51.4763 KOps/s 51.8095 KOps/s $\color{#d91a1a}-0.64\%$
test_step_mdp_speed[False-False-True-True-True] 90.4910μs 48.8906μs 20.4538 KOps/s 20.2387 KOps/s $\color{#35bf28}+1.06\%$
test_step_mdp_speed[False-False-True-True-False] 62.6210μs 33.1186μs 30.1945 KOps/s 30.8956 KOps/s $\color{#d91a1a}-2.27\%$
test_step_mdp_speed[False-False-True-False-True] 62.2720μs 30.8551μs 32.4096 KOps/s 33.1607 KOps/s $\color{#d91a1a}-2.27\%$
test_step_mdp_speed[False-False-True-False-False] 48.6510μs 19.3738μs 51.6161 KOps/s 51.9905 KOps/s $\color{#d91a1a}-0.72\%$
test_step_mdp_speed[False-False-False-True-True] 81.7610μs 51.7876μs 19.3096 KOps/s 19.5767 KOps/s $\color{#d91a1a}-1.36\%$
test_step_mdp_speed[False-False-False-True-False] 61.3810μs 34.9747μs 28.5921 KOps/s 29.0643 KOps/s $\color{#d91a1a}-1.62\%$
test_step_mdp_speed[False-False-False-False-True] 60.2510μs 32.2885μs 30.9708 KOps/s 31.0260 KOps/s $\color{#d91a1a}-0.18\%$
test_step_mdp_speed[False-False-False-False-False] 49.7510μs 21.7891μs 45.8946 KOps/s 47.1651 KOps/s $\color{#d91a1a}-2.69\%$
test_values[generalized_advantage_estimate-True-True] 26.0394ms 25.6493ms 38.9874 Ops/s 38.8617 Ops/s $\color{#35bf28}+0.32\%$
test_values[vec_generalized_advantage_estimate-True-True] 0.1058s 3.0188ms 331.2626 Ops/s 319.2596 Ops/s $\color{#35bf28}+3.76\%$
test_values[td0_return_estimate-False-False] 0.1097ms 83.2448μs 12.0128 KOps/s 12.1114 KOps/s $\color{#d91a1a}-0.81\%$
test_values[td1_return_estimate-False-False] 57.5466ms 57.1132ms 17.5091 Ops/s 17.4735 Ops/s $\color{#35bf28}+0.20\%$
test_values[vec_td1_return_estimate-False-False] 1.2932ms 1.0932ms 914.7349 Ops/s 908.7640 Ops/s $\color{#35bf28}+0.66\%$
test_values[td_lambda_return_estimate-True-False] 91.2369ms 90.6472ms 11.0318 Ops/s 11.0903 Ops/s $\color{#d91a1a}-0.53\%$
test_values[vec_td_lambda_return_estimate-True-False] 1.3804ms 1.0986ms 910.2521 Ops/s 915.5156 Ops/s $\color{#d91a1a}-0.57\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 25.7560ms 25.6025ms 39.0587 Ops/s 38.7557 Ops/s $\color{#35bf28}+0.78\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 1.0467ms 0.7726ms 1.2943 KOps/s 1.2984 KOps/s $\color{#d91a1a}-0.31\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.8168ms 0.6815ms 1.4673 KOps/s 1.4678 KOps/s $\color{#d91a1a}-0.03\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 1.6587ms 1.4983ms 667.4336 Ops/s 667.9362 Ops/s $\color{#d91a1a}-0.08\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 0.8707ms 0.6966ms 1.4354 KOps/s 1.4317 KOps/s $\color{#35bf28}+0.26\%$
test_dqn_speed[False-None] 6.9068ms 1.5446ms 647.4091 Ops/s 655.1339 Ops/s $\color{#d91a1a}-1.18\%$
test_dqn_speed[False-backward] 2.2967ms 2.1374ms 467.8505 Ops/s 462.0823 Ops/s $\color{#35bf28}+1.25\%$
test_dqn_speed[True-None] 0.7276ms 0.5530ms 1.8084 KOps/s 1.7076 KOps/s $\textbf{\color{#35bf28}+5.90\%}$
test_dqn_speed[True-backward] 1.2932ms 1.2307ms 812.5259 Ops/s 856.7447 Ops/s $\textbf{\color{#d91a1a}-5.16\%}$
test_dqn_speed[reduce-overhead-None] 0.7498ms 0.5709ms 1.7515 KOps/s 1.6827 KOps/s $\color{#35bf28}+4.09\%$
test_dqn_speed[reduce-overhead-backward] 1.2200ms 1.0733ms 931.7105 Ops/s 1.0256 KOps/s $\textbf{\color{#d91a1a}-9.15\%}$
test_ddpg_speed[False-None] 3.2174ms 2.8891ms 346.1345 Ops/s 344.1966 Ops/s $\color{#35bf28}+0.56\%$
test_ddpg_speed[False-backward] 4.7182ms 4.2869ms 233.2679 Ops/s 238.3009 Ops/s $\color{#d91a1a}-2.11\%$
test_ddpg_speed[True-None] 1.5222ms 1.3475ms 742.0906 Ops/s 737.4483 Ops/s $\color{#35bf28}+0.63\%$
test_ddpg_speed[True-backward] 2.6993ms 2.5817ms 387.3447 Ops/s 404.5600 Ops/s $\color{#d91a1a}-4.26\%$
test_ddpg_speed[reduce-overhead-None] 1.5520ms 1.3582ms 736.2790 Ops/s 722.7944 Ops/s $\color{#35bf28}+1.87\%$
test_ddpg_speed[reduce-overhead-backward] 2.2200ms 2.0420ms 489.7211 Ops/s 518.2983 Ops/s $\textbf{\color{#d91a1a}-5.51\%}$
test_sac_speed[False-None] 8.4899ms 8.1031ms 123.4097 Ops/s 122.0076 Ops/s $\color{#35bf28}+1.15\%$
test_sac_speed[False-backward] 11.9305ms 11.3130ms 88.3940 Ops/s 89.9252 Ops/s $\color{#d91a1a}-1.70\%$
test_sac_speed[True-None] 2.0088ms 1.8448ms 542.0577 Ops/s 533.2998 Ops/s $\color{#35bf28}+1.64\%$
test_sac_speed[True-backward] 3.8698ms 3.7168ms 269.0475 Ops/s 275.3000 Ops/s $\color{#d91a1a}-2.27\%$
test_sac_speed[reduce-overhead-None] 21.0375ms 11.9012ms 84.0248 Ops/s 83.4191 Ops/s $\color{#35bf28}+0.73\%$
test_sac_speed[reduce-overhead-backward] 1.9505ms 1.7910ms 558.3417 Ops/s 594.4868 Ops/s $\textbf{\color{#d91a1a}-6.08\%}$
test_redq_speed[False-None] 8.0230ms 7.5307ms 132.7907 Ops/s 130.2206 Ops/s $\color{#35bf28}+1.97\%$
test_redq_speed[False-backward] 12.5237ms 11.8041ms 84.7164 Ops/s 86.1014 Ops/s $\color{#d91a1a}-1.61\%$
test_redq_speed[True-None] 2.5776ms 2.2938ms 435.9557 Ops/s 430.4739 Ops/s $\color{#35bf28}+1.27\%$
test_redq_speed[True-backward] 4.3797ms 4.1749ms 239.5254 Ops/s 244.3068 Ops/s $\color{#d91a1a}-1.96\%$
test_redq_speed[reduce-overhead-None] 2.4886ms 2.3027ms 434.2795 Ops/s 424.3167 Ops/s $\color{#35bf28}+2.35\%$
test_redq_speed[reduce-overhead-backward] 4.3067ms 4.1616ms 240.2910 Ops/s 233.7297 Ops/s $\color{#35bf28}+2.81\%$
test_redq_deprec_speed[False-None] 9.5737ms 9.1204ms 109.6440 Ops/s 108.2504 Ops/s $\color{#35bf28}+1.29\%$
test_redq_deprec_speed[False-backward] 12.7713ms 12.3301ms 81.1027 Ops/s 80.2497 Ops/s $\color{#35bf28}+1.06\%$
test_redq_deprec_speed[True-None] 2.7934ms 2.6288ms 380.4028 Ops/s 367.1740 Ops/s $\color{#35bf28}+3.60\%$
test_redq_deprec_speed[True-backward] 4.7277ms 4.4877ms 222.8309 Ops/s 219.1613 Ops/s $\color{#35bf28}+1.67\%$
test_redq_deprec_speed[reduce-overhead-None] 2.8007ms 2.6327ms 379.8327 Ops/s 373.2660 Ops/s $\color{#35bf28}+1.76\%$
test_redq_deprec_speed[reduce-overhead-backward] 4.6439ms 4.4744ms 223.4947 Ops/s 216.1022 Ops/s $\color{#35bf28}+3.42\%$
test_td3_speed[False-None] 8.0290ms 7.9934ms 125.1025 Ops/s 123.0775 Ops/s $\color{#35bf28}+1.65\%$
test_td3_speed[False-backward] 10.9776ms 10.5541ms 94.7498 Ops/s 92.7339 Ops/s $\color{#35bf28}+2.17\%$
test_td3_speed[True-None] 1.6867ms 1.6538ms 604.6842 Ops/s 593.3413 Ops/s $\color{#35bf28}+1.91\%$
test_td3_speed[True-backward] 3.4217ms 3.3564ms 297.9404 Ops/s 287.8901 Ops/s $\color{#35bf28}+3.49\%$
test_td3_speed[reduce-overhead-None] 50.1940ms 25.8932ms 38.6202 Ops/s 38.4100 Ops/s $\color{#35bf28}+0.55\%$
test_td3_speed[reduce-overhead-backward] 1.5450ms 1.5046ms 664.6502 Ops/s 656.4163 Ops/s $\color{#35bf28}+1.25\%$
test_cql_speed[False-None] 17.2515ms 16.8489ms 59.3510 Ops/s 58.6376 Ops/s $\color{#35bf28}+1.22\%$
test_cql_speed[False-backward] 23.3425ms 22.4252ms 44.5928 Ops/s 44.2224 Ops/s $\color{#35bf28}+0.84\%$
test_cql_speed[True-None] 3.4340ms 3.2506ms 307.6374 Ops/s 299.0773 Ops/s $\color{#35bf28}+2.86\%$
test_cql_speed[True-backward] 6.2567ms 5.6968ms 175.5381 Ops/s 178.2997 Ops/s $\color{#d91a1a}-1.55\%$
test_cql_speed[reduce-overhead-None] 21.6712ms 13.0819ms 76.4417 Ops/s 76.7747 Ops/s $\color{#d91a1a}-0.43\%$
test_cql_speed[reduce-overhead-backward] 2.1673ms 2.0096ms 497.6084 Ops/s 540.2830 Ops/s $\textbf{\color{#d91a1a}-7.90\%}$
test_a2c_speed[False-None] 3.4772ms 3.2412ms 308.5300 Ops/s 306.4279 Ops/s $\color{#35bf28}+0.69\%$
test_a2c_speed[False-backward] 6.9206ms 6.3425ms 157.6670 Ops/s 162.0587 Ops/s $\color{#d91a1a}-2.71\%$
test_a2c_speed[True-None] 1.5312ms 1.3378ms 747.4936 Ops/s 741.1895 Ops/s $\color{#35bf28}+0.85\%$
test_a2c_speed[True-backward] 3.1801ms 3.0438ms 328.5330 Ops/s 322.0457 Ops/s $\color{#35bf28}+2.01\%$
test_a2c_speed[reduce-overhead-None] 16.3078ms 9.0042ms 111.0594 Ops/s 111.6383 Ops/s $\color{#d91a1a}-0.52\%$
test_a2c_speed[reduce-overhead-backward] 1.7316ms 1.6105ms 620.9087 Ops/s 612.9712 Ops/s $\color{#35bf28}+1.29\%$
test_ppo_speed[False-None] 3.9341ms 3.7621ms 265.8059 Ops/s 261.7868 Ops/s $\color{#35bf28}+1.54\%$
test_ppo_speed[False-backward] 7.9714ms 7.1542ms 139.7781 Ops/s 136.6742 Ops/s $\color{#35bf28}+2.27\%$
test_ppo_speed[True-None] 1.5926ms 1.3947ms 716.9993 Ops/s 706.5781 Ops/s $\color{#35bf28}+1.47\%$
test_ppo_speed[True-backward] 3.2181ms 3.0402ms 328.9298 Ops/s 304.3302 Ops/s $\textbf{\color{#35bf28}+8.08\%}$
test_ppo_speed[reduce-overhead-None] 1.0816ms 0.9414ms 1.0622 KOps/s 1.0366 KOps/s $\color{#35bf28}+2.47\%$
test_ppo_speed[reduce-overhead-backward] 1.4863ms 1.3938ms 717.4572 Ops/s 622.0808 Ops/s $\textbf{\color{#35bf28}+15.33\%}$
test_reinforce_speed[False-None] 2.4977ms 2.2957ms 435.6050 Ops/s 427.2573 Ops/s $\color{#35bf28}+1.95\%$
test_reinforce_speed[False-backward] 3.4186ms 3.2873ms 304.2012 Ops/s 287.0314 Ops/s $\textbf{\color{#35bf28}+5.98\%}$
test_reinforce_speed[True-None] 1.4398ms 1.2761ms 783.6566 Ops/s 767.1025 Ops/s $\color{#35bf28}+2.16\%$
test_reinforce_speed[True-backward] 3.0939ms 2.9268ms 341.6649 Ops/s 324.1993 Ops/s $\textbf{\color{#35bf28}+5.39\%}$
test_reinforce_speed[reduce-overhead-None] 18.7483ms 10.1020ms 98.9901 Ops/s 101.4954 Ops/s $\color{#d91a1a}-2.47\%$
test_reinforce_speed[reduce-overhead-backward] 1.5216ms 1.4584ms 685.6675 Ops/s 599.2075 Ops/s $\textbf{\color{#35bf28}+14.43\%}$
test_iql_speed[False-None] 9.9445ms 9.2355ms 108.2782 Ops/s 105.0217 Ops/s $\color{#35bf28}+3.10\%$
test_iql_speed[False-backward] 13.3622ms 12.8100ms 78.0641 Ops/s 73.7372 Ops/s $\textbf{\color{#35bf28}+5.87\%}$
test_iql_speed[True-None] 2.4693ms 2.2215ms 450.1533 Ops/s 428.9606 Ops/s $\color{#35bf28}+4.94\%$
test_iql_speed[True-backward] 5.1493ms 4.9361ms 202.5884 Ops/s 199.2703 Ops/s $\color{#35bf28}+1.67\%$
test_iql_speed[reduce-overhead-None] 19.9892ms 11.1277ms 89.8659 Ops/s 90.1257 Ops/s $\color{#d91a1a}-0.29\%$
test_iql_speed[reduce-overhead-backward] 2.1854ms 2.0429ms 489.4914 Ops/s 464.2431 Ops/s $\textbf{\color{#35bf28}+5.44\%}$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 7.8375ms 6.3283ms 158.0204 Ops/s 155.2534 Ops/s $\color{#35bf28}+1.78\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.5194ms 0.3145ms 3.1798 KOps/s 2.8442 KOps/s $\textbf{\color{#35bf28}+11.80\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.5097ms 0.2700ms 3.7031 KOps/s 3.2749 KOps/s $\textbf{\color{#35bf28}+13.07\%}$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.4030ms 6.0699ms 164.7463 Ops/s 163.4839 Ops/s $\color{#35bf28}+0.77\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 1.8438ms 0.3209ms 3.1162 KOps/s 3.2012 KOps/s $\color{#d91a1a}-2.66\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.4995ms 0.2909ms 3.4371 KOps/s 3.3915 KOps/s $\color{#35bf28}+1.35\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 1.7269ms 1.4467ms 691.2162 Ops/s 724.5325 Ops/s $\color{#d91a1a}-4.60\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 1.6961ms 1.3846ms 722.2392 Ops/s 767.3670 Ops/s $\textbf{\color{#d91a1a}-5.88\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.3953ms 6.2285ms 160.5536 Ops/s 158.7843 Ops/s $\color{#35bf28}+1.11\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 2.2640ms 0.4323ms 2.3132 KOps/s 2.0969 KOps/s $\textbf{\color{#35bf28}+10.31\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.6899ms 0.4171ms 2.3973 KOps/s 2.2236 KOps/s $\textbf{\color{#35bf28}+7.81\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.4788ms 6.1113ms 163.6323 Ops/s 162.6789 Ops/s $\color{#35bf28}+0.59\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.8041ms 0.2740ms 3.6493 KOps/s 3.6630 KOps/s $\color{#d91a1a}-0.37\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.5787ms 0.2989ms 3.3451 KOps/s 3.9341 KOps/s $\textbf{\color{#d91a1a}-14.97\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.4097ms 6.0511ms 165.2594 Ops/s 163.5496 Ops/s $\color{#35bf28}+1.05\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.6261ms 0.2681ms 3.7299 KOps/s 2.9448 KOps/s $\textbf{\color{#35bf28}+26.66\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.4945ms 0.2717ms 3.6801 KOps/s 3.9978 KOps/s $\textbf{\color{#d91a1a}-7.95\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.3473ms 6.1610ms 162.3103 Ops/s 158.4395 Ops/s $\color{#35bf28}+2.44\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.7685ms 0.4139ms 2.4161 KOps/s 2.1936 KOps/s $\textbf{\color{#35bf28}+10.15\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.6263ms 0.4040ms 2.4752 KOps/s 2.1827 KOps/s $\textbf{\color{#35bf28}+13.40\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 7.1998ms 5.5012ms 181.7796 Ops/s 180.8781 Ops/s $\color{#35bf28}+0.50\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 9.5339ms 2.0745ms 482.0457 Ops/s 435.7646 Ops/s $\textbf{\color{#35bf28}+10.62\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 6.8666ms 1.2220ms 818.3470 Ops/s 798.3652 Ops/s $\color{#35bf28}+2.50\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 7.0150ms 5.5712ms 179.4948 Ops/s 182.8388 Ops/s $\color{#d91a1a}-1.83\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 10.0393ms 2.0927ms 477.8446 Ops/s 431.8724 Ops/s $\textbf{\color{#35bf28}+10.64\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 3.4319ms 1.1772ms 849.4915 Ops/s 872.5925 Ops/s $\color{#d91a1a}-2.65\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 0.5066s 15.7869ms 63.3436 Ops/s 31.2872 Ops/s $\textbf{\color{#35bf28}+102.46\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 3.9995ms 1.8498ms 540.5891 Ops/s 443.9868 Ops/s $\textbf{\color{#35bf28}+21.76\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 2.1668ms 1.2006ms 832.8912 Ops/s 723.4995 Ops/s $\textbf{\color{#35bf28}+15.12\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 12.8867ms 12.7527ms 78.4148 Ops/s 73.2337 Ops/s $\textbf{\color{#35bf28}+7.07\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 18.1725ms 16.5076ms 60.5782 Ops/s 57.8276 Ops/s $\color{#35bf28}+4.76\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 17.7346ms 17.4289ms 57.3760 Ops/s 55.6561 Ops/s $\color{#35bf28}+3.09\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 18.1602ms 16.7814ms 59.5898 Ops/s 58.7916 Ops/s $\color{#35bf28}+1.36\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 17.6561ms 17.3571ms 57.6135 Ops/s 55.6544 Ops/s $\color{#35bf28}+3.52\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 18.9748ms 17.9463ms 55.7219 Ops/s 54.6879 Ops/s $\color{#35bf28}+1.89\%$

@vmoens vmoens added the Suitable for minor Suitable to be integrated in minor release (no new feature) label Feb 7, 2025
@vmoens vmoens merged commit 75f113f into main Feb 7, 2025
63 of 77 checks passed
vmoens added a commit that referenced this pull request Feb 7, 2025
(cherry picked from commit 75f113f)
vmoens added a commit that referenced this pull request Feb 7, 2025
(cherry picked from commit 75f113f)
vmoens added a commit that referenced this pull request Feb 7, 2025
(cherry picked from commit 75f113f)
vmoens added a commit that referenced this pull request Feb 10, 2025
(cherry picked from commit 75f113f)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. documentation Improvements or additions to documentation Suitable for minor Suitable to be integrated in minor release (no new feature)
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants