Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BugFix] Fix tests failing because of https://github.com/pytorch/pytorch/pull/137602 #2558

Merged
merged 1 commit into from
Nov 13, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Nov 13, 2024

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Nov 13, 2024
cc mikaylagawarecki albanD

ghstack-source-id: 6fc7434a259f92b0fca8875b20ac22624ecf1a03
Pull Request resolved: #2558
Copy link

pytorch-bot bot commented Nov 13, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2558

Note: Links to docs will display an error until the docs builds have been completed.

❌ 3 New Failures, 5 Unrelated Failures

As of commit 470b831 with merge base 50a35f6 (image):

NEW FAILURES - The following jobs have failed:

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 13, 2024
Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}27$. Worsened: $\large\color{#d91a1a}3$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.4362s 0.4325s 2.3119 Ops/s 2.1890 Ops/s $\textbf{\color{#35bf28}+5.62\%}$
test_transformed 0.6076s 0.6066s 1.6486 Ops/s 1.5877 Ops/s $\color{#35bf28}+3.84\%$
test_serial 1.3666s 1.3628s 0.7338 Ops/s 0.7228 Ops/s $\color{#35bf28}+1.52\%$
test_parallel 1.2988s 1.2891s 0.7757 Ops/s 0.7486 Ops/s $\color{#35bf28}+3.62\%$
test_step_mdp_speed[True-True-True-True-True] 0.2097ms 27.1807μs 36.7908 KOps/s 36.2310 KOps/s $\color{#35bf28}+1.55\%$
test_step_mdp_speed[True-True-True-True-False] 48.9430μs 16.0129μs 62.4495 KOps/s 61.7283 KOps/s $\color{#35bf28}+1.17\%$
test_step_mdp_speed[True-True-True-False-True] 47.0590μs 15.5675μs 64.2363 KOps/s 63.1516 KOps/s $\color{#35bf28}+1.72\%$
test_step_mdp_speed[True-True-True-False-False] 41.0970μs 9.1307μs 109.5207 KOps/s 107.5201 KOps/s $\color{#35bf28}+1.86\%$
test_step_mdp_speed[True-True-False-True-True] 88.5960μs 29.3650μs 34.0541 KOps/s 33.7547 KOps/s $\color{#35bf28}+0.89\%$
test_step_mdp_speed[True-True-False-True-False] 45.4350μs 17.8543μs 56.0089 KOps/s 56.1916 KOps/s $\color{#d91a1a}-0.33\%$
test_step_mdp_speed[True-True-False-False-True] 57.6390μs 17.3182μs 57.7426 KOps/s 56.7001 KOps/s $\color{#35bf28}+1.84\%$
test_step_mdp_speed[True-True-False-False-False] 37.6710μs 10.8371μs 92.2758 KOps/s 90.9219 KOps/s $\color{#35bf28}+1.49\%$
test_step_mdp_speed[True-False-True-True-True] 72.4760μs 31.1185μs 32.1352 KOps/s 32.1484 KOps/s $\color{#d91a1a}-0.04\%$
test_step_mdp_speed[True-False-True-True-False] 46.8880μs 19.3910μs 51.5704 KOps/s 50.9995 KOps/s $\color{#35bf28}+1.12\%$
test_step_mdp_speed[True-False-True-False-True] 55.0830μs 17.2713μs 57.8994 KOps/s 57.2144 KOps/s $\color{#35bf28}+1.20\%$
test_step_mdp_speed[True-False-True-False-False] 39.8450μs 10.8503μs 92.1633 KOps/s 92.0886 KOps/s $\color{#35bf28}+0.08\%$
test_step_mdp_speed[True-False-False-True-True] 0.1018ms 33.0099μs 30.2939 KOps/s 30.3455 KOps/s $\color{#d91a1a}-0.17\%$
test_step_mdp_speed[True-False-False-True-False] 65.2720μs 20.8902μs 47.8692 KOps/s 47.6636 KOps/s $\color{#35bf28}+0.43\%$
test_step_mdp_speed[True-False-False-False-True] 51.4370μs 18.7914μs 53.2158 KOps/s 52.7170 KOps/s $\color{#35bf28}+0.95\%$
test_step_mdp_speed[True-False-False-False-False] 31.6590μs 12.3469μs 80.9922 KOps/s 79.4734 KOps/s $\color{#35bf28}+1.91\%$
test_step_mdp_speed[False-True-True-True-True] 71.6650μs 31.1344μs 32.1189 KOps/s 31.9070 KOps/s $\color{#35bf28}+0.66\%$
test_step_mdp_speed[False-True-True-True-False] 56.4360μs 19.4478μs 51.4196 KOps/s 51.1238 KOps/s $\color{#35bf28}+0.58\%$
test_step_mdp_speed[False-True-True-False-True] 48.1710μs 19.7163μs 50.7194 KOps/s 49.2854 KOps/s $\color{#35bf28}+2.91\%$
test_step_mdp_speed[False-True-True-False-False] 46.0160μs 12.0261μs 83.1523 KOps/s 80.6751 KOps/s $\color{#35bf28}+3.07\%$
test_step_mdp_speed[False-True-False-True-True] 62.4680μs 32.7733μs 30.5126 KOps/s 30.5526 KOps/s $\color{#d91a1a}-0.13\%$
test_step_mdp_speed[False-True-False-True-False] 52.6790μs 21.0399μs 47.5287 KOps/s 47.2191 KOps/s $\color{#35bf28}+0.66\%$
test_step_mdp_speed[False-True-False-False-True] 2.9617ms 21.5653μs 46.3707 KOps/s 46.2817 KOps/s $\color{#35bf28}+0.19\%$
test_step_mdp_speed[False-True-False-False-False] 42.3190μs 13.5596μs 73.7484 KOps/s 71.7800 KOps/s $\color{#35bf28}+2.74\%$
test_step_mdp_speed[False-False-True-True-True] 67.9180μs 34.1274μs 29.3020 KOps/s 28.9572 KOps/s $\color{#35bf28}+1.19\%$
test_step_mdp_speed[False-False-True-True-False] 51.6970μs 22.6328μs 44.1837 KOps/s 43.9273 KOps/s $\color{#35bf28}+0.58\%$
test_step_mdp_speed[False-False-True-False-True] 49.3130μs 21.3832μs 46.7657 KOps/s 46.3937 KOps/s $\color{#35bf28}+0.80\%$
test_step_mdp_speed[False-False-True-False-False] 60.3040μs 13.6226μs 73.4073 KOps/s 71.3092 KOps/s $\color{#35bf28}+2.94\%$
test_step_mdp_speed[False-False-False-True-True] 74.2490μs 35.7900μs 27.9408 KOps/s 27.9189 KOps/s $\color{#35bf28}+0.08\%$
test_step_mdp_speed[False-False-False-True-False] 65.4630μs 24.1259μs 41.4493 KOps/s 41.7349 KOps/s $\color{#d91a1a}-0.68\%$
test_step_mdp_speed[False-False-False-False-True] 56.1760μs 22.6627μs 44.1254 KOps/s 44.0316 KOps/s $\color{#35bf28}+0.21\%$
test_step_mdp_speed[False-False-False-False-False] 39.2730μs 15.1653μs 65.9401 KOps/s 64.5476 KOps/s $\color{#35bf28}+2.16\%$
test_values[generalized_advantage_estimate-True-True] 10.1785ms 9.7512ms 102.5514 Ops/s 102.8277 Ops/s $\color{#d91a1a}-0.27\%$
test_values[vec_generalized_advantage_estimate-True-True] 35.8552ms 33.4536ms 29.8922 Ops/s 29.6837 Ops/s $\color{#35bf28}+0.70\%$
test_values[td0_return_estimate-False-False] 0.2166ms 0.1715ms 5.8310 KOps/s 5.4605 KOps/s $\textbf{\color{#35bf28}+6.78\%}$
test_values[td1_return_estimate-False-False] 26.2440ms 24.3494ms 41.0688 Ops/s 40.6962 Ops/s $\color{#35bf28}+0.92\%$
test_values[vec_td1_return_estimate-False-False] 35.5663ms 33.4872ms 29.8621 Ops/s 29.6059 Ops/s $\color{#35bf28}+0.87\%$
test_values[td_lambda_return_estimate-True-False] 35.6207ms 35.0222ms 28.5533 Ops/s 28.3092 Ops/s $\color{#35bf28}+0.86\%$
test_values[vec_td_lambda_return_estimate-True-False] 36.0475ms 33.6110ms 29.7522 Ops/s 29.7037 Ops/s $\color{#35bf28}+0.16\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 8.8786ms 8.5469ms 117.0021 Ops/s 119.9312 Ops/s $\color{#d91a1a}-2.44\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 2.3932ms 2.0404ms 490.1094 Ops/s 491.0853 Ops/s $\color{#d91a1a}-0.20\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.5451ms 0.3589ms 2.7862 KOps/s 2.7627 KOps/s $\color{#35bf28}+0.85\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 45.9767ms 43.1318ms 23.1847 Ops/s 22.4382 Ops/s $\color{#35bf28}+3.33\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 4.0131ms 3.0604ms 326.7559 Ops/s 324.6880 Ops/s $\color{#35bf28}+0.64\%$
test_dqn_speed[False-None] 6.2346ms 1.3500ms 740.7560 Ops/s 745.2745 Ops/s $\color{#d91a1a}-0.61\%$
test_dqn_speed[False-backward] 1.8546ms 1.8067ms 553.5057 Ops/s 543.9258 Ops/s $\color{#35bf28}+1.76\%$
test_dqn_speed[True-None] 1.5031ms 0.4655ms 2.1481 KOps/s 2.1245 KOps/s $\color{#35bf28}+1.11\%$
test_dqn_speed[True-backward] 1.0801ms 0.8962ms 1.1158 KOps/s 857.2370 Ops/s $\textbf{\color{#35bf28}+30.16\%}$
test_dqn_speed[reduce-overhead-None] 0.6102ms 0.4701ms 2.1270 KOps/s 2.1053 KOps/s $\color{#35bf28}+1.03\%$
test_dqn_speed[reduce-overhead-backward] 1.0724ms 0.9222ms 1.0843 KOps/s 1.0943 KOps/s $\color{#d91a1a}-0.91\%$
test_ddpg_speed[False-None] 3.4762ms 2.8083ms 356.0810 Ops/s 356.0578 Ops/s $+0.01\%$
test_ddpg_speed[False-backward] 4.1418ms 3.9453ms 253.4684 Ops/s 251.3947 Ops/s $\color{#35bf28}+0.82\%$
test_ddpg_speed[True-None] 1.3975ms 1.0013ms 998.7473 Ops/s 976.4540 Ops/s $\color{#35bf28}+2.28\%$
test_ddpg_speed[True-backward] 2.0660ms 1.9359ms 516.5506 Ops/s 416.4807 Ops/s $\textbf{\color{#35bf28}+24.03\%}$
test_ddpg_speed[reduce-overhead-None] 1.3679ms 1.0078ms 992.2486 Ops/s 973.3868 Ops/s $\color{#35bf28}+1.94\%$
test_ddpg_speed[reduce-overhead-backward] 2.1114ms 1.9304ms 518.0179 Ops/s 508.8875 Ops/s $\color{#35bf28}+1.79\%$
test_sac_speed[False-None] 10.1187ms 8.1534ms 122.6487 Ops/s 119.5588 Ops/s $\color{#35bf28}+2.58\%$
test_sac_speed[False-backward] 12.2029ms 10.9503ms 91.3213 Ops/s 87.8882 Ops/s $\color{#35bf28}+3.91\%$
test_sac_speed[True-None] 2.3568ms 1.8397ms 543.5759 Ops/s 497.6372 Ops/s $\textbf{\color{#35bf28}+9.23\%}$
test_sac_speed[True-backward] 3.9477ms 3.7466ms 266.9072 Ops/s 271.9418 Ops/s $\color{#d91a1a}-1.85\%$
test_sac_speed[reduce-overhead-None] 2.4149ms 1.8625ms 536.9248 Ops/s 526.9415 Ops/s $\color{#35bf28}+1.89\%$
test_sac_speed[reduce-overhead-backward] 3.9412ms 3.5331ms 283.0373 Ops/s 260.4819 Ops/s $\textbf{\color{#35bf28}+8.66\%}$
test_redq_speed[False-None] 14.7035ms 12.8499ms 77.8217 Ops/s 74.6932 Ops/s $\color{#35bf28}+4.19\%$
test_redq_speed[False-backward] 23.6624ms 21.9019ms 45.6581 Ops/s 44.0312 Ops/s $\color{#35bf28}+3.69\%$
test_redq_speed[True-None] 5.8774ms 5.1328ms 194.8268 Ops/s 184.4642 Ops/s $\textbf{\color{#35bf28}+5.62\%}$
test_redq_speed[True-backward] 13.6381ms 12.3808ms 80.7704 Ops/s 75.4139 Ops/s $\textbf{\color{#35bf28}+7.10\%}$
test_redq_speed[reduce-overhead-None] 5.4221ms 4.6728ms 214.0034 Ops/s 180.6584 Ops/s $\textbf{\color{#35bf28}+18.46\%}$
test_redq_speed[reduce-overhead-backward] 13.0460ms 12.4219ms 80.5029 Ops/s 76.2680 Ops/s $\textbf{\color{#35bf28}+5.55\%}$
test_redq_deprec_speed[False-None] 15.1425ms 13.0522ms 76.6156 Ops/s 69.2927 Ops/s $\textbf{\color{#35bf28}+10.57\%}$
test_redq_deprec_speed[False-backward] 20.4064ms 18.7400ms 53.3619 Ops/s 48.4285 Ops/s $\textbf{\color{#35bf28}+10.19\%}$
test_redq_deprec_speed[True-None] 4.3710ms 3.8157ms 262.0756 Ops/s 222.7450 Ops/s $\textbf{\color{#35bf28}+17.66\%}$
test_redq_deprec_speed[True-backward] 8.9796ms 8.3848ms 119.2635 Ops/s 106.1149 Ops/s $\textbf{\color{#35bf28}+12.39\%}$
test_redq_deprec_speed[reduce-overhead-None] 4.8615ms 3.8059ms 262.7488 Ops/s 228.8428 Ops/s $\textbf{\color{#35bf28}+14.82\%}$
test_redq_deprec_speed[reduce-overhead-backward] 9.3802ms 8.6605ms 115.4666 Ops/s 110.0569 Ops/s $\color{#35bf28}+4.92\%$
test_td3_speed[False-None] 10.5375ms 8.0297ms 124.5377 Ops/s 119.0402 Ops/s $\color{#35bf28}+4.62\%$
test_td3_speed[False-backward] 0.2570s 15.5668ms 64.2395 Ops/s 88.8761 Ops/s $\textbf{\color{#d91a1a}-27.72\%}$
test_td3_speed[True-None] 1.8240ms 1.7042ms 586.7828 Ops/s 556.4023 Ops/s $\textbf{\color{#35bf28}+5.46\%}$
test_td3_speed[True-backward] 3.7050ms 3.4380ms 290.8698 Ops/s 244.4604 Ops/s $\textbf{\color{#35bf28}+18.98\%}$
test_td3_speed[reduce-overhead-None] 1.8775ms 1.6933ms 590.5546 Ops/s 476.2822 Ops/s $\textbf{\color{#35bf28}+23.99\%}$
test_td3_speed[reduce-overhead-backward] 3.4401ms 3.3169ms 301.4877 Ops/s 257.2812 Ops/s $\textbf{\color{#35bf28}+17.18\%}$
test_cql_speed[False-None] 38.6306ms 36.3322ms 27.5238 Ops/s 26.7536 Ops/s $\color{#35bf28}+2.88\%$
test_cql_speed[False-backward] 47.5141ms 45.7391ms 21.8631 Ops/s 20.5264 Ops/s $\textbf{\color{#35bf28}+6.51\%}$
test_cql_speed[True-None] 17.0432ms 15.6041ms 64.0858 Ops/s 62.2475 Ops/s $\color{#35bf28}+2.95\%$
test_cql_speed[True-backward] 23.4340ms 22.7312ms 43.9924 Ops/s 42.8907 Ops/s $\color{#35bf28}+2.57\%$
test_cql_speed[reduce-overhead-None] 17.0083ms 15.6754ms 63.7943 Ops/s 61.4637 Ops/s $\color{#35bf28}+3.79\%$
test_cql_speed[reduce-overhead-backward] 23.1385ms 22.3098ms 44.8233 Ops/s 43.0149 Ops/s $\color{#35bf28}+4.20\%$
test_a2c_speed[False-None] 7.9237ms 7.1677ms 139.5141 Ops/s 133.7650 Ops/s $\color{#35bf28}+4.30\%$
test_a2c_speed[False-backward] 15.4031ms 14.5625ms 68.6696 Ops/s 67.0282 Ops/s $\color{#35bf28}+2.45\%$
test_a2c_speed[True-None] 3.9028ms 3.3057ms 302.5053 Ops/s 298.8031 Ops/s $\color{#35bf28}+1.24\%$
test_a2c_speed[True-backward] 10.8089ms 9.9202ms 100.8040 Ops/s 101.0548 Ops/s $\color{#d91a1a}-0.25\%$
test_a2c_speed[reduce-overhead-None] 4.0179ms 3.2826ms 304.6390 Ops/s 299.3818 Ops/s $\color{#35bf28}+1.76\%$
test_a2c_speed[reduce-overhead-backward] 10.4214ms 9.9591ms 100.4108 Ops/s 97.2990 Ops/s $\color{#35bf28}+3.20\%$
test_ppo_speed[False-None] 8.4863ms 7.6653ms 130.4575 Ops/s 125.4223 Ops/s $\color{#35bf28}+4.01\%$
test_ppo_speed[False-backward] 16.0751ms 15.2673ms 65.4995 Ops/s 64.1507 Ops/s $\color{#35bf28}+2.10\%$
test_ppo_speed[True-None] 4.0555ms 3.6961ms 270.5562 Ops/s 259.7263 Ops/s $\color{#35bf28}+4.17\%$
test_ppo_speed[True-backward] 10.0601ms 9.6836ms 103.2669 Ops/s 98.2790 Ops/s $\textbf{\color{#35bf28}+5.08\%}$
test_ppo_speed[reduce-overhead-None] 4.6362ms 3.7171ms 269.0253 Ops/s 260.8595 Ops/s $\color{#35bf28}+3.13\%$
test_ppo_speed[reduce-overhead-backward] 10.2867ms 9.7337ms 102.7354 Ops/s 102.0169 Ops/s $\color{#35bf28}+0.70\%$
test_reinforce_speed[False-None] 8.3782ms 6.4622ms 154.7468 Ops/s 150.3576 Ops/s $\color{#35bf28}+2.92\%$
test_reinforce_speed[False-backward] 11.0380ms 9.7576ms 102.4843 Ops/s 98.1064 Ops/s $\color{#35bf28}+4.46\%$
test_reinforce_speed[True-None] 3.0728ms 2.6473ms 377.7487 Ops/s 366.5843 Ops/s $\color{#35bf28}+3.05\%$
test_reinforce_speed[True-backward] 9.4149ms 8.7234ms 114.6338 Ops/s 114.6321 Ops/s $+0.00\%$
test_reinforce_speed[reduce-overhead-None] 3.2766ms 2.6455ms 378.0030 Ops/s 370.2621 Ops/s $\color{#35bf28}+2.09\%$
test_reinforce_speed[reduce-overhead-backward] 9.5627ms 8.7653ms 114.0857 Ops/s 110.0021 Ops/s $\color{#35bf28}+3.71\%$
test_iql_speed[False-None] 33.9526ms 32.2216ms 31.0351 Ops/s 30.3580 Ops/s $\color{#35bf28}+2.23\%$
test_iql_speed[False-backward] 46.3640ms 44.8948ms 22.2743 Ops/s 21.4946 Ops/s $\color{#35bf28}+3.63\%$
test_iql_speed[True-None] 11.6470ms 10.7596ms 92.9402 Ops/s 91.2967 Ops/s $\color{#35bf28}+1.80\%$
test_iql_speed[True-backward] 23.8401ms 22.0474ms 45.3569 Ops/s 44.5388 Ops/s $\color{#35bf28}+1.84\%$
test_iql_speed[reduce-overhead-None] 12.0656ms 10.7429ms 93.0849 Ops/s 92.1233 Ops/s $\color{#35bf28}+1.04\%$
test_iql_speed[reduce-overhead-backward] 22.5659ms 21.8871ms 45.6890 Ops/s 45.1608 Ops/s $\color{#35bf28}+1.17\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.9272ms 4.9282ms 202.9128 Ops/s 194.1511 Ops/s $\color{#35bf28}+4.51\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 3.1712ms 0.5195ms 1.9251 KOps/s 1.8926 KOps/s $\color{#35bf28}+1.71\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.8812ms 0.4916ms 2.0340 KOps/s 2.0256 KOps/s $\color{#35bf28}+0.41\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 5.6602ms 4.8264ms 207.1956 Ops/s 206.0851 Ops/s $\color{#35bf28}+0.54\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.8819ms 0.5015ms 1.9940 KOps/s 1.9742 KOps/s $\color{#35bf28}+1.01\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.7783ms 0.4781ms 2.0915 KOps/s 2.0603 KOps/s $\color{#35bf28}+1.52\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 2.5261ms 1.6290ms 613.8914 Ops/s 603.5405 Ops/s $\color{#35bf28}+1.72\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 1.7956ms 1.5688ms 637.4331 Ops/s 623.2785 Ops/s $\color{#35bf28}+2.27\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 5.1969ms 4.8628ms 205.6416 Ops/s 191.3758 Ops/s $\textbf{\color{#35bf28}+7.45\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.6179ms 0.6547ms 1.5274 KOps/s 1.4794 KOps/s $\color{#35bf28}+3.25\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.9200ms 0.6198ms 1.6134 KOps/s 1.5455 KOps/s $\color{#35bf28}+4.39\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 7.3177ms 4.7626ms 209.9690 Ops/s 201.3113 Ops/s $\color{#35bf28}+4.30\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 2.9517ms 0.5166ms 1.9359 KOps/s 1.9020 KOps/s $\color{#35bf28}+1.78\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.8269ms 0.5046ms 1.9816 KOps/s 1.9774 KOps/s $\color{#35bf28}+0.21\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 7.7653ms 4.7703ms 209.6285 Ops/s 202.3338 Ops/s $\color{#35bf28}+3.61\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 2.7887ms 0.4996ms 2.0016 KOps/s 1.9295 KOps/s $\color{#35bf28}+3.74\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.7784ms 0.4771ms 2.0960 KOps/s 2.0285 KOps/s $\color{#35bf28}+3.32\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 5.9196ms 4.8279ms 207.1313 Ops/s 200.0494 Ops/s $\color{#35bf28}+3.54\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.1961ms 0.6508ms 1.5365 KOps/s 1.4945 KOps/s $\color{#35bf28}+2.81\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 7.8209ms 0.6303ms 1.5865 KOps/s 1.6000 KOps/s $\color{#d91a1a}-0.85\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 0.4056s 12.2057ms 81.9292 Ops/s 244.2428 Ops/s $\textbf{\color{#d91a1a}-66.46\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 8.2461ms 2.2883ms 436.9998 Ops/s 426.6192 Ops/s $\color{#35bf28}+2.43\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 5.4520ms 1.2906ms 774.8487 Ops/s 802.6905 Ops/s $\color{#d91a1a}-3.47\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 5.3651ms 4.1522ms 240.8362 Ops/s 35.8386 Ops/s $\textbf{\color{#35bf28}+572.00\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 0.3603s 9.4326ms 106.0154 Ops/s 473.7702 Ops/s $\textbf{\color{#d91a1a}-77.62\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 4.9569ms 1.3229ms 755.9248 Ops/s 787.1367 Ops/s $\color{#d91a1a}-3.97\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 5.9676ms 4.3802ms 228.2994 Ops/s 208.7498 Ops/s $\textbf{\color{#35bf28}+9.37\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 9.0115ms 2.5155ms 397.5302 Ops/s 389.1505 Ops/s $\color{#35bf28}+2.15\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 2.8022ms 1.4437ms 692.6539 Ops/s 610.6557 Ops/s $\textbf{\color{#35bf28}+13.43\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 11.1523ms 10.8790ms 91.9206 Ops/s 85.5716 Ops/s $\textbf{\color{#35bf28}+7.42\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 17.2979ms 14.2084ms 70.3809 Ops/s 67.3477 Ops/s $\color{#35bf28}+4.50\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 20.7338ms 19.6350ms 50.9295 Ops/s 49.0372 Ops/s $\color{#35bf28}+3.86\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 14.5943ms 14.2988ms 69.9360 Ops/s 66.4509 Ops/s $\textbf{\color{#35bf28}+5.24\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 20.7521ms 19.8515ms 50.3741 Ops/s 49.1482 Ops/s $\color{#35bf28}+2.49\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 16.6106ms 15.5802ms 64.1838 Ops/s 62.9572 Ops/s $\color{#35bf28}+1.95\%$

Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}13$. Worsened: $\large\color{#d91a1a}8$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.7584s 0.7382s 1.3546 Ops/s 1.3194 Ops/s $\color{#35bf28}+2.66\%$
test_transformed 1.0830s 1.0212s 0.9793 Ops/s 1.0300 Ops/s $\color{#d91a1a}-4.92\%$
test_serial 2.2823s 2.1823s 0.4582 Ops/s 0.4772 Ops/s $\color{#d91a1a}-3.98\%$
test_parallel 2.1043s 1.9977s 0.5006 Ops/s 0.5129 Ops/s $\color{#d91a1a}-2.40\%$
test_step_mdp_speed[True-True-True-True-True] 0.1972ms 36.5638μs 27.3494 KOps/s 27.4319 KOps/s $\color{#d91a1a}-0.30\%$
test_step_mdp_speed[True-True-True-True-False] 48.3910μs 20.5787μs 48.5941 KOps/s 48.1713 KOps/s $\color{#35bf28}+0.88\%$
test_step_mdp_speed[True-True-True-False-True] 46.9310μs 20.3145μs 49.2259 KOps/s 50.0618 KOps/s $\color{#d91a1a}-1.67\%$
test_step_mdp_speed[True-True-True-False-False] 41.4610μs 11.5635μs 86.4793 KOps/s 84.9100 KOps/s $\color{#35bf28}+1.85\%$
test_step_mdp_speed[True-True-False-True-True] 72.2120μs 38.9673μs 25.6625 KOps/s 25.6905 KOps/s $\color{#d91a1a}-0.11\%$
test_step_mdp_speed[True-True-False-True-False] 54.4010μs 22.6938μs 44.0649 KOps/s 44.6397 KOps/s $\color{#d91a1a}-1.29\%$
test_step_mdp_speed[True-True-False-False-True] 51.8400μs 22.3689μs 44.7048 KOps/s 45.6008 KOps/s $\color{#d91a1a}-1.96\%$
test_step_mdp_speed[True-True-False-False-False] 41.1510μs 13.7543μs 72.7047 KOps/s 74.6150 KOps/s $\color{#d91a1a}-2.56\%$
test_step_mdp_speed[True-False-True-True-True] 71.7610μs 41.0447μs 24.3637 KOps/s 24.6292 KOps/s $\color{#d91a1a}-1.08\%$
test_step_mdp_speed[True-False-True-True-False] 51.9610μs 24.8704μs 40.2085 KOps/s 39.7950 KOps/s $\color{#35bf28}+1.04\%$
test_step_mdp_speed[True-False-True-False-True] 57.9610μs 22.4364μs 44.5704 KOps/s 44.8736 KOps/s $\color{#d91a1a}-0.68\%$
test_step_mdp_speed[True-False-True-False-False] 38.7300μs 13.8237μs 72.3394 KOps/s 72.9628 KOps/s $\color{#d91a1a}-0.85\%$
test_step_mdp_speed[True-False-False-True-True] 0.1166ms 42.3940μs 23.5883 KOps/s 23.6807 KOps/s $\color{#d91a1a}-0.39\%$
test_step_mdp_speed[True-False-False-True-False] 54.7710μs 26.6100μs 37.5798 KOps/s 37.7757 KOps/s $\color{#d91a1a}-0.52\%$
test_step_mdp_speed[True-False-False-False-True] 60.0310μs 24.3050μs 41.1439 KOps/s 41.1044 KOps/s $\color{#35bf28}+0.10\%$
test_step_mdp_speed[True-False-False-False-False] 46.1010μs 15.5045μs 64.4974 KOps/s 63.5024 KOps/s $\color{#35bf28}+1.57\%$
test_step_mdp_speed[False-True-True-True-True] 74.7320μs 40.3483μs 24.7842 KOps/s 24.3847 KOps/s $\color{#35bf28}+1.64\%$
test_step_mdp_speed[False-True-True-True-False] 52.1010μs 24.6854μs 40.5098 KOps/s 40.5655 KOps/s $\color{#d91a1a}-0.14\%$
test_step_mdp_speed[False-True-True-False-True] 61.8710μs 25.8334μs 38.7095 KOps/s 38.3770 KOps/s $\color{#35bf28}+0.87\%$
test_step_mdp_speed[False-True-True-False-False] 41.0600μs 15.4167μs 64.8646 KOps/s 64.8723 KOps/s $\color{#d91a1a}-0.01\%$
test_step_mdp_speed[False-True-False-True-True] 71.7310μs 42.5645μs 23.4938 KOps/s 23.5578 KOps/s $\color{#d91a1a}-0.27\%$
test_step_mdp_speed[False-True-False-True-False] 54.6110μs 26.5618μs 37.6480 KOps/s 37.4326 KOps/s $\color{#35bf28}+0.58\%$
test_step_mdp_speed[False-True-False-False-True] 3.3673ms 27.8527μs 35.9032 KOps/s 36.0909 KOps/s $\color{#d91a1a}-0.52\%$
test_step_mdp_speed[False-True-False-False-False] 46.3010μs 17.1050μs 58.4625 KOps/s 56.5824 KOps/s $\color{#35bf28}+3.32\%$
test_step_mdp_speed[False-False-True-True-True] 75.3910μs 44.5937μs 22.4247 KOps/s 22.1286 KOps/s $\color{#35bf28}+1.34\%$
test_step_mdp_speed[False-False-True-True-False] 64.0710μs 28.8897μs 34.6144 KOps/s 34.5599 KOps/s $\color{#35bf28}+0.16\%$
test_step_mdp_speed[False-False-True-False-True] 53.3410μs 27.4575μs 36.4200 KOps/s 35.8441 KOps/s $\color{#35bf28}+1.61\%$
test_step_mdp_speed[False-False-True-False-False] 43.6610μs 17.0189μs 58.7582 KOps/s 57.6556 KOps/s $\color{#35bf28}+1.91\%$
test_step_mdp_speed[False-False-False-True-True] 78.7210μs 46.2094μs 21.6406 KOps/s 21.7194 KOps/s $\color{#d91a1a}-0.36\%$
test_step_mdp_speed[False-False-False-True-False] 97.1710μs 30.4130μs 32.8807 KOps/s 32.4664 KOps/s $\color{#35bf28}+1.28\%$
test_step_mdp_speed[False-False-False-False-True] 65.8410μs 29.2330μs 34.2079 KOps/s 34.3507 KOps/s $\color{#d91a1a}-0.42\%$
test_step_mdp_speed[False-False-False-False-False] 49.7110μs 19.0078μs 52.6100 KOps/s 52.6434 KOps/s $\color{#d91a1a}-0.06\%$
test_values[generalized_advantage_estimate-True-True] 24.1169ms 23.7704ms 42.0691 Ops/s 40.5203 Ops/s $\color{#35bf28}+3.82\%$
test_values[vec_generalized_advantage_estimate-True-True] 0.1151s 3.1753ms 314.9310 Ops/s 314.0496 Ops/s $\color{#35bf28}+0.28\%$
test_values[td0_return_estimate-False-False] 86.1020μs 63.6684μs 15.7064 KOps/s 15.3551 KOps/s $\color{#35bf28}+2.29\%$
test_values[td1_return_estimate-False-False] 53.4266ms 52.9161ms 18.8979 Ops/s 18.2978 Ops/s $\color{#35bf28}+3.28\%$
test_values[vec_td1_return_estimate-False-False] 1.2568ms 1.0529ms 949.7482 Ops/s 945.1997 Ops/s $\color{#35bf28}+0.48\%$
test_values[td_lambda_return_estimate-True-False] 85.1952ms 84.7357ms 11.8014 Ops/s 11.4482 Ops/s $\color{#35bf28}+3.08\%$
test_values[vec_td_lambda_return_estimate-True-False] 1.2211ms 1.0491ms 953.2029 Ops/s 940.7847 Ops/s $\color{#35bf28}+1.32\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 23.8325ms 23.6120ms 42.3513 Ops/s 40.9112 Ops/s $\color{#35bf28}+3.52\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 1.0098ms 0.7222ms 1.3847 KOps/s 1.3689 KOps/s $\color{#35bf28}+1.15\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.7623ms 0.6496ms 1.5395 KOps/s 1.5515 KOps/s $\color{#d91a1a}-0.77\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 1.5252ms 1.4541ms 687.6873 Ops/s 684.9714 Ops/s $\color{#35bf28}+0.40\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 0.7215ms 0.6622ms 1.5100 KOps/s 1.5012 KOps/s $\color{#35bf28}+0.59\%$
test_dqn_speed[False-None] 6.6148ms 1.2912ms 774.4573 Ops/s 785.8958 Ops/s $\color{#d91a1a}-1.46\%$
test_dqn_speed[False-backward] 1.8638ms 1.7803ms 561.7000 Ops/s 553.1466 Ops/s $\color{#35bf28}+1.55\%$
test_dqn_speed[True-None] 0.6887ms 0.5411ms 1.8480 KOps/s 1.7905 KOps/s $\color{#35bf28}+3.21\%$
test_dqn_speed[True-backward] 1.0273ms 0.9879ms 1.0122 KOps/s 831.1986 Ops/s $\textbf{\color{#35bf28}+21.78\%}$
test_dqn_speed[reduce-overhead-None] 0.6592ms 0.5605ms 1.7840 KOps/s 1.7836 KOps/s $\color{#35bf28}+0.02\%$
test_dqn_speed[reduce-overhead-backward] 1.1371ms 0.9843ms 1.0159 KOps/s 1.0063 KOps/s $\color{#35bf28}+0.96\%$
test_ddpg_speed[False-None] 3.4115ms 2.6272ms 380.6374 Ops/s 383.4559 Ops/s $\color{#d91a1a}-0.74\%$
test_ddpg_speed[False-backward] 3.9607ms 3.8030ms 262.9474 Ops/s 265.8526 Ops/s $\color{#d91a1a}-1.09\%$
test_ddpg_speed[True-None] 1.4104ms 1.2131ms 824.3251 Ops/s 814.1487 Ops/s $\color{#35bf28}+1.25\%$
test_ddpg_speed[True-backward] 2.1957ms 2.1562ms 463.7727 Ops/s 414.4781 Ops/s $\textbf{\color{#35bf28}+11.89\%}$
test_ddpg_speed[reduce-overhead-None] 1.2827ms 1.2133ms 824.1911 Ops/s 787.4661 Ops/s $\color{#35bf28}+4.66\%$
test_ddpg_speed[reduce-overhead-backward] 2.2914ms 2.1836ms 457.9522 Ops/s 448.2593 Ops/s $\color{#35bf28}+2.16\%$
test_sac_speed[False-None] 8.2571ms 7.3235ms 136.5460 Ops/s 137.9746 Ops/s $\color{#d91a1a}-1.04\%$
test_sac_speed[False-backward] 11.0684ms 10.4402ms 95.7832 Ops/s 96.5084 Ops/s $\color{#d91a1a}-0.75\%$
test_sac_speed[True-None] 2.0706ms 1.9657ms 508.7280 Ops/s 505.6084 Ops/s $\color{#35bf28}+0.62\%$
test_sac_speed[True-backward] 4.0126ms 3.8483ms 259.8567 Ops/s 257.8899 Ops/s $\color{#35bf28}+0.76\%$
test_sac_speed[reduce-overhead-None] 2.1580ms 1.9804ms 504.9393 Ops/s 507.7536 Ops/s $\color{#d91a1a}-0.55\%$
test_sac_speed[reduce-overhead-backward] 3.9886ms 3.8487ms 259.8281 Ops/s 260.4401 Ops/s $\color{#d91a1a}-0.24\%$
test_redq_speed[False-None] 11.5417ms 9.7494ms 102.5706 Ops/s 102.9162 Ops/s $\color{#d91a1a}-0.34\%$
test_redq_speed[False-backward] 17.3526ms 16.5670ms 60.3608 Ops/s 59.7467 Ops/s $\color{#35bf28}+1.03\%$
test_redq_speed[True-None] 3.6491ms 3.4524ms 289.6524 Ops/s 280.7702 Ops/s $\color{#35bf28}+3.16\%$
test_redq_speed[True-backward] 8.6256ms 8.3817ms 119.3070 Ops/s 111.5563 Ops/s $\textbf{\color{#35bf28}+6.95\%}$
test_redq_speed[reduce-overhead-None] 3.8095ms 3.4322ms 291.3617 Ops/s 285.5132 Ops/s $\color{#35bf28}+2.05\%$
test_redq_speed[reduce-overhead-backward] 8.5721ms 8.3687ms 119.4934 Ops/s 117.5489 Ops/s $\color{#35bf28}+1.65\%$
test_redq_deprec_speed[False-None] 10.9096ms 10.2854ms 97.2251 Ops/s 97.0216 Ops/s $\color{#35bf28}+0.21\%$
test_redq_deprec_speed[False-backward] 15.2994ms 14.8438ms 67.3682 Ops/s 66.4653 Ops/s $\color{#35bf28}+1.36\%$
test_redq_deprec_speed[True-None] 3.4285ms 3.1369ms 318.7835 Ops/s 318.3400 Ops/s $\color{#35bf28}+0.14\%$
test_redq_deprec_speed[True-backward] 7.1866ms 6.8981ms 144.9675 Ops/s 138.2463 Ops/s $\color{#35bf28}+4.86\%$
test_redq_deprec_speed[reduce-overhead-None] 3.4567ms 3.1125ms 321.2881 Ops/s 319.8456 Ops/s $\color{#35bf28}+0.45\%$
test_redq_deprec_speed[reduce-overhead-backward] 7.1782ms 6.9061ms 144.7992 Ops/s 147.3031 Ops/s $\color{#d91a1a}-1.70\%$
test_td3_speed[False-None] 7.4666ms 7.2429ms 138.0662 Ops/s 137.0950 Ops/s $\color{#35bf28}+0.71\%$
test_td3_speed[False-backward] 10.2648ms 9.9934ms 100.0656 Ops/s 99.0390 Ops/s $\color{#35bf28}+1.04\%$
test_td3_speed[True-None] 1.8751ms 1.8464ms 541.6058 Ops/s 526.8937 Ops/s $\color{#35bf28}+2.79\%$
test_td3_speed[True-backward] 3.7081ms 3.5651ms 280.5009 Ops/s 253.4498 Ops/s $\textbf{\color{#35bf28}+10.67\%}$
test_td3_speed[reduce-overhead-None] 1.8751ms 1.8380ms 544.0595 Ops/s 532.8369 Ops/s $\color{#35bf28}+2.11\%$
test_td3_speed[reduce-overhead-backward] 3.6925ms 3.6101ms 276.9971 Ops/s 282.6284 Ops/s $\color{#d91a1a}-1.99\%$
test_cql_speed[False-None] 27.3388ms 24.1775ms 41.3608 Ops/s 41.2380 Ops/s $\color{#35bf28}+0.30\%$
test_cql_speed[False-backward] 36.8430ms 33.6591ms 29.7096 Ops/s 29.9339 Ops/s $\color{#d91a1a}-0.75\%$
test_cql_speed[True-None] 11.0259ms 10.6539ms 93.8620 Ops/s 94.8740 Ops/s $\color{#d91a1a}-1.07\%$
test_cql_speed[True-backward] 18.5992ms 16.2597ms 61.5019 Ops/s 63.2591 Ops/s $\color{#d91a1a}-2.78\%$
test_cql_speed[reduce-overhead-None] 10.9082ms 10.5865ms 94.4603 Ops/s 96.6621 Ops/s $\color{#d91a1a}-2.28\%$
test_cql_speed[reduce-overhead-backward] 16.3866ms 16.1164ms 62.0488 Ops/s 63.3349 Ops/s $\color{#d91a1a}-2.03\%$
test_a2c_speed[False-None] 5.4356ms 5.1961ms 192.4520 Ops/s 190.7842 Ops/s $\color{#35bf28}+0.87\%$
test_a2c_speed[False-backward] 11.7950ms 11.3583ms 88.0415 Ops/s 86.5894 Ops/s $\color{#35bf28}+1.68\%$
test_a2c_speed[True-None] 3.2038ms 2.9803ms 335.5384 Ops/s 334.4719 Ops/s $\color{#35bf28}+0.32\%$
test_a2c_speed[True-backward] 8.4541ms 8.1773ms 122.2894 Ops/s 122.5534 Ops/s $\color{#d91a1a}-0.22\%$
test_a2c_speed[reduce-overhead-None] 3.3512ms 2.9494ms 339.0538 Ops/s 333.2699 Ops/s $\color{#35bf28}+1.74\%$
test_a2c_speed[reduce-overhead-backward] 8.3001ms 8.1461ms 122.7575 Ops/s 121.3498 Ops/s $\color{#35bf28}+1.16\%$
test_ppo_speed[False-None] 5.7564ms 5.5210ms 181.1277 Ops/s 183.6581 Ops/s $\color{#d91a1a}-1.38\%$
test_ppo_speed[False-backward] 13.2857ms 11.8275ms 84.5489 Ops/s 85.4859 Ops/s $\color{#d91a1a}-1.10\%$
test_ppo_speed[True-None] 3.7022ms 3.3618ms 297.4558 Ops/s 292.6838 Ops/s $\color{#35bf28}+1.63\%$
test_ppo_speed[True-backward] 8.1019ms 7.8972ms 126.6266 Ops/s 115.4983 Ops/s $\textbf{\color{#35bf28}+9.64\%}$
test_ppo_speed[reduce-overhead-None] 3.7320ms 3.3561ms 297.9672 Ops/s 297.1330 Ops/s $\color{#35bf28}+0.28\%$
test_ppo_speed[reduce-overhead-backward] 8.2274ms 7.9901ms 125.1553 Ops/s 126.3133 Ops/s $\color{#d91a1a}-0.92\%$
test_reinforce_speed[False-None] 5.9853ms 4.2299ms 236.4098 Ops/s 230.2620 Ops/s $\color{#35bf28}+2.67\%$
test_reinforce_speed[False-backward] 7.2015ms 6.9799ms 143.2681 Ops/s 142.0933 Ops/s $\color{#35bf28}+0.83\%$
test_reinforce_speed[True-None] 2.2713ms 2.1369ms 467.9719 Ops/s 367.3980 Ops/s $\textbf{\color{#35bf28}+27.37\%}$
test_reinforce_speed[True-backward] 7.1165ms 6.8913ms 145.1101 Ops/s 143.0506 Ops/s $\color{#35bf28}+1.44\%$
test_reinforce_speed[reduce-overhead-None] 2.4162ms 2.1585ms 463.2853 Ops/s 457.8121 Ops/s $\color{#35bf28}+1.20\%$
test_reinforce_speed[reduce-overhead-backward] 7.0863ms 6.8335ms 146.3383 Ops/s 144.3445 Ops/s $\color{#35bf28}+1.38\%$
test_iql_speed[False-None] 20.2068ms 19.0311ms 52.5455 Ops/s 51.0189 Ops/s $\color{#35bf28}+2.99\%$
test_iql_speed[False-backward] 30.4759ms 29.3352ms 34.0887 Ops/s 34.3940 Ops/s $\color{#d91a1a}-0.89\%$
test_iql_speed[True-None] 7.0553ms 6.5295ms 153.1523 Ops/s 149.1331 Ops/s $\color{#35bf28}+2.70\%$
test_iql_speed[True-backward] 15.9578ms 14.9683ms 66.8080 Ops/s 67.3820 Ops/s $\color{#d91a1a}-0.85\%$
test_iql_speed[reduce-overhead-None] 6.9753ms 6.5179ms 153.4242 Ops/s 149.9517 Ops/s $\color{#35bf28}+2.32\%$
test_iql_speed[reduce-overhead-backward] 15.9172ms 14.9058ms 67.0881 Ops/s 67.9143 Ops/s $\color{#d91a1a}-1.22\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.3866ms 6.2517ms 159.9555 Ops/s 156.4655 Ops/s $\color{#35bf28}+2.23\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.7407ms 0.3412ms 2.9309 KOps/s 2.9998 KOps/s $\color{#d91a1a}-2.30\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.5377ms 0.3147ms 3.1775 KOps/s 3.6108 KOps/s $\textbf{\color{#d91a1a}-12.00\%}$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.2509ms 5.9679ms 167.5624 Ops/s 162.2782 Ops/s $\color{#35bf28}+3.26\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 2.1294ms 0.2677ms 3.7354 KOps/s 3.3927 KOps/s $\textbf{\color{#35bf28}+10.10\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.6344ms 0.2889ms 3.4608 KOps/s 3.3360 KOps/s $\color{#35bf28}+3.74\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 1.6073ms 1.2766ms 783.3496 Ops/s 723.6780 Ops/s $\textbf{\color{#35bf28}+8.25\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 1.6190ms 1.3021ms 768.0121 Ops/s 734.5345 Ops/s $\color{#35bf28}+4.56\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.3756ms 6.2251ms 160.6395 Ops/s 158.2121 Ops/s $\color{#35bf28}+1.53\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 2.0659ms 0.4765ms 2.0985 KOps/s 2.0293 KOps/s $\color{#35bf28}+3.41\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.6811ms 0.3921ms 2.5502 KOps/s 2.1636 KOps/s $\textbf{\color{#35bf28}+17.87\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.4047ms 6.0256ms 165.9593 Ops/s 162.5020 Ops/s $\color{#35bf28}+2.13\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 1.8780ms 0.3600ms 2.7779 KOps/s 3.3602 KOps/s $\textbf{\color{#d91a1a}-17.33\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.5561ms 0.2966ms 3.3720 KOps/s 4.0845 KOps/s $\textbf{\color{#d91a1a}-17.44\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.5502ms 6.0309ms 165.8117 Ops/s 165.0040 Ops/s $\color{#35bf28}+0.49\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 1.0052ms 0.3152ms 3.1722 KOps/s 3.3837 KOps/s $\textbf{\color{#d91a1a}-6.25\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.5307ms 0.3084ms 3.2425 KOps/s 3.5893 KOps/s $\textbf{\color{#d91a1a}-9.66\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.5890ms 6.1645ms 162.2189 Ops/s 160.6201 Ops/s $\color{#35bf28}+1.00\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 2.1007ms 0.4560ms 2.1931 KOps/s 2.3000 KOps/s $\color{#d91a1a}-4.65\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.7042ms 0.4335ms 2.3068 KOps/s 2.2563 KOps/s $\color{#35bf28}+2.24\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 0.4136s 13.4041ms 74.6039 Ops/s 193.0492 Ops/s $\textbf{\color{#d91a1a}-61.35\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 8.0330ms 1.9658ms 508.7088 Ops/s 470.2113 Ops/s $\textbf{\color{#35bf28}+8.19\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 8.4722ms 1.2923ms 773.8323 Ops/s 873.1952 Ops/s $\textbf{\color{#d91a1a}-11.38\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 7.3519ms 5.2675ms 189.8447 Ops/s 193.1631 Ops/s $\color{#d91a1a}-1.72\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 8.7458ms 2.0591ms 485.6594 Ops/s 444.2191 Ops/s $\textbf{\color{#35bf28}+9.33\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 7.2589ms 1.2384ms 807.5229 Ops/s 875.2476 Ops/s $\textbf{\color{#d91a1a}-7.74\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 0.3682s 12.7567ms 78.3904 Ops/s 36.3286 Ops/s $\textbf{\color{#35bf28}+115.78\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 11.9328ms 2.0746ms 482.0297 Ops/s 475.6474 Ops/s $\color{#35bf28}+1.34\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 7.1131ms 1.3670ms 731.5326 Ops/s 745.1487 Ops/s $\color{#d91a1a}-1.83\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 13.6978ms 12.8364ms 77.9034 Ops/s 76.7075 Ops/s $\color{#35bf28}+1.56\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 17.6479ms 16.3254ms 61.2541 Ops/s 58.9666 Ops/s $\color{#35bf28}+3.88\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 17.8227ms 17.2949ms 57.8206 Ops/s 55.6166 Ops/s $\color{#35bf28}+3.96\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 17.1300ms 16.5705ms 60.3483 Ops/s 59.2597 Ops/s $\color{#35bf28}+1.84\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 18.0654ms 17.1564ms 58.2872 Ops/s 54.9595 Ops/s $\textbf{\color{#35bf28}+6.05\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 17.6799ms 17.2334ms 58.0269 Ops/s 55.7876 Ops/s $\color{#35bf28}+4.01\%$

@vmoens vmoens merged commit 470b831 into gh/vmoens/41/base Nov 13, 2024
66 of 71 checks passed
vmoens added a commit that referenced this pull request Nov 13, 2024
cc mikaylagawarecki albanD

ghstack-source-id: 6fc7434a259f92b0fca8875b20ac22624ecf1a03
Pull Request resolved: #2558
@vmoens vmoens deleted the gh/vmoens/41/head branch November 13, 2024 15:23
vmoens added a commit that referenced this pull request Nov 14, 2024
cc mikaylagawarecki albanD

ghstack-source-id: 6fc7434a259f92b0fca8875b20ac22624ecf1a03
Pull Request resolved: #2558

(cherry picked from commit 165163a)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants