Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Doc] Add missing classes to doc #1203

Merged
merged 1 commit into from
Feb 3, 2025
Merged

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Feb 3, 2025

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Feb 3, 2025
ghstack-source-id: 2e174577aa33cc8d69c0f423c90ea2e5ee0fdef6
Pull Request resolved: #1203
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 3, 2025
@vmoens vmoens merged commit 05f8a61 into gh/vmoens/47/base Feb 3, 2025
8 of 23 checks passed
vmoens added a commit that referenced this pull request Feb 3, 2025
ghstack-source-id: 2e174577aa33cc8d69c0f423c90ea2e5ee0fdef6
Pull Request resolved: #1203
@vmoens vmoens deleted the gh/vmoens/47/head branch February 3, 2025 17:07
Copy link

github-actions bot commented Feb 3, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 217. Improved: $\large\color{#35bf28}5$. Worsened: $\large\color{#d91a1a}18$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 67.4360μs 21.2185μs 47.1286 KOps/s 48.6519 KOps/s $\color{#d91a1a}-3.13\%$
test_plain_set_stack_nested 56.0350μs 21.3564μs 46.8243 KOps/s 47.7523 KOps/s $\color{#d91a1a}-1.94\%$
test_plain_set_nested_inplace 0.1064ms 23.3333μs 42.8572 KOps/s 44.5441 KOps/s $\color{#d91a1a}-3.79\%$
test_plain_set_stack_nested_inplace 59.3410μs 23.2162μs 43.0733 KOps/s 44.5772 KOps/s $\color{#d91a1a}-3.37\%$
test_items 41.5380μs 4.1647μs 240.1158 KOps/s 244.0307 KOps/s $\color{#d91a1a}-1.60\%$
test_items_nested 0.7465ms 0.4106ms 2.4353 KOps/s 2.4483 KOps/s $\color{#d91a1a}-0.53\%$
test_items_nested_locked 0.8650ms 0.4113ms 2.4313 KOps/s 2.4520 KOps/s $\color{#d91a1a}-0.85\%$
test_items_nested_leaf 0.1571ms 76.8573μs 13.0111 KOps/s 12.9647 KOps/s $\color{#35bf28}+0.36\%$
test_items_stack_nested 0.6140ms 0.4109ms 2.4339 KOps/s 2.3473 KOps/s $\color{#35bf28}+3.69\%$
test_items_stack_nested_leaf 0.1570ms 78.2684μs 12.7765 KOps/s 12.6667 KOps/s $\color{#35bf28}+0.87\%$
test_items_stack_nested_locked 0.5762ms 0.4140ms 2.4152 KOps/s 2.4260 KOps/s $\color{#d91a1a}-0.45\%$
test_keys 31.4290μs 3.7320μs 267.9516 KOps/s 281.1827 KOps/s $\color{#d91a1a}-4.71\%$
test_keys_nested 0.2853ms 0.1647ms 6.0731 KOps/s 6.0308 KOps/s $\color{#35bf28}+0.70\%$
test_keys_nested_locked 1.7808ms 0.1716ms 5.8265 KOps/s 5.8217 KOps/s $\color{#35bf28}+0.08\%$
test_keys_nested_leaf 0.2609ms 0.1440ms 6.9425 KOps/s 6.9411 KOps/s $\color{#35bf28}+0.02\%$
test_keys_stack_nested 0.3060ms 0.1654ms 6.0443 KOps/s 6.0791 KOps/s $\color{#d91a1a}-0.57\%$
test_keys_stack_nested_leaf 0.2258ms 0.1427ms 7.0065 KOps/s 6.9797 KOps/s $\color{#35bf28}+0.38\%$
test_keys_stack_nested_locked 0.2330ms 0.1720ms 5.8144 KOps/s 5.8349 KOps/s $\color{#d91a1a}-0.35\%$
test_values 14.0862μs 1.0307μs 970.1922 KOps/s 887.8213 KOps/s $\textbf{\color{#35bf28}+9.28\%}$
test_values_nested 0.1200ms 62.5467μs 15.9881 KOps/s 16.2377 KOps/s $\color{#d91a1a}-1.54\%$
test_values_nested_locked 0.1185ms 62.5316μs 15.9919 KOps/s 16.3723 KOps/s $\color{#d91a1a}-2.32\%$
test_values_nested_leaf 0.1303ms 71.7894μs 13.9296 KOps/s 14.1682 KOps/s $\color{#d91a1a}-1.68\%$
test_values_stack_nested 0.1102ms 63.2012μs 15.8225 KOps/s 15.3680 KOps/s $\color{#35bf28}+2.96\%$
test_values_stack_nested_leaf 0.1335ms 71.4954μs 13.9869 KOps/s 14.2818 KOps/s $\color{#d91a1a}-2.06\%$
test_values_stack_nested_locked 0.1498ms 63.8239μs 15.6681 KOps/s 16.1445 KOps/s $\color{#d91a1a}-2.95\%$
test_membership 28.4940μs 0.8850μs 1.1300 MOps/s 1.1530 MOps/s $\color{#d91a1a}-1.99\%$
test_membership_nested 38.0410μs 2.8823μs 346.9402 KOps/s 347.0271 KOps/s $\color{#d91a1a}-0.03\%$
test_membership_nested_leaf 51.3660μs 2.8687μs 348.5874 KOps/s 349.5149 KOps/s $\color{#d91a1a}-0.27\%$
test_membership_stacked_nested 34.2840μs 2.8837μs 346.7762 KOps/s 350.6693 KOps/s $\color{#d91a1a}-1.11\%$
test_membership_stacked_nested_leaf 57.6780μs 2.8641μs 349.1478 KOps/s 344.5475 KOps/s $\color{#35bf28}+1.34\%$
test_membership_nested_last 27.4410μs 4.3374μs 230.5508 KOps/s 231.7429 KOps/s $\color{#d91a1a}-0.51\%$
test_membership_nested_leaf_last 50.3940μs 4.3097μs 232.0342 KOps/s 230.8129 KOps/s $\color{#35bf28}+0.53\%$
test_membership_stacked_nested_last 36.2480μs 4.3118μs 231.9192 KOps/s 224.7012 KOps/s $\color{#35bf28}+3.21\%$
test_membership_stacked_nested_leaf_last 24.0650μs 4.3192μs 231.5255 KOps/s 231.7342 KOps/s $\color{#d91a1a}-0.09\%$
test_nested_getleaf 57.8680μs 10.7513μs 93.0124 KOps/s 95.5994 KOps/s $\color{#d91a1a}-2.71\%$
test_nested_get 59.4610μs 10.2048μs 97.9936 KOps/s 99.6889 KOps/s $\color{#d91a1a}-1.70\%$
test_stacked_getleaf 31.7090μs 10.6609μs 93.8008 KOps/s 95.6049 KOps/s $\color{#d91a1a}-1.89\%$
test_stacked_get 62.3770μs 10.1373μs 98.6460 KOps/s 99.5982 KOps/s $\color{#d91a1a}-0.96\%$
test_nested_getitemleaf 75.5510μs 11.2736μs 88.7029 KOps/s 90.4234 KOps/s $\color{#d91a1a}-1.90\%$
test_nested_getitem 41.0470μs 10.6974μs 93.4805 KOps/s 94.5818 KOps/s $\color{#d91a1a}-1.16\%$
test_stacked_getitemleaf 68.9590μs 11.3653μs 87.9868 KOps/s 89.3894 KOps/s $\color{#d91a1a}-1.57\%$
test_stacked_getitem 46.8880μs 10.7805μs 92.7598 KOps/s 95.0249 KOps/s $\color{#d91a1a}-2.38\%$
test_lock_nested 0.5836ms 0.4186ms 2.3887 KOps/s 2.4267 KOps/s $\color{#d91a1a}-1.57\%$
test_lock_stack_nested 0.7270ms 0.4282ms 2.3352 KOps/s 2.3625 KOps/s $\color{#d91a1a}-1.16\%$
test_unlock_nested 0.7106ms 0.3417ms 2.9267 KOps/s 2.9539 KOps/s $\color{#d91a1a}-0.92\%$
test_unlock_stack_nested 0.5069ms 0.3434ms 2.9118 KOps/s 2.9169 KOps/s $\color{#d91a1a}-0.18\%$
test_flatten_speed 0.1818ms 0.1009ms 9.9069 KOps/s 10.1416 KOps/s $\color{#d91a1a}-2.31\%$
test_unflatten_speed 0.6470ms 0.5311ms 1.8830 KOps/s 1.9340 KOps/s $\color{#d91a1a}-2.64\%$
test_common_ops 4.4996ms 0.8548ms 1.1698 KOps/s 1.2367 KOps/s $\textbf{\color{#d91a1a}-5.41\%}$
test_creation 66.3440μs 2.4806μs 403.1217 KOps/s 408.4446 KOps/s $\color{#d91a1a}-1.30\%$
test_creation_empty 46.8880μs 12.9864μs 77.0038 KOps/s 86.1836 KOps/s $\textbf{\color{#d91a1a}-10.65\%}$
test_creation_nested_1 44.9750μs 16.1413μs 61.9530 KOps/s 68.3311 KOps/s $\textbf{\color{#d91a1a}-9.33\%}$
test_creation_nested_2 51.5160μs 20.5414μs 48.6821 KOps/s 52.8490 KOps/s $\textbf{\color{#d91a1a}-7.88\%}$
test_clone 73.5370μs 13.4783μs 74.1936 KOps/s 75.0471 KOps/s $\color{#d91a1a}-1.14\%$
test_getitem[int] 0.8094ms 12.8208μs 77.9985 KOps/s 76.4474 KOps/s $\color{#35bf28}+2.03\%$
test_getitem[slice_int] 0.1429ms 23.9744μs 41.7112 KOps/s 41.4521 KOps/s $\color{#35bf28}+0.63\%$
test_getitem[range] 0.2071ms 50.5488μs 19.7829 KOps/s 20.2363 KOps/s $\color{#d91a1a}-2.24\%$
test_getitem[tuple] 0.1276ms 19.9784μs 50.0541 KOps/s 50.3144 KOps/s $\color{#d91a1a}-0.52\%$
test_getitem[list] 0.2176ms 46.5444μs 21.4849 KOps/s 21.9073 KOps/s $\color{#d91a1a}-1.93\%$
test_setitem_dim[int] 60.8540μs 26.2365μs 38.1149 KOps/s 38.4120 KOps/s $\color{#d91a1a}-0.77\%$
test_setitem_dim[slice_int] 97.0010μs 51.7186μs 19.3354 KOps/s 19.0132 KOps/s $\color{#35bf28}+1.69\%$
test_setitem_dim[range] 0.1236ms 76.5111μs 13.0700 KOps/s 13.0251 KOps/s $\color{#35bf28}+0.35\%$
test_setitem_dim[tuple] 77.3440μs 40.5831μs 24.6408 KOps/s 24.7806 KOps/s $\color{#d91a1a}-0.56\%$
test_setitem 0.1246ms 21.3761μs 46.7813 KOps/s 48.6576 KOps/s $\color{#d91a1a}-3.86\%$
test_set 99.8070μs 20.8180μs 48.0354 KOps/s 49.7222 KOps/s $\color{#d91a1a}-3.39\%$
test_set_shared 0.4556ms 0.1828ms 5.4707 KOps/s 5.5259 KOps/s $\color{#d91a1a}-1.00\%$
test_update 0.2182ms 24.4956μs 40.8237 KOps/s 44.0656 KOps/s $\textbf{\color{#d91a1a}-7.36\%}$
test_update_nested 0.1289ms 34.8272μs 28.7132 KOps/s 30.5965 KOps/s $\textbf{\color{#d91a1a}-6.16\%}$
test_update__nested 0.4521ms 34.0683μs 29.3528 KOps/s 29.7687 KOps/s $\color{#d91a1a}-1.40\%$
test_set_nested 0.1041ms 23.3700μs 42.7899 KOps/s 45.3060 KOps/s $\textbf{\color{#d91a1a}-5.55\%}$
test_set_nested_new 0.1189ms 28.1582μs 35.5137 KOps/s 37.5749 KOps/s $\textbf{\color{#d91a1a}-5.49\%}$
test_select 0.1272ms 44.8550μs 22.2941 KOps/s 23.1171 KOps/s $\color{#d91a1a}-3.56\%$
test_select_nested 0.1487ms 63.1034μs 15.8470 KOps/s 15.8927 KOps/s $\color{#d91a1a}-0.29\%$
test_exclude_nested 0.1525ms 80.8127μs 12.3743 KOps/s 12.2587 KOps/s $\color{#35bf28}+0.94\%$
test_empty[True] 0.5793ms 0.4119ms 2.4275 KOps/s 2.4626 KOps/s $\color{#d91a1a}-1.42\%$
test_empty[False] 13.0343μs 1.3879μs 720.5083 KOps/s 728.1549 KOps/s $\color{#d91a1a}-1.05\%$
test_unbind_speed 0.4827ms 0.2707ms 3.6947 KOps/s 3.6706 KOps/s $\color{#35bf28}+0.66\%$
test_unbind_speed_stack0 0.4108ms 0.2713ms 3.6858 KOps/s 3.7078 KOps/s $\color{#d91a1a}-0.59\%$
test_unbind_speed_stack1 0.1082s 0.7533ms 1.3275 KOps/s 1.2312 KOps/s $\textbf{\color{#35bf28}+7.83\%}$
test_split 0.1115s 1.7716ms 564.4536 Ops/s 570.5434 Ops/s $\color{#d91a1a}-1.07\%$
test_chunk 0.1137s 1.7731ms 563.9827 Ops/s 630.0014 Ops/s $\textbf{\color{#d91a1a}-10.48\%}$
test_consolidate_njt[False-None] 8.6753ms 8.2821ms 120.7427 Ops/s 111.1239 Ops/s $\textbf{\color{#35bf28}+8.66\%}$
test_creation[device0] 0.2715ms 93.9764μs 10.6410 KOps/s 10.9895 KOps/s $\color{#d91a1a}-3.17\%$
test_creation_from_tensor 3.8526ms 98.0890μs 10.1948 KOps/s 10.3412 KOps/s $\color{#d91a1a}-1.42\%$
test_add_one[memmap_tensor0] 0.2126ms 4.7223μs 211.7622 KOps/s 201.3600 KOps/s $\textbf{\color{#35bf28}+5.17\%}$
test_contiguous[memmap_tensor0] 19.4370μs 0.5174μs 1.9327 MOps/s 1.9020 MOps/s $\color{#35bf28}+1.62\%$
test_stack[memmap_tensor0] 35.1360μs 3.3440μs 299.0474 KOps/s 288.5775 KOps/s $\color{#35bf28}+3.63\%$
test_memmaptd_index 0.3347ms 0.2269ms 4.4073 KOps/s 4.2660 KOps/s $\color{#35bf28}+3.31\%$
test_memmaptd_index_astensor 1.0505ms 0.3170ms 3.1547 KOps/s 3.1378 KOps/s $\color{#35bf28}+0.54\%$
test_memmaptd_index_op 0.8305ms 0.6019ms 1.6613 KOps/s 1.7278 KOps/s $\color{#d91a1a}-3.85\%$
test_serialize_model 0.2307s 0.1359s 7.3607 Ops/s 8.7710 Ops/s $\textbf{\color{#d91a1a}-16.08\%}$
test_serialize_model_pickle 0.4669s 0.3965s 2.5218 Ops/s 2.5798 Ops/s $\color{#d91a1a}-2.25\%$
test_serialize_weights 0.1201s 0.1160s 8.6226 Ops/s 9.0035 Ops/s $\color{#d91a1a}-4.23\%$
test_serialize_weights_returnearly 0.2063s 0.1692s 5.9088 Ops/s 6.5400 Ops/s $\textbf{\color{#d91a1a}-9.65\%}$
test_serialize_weights_pickle 1.0048s 0.7426s 1.3467 Ops/s 2.5762 Ops/s $\textbf{\color{#d91a1a}-47.73\%}$
test_serialize_weights_filesystem 0.1515s 0.1407s 7.1095 Ops/s 6.8488 Ops/s $\color{#35bf28}+3.81\%$
test_serialize_model_filesystem 0.2544s 0.1628s 6.1419 Ops/s 6.4755 Ops/s $\textbf{\color{#d91a1a}-5.15\%}$
test_reshape_pytree 73.4780μs 26.5161μs 37.7129 KOps/s 37.6229 KOps/s $\color{#35bf28}+0.24\%$
test_reshape_td 0.1089ms 33.2962μs 30.0334 KOps/s 30.9705 KOps/s $\color{#d91a1a}-3.03\%$
test_view_pytree 93.8120μs 26.4657μs 37.7848 KOps/s 38.0596 KOps/s $\color{#d91a1a}-0.72\%$
test_view_td 89.6780μs 38.8996μs 25.7072 KOps/s 26.2201 KOps/s $\color{#d91a1a}-1.96\%$
test_unbind_pytree 98.9190μs 30.1253μs 33.1947 KOps/s 34.2877 KOps/s $\color{#d91a1a}-3.19\%$
test_unbind_td 0.3598ms 40.1640μs 24.8979 KOps/s 25.2412 KOps/s $\color{#d91a1a}-1.36\%$
test_split_pytree 72.0050μs 29.9569μs 33.3813 KOps/s 34.5501 KOps/s $\color{#d91a1a}-3.38\%$
test_split_td 0.5125ms 44.4959μs 22.4740 KOps/s 21.9944 KOps/s $\color{#35bf28}+2.18\%$
test_add_pytree 74.3090μs 35.6857μs 28.0225 KOps/s 27.7988 KOps/s $\color{#35bf28}+0.80\%$
test_add_td 0.1650ms 59.8030μs 16.7216 KOps/s 18.1608 KOps/s $\textbf{\color{#d91a1a}-7.92\%}$
test_compile_add_one_nested[tensordict-compile] 0.1712ms 67.7058μs 14.7698 KOps/s 15.3178 KOps/s $\color{#d91a1a}-3.58\%$
test_compile_add_one_nested[tensordict-eager] 0.4167ms 0.1761ms 5.6781 KOps/s 5.8407 KOps/s $\color{#d91a1a}-2.78\%$
test_compile_add_one_nested[pytree-compile] 0.1039ms 45.8763μs 21.7977 KOps/s 22.0388 KOps/s $\color{#d91a1a}-1.09\%$
test_compile_add_one_nested[pytree-eager] 0.2688ms 0.1187ms 8.4219 KOps/s 8.5317 KOps/s $\color{#d91a1a}-1.29\%$
test_compile_copy_nested[tensordict-compile] 67.5370μs 29.1975μs 34.2495 KOps/s 35.0789 KOps/s $\color{#d91a1a}-2.36\%$
test_compile_copy_nested[tensordict-eager] 0.1551ms 59.0562μs 16.9330 KOps/s 17.0912 KOps/s $\color{#d91a1a}-0.93\%$
test_compile_copy_nested[pytree-compile] 0.1769ms 79.9373μs 12.5098 KOps/s 12.4876 KOps/s $\color{#35bf28}+0.18\%$
test_compile_copy_nested[pytree-eager] 0.1562ms 67.4909μs 14.8168 KOps/s 14.9376 KOps/s $\color{#d91a1a}-0.81\%$
test_compile_add_one_flat[tensordict-compile] 0.2376ms 0.1070ms 9.3419 KOps/s 9.5140 KOps/s $\color{#d91a1a}-1.81\%$
test_compile_add_one_flat[tensordict-eager] 0.3695ms 0.2223ms 4.4987 KOps/s 4.6393 KOps/s $\color{#d91a1a}-3.03\%$
test_compile_add_one_flat[tensorclass-compile] 0.1454ms 47.6670μs 20.9789 KOps/s 21.5194 KOps/s $\color{#d91a1a}-2.51\%$
test_compile_add_one_flat[tensorclass-eager] 0.1921ms 67.7807μs 14.7535 KOps/s 14.8952 KOps/s $\color{#d91a1a}-0.95\%$
test_compile_add_one_flat[pytree-compile] 0.2469ms 0.1004ms 9.9616 KOps/s 10.1019 KOps/s $\color{#d91a1a}-1.39\%$
test_compile_add_one_flat[pytree-eager] 0.2960ms 0.2021ms 4.9482 KOps/s 5.0183 KOps/s $\color{#d91a1a}-1.40\%$
test_compile_add_self_flat[tensordict-eager] 0.3835ms 0.2383ms 4.1968 KOps/s 4.2517 KOps/s $\color{#d91a1a}-1.29\%$
test_compile_add_self_flat[tensordict-compile] 0.1900ms 0.1068ms 9.3601 KOps/s 9.3363 KOps/s $\color{#35bf28}+0.25\%$
test_compile_add_self_flat[tensorclass-eager] 0.1463ms 62.5431μs 15.9890 KOps/s 15.7196 KOps/s $\color{#35bf28}+1.71\%$
test_compile_add_self_flat[tensorclass-compile] 0.1266ms 47.9809μs 20.8416 KOps/s 20.6355 KOps/s $\color{#35bf28}+1.00\%$
test_compile_add_self_flat[pytree-eager] 0.2843ms 0.1582ms 6.3224 KOps/s 6.4072 KOps/s $\color{#d91a1a}-1.32\%$
test_compile_add_self_flat[pytree-compile] 0.1858ms 0.1000ms 9.9958 KOps/s 9.8514 KOps/s $\color{#35bf28}+1.47\%$
test_compile_copy_flat[tensordict-compile] 84.1670μs 21.9930μs 45.4690 KOps/s 45.4162 KOps/s $\color{#35bf28}+0.12\%$
test_compile_copy_flat[tensordict-eager] 0.1265ms 67.8311μs 14.7425 KOps/s 14.9911 KOps/s $\color{#d91a1a}-1.66\%$
test_compile_copy_flat[pytree-compile] 0.1628ms 82.2031μs 12.1650 KOps/s 12.2820 KOps/s $\color{#d91a1a}-0.95\%$
test_compile_copy_flat[pytree-eager] 0.1338ms 68.6807μs 14.5601 KOps/s 14.6224 KOps/s $\color{#d91a1a}-0.43\%$
test_compile_assign_and_add[tensordict-compile] 0.4388ms 0.2174ms 4.6003 KOps/s 4.6915 KOps/s $\color{#d91a1a}-1.94\%$
test_compile_assign_and_add[tensordict-eager] 3.1100ms 1.4029ms 712.8083 Ops/s 708.3098 Ops/s $\color{#35bf28}+0.64\%$
test_compile_assign_and_add[pytree-compile] 0.4059ms 0.2088ms 4.7896 KOps/s 4.7650 KOps/s $\color{#35bf28}+0.52\%$
test_compile_assign_and_add[pytree-eager] 1.0017ms 0.8147ms 1.2275 KOps/s 1.2042 KOps/s $\color{#35bf28}+1.93\%$
test_compile_assign_and_add_stack[compile] 0.9339ms 0.4536ms 2.2046 KOps/s 2.2229 KOps/s $\color{#d91a1a}-0.83\%$
test_compile_assign_and_add_stack[eager] 5.8105ms 2.8296ms 353.4125 Ops/s 366.3345 Ops/s $\color{#d91a1a}-3.53\%$
test_compile_indexing[tensor-tensordict-compile] 0.1013ms 38.9851μs 25.6508 KOps/s 25.7857 KOps/s $\color{#d91a1a}-0.52\%$
test_compile_indexing[tensor-tensordict-eager] 0.5791ms 32.2219μs 31.0348 KOps/s 30.0594 KOps/s $\color{#35bf28}+3.24\%$
test_compile_indexing[tensor-tensorclass-compile] 0.1014ms 31.9556μs 31.2935 KOps/s 31.8478 KOps/s $\color{#d91a1a}-1.74\%$
test_compile_indexing[tensor-tensorclass-eager] 92.6630μs 24.6557μs 40.5586 KOps/s 42.5611 KOps/s $\color{#d91a1a}-4.71\%$
test_compile_indexing[tensor-pytree-compile] 0.1062ms 32.3315μs 30.9296 KOps/s 31.2424 KOps/s $\color{#d91a1a}-1.00\%$
test_compile_indexing[tensor-pytree-eager] 72.0040μs 23.0728μs 43.3411 KOps/s 43.6160 KOps/s $\color{#d91a1a}-0.63\%$
test_compile_indexing[slice-tensordict-compile] 0.1323ms 53.2760μs 18.7702 KOps/s 18.8844 KOps/s $\color{#d91a1a}-0.61\%$
test_compile_indexing[slice-tensordict-eager] 0.8051ms 19.2828μs 51.8598 KOps/s 48.1745 KOps/s $\textbf{\color{#35bf28}+7.65\%}$
test_compile_indexing[slice-tensorclass-compile] 0.1233ms 46.3516μs 21.5742 KOps/s 21.6298 KOps/s $\color{#d91a1a}-0.26\%$
test_compile_indexing[slice-tensorclass-eager] 75.0300μs 18.7279μs 53.3962 KOps/s 54.1310 KOps/s $\color{#d91a1a}-1.36\%$
test_compile_indexing[slice-pytree-compile] 0.1273ms 46.8540μs 21.3429 KOps/s 21.1771 KOps/s $\color{#35bf28}+0.78\%$
test_compile_indexing[slice-pytree-eager] 77.5550μs 18.4050μs 54.3332 KOps/s 53.8289 KOps/s $\color{#35bf28}+0.94\%$
test_compile_indexing[int-tensordict-compile] 0.1475ms 54.7452μs 18.2664 KOps/s 18.3569 KOps/s $\color{#d91a1a}-0.49\%$
test_compile_indexing[int-tensordict-eager] 0.9915ms 19.5674μs 51.1055 KOps/s 48.9521 KOps/s $\color{#35bf28}+4.40\%$
test_compile_indexing[int-tensorclass-compile] 0.1101ms 47.1285μs 21.2186 KOps/s 21.3417 KOps/s $\color{#d91a1a}-0.58\%$
test_compile_indexing[int-tensorclass-eager] 91.0210μs 18.6655μs 53.5749 KOps/s 54.4916 KOps/s $\color{#d91a1a}-1.68\%$
test_compile_indexing[int-pytree-compile] 0.1171ms 47.7954μs 20.9225 KOps/s 21.2940 KOps/s $\color{#d91a1a}-1.74\%$
test_compile_indexing[int-pytree-eager] 80.2700μs 18.4371μs 54.2384 KOps/s 53.9799 KOps/s $\color{#35bf28}+0.48\%$
test_mod_add[eager] 94.6770μs 35.4856μs 28.1804 KOps/s 27.0570 KOps/s $\color{#35bf28}+4.15\%$
test_mod_add[compile] 0.1544ms 64.3241μs 15.5463 KOps/s 15.4251 KOps/s $\color{#35bf28}+0.79\%$
test_mod_add[compile-overhead] 0.1446ms 64.3472μs 15.5407 KOps/s 15.6112 KOps/s $\color{#d91a1a}-0.45\%$
test_mod_wrap[eager] 0.3765ms 0.2265ms 4.4144 KOps/s 4.3111 KOps/s $\color{#35bf28}+2.40\%$
test_mod_wrap[compile] 1.9487ms 0.2303ms 4.3427 KOps/s 4.1947 KOps/s $\color{#35bf28}+3.53\%$
test_mod_wrap[compile-overhead] 0.5075ms 0.2278ms 4.3893 KOps/s 4.3120 KOps/s $\color{#35bf28}+1.79\%$
test_mod_wrap_and_backward[eager] 36.6641ms 14.6770ms 68.1336 Ops/s 88.3693 Ops/s $\textbf{\color{#d91a1a}-22.90\%}$
test_mod_wrap_and_backward[compile] 17.0391ms 11.8608ms 84.3113 Ops/s 89.8701 Ops/s $\textbf{\color{#d91a1a}-6.19\%}$
test_mod_wrap_and_backward[compile-overhead] 12.0311ms 11.2500ms 88.8888 Ops/s 87.8183 Ops/s $\color{#35bf28}+1.22\%$
test_seq_add[eager] 0.2514ms 0.1176ms 8.5048 KOps/s 8.2567 KOps/s $\color{#35bf28}+3.00\%$
test_seq_add[compile] 0.1577ms 78.4595μs 12.7454 KOps/s 13.1381 KOps/s $\color{#d91a1a}-2.99\%$
test_seq_add[compile-overhead] 0.1901ms 76.4119μs 13.0870 KOps/s 13.2377 KOps/s $\color{#d91a1a}-1.14\%$
test_seq_wrap[eager] 0.7222ms 0.4533ms 2.2059 KOps/s 2.1905 KOps/s $\color{#35bf28}+0.70\%$
test_seq_wrap[compile] 0.3813ms 0.2460ms 4.0658 KOps/s 3.9882 KOps/s $\color{#35bf28}+1.95\%$
test_seq_wrap[compile-overhead] 0.4611ms 0.2459ms 4.0661 KOps/s 4.0168 KOps/s $\color{#35bf28}+1.23\%$
test_func_call_runtime[False-eager] 0.7387ms 0.5544ms 1.8037 KOps/s 1.7451 KOps/s $\color{#35bf28}+3.35\%$
test_func_call_runtime[False-compile] 0.5605ms 0.4456ms 2.2440 KOps/s 2.1714 KOps/s $\color{#35bf28}+3.34\%$
test_func_call_runtime[False-compile-overhead] 0.5903ms 0.4478ms 2.2331 KOps/s 2.1576 KOps/s $\color{#35bf28}+3.50\%$
test_func_call_runtime[True-eager] 1.0415ms 0.7717ms 1.2958 KOps/s 1.2784 KOps/s $\color{#35bf28}+1.36\%$
test_func_call_runtime[True-compile] 0.7839ms 0.4685ms 2.1345 KOps/s 2.0588 KOps/s $\color{#35bf28}+3.68\%$
test_func_call_runtime[True-compile-overhead] 0.8718ms 0.4718ms 2.1196 KOps/s 2.0830 KOps/s $\color{#35bf28}+1.76\%$
test_func_call_cm_runtime[False-eager] 1.3101ms 0.5660ms 1.7668 KOps/s 1.7867 KOps/s $\color{#d91a1a}-1.11\%$
test_func_call_cm_runtime[False-compile] 1.7357ms 0.4493ms 2.2255 KOps/s 2.1591 KOps/s $\color{#35bf28}+3.07\%$
test_func_call_cm_runtime[False-compile-overhead] 0.6205ms 0.4482ms 2.2312 KOps/s 2.1941 KOps/s $\color{#35bf28}+1.69\%$
test_func_call_cm_runtime[True-eager] 1.4763ms 0.9141ms 1.0940 KOps/s 1.0815 KOps/s $\color{#35bf28}+1.16\%$
test_func_call_cm_runtime[True-compile] 1.2509ms 0.8125ms 1.2308 KOps/s 1.1994 KOps/s $\color{#35bf28}+2.62\%$
test_func_call_cm_runtime[True-compile-overhead] 1.2265ms 0.8159ms 1.2257 KOps/s 1.2070 KOps/s $\color{#35bf28}+1.55\%$
test_vmap_func_call_cm_runtime[eager] 2.7430ms 1.9421ms 514.8972 Ops/s 501.3248 Ops/s $\color{#35bf28}+2.71\%$
test_vmap_func_call_cm_runtime[compile] 1.0938ms 0.5420ms 1.8449 KOps/s 1.7816 KOps/s $\color{#35bf28}+3.55\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.8886ms 0.5419ms 1.8453 KOps/s 1.8134 KOps/s $\color{#35bf28}+1.76\%$
test_distributed 0.4531ms 0.1255ms 7.9674 KOps/s 7.6611 KOps/s $\color{#35bf28}+4.00\%$
test_tdmodule 0.1188ms 27.1940μs 36.7728 KOps/s 36.8466 KOps/s $\color{#d91a1a}-0.20\%$
test_tdmodule_dispatch 93.4250μs 49.6220μs 20.1523 KOps/s 20.4397 KOps/s $\color{#d91a1a}-1.41\%$
test_tdseq 69.3100μs 30.1790μs 33.1357 KOps/s 33.7868 KOps/s $\color{#d91a1a}-1.93\%$
test_tdseq_dispatch 94.0760μs 56.0739μs 17.8336 KOps/s 18.0889 KOps/s $\color{#d91a1a}-1.41\%$
test_instantiation_functorch 1.7698ms 1.5333ms 652.1672 Ops/s 639.5068 Ops/s $\color{#35bf28}+1.98\%$
test_exec_functorch 0.2912ms 0.1805ms 5.5414 KOps/s 5.5486 KOps/s $\color{#d91a1a}-0.13\%$
test_exec_functional_call 0.3298ms 0.1726ms 5.7924 KOps/s 5.8387 KOps/s $\color{#d91a1a}-0.79\%$
test_exec_td_decorator 0.5455ms 0.2324ms 4.3020 KOps/s 4.2458 KOps/s $\color{#35bf28}+1.32\%$
test_vmap_mlp_speed_decorator[True-True] 0.8726ms 0.6696ms 1.4933 KOps/s 1.4814 KOps/s $\color{#35bf28}+0.81\%$
test_vmap_mlp_speed_decorator[True-False] 1.3254ms 0.6836ms 1.4629 KOps/s 1.4588 KOps/s $\color{#35bf28}+0.28\%$
test_vmap_mlp_speed_decorator[False-True] 0.7364ms 0.5412ms 1.8477 KOps/s 1.8327 KOps/s $\color{#35bf28}+0.82\%$
test_vmap_mlp_speed_decorator[False-False] 0.8099ms 0.5429ms 1.8420 KOps/s 1.8172 KOps/s $\color{#35bf28}+1.37\%$
test_to_module_speed[True] 2.0167ms 1.3591ms 735.8046 Ops/s 743.6681 Ops/s $\color{#d91a1a}-1.06\%$
test_to_module_speed[False] 1.8273ms 1.3131ms 761.5482 Ops/s 752.2260 Ops/s $\color{#35bf28}+1.24\%$
test_tc_init 95.7400μs 49.2575μs 20.3015 KOps/s 20.9211 KOps/s $\color{#d91a1a}-2.96\%$
test_tc_init_nested 0.2005ms 97.6750μs 10.2380 KOps/s 10.6874 KOps/s $\color{#d91a1a}-4.20\%$
test_tc_first_layer_tensor 23.4140μs 1.6139μs 619.6065 KOps/s 636.4004 KOps/s $\color{#d91a1a}-2.64\%$
test_tc_first_layer_nontensor 45.6250μs 4.7803μs 209.1898 KOps/s 212.6301 KOps/s $\color{#d91a1a}-1.62\%$
test_tc_second_layer_tensor 27.2610μs 2.9804μs 335.5275 KOps/s 342.7123 KOps/s $\color{#d91a1a}-2.10\%$
test_tc_second_layer_nontensor 30.7580μs 6.1444μs 162.7507 KOps/s 164.3015 KOps/s $\color{#d91a1a}-0.94\%$
test_unbind 0.2512s 14.2587ms 70.1327 Ops/s 74.3232 Ops/s $\textbf{\color{#d91a1a}-5.64\%}$
test_full_like 10.0138ms 8.6963ms 114.9914 Ops/s 109.8619 Ops/s $\color{#35bf28}+4.67\%$
test_zeros_like 8.6451ms 4.8116ms 207.8311 Ops/s 315.1094 Ops/s $\textbf{\color{#d91a1a}-34.04\%}$
test_ones_like 5.2104ms 3.9040ms 256.1467 Ops/s 269.2782 Ops/s $\color{#d91a1a}-4.88\%$
test_clone 6.7007ms 5.7041ms 175.3128 Ops/s 178.5024 Ops/s $\color{#d91a1a}-1.79\%$
test_squeeze 0.1083ms 12.3872μs 80.7282 KOps/s 78.4823 KOps/s $\color{#35bf28}+2.86\%$
test_unsqueeze 0.1736ms 92.6865μs 10.7891 KOps/s 10.4051 KOps/s $\color{#35bf28}+3.69\%$
test_split 0.3722ms 0.1969ms 5.0794 KOps/s 5.0275 KOps/s $\color{#35bf28}+1.03\%$
test_permute 0.3572ms 0.2054ms 4.8687 KOps/s 4.7458 KOps/s $\color{#35bf28}+2.59\%$
test_stack 30.2638ms 26.7301ms 37.4110 Ops/s 37.6349 Ops/s $\color{#d91a1a}-0.59\%$
test_cat 30.2103ms 26.2914ms 38.0352 Ops/s 38.1020 Ops/s $\color{#d91a1a}-0.18\%$

Copy link

github-actions bot commented Feb 3, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 229. Improved: $\large\color{#35bf28}34$. Worsened: $\large\color{#d91a1a}18$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 26.5610μs 11.6595μs 85.7667 KOps/s 76.1461 KOps/s $\textbf{\color{#35bf28}+12.63\%}$
test_plain_set_stack_nested 0.1924ms 11.8112μs 84.6655 KOps/s 74.9771 KOps/s $\textbf{\color{#35bf28}+12.92\%}$
test_plain_set_nested_inplace 36.5210μs 12.6585μs 78.9981 KOps/s 70.5476 KOps/s $\textbf{\color{#35bf28}+11.98\%}$
test_plain_set_stack_nested_inplace 0.2023ms 12.7162μs 78.6398 KOps/s 70.7106 KOps/s $\textbf{\color{#35bf28}+11.21\%}$
test_items 0.1660ms 2.8645μs 349.0988 KOps/s 348.7121 KOps/s $\color{#35bf28}+0.11\%$
test_items_nested 0.4200ms 0.3677ms 2.7197 KOps/s 2.7512 KOps/s $\color{#d91a1a}-1.14\%$
test_items_nested_locked 0.4193ms 0.3661ms 2.7317 KOps/s 2.7450 KOps/s $\color{#d91a1a}-0.48\%$
test_items_nested_leaf 0.1621ms 59.1683μs 16.9009 KOps/s 17.2142 KOps/s $\color{#d91a1a}-1.82\%$
test_items_stack_nested 0.4013ms 0.3674ms 2.7218 KOps/s 2.7717 KOps/s $\color{#d91a1a}-1.80\%$
test_items_stack_nested_leaf 90.2210μs 59.1692μs 16.9007 KOps/s 16.9869 KOps/s $\color{#d91a1a}-0.51\%$
test_items_stack_nested_locked 0.5221ms 0.3669ms 2.7255 KOps/s 2.7561 KOps/s $\color{#d91a1a}-1.11\%$
test_keys 29.6000μs 3.4617μs 288.8770 KOps/s 289.1305 KOps/s $\color{#d91a1a}-0.09\%$
test_keys_nested 0.2348ms 87.4397μs 11.4365 KOps/s 11.4401 KOps/s $\color{#d91a1a}-0.03\%$
test_keys_nested_locked 0.7046ms 93.9984μs 10.6385 KOps/s 10.8544 KOps/s $\color{#d91a1a}-1.99\%$
test_keys_nested_leaf 0.1111ms 78.2400μs 12.7812 KOps/s 12.7972 KOps/s $\color{#d91a1a}-0.13\%$
test_keys_stack_nested 0.1267ms 87.8946μs 11.3773 KOps/s 11.4029 KOps/s $\color{#d91a1a}-0.22\%$
test_keys_stack_nested_leaf 0.1202ms 80.1749μs 12.4727 KOps/s 12.6875 KOps/s $\color{#d91a1a}-1.69\%$
test_keys_stack_nested_locked 0.1206ms 93.4760μs 10.6979 KOps/s 10.6973 KOps/s $+0.01\%$
test_values 6.1735μs 0.8519μs 1.1738 MOps/s 1.1773 MOps/s $\color{#d91a1a}-0.30\%$
test_values_nested 88.5310μs 37.4051μs 26.7343 KOps/s 26.8456 KOps/s $\color{#d91a1a}-0.41\%$
test_values_nested_locked 63.5910μs 38.7891μs 25.7804 KOps/s 25.6073 KOps/s $\color{#35bf28}+0.68\%$
test_values_nested_leaf 82.9110μs 42.5438μs 23.5052 KOps/s 24.1999 KOps/s $\color{#d91a1a}-2.87\%$
test_values_stack_nested 0.1393ms 37.3135μs 26.7999 KOps/s 26.3544 KOps/s $\color{#35bf28}+1.69\%$
test_values_stack_nested_leaf 0.2029ms 42.7162μs 23.4103 KOps/s 23.8894 KOps/s $\color{#d91a1a}-2.01\%$
test_values_stack_nested_locked 84.3520μs 39.3817μs 25.3925 KOps/s 25.4445 KOps/s $\color{#d91a1a}-0.20\%$
test_membership 10.2942μs 0.5086μs 1.9660 MOps/s 1.9489 MOps/s $\color{#35bf28}+0.88\%$
test_membership_nested 47.5110μs 2.0982μs 476.6001 KOps/s 508.1170 KOps/s $\textbf{\color{#d91a1a}-6.20\%}$
test_membership_nested_leaf 19.1155μs 2.0330μs 491.8911 KOps/s 498.5894 KOps/s $\color{#d91a1a}-1.34\%$
test_membership_stacked_nested 0.1169ms 2.1327μs 468.8916 KOps/s 485.8798 KOps/s $\color{#d91a1a}-3.50\%$
test_membership_stacked_nested_leaf 34.8710μs 2.0896μs 478.5626 KOps/s 480.9205 KOps/s $\color{#d91a1a}-0.49\%$
test_membership_nested_last 35.3410μs 3.0842μs 324.2355 KOps/s 330.3923 KOps/s $\color{#d91a1a}-1.86\%$
test_membership_nested_leaf_last 42.9110μs 3.0903μs 323.5927 KOps/s 331.9305 KOps/s $\color{#d91a1a}-2.51\%$
test_membership_stacked_nested_last 44.9310μs 8.1979μs 121.9820 KOps/s 122.4256 KOps/s $\color{#d91a1a}-0.36\%$
test_membership_stacked_nested_leaf_last 50.1900μs 8.1506μs 122.6909 KOps/s 122.9178 KOps/s $\color{#d91a1a}-0.18\%$
test_nested_getleaf 43.8910μs 6.2662μs 159.5863 KOps/s 161.8736 KOps/s $\color{#d91a1a}-1.41\%$
test_nested_get 33.8910μs 5.8440μs 171.1153 KOps/s 170.3131 KOps/s $\color{#35bf28}+0.47\%$
test_stacked_getleaf 31.6100μs 6.1484μs 162.6434 KOps/s 162.2188 KOps/s $\color{#35bf28}+0.26\%$
test_stacked_get 32.0500μs 5.8219μs 171.7642 KOps/s 169.8979 KOps/s $\color{#35bf28}+1.10\%$
test_nested_getitemleaf 41.9800μs 6.5278μs 153.1906 KOps/s 152.8297 KOps/s $\color{#35bf28}+0.24\%$
test_nested_getitem 32.4400μs 6.1698μs 162.0794 KOps/s 162.2102 KOps/s $\color{#d91a1a}-0.08\%$
test_stacked_getitemleaf 47.5110μs 6.4342μs 155.4203 KOps/s 155.4189 KOps/s $+0.00\%$
test_stacked_getitem 25.7210μs 6.0812μs 164.4404 KOps/s 163.5958 KOps/s $\color{#35bf28}+0.52\%$
test_lock_nested 10.6896ms 0.3475ms 2.8776 KOps/s 2.8774 KOps/s $+0.01\%$
test_lock_stack_nested 0.4164ms 0.3337ms 2.9963 KOps/s 2.9593 KOps/s $\color{#35bf28}+1.25\%$
test_unlock_nested 0.3902ms 0.2808ms 3.5617 KOps/s 3.5789 KOps/s $\color{#d91a1a}-0.48\%$
test_unlock_stack_nested 0.4212ms 0.2732ms 3.6597 KOps/s 3.6323 KOps/s $\color{#35bf28}+0.75\%$
test_flatten_speed 0.2716ms 75.2246μs 13.2935 KOps/s 13.3286 KOps/s $\color{#d91a1a}-0.26\%$
test_unflatten_speed 0.3712ms 0.3243ms 3.0832 KOps/s 3.1023 KOps/s $\color{#d91a1a}-0.62\%$
test_common_ops 0.8129ms 0.5800ms 1.7242 KOps/s 1.5795 KOps/s $\textbf{\color{#35bf28}+9.16\%}$
test_creation 0.1324ms 1.7441μs 573.3749 KOps/s 561.6233 KOps/s $\color{#35bf28}+2.09\%$
test_creation_empty 29.8200μs 7.0072μs 142.7098 KOps/s 97.7240 KOps/s $\textbf{\color{#35bf28}+46.03\%}$
test_creation_nested_1 32.6510μs 8.6650μs 115.4065 KOps/s 84.1102 KOps/s $\textbf{\color{#35bf28}+37.21\%}$
test_creation_nested_2 44.6010μs 11.4788μs 87.1173 KOps/s 67.7708 KOps/s $\textbf{\color{#35bf28}+28.55\%}$
test_clone 48.1110μs 9.6487μs 103.6409 KOps/s 102.8347 KOps/s $\color{#35bf28}+0.78\%$
test_getitem[int] 1.6457ms 10.8567μs 92.1089 KOps/s 93.1885 KOps/s $\color{#d91a1a}-1.16\%$
test_getitem[slice_int] 0.1036ms 20.8226μs 48.0247 KOps/s 48.2381 KOps/s $\color{#d91a1a}-0.44\%$
test_getitem[range] 0.1456ms 37.8228μs 26.4391 KOps/s 28.2028 KOps/s $\textbf{\color{#d91a1a}-6.25\%}$
test_getitem[tuple] 0.1174ms 18.2122μs 54.9083 KOps/s 55.0781 KOps/s $\color{#d91a1a}-0.31\%$
test_getitem[list] 0.1619ms 32.3594μs 30.9030 KOps/s 32.0153 KOps/s $\color{#d91a1a}-3.47\%$
test_setitem_dim[int] 49.2210μs 18.7800μs 53.2480 KOps/s 55.8792 KOps/s $\color{#d91a1a}-4.71\%$
test_setitem_dim[slice_int] 0.1591ms 39.0516μs 25.6071 KOps/s 27.7329 KOps/s $\textbf{\color{#d91a1a}-7.66\%}$
test_setitem_dim[range] 0.1520ms 54.2238μs 18.4421 KOps/s 19.6354 KOps/s $\textbf{\color{#d91a1a}-6.08\%}$
test_setitem_dim[tuple] 51.3210μs 31.0332μs 32.2236 KOps/s 32.4277 KOps/s $\color{#d91a1a}-0.63\%$
test_setitem 0.1027ms 13.3201μs 75.0744 KOps/s 66.2126 KOps/s $\textbf{\color{#35bf28}+13.38\%}$
test_set 48.4110μs 12.9229μs 77.3820 KOps/s 67.8104 KOps/s $\textbf{\color{#35bf28}+14.12\%}$
test_set_shared 0.5184ms 0.1631ms 6.1317 KOps/s 6.2300 KOps/s $\color{#d91a1a}-1.58\%$
test_update 0.4082ms 15.6748μs 63.7967 KOps/s 53.3912 KOps/s $\textbf{\color{#35bf28}+19.49\%}$
test_update_nested 82.9110μs 20.9069μs 47.8310 KOps/s 41.8968 KOps/s $\textbf{\color{#35bf28}+14.16\%}$
test_update__nested 0.5977ms 24.8209μs 40.2886 KOps/s 42.1511 KOps/s $\color{#d91a1a}-4.42\%$
test_set_nested 0.1393ms 14.2748μs 70.0533 KOps/s 64.0563 KOps/s $\textbf{\color{#35bf28}+9.36\%}$
test_set_nested_new 59.6810μs 16.4607μs 60.7509 KOps/s 55.8377 KOps/s $\textbf{\color{#35bf28}+8.80\%}$
test_select 0.2068ms 28.7931μs 34.7306 KOps/s 33.4720 KOps/s $\color{#35bf28}+3.76\%$
test_select_nested 0.1354ms 43.6036μs 22.9339 KOps/s 22.8750 KOps/s $\color{#35bf28}+0.26\%$
test_exclude_nested 0.1010ms 61.9920μs 16.1311 KOps/s 15.7729 KOps/s $\color{#35bf28}+2.27\%$
test_empty[True] 0.3946ms 0.2938ms 3.4042 KOps/s 3.3700 KOps/s $\color{#35bf28}+1.01\%$
test_empty[False] 3.5950μs 0.8281μs 1.2077 MOps/s 1.1889 MOps/s $\color{#35bf28}+1.57\%$
test_to 85.3510μs 54.3566μs 18.3970 KOps/s 17.3075 KOps/s $\textbf{\color{#35bf28}+6.29\%}$
test_to_nonblocking 0.1965ms 46.4467μs 21.5301 KOps/s 21.9680 KOps/s $\color{#d91a1a}-1.99\%$
test_unbind_speed 0.2699ms 0.2431ms 4.1129 KOps/s 4.1974 KOps/s $\color{#d91a1a}-2.01\%$
test_unbind_speed_stack0 0.3789ms 0.2380ms 4.2008 KOps/s 4.2683 KOps/s $\color{#d91a1a}-1.58\%$
test_unbind_speed_stack1 0.1040s 0.7278ms 1.3740 KOps/s 1.3685 KOps/s $\color{#35bf28}+0.40\%$
test_split 1.6023ms 1.4738ms 678.5128 Ops/s 628.8453 Ops/s $\textbf{\color{#35bf28}+7.90\%}$
test_chunk 0.1038s 1.8041ms 554.2990 Ops/s 623.6865 Ops/s $\textbf{\color{#d91a1a}-11.13\%}$
test_consolidate[False-None] 3.1765ms 2.6816ms 372.9133 Ops/s 371.2272 Ops/s $\color{#35bf28}+0.45\%$
test_consolidate[default-None] 1.8554ms 1.7264ms 579.2408 Ops/s 586.7769 Ops/s $\color{#d91a1a}-1.28\%$
test_consolidate[reduce-overhead-None] 1.9076ms 1.7495ms 571.5841 Ops/s 572.9154 Ops/s $\color{#d91a1a}-0.23\%$
test_consolidate_njt[False-None] 7.0733ms 6.5374ms 152.9650 Ops/s 106.8957 Ops/s $\textbf{\color{#35bf28}+43.10\%}$
test_to[False-False-None] 0.3283s 2.2053ms 453.4439 Ops/s 602.7278 Ops/s $\textbf{\color{#d91a1a}-24.77\%}$
test_to[True-False-None] 1.5004ms 1.3128ms 761.7025 Ops/s 748.2362 Ops/s $\color{#35bf28}+1.80\%$
test_to[within-False-None] 4.2741ms 4.0820ms 244.9761 Ops/s 237.9109 Ops/s $\color{#35bf28}+2.97\%$
test_to[True-default-None] 5.4376ms 5.1816ms 192.9920 Ops/s 186.9728 Ops/s $\color{#35bf28}+3.22\%$
test_to_njt[False-False-None] 7.0994ms 6.7885ms 147.3085 Ops/s 137.6355 Ops/s $\textbf{\color{#35bf28}+7.03\%}$
test_to_njt[True-False-None] 5.8592ms 5.5482ms 180.2389 Ops/s 170.1395 Ops/s $\textbf{\color{#35bf28}+5.94\%}$
test_to_njt[within-False-None] 12.7214ms 12.4262ms 80.4752 Ops/s 80.6149 Ops/s $\color{#d91a1a}-0.17\%$
test_creation[device0] 0.4604ms 81.0372μs 12.3400 KOps/s 12.4446 KOps/s $\color{#d91a1a}-0.84\%$
test_creation_from_tensor 0.5300ms 84.0601μs 11.8962 KOps/s 11.9669 KOps/s $\color{#d91a1a}-0.59\%$
test_add_one[memmap_tensor0] 0.2265ms 6.2665μs 159.5795 KOps/s 161.8172 KOps/s $\color{#d91a1a}-1.38\%$
test_contiguous[memmap_tensor0] 2.0941μs 0.4654μs 2.1487 MOps/s 2.3351 MOps/s $\textbf{\color{#d91a1a}-7.98\%}$
test_stack[memmap_tensor0] 0.1503ms 4.5838μs 218.1595 KOps/s 226.9568 KOps/s $\color{#d91a1a}-3.88\%$
test_memmaptd_index 1.9222ms 0.2436ms 4.1048 KOps/s 4.1517 KOps/s $\color{#d91a1a}-1.13\%$
test_memmaptd_index_astensor 0.4398ms 0.3015ms 3.3172 KOps/s 3.3349 KOps/s $\color{#d91a1a}-0.53\%$
test_memmaptd_index_op 0.6641ms 0.5294ms 1.8888 KOps/s 1.6970 KOps/s $\textbf{\color{#35bf28}+11.30\%}$
test_serialize_model 0.1325s 0.1308s 7.6424 Ops/s 7.6459 Ops/s $\color{#d91a1a}-0.05\%$
test_serialize_model_pickle 1.3470s 1.1909s 0.8397 Ops/s 0.8238 Ops/s $\color{#35bf28}+1.93\%$
test_serialize_weights 0.1321s 0.1302s 7.6793 Ops/s 7.6829 Ops/s $\color{#d91a1a}-0.05\%$
test_serialize_weights_returnearly 0.4004s 64.3888ms 15.5306 Ops/s 14.1959 Ops/s $\textbf{\color{#35bf28}+9.40\%}$
test_serialize_weights_pickle 1.3768s 1.2238s 0.8171 Ops/s 0.8204 Ops/s $\color{#d91a1a}-0.40\%$
test_reshape_pytree 85.8810μs 22.4938μs 44.4568 KOps/s 44.8777 KOps/s $\color{#d91a1a}-0.94\%$
test_reshape_td 0.1981ms 27.2913μs 36.6417 KOps/s 36.7647 KOps/s $\color{#d91a1a}-0.33\%$
test_view_pytree 0.1823ms 22.1382μs 45.1709 KOps/s 45.7035 KOps/s $\color{#d91a1a}-1.17\%$
test_view_td 0.1761ms 30.4162μs 32.8772 KOps/s 30.8426 KOps/s $\textbf{\color{#35bf28}+6.60\%}$
test_unbind_pytree 0.1851ms 27.2728μs 36.6666 KOps/s 35.9689 KOps/s $\color{#35bf28}+1.94\%$
test_unbind_td 0.5642ms 36.4439μs 27.4394 KOps/s 27.6511 KOps/s $\color{#d91a1a}-0.77\%$
test_split_pytree 0.1533ms 30.2141μs 33.0971 KOps/s 33.4747 KOps/s $\color{#d91a1a}-1.13\%$
test_split_td 0.7565ms 39.2163μs 25.4996 KOps/s 25.5891 KOps/s $\color{#d91a1a}-0.35\%$
test_add_pytree 0.2331ms 31.6634μs 31.5822 KOps/s 30.3381 KOps/s $\color{#35bf28}+4.10\%$
test_add_td 0.2104ms 41.8778μs 23.8790 KOps/s 19.7922 KOps/s $\textbf{\color{#35bf28}+20.65\%}$
test_compile_add_one_nested[tensordict-compile] 0.2744ms 0.1210ms 8.2662 KOps/s 7.8169 KOps/s $\textbf{\color{#35bf28}+5.75\%}$
test_compile_add_one_nested[tensordict-eager] 0.2983ms 0.1300ms 7.6951 KOps/s 7.6435 KOps/s $\color{#35bf28}+0.68\%$
test_compile_add_one_nested[pytree-compile] 0.2461ms 96.2670μs 10.3878 KOps/s 10.2420 KOps/s $\color{#35bf28}+1.42\%$
test_compile_add_one_nested[pytree-eager] 1.0725ms 0.1443ms 6.9303 KOps/s 6.8979 KOps/s $\color{#35bf28}+0.47\%$
test_compile_copy_nested[tensordict-compile] 0.1623ms 24.2608μs 41.2188 KOps/s 43.5089 KOps/s $\textbf{\color{#d91a1a}-5.26\%}$
test_compile_copy_nested[tensordict-eager] 0.1729ms 29.9826μs 33.3527 KOps/s 33.6286 KOps/s $\color{#d91a1a}-0.82\%$
test_compile_copy_nested[pytree-compile] 0.3830ms 64.4850μs 15.5075 KOps/s 15.3648 KOps/s $\color{#35bf28}+0.93\%$
test_compile_copy_nested[pytree-eager] 0.1503ms 49.1111μs 20.3620 KOps/s 20.2680 KOps/s $\color{#35bf28}+0.46\%$
test_compile_add_one_flat[tensordict-compile] 0.3035ms 0.1431ms 6.9897 KOps/s 7.0858 KOps/s $\color{#d91a1a}-1.36\%$
test_compile_add_one_flat[tensordict-eager] 0.4028ms 0.2164ms 4.6202 KOps/s 4.6552 KOps/s $\color{#d91a1a}-0.75\%$
test_compile_add_one_flat[tensorclass-compile] 0.2619ms 0.1057ms 9.4583 KOps/s 10.3373 KOps/s $\textbf{\color{#d91a1a}-8.50\%}$
test_compile_add_one_flat[tensorclass-eager] 0.2166ms 57.3166μs 17.4470 KOps/s 17.4561 KOps/s $\color{#d91a1a}-0.05\%$
test_compile_add_one_flat[pytree-compile] 0.2865ms 0.1415ms 7.0654 KOps/s 7.4503 KOps/s $\textbf{\color{#d91a1a}-5.17\%}$
test_compile_add_one_flat[pytree-eager] 0.6742ms 0.4705ms 2.1252 KOps/s 2.1524 KOps/s $\color{#d91a1a}-1.27\%$
test_compile_add_self_flat[tensordict-eager] 0.4274ms 0.2658ms 3.7629 KOps/s 3.8891 KOps/s $\color{#d91a1a}-3.24\%$
test_compile_add_self_flat[tensordict-compile] 0.3532ms 0.1479ms 6.7599 KOps/s 7.1256 KOps/s $\textbf{\color{#d91a1a}-5.13\%}$
test_compile_add_self_flat[tensorclass-eager] 0.2850ms 69.4916μs 14.3902 KOps/s 14.7459 KOps/s $\color{#d91a1a}-2.41\%$
test_compile_add_self_flat[tensorclass-compile] 0.2922ms 0.1040ms 9.6118 KOps/s 10.2553 KOps/s $\textbf{\color{#d91a1a}-6.28\%}$
test_compile_add_self_flat[pytree-eager] 0.5949ms 0.3980ms 2.5127 KOps/s 2.5731 KOps/s $\color{#d91a1a}-2.35\%$
test_compile_add_self_flat[pytree-compile] 0.2925ms 0.1419ms 7.0471 KOps/s 7.5981 KOps/s $\textbf{\color{#d91a1a}-7.25\%}$
test_compile_copy_flat[tensordict-compile] 0.1707ms 18.4188μs 54.2925 KOps/s 56.3553 KOps/s $\color{#d91a1a}-3.66\%$
test_compile_copy_flat[tensordict-eager] 75.9320μs 31.2321μs 32.0183 KOps/s 31.9595 KOps/s $\color{#35bf28}+0.18\%$
test_compile_copy_flat[pytree-compile] 0.2154ms 70.2447μs 14.2360 KOps/s 14.0035 KOps/s $\color{#35bf28}+1.66\%$
test_compile_copy_flat[pytree-eager] 0.1444ms 51.3154μs 19.4873 KOps/s 19.1807 KOps/s $\color{#35bf28}+1.60\%$
test_compile_assign_and_add[tensordict-compile] 1.6708ms 0.3981ms 2.5122 KOps/s 2.1964 KOps/s $\textbf{\color{#35bf28}+14.38\%}$
test_compile_assign_and_add[tensordict-eager] 2.9946ms 2.5407ms 393.5899 Ops/s 398.9089 Ops/s $\color{#d91a1a}-1.33\%$
test_compile_assign_and_add[pytree-compile] 1.6467ms 0.4469ms 2.2376 KOps/s 2.2890 KOps/s $\color{#d91a1a}-2.25\%$
test_compile_assign_and_add[pytree-eager] 2.9260ms 2.5348ms 394.5100 Ops/s 394.5627 Ops/s $\color{#d91a1a}-0.01\%$
test_compile_indexing[tensor-tensordict-compile] 0.5539ms 0.1100ms 9.0879 KOps/s 8.6686 KOps/s $\color{#35bf28}+4.84\%$
test_compile_indexing[tensor-tensordict-eager] 0.5630ms 76.4597μs 13.0788 KOps/s 12.3022 KOps/s $\textbf{\color{#35bf28}+6.31\%}$
test_compile_indexing[tensor-tensorclass-compile] 0.7775ms 0.1037ms 9.6420 KOps/s 9.2876 KOps/s $\color{#35bf28}+3.82\%$
test_compile_indexing[tensor-tensorclass-eager] 0.4638ms 64.7747μs 15.4381 KOps/s 14.4648 KOps/s $\textbf{\color{#35bf28}+6.73\%}$
test_compile_indexing[tensor-pytree-compile] 0.5308ms 0.1067ms 9.3698 KOps/s 9.1714 KOps/s $\color{#35bf28}+2.16\%$
test_compile_indexing[tensor-pytree-eager] 0.4903ms 67.4838μs 14.8184 KOps/s 14.4521 KOps/s $\color{#35bf28}+2.53\%$
test_compile_indexing[slice-tensordict-compile] 0.5252ms 0.1029ms 9.7176 KOps/s 9.9295 KOps/s $\color{#d91a1a}-2.13\%$
test_compile_indexing[slice-tensordict-eager] 0.4161ms 17.3015μs 57.7985 KOps/s 58.1223 KOps/s $\color{#d91a1a}-0.56\%$
test_compile_indexing[slice-tensorclass-compile] 0.2591ms 95.2835μs 10.4950 KOps/s 10.3779 KOps/s $\color{#35bf28}+1.13\%$
test_compile_indexing[slice-tensorclass-eager] 0.4337ms 15.5707μs 64.2231 KOps/s 63.4361 KOps/s $\color{#35bf28}+1.24\%$
test_compile_indexing[slice-pytree-compile] 0.5021ms 97.4876μs 10.2577 KOps/s 10.2916 KOps/s $\color{#d91a1a}-0.33\%$
test_compile_indexing[slice-pytree-eager] 0.4099ms 15.6393μs 63.9417 KOps/s 63.6234 KOps/s $\color{#35bf28}+0.50\%$
test_compile_indexing[int-tensordict-compile] 0.5074ms 0.1054ms 9.4915 KOps/s 9.8981 KOps/s $\color{#d91a1a}-4.11\%$
test_compile_indexing[int-tensordict-eager] 0.5921ms 16.8020μs 59.5166 KOps/s 58.9018 KOps/s $\color{#35bf28}+1.04\%$
test_compile_indexing[int-tensorclass-compile] 0.5246ms 99.1672μs 10.0840 KOps/s 10.3321 KOps/s $\color{#d91a1a}-2.40\%$
test_compile_indexing[int-tensorclass-eager] 0.4174ms 15.5666μs 64.2402 KOps/s 64.5744 KOps/s $\color{#d91a1a}-0.52\%$
test_compile_indexing[int-pytree-compile] 0.5028ms 97.2839μs 10.2792 KOps/s 10.3556 KOps/s $\color{#d91a1a}-0.74\%$
test_compile_indexing[int-pytree-eager] 0.4328ms 18.4452μs 54.2148 KOps/s 63.9858 KOps/s $\textbf{\color{#d91a1a}-15.27\%}$
test_mod_add[eager] 0.4440ms 36.4856μs 27.4080 KOps/s 25.1952 KOps/s $\textbf{\color{#35bf28}+8.78\%}$
test_mod_add[compile] 0.4860ms 80.2764μs 12.4570 KOps/s 12.3893 KOps/s $\color{#35bf28}+0.55\%$
test_mod_add[compile-overhead] 0.3287ms 0.1915ms 5.2220 KOps/s 5.5423 KOps/s $\textbf{\color{#d91a1a}-5.78\%}$
test_mod_wrap[eager] 0.6496ms 0.2409ms 4.1511 KOps/s 4.0432 KOps/s $\color{#35bf28}+2.67\%$
test_mod_wrap[compile] 0.4501ms 0.2813ms 3.5543 KOps/s 3.5152 KOps/s $\color{#35bf28}+1.11\%$
test_mod_wrap[compile-overhead] 7.0198ms 3.7499ms 266.6731 Ops/s 270.5030 Ops/s $\color{#d91a1a}-1.42\%$
test_mod_wrap_and_backward[eager] 1.6327ms 1.4379ms 695.4624 Ops/s 706.4862 Ops/s $\color{#d91a1a}-1.56\%$
test_mod_wrap_and_backward[compile] 1.5319ms 1.3565ms 737.2017 Ops/s 742.4789 Ops/s $\color{#d91a1a}-0.71\%$
test_mod_wrap_and_backward[compile-overhead] 1.4844ms 0.9385ms 1.0655 KOps/s 1.0686 KOps/s $\color{#d91a1a}-0.29\%$
test_seq_add[eager] 0.3180ms 0.1130ms 8.8502 KOps/s 8.5873 KOps/s $\color{#35bf28}+3.06\%$
test_seq_add[compile] 0.2609ms 88.5577μs 11.2921 KOps/s 11.2690 KOps/s $\color{#35bf28}+0.20\%$
test_seq_add[compile-overhead] 0.3041ms 0.1284ms 7.7856 KOps/s 7.6898 KOps/s $\color{#35bf28}+1.25\%$
test_seq_wrap[eager] 0.5919ms 0.4071ms 2.4567 KOps/s 2.3617 KOps/s $\color{#35bf28}+4.02\%$
test_seq_wrap[compile] 0.4646ms 0.2961ms 3.3768 KOps/s 3.2204 KOps/s $\color{#35bf28}+4.86\%$
test_seq_wrap[compile-overhead] 0.3925ms 0.2247ms 4.4494 KOps/s 4.4173 KOps/s $\color{#35bf28}+0.73\%$
test_func_call_runtime[False-eager] 0.8810ms 0.7038ms 1.4209 KOps/s 1.3960 KOps/s $\color{#35bf28}+1.79\%$
test_func_call_runtime[False-compile] 1.0019ms 0.7478ms 1.3372 KOps/s 1.3446 KOps/s $\color{#d91a1a}-0.55\%$
test_func_call_runtime[False-compile-overhead] 0.5060ms 0.3644ms 2.7441 KOps/s 2.7290 KOps/s $\color{#35bf28}+0.55\%$
test_func_call_runtime[True-eager] 1.0863ms 0.8837ms 1.1316 KOps/s 1.1477 KOps/s $\color{#d91a1a}-1.41\%$
test_func_call_runtime[True-compile] 0.9706ms 0.7661ms 1.3053 KOps/s 1.3131 KOps/s $\color{#d91a1a}-0.59\%$
test_func_call_runtime[True-compile-overhead] 0.5126ms 0.3854ms 2.5945 KOps/s 2.5981 KOps/s $\color{#d91a1a}-0.14\%$
test_func_call_cm_runtime[False-eager] 0.8899ms 0.7052ms 1.4181 KOps/s 1.3280 KOps/s $\textbf{\color{#35bf28}+6.78\%}$
test_func_call_cm_runtime[False-compile] 0.9231ms 0.7476ms 1.3375 KOps/s 1.3520 KOps/s $\color{#d91a1a}-1.07\%$
test_func_call_cm_runtime[False-compile-overhead] 0.5531ms 0.3682ms 2.7158 KOps/s 2.7261 KOps/s $\color{#d91a1a}-0.38\%$
test_func_call_cm_runtime[True-eager] 1.1878ms 0.9899ms 1.0102 KOps/s 1.0297 KOps/s $\color{#d91a1a}-1.90\%$
test_func_call_cm_runtime[True-compile] 1.1395ms 0.9617ms 1.0398 KOps/s 1.0400 KOps/s $\color{#d91a1a}-0.01\%$
test_func_call_cm_runtime[True-compile-overhead] 1.1150ms 0.9558ms 1.0462 KOps/s 1.0410 KOps/s $\color{#35bf28}+0.50\%$
test_vmap_func_call_cm_runtime[eager] 2.4441ms 2.0229ms 494.3329 Ops/s 495.5169 Ops/s $\color{#d91a1a}-0.24\%$
test_vmap_func_call_cm_runtime[compile] 1.0046ms 0.8176ms 1.2231 KOps/s 1.2121 KOps/s $\color{#35bf28}+0.90\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.5671ms 0.4181ms 2.3915 KOps/s 2.3660 KOps/s $\color{#35bf28}+1.08\%$
test_distributed 5.4416ms 0.2222ms 4.5013 KOps/s 7.4311 KOps/s $\textbf{\color{#d91a1a}-39.43\%}$
test_tdmodule 55.6810μs 18.3737μs 54.4256 KOps/s 45.5289 KOps/s $\textbf{\color{#35bf28}+19.54\%}$
test_tdmodule_dispatch 0.1622ms 33.2196μs 30.1027 KOps/s 26.3342 KOps/s $\textbf{\color{#35bf28}+14.31\%}$
test_tdseq 40.7810μs 19.8491μs 50.3800 KOps/s 45.7360 KOps/s $\textbf{\color{#35bf28}+10.15\%}$
test_tdseq_dispatch 0.1211ms 37.0455μs 26.9938 KOps/s 24.2692 KOps/s $\textbf{\color{#35bf28}+11.23\%}$
test_instantiation_functorch 1.7174ms 1.5418ms 648.5952 Ops/s 647.7303 Ops/s $\color{#35bf28}+0.13\%$
test_exec_functorch 0.3464ms 0.1407ms 7.1088 KOps/s 7.0309 KOps/s $\color{#35bf28}+1.11\%$
test_exec_functional_call 0.3146ms 0.1295ms 7.7239 KOps/s 7.6060 KOps/s $\color{#35bf28}+1.55\%$
test_exec_td_decorator 0.3869ms 0.1812ms 5.5186 KOps/s 5.4296 KOps/s $\color{#35bf28}+1.64\%$
test_vmap_mlp_speed_decorator[True-True] 0.8277ms 0.6587ms 1.5182 KOps/s 1.5051 KOps/s $\color{#35bf28}+0.87\%$
test_vmap_mlp_speed_decorator[True-False] 0.8298ms 0.6608ms 1.5134 KOps/s 1.4932 KOps/s $\color{#35bf28}+1.35\%$
test_vmap_mlp_speed_decorator[False-True] 0.7384ms 0.5743ms 1.7411 KOps/s 1.7403 KOps/s $\color{#35bf28}+0.05\%$
test_vmap_mlp_speed_decorator[False-False] 0.7688ms 0.5742ms 1.7417 KOps/s 1.7491 KOps/s $\color{#d91a1a}-0.42\%$
test_vmap_transformer_speed_decorator[True-True] 18.7637ms 18.4827ms 54.1045 Ops/s 54.0444 Ops/s $\color{#35bf28}+0.11\%$
test_vmap_transformer_speed_decorator[True-False] 19.5438ms 18.6291ms 53.6794 Ops/s 54.3141 Ops/s $\color{#d91a1a}-1.17\%$
test_vmap_transformer_speed_decorator[False-True] 19.3319ms 18.4292ms 54.2618 Ops/s 54.8432 Ops/s $\color{#d91a1a}-1.06\%$
test_vmap_transformer_speed_decorator[False-False] 19.4312ms 18.5507ms 53.9062 Ops/s 54.6281 Ops/s $\color{#d91a1a}-1.32\%$
test_to_module_speed[True] 1.0843ms 0.9614ms 1.0401 KOps/s 1.0470 KOps/s $\color{#d91a1a}-0.66\%$
test_to_module_speed[False] 1.4265ms 0.9535ms 1.0487 KOps/s 1.0584 KOps/s $\color{#d91a1a}-0.91\%$
test_tc_init 0.1089ms 35.3577μs 28.2824 KOps/s 27.1504 KOps/s $\color{#35bf28}+4.17\%$
test_tc_init_nested 99.9220μs 69.7618μs 14.3345 KOps/s 13.5579 KOps/s $\textbf{\color{#35bf28}+5.73\%}$
test_tc_first_layer_tensor 29.5200μs 0.8299μs 1.2049 MOps/s 1.2122 MOps/s $\color{#d91a1a}-0.60\%$
test_tc_first_layer_nontensor 25.0710μs 2.2700μs 440.5212 KOps/s 436.9573 KOps/s $\color{#35bf28}+0.82\%$
test_tc_second_layer_tensor 10.5303μs 1.4324μs 698.1272 KOps/s 698.2298 KOps/s $\color{#d91a1a}-0.01\%$
test_tc_second_layer_nontensor 26.0300μs 3.0084μs 332.4081 KOps/s 332.8712 KOps/s $\color{#d91a1a}-0.14\%$
test_unbind 0.2465s 10.7915ms 92.6657 Ops/s 142.4595 Ops/s $\textbf{\color{#d91a1a}-34.95\%}$
test_full_like 11.9676ms 10.4247ms 95.9262 Ops/s 94.0270 Ops/s $\color{#35bf28}+2.02\%$
test_zeros_like 5.1306ms 4.6014ms 217.3268 Ops/s 215.9141 Ops/s $\color{#35bf28}+0.65\%$
test_ones_like 5.5506ms 4.7100ms 212.3121 Ops/s 213.5197 Ops/s $\color{#d91a1a}-0.57\%$
test_clone 13.1741ms 10.0183ms 99.8172 Ops/s 131.2426 Ops/s $\textbf{\color{#d91a1a}-23.94\%}$
test_squeeze 0.1003ms 9.6537μs 103.5875 KOps/s 105.9075 KOps/s $\color{#d91a1a}-2.19\%$
test_unsqueeze 0.2196ms 72.3016μs 13.8309 KOps/s 14.1280 KOps/s $\color{#d91a1a}-2.10\%$
test_split 0.2713ms 0.1573ms 6.3577 KOps/s 6.4115 KOps/s $\color{#d91a1a}-0.84\%$
test_permute 0.3021ms 0.1733ms 5.7700 KOps/s 5.7062 KOps/s $\color{#35bf28}+1.12\%$
test_stack 53.9251ms 51.7830ms 19.3113 Ops/s 18.6483 Ops/s $\color{#35bf28}+3.56\%$
test_cat 54.5988ms 52.4317ms 19.0724 Ops/s 19.1286 Ops/s $\color{#d91a1a}-0.29\%$

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants