Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] str2td #953

Merged
merged 2 commits into from
Sep 10, 2024
Merged

[Feature] str2td #953

merged 2 commits into from
Sep 10, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Aug 9, 2024

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Aug 9, 2024
ghstack-source-id: af2afacb11521683b977659bc14212853af17a56
Pull Request resolved: #953
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Aug 9, 2024
Copy link

github-actions bot commented Aug 9, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 222. Improved: $\large\color{#35bf28}22$. Worsened: $\large\color{#d91a1a}11$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 54.2510μs 19.7582μs 50.6118 KOps/s 50.3875 KOps/s $\color{#35bf28}+0.45\%$
test_plain_set_stack_nested 47.2690μs 20.1838μs 49.5448 KOps/s 49.4162 KOps/s $\color{#35bf28}+0.26\%$
test_plain_set_nested_inplace 55.2040μs 21.4807μs 46.5535 KOps/s 46.3358 KOps/s $\color{#35bf28}+0.47\%$
test_plain_set_stack_nested_inplace 67.4870μs 21.4250μs 46.6745 KOps/s 43.6517 KOps/s $\textbf{\color{#35bf28}+6.92\%}$
test_items 22.6320μs 4.1907μs 238.6215 KOps/s 242.8176 KOps/s $\color{#d91a1a}-1.73\%$
test_items_nested 0.7146ms 0.3375ms 2.9631 KOps/s 3.0730 KOps/s $\color{#d91a1a}-3.58\%$
test_items_nested_locked 0.5257ms 0.3375ms 2.9630 KOps/s 3.0792 KOps/s $\color{#d91a1a}-3.77\%$
test_items_nested_leaf 0.2016ms 85.8746μs 11.6449 KOps/s 11.7256 KOps/s $\color{#d91a1a}-0.69\%$
test_items_stack_nested 0.5181ms 0.3378ms 2.9607 KOps/s 3.0509 KOps/s $\color{#d91a1a}-2.96\%$
test_items_stack_nested_leaf 0.1608ms 83.8456μs 11.9267 KOps/s 11.8423 KOps/s $\color{#35bf28}+0.71\%$
test_items_stack_nested_locked 0.4327ms 0.3368ms 2.9689 KOps/s 3.0575 KOps/s $\color{#d91a1a}-2.90\%$
test_keys 36.3480μs 3.5281μs 283.4426 KOps/s 281.9610 KOps/s $\color{#35bf28}+0.53\%$
test_keys_nested 0.1830ms 96.8303μs 10.3273 KOps/s 10.2215 KOps/s $\color{#35bf28}+1.04\%$
test_keys_nested_locked 1.8026ms 0.1033ms 9.6839 KOps/s 9.8528 KOps/s $\color{#d91a1a}-1.71\%$
test_keys_nested_leaf 0.1835ms 81.9226μs 12.2066 KOps/s 11.9699 KOps/s $\color{#35bf28}+1.98\%$
test_keys_stack_nested 0.1646ms 95.2988μs 10.4933 KOps/s 10.4800 KOps/s $\color{#35bf28}+0.13\%$
test_keys_stack_nested_leaf 0.1763ms 79.8440μs 12.5244 KOps/s 12.5806 KOps/s $\color{#d91a1a}-0.45\%$
test_keys_stack_nested_locked 0.1950ms 99.8303μs 10.0170 KOps/s 10.0667 KOps/s $\color{#d91a1a}-0.49\%$
test_values 10.3118μs 1.0635μs 940.3354 KOps/s 927.7392 KOps/s $\color{#35bf28}+1.36\%$
test_values_nested 90.3690μs 48.0000μs 20.8333 KOps/s 20.2710 KOps/s $\color{#35bf28}+2.77\%$
test_values_nested_locked 84.0570μs 47.9436μs 20.8578 KOps/s 20.7636 KOps/s $\color{#35bf28}+0.45\%$
test_values_nested_leaf 0.1223ms 42.6563μs 23.4432 KOps/s 23.7696 KOps/s $\color{#d91a1a}-1.37\%$
test_values_stack_nested 87.8950μs 48.1655μs 20.7618 KOps/s 20.7810 KOps/s $\color{#d91a1a}-0.09\%$
test_values_stack_nested_leaf 89.6070μs 41.4550μs 24.1226 KOps/s 24.5493 KOps/s $\color{#d91a1a}-1.74\%$
test_values_stack_nested_locked 92.8730μs 48.3440μs 20.6851 KOps/s 20.8164 KOps/s $\color{#d91a1a}-0.63\%$
test_membership 4.8934μs 0.6946μs 1.4397 MOps/s 1.1675 MOps/s $\textbf{\color{#35bf28}+23.32\%}$
test_membership_nested 19.9070μs 2.5975μs 384.9801 KOps/s 385.0532 KOps/s $\color{#d91a1a}-0.02\%$
test_membership_nested_leaf 30.1760μs 2.5992μs 384.7379 KOps/s 390.4880 KOps/s $\color{#d91a1a}-1.47\%$
test_membership_stacked_nested 29.5960μs 2.6109μs 383.0093 KOps/s 388.8942 KOps/s $\color{#d91a1a}-1.51\%$
test_membership_stacked_nested_leaf 31.6990μs 2.6699μs 374.5459 KOps/s 383.7570 KOps/s $\color{#d91a1a}-2.40\%$
test_membership_nested_last 31.8000μs 3.8150μs 262.1244 KOps/s 267.5132 KOps/s $\color{#d91a1a}-2.01\%$
test_membership_nested_leaf_last 37.0890μs 3.8052μs 262.7995 KOps/s 264.6012 KOps/s $\color{#d91a1a}-0.68\%$
test_membership_stacked_nested_last 32.0100μs 4.8205μs 207.4469 KOps/s 235.1893 KOps/s $\textbf{\color{#d91a1a}-11.80\%}$
test_membership_stacked_nested_leaf_last 29.4250μs 4.8454μs 206.3823 KOps/s 233.8531 KOps/s $\textbf{\color{#d91a1a}-11.75\%}$
test_nested_getleaf 30.5970μs 10.5913μs 94.4173 KOps/s 92.6239 KOps/s $\color{#35bf28}+1.94\%$
test_nested_get 46.3570μs 9.9968μs 100.0316 KOps/s 98.2999 KOps/s $\color{#35bf28}+1.76\%$
test_stacked_getleaf 37.4300μs 10.6161μs 94.1964 KOps/s 93.9600 KOps/s $\color{#35bf28}+0.25\%$
test_stacked_get 48.3900μs 9.9846μs 100.1538 KOps/s 98.5857 KOps/s $\color{#35bf28}+1.59\%$
test_nested_getitemleaf 39.2530μs 10.9713μs 91.1465 KOps/s 90.7708 KOps/s $\color{#35bf28}+0.41\%$
test_nested_getitem 49.5120μs 10.2791μs 97.2849 KOps/s 96.6023 KOps/s $\color{#35bf28}+0.71\%$
test_stacked_getitemleaf 38.7520μs 10.8718μs 91.9810 KOps/s 91.4686 KOps/s $\color{#35bf28}+0.56\%$
test_stacked_getitem 39.8140μs 10.2275μs 97.7753 KOps/s 98.3970 KOps/s $\color{#d91a1a}-0.63\%$
test_lock_nested 89.0178ms 0.5719ms 1.7486 KOps/s 2.0881 KOps/s $\textbf{\color{#d91a1a}-16.26\%}$
test_lock_stack_nested 0.7954ms 0.4486ms 2.2290 KOps/s 2.2463 KOps/s $\color{#d91a1a}-0.77\%$
test_unlock_nested 91.0712ms 0.4974ms 2.0104 KOps/s 2.4880 KOps/s $\textbf{\color{#d91a1a}-19.19\%}$
test_unlock_stack_nested 0.4663ms 0.3627ms 2.7568 KOps/s 2.7346 KOps/s $\color{#35bf28}+0.81\%$
test_flatten_speed 0.2121ms 0.1042ms 9.5994 KOps/s 9.4464 KOps/s $\color{#35bf28}+1.62\%$
test_unflatten_speed 0.5479ms 0.4585ms 2.1809 KOps/s 2.2039 KOps/s $\color{#d91a1a}-1.05\%$
test_common_ops 4.7204ms 1.0877ms 919.3928 Ops/s 893.1228 Ops/s $\color{#35bf28}+2.94\%$
test_creation 84.0770μs 2.1312μs 469.2158 KOps/s 470.4667 KOps/s $\color{#d91a1a}-0.27\%$
test_creation_empty 45.4340μs 16.4765μs 60.6925 KOps/s 56.8370 KOps/s $\textbf{\color{#35bf28}+6.78\%}$
test_creation_nested_1 49.2220μs 19.4350μs 51.4534 KOps/s 48.5301 KOps/s $\textbf{\color{#35bf28}+6.02\%}$
test_creation_nested_2 51.9470μs 23.9940μs 41.6771 KOps/s 40.2020 KOps/s $\color{#35bf28}+3.67\%$
test_clone 0.1829ms 17.0409μs 58.6825 KOps/s 56.4214 KOps/s $\color{#35bf28}+4.01\%$
test_getitem[int] 0.8981ms 16.9210μs 59.0981 KOps/s 60.3314 KOps/s $\color{#d91a1a}-2.04\%$
test_getitem[slice_int] 0.1478ms 31.0716μs 32.1838 KOps/s 31.3587 KOps/s $\color{#35bf28}+2.63\%$
test_getitem[range] 0.2848ms 58.6529μs 17.0494 KOps/s 16.9522 KOps/s $\color{#35bf28}+0.57\%$
test_getitem[tuple] 0.1627ms 25.7219μs 38.8773 KOps/s 39.7771 KOps/s $\color{#d91a1a}-2.26\%$
test_getitem[list] 0.2508ms 52.7425μs 18.9600 KOps/s 18.2714 KOps/s $\color{#35bf28}+3.77\%$
test_setitem_dim[int] 55.4040μs 37.7643μs 26.4800 KOps/s 25.1791 KOps/s $\textbf{\color{#35bf28}+5.17\%}$
test_setitem_dim[slice_int] 0.1033ms 66.9104μs 14.9454 KOps/s 14.3978 KOps/s $\color{#35bf28}+3.80\%$
test_setitem_dim[range] 0.1812ms 92.2885μs 10.8356 KOps/s 10.7824 KOps/s $\color{#35bf28}+0.49\%$
test_setitem_dim[tuple] 0.1015ms 55.5250μs 18.0099 KOps/s 17.8822 KOps/s $\color{#35bf28}+0.71\%$
test_setitem 0.1132ms 29.4427μs 33.9642 KOps/s 33.1172 KOps/s $\color{#35bf28}+2.56\%$
test_set 94.9670μs 29.1112μs 34.3510 KOps/s 33.7323 KOps/s $\color{#35bf28}+1.83\%$
test_set_shared 1.1779ms 0.2114ms 4.7304 KOps/s 4.6508 KOps/s $\color{#35bf28}+1.71\%$
test_update 0.1515ms 35.1994μs 28.4095 KOps/s 27.8572 KOps/s $\color{#35bf28}+1.98\%$
test_update_nested 0.2050ms 44.8585μs 22.2923 KOps/s 21.5270 KOps/s $\color{#35bf28}+3.56\%$
test_update__nested 0.1139ms 34.9556μs 28.6077 KOps/s 28.0498 KOps/s $\color{#35bf28}+1.99\%$
test_set_nested 90.7090μs 30.9450μs 32.3154 KOps/s 31.2196 KOps/s $\color{#35bf28}+3.51\%$
test_set_nested_new 0.1244ms 36.1416μs 27.6689 KOps/s 26.7400 KOps/s $\color{#35bf28}+3.47\%$
test_select 0.1859ms 55.1884μs 18.1198 KOps/s 18.3203 KOps/s $\color{#d91a1a}-1.09\%$
test_select_nested 0.1513ms 59.6532μs 16.7636 KOps/s 16.7950 KOps/s $\color{#d91a1a}-0.19\%$
test_exclude_nested 0.1416ms 75.1848μs 13.3006 KOps/s 13.3519 KOps/s $\color{#d91a1a}-0.38\%$
test_empty[True] 0.5184ms 0.3147ms 3.1774 KOps/s 3.1661 KOps/s $\color{#35bf28}+0.36\%$
test_empty[False] 7.3076μs 1.1684μs 855.8460 KOps/s 806.6329 KOps/s $\textbf{\color{#35bf28}+6.10\%}$
test_unbind_speed 0.4799ms 0.3076ms 3.2505 KOps/s 3.2537 KOps/s $\color{#d91a1a}-0.10\%$
test_unbind_speed_stack0 0.5134ms 0.2937ms 3.4043 KOps/s 3.4512 KOps/s $\color{#d91a1a}-1.36\%$
test_unbind_speed_stack1 99.6902ms 0.8107ms 1.2335 KOps/s 1.3829 KOps/s $\textbf{\color{#d91a1a}-10.80\%}$
test_split 93.9126ms 2.2113ms 452.2271 Ops/s 459.8287 Ops/s $\color{#d91a1a}-1.65\%$
test_chunk 3.2903ms 2.0247ms 493.9019 Ops/s 463.0873 Ops/s $\textbf{\color{#35bf28}+6.65\%}$
test_creation[device0] 4.2086ms 0.1202ms 8.3224 KOps/s 8.5212 KOps/s $\color{#d91a1a}-2.33\%$
test_creation_from_tensor 0.2442ms 0.1185ms 8.4412 KOps/s 8.4520 KOps/s $\color{#d91a1a}-0.13\%$
test_add_one[memmap_tensor0] 0.1649ms 7.7185μs 129.5586 KOps/s 131.6916 KOps/s $\color{#d91a1a}-1.62\%$
test_contiguous[memmap_tensor0] 18.7850μs 1.9813μs 504.7129 KOps/s 523.2414 KOps/s $\color{#d91a1a}-3.54\%$
test_stack[memmap_tensor0] 49.2720μs 5.9927μs 166.8694 KOps/s 170.2663 KOps/s $\color{#d91a1a}-2.00\%$
test_memmaptd_index 1.1580ms 0.4011ms 2.4929 KOps/s 2.4904 KOps/s $\color{#35bf28}+0.10\%$
test_memmaptd_index_astensor 0.7868ms 0.4765ms 2.0985 KOps/s 2.0610 KOps/s $\color{#35bf28}+1.82\%$
test_memmaptd_index_op 2.4675ms 0.9897ms 1.0104 KOps/s 962.5372 Ops/s $\color{#35bf28}+4.97\%$
test_serialize_model 0.2224s 0.1309s 7.6393 Ops/s 8.4288 Ops/s $\textbf{\color{#d91a1a}-9.37\%}$
test_serialize_model_pickle 0.4743s 0.3976s 2.5151 Ops/s 2.5174 Ops/s $\color{#d91a1a}-0.09\%$
test_serialize_weights 0.1216s 0.1172s 8.5304 Ops/s 7.7280 Ops/s $\textbf{\color{#35bf28}+10.38\%}$
test_serialize_weights_returnearly 0.2440s 0.1709s 5.8499 Ops/s 6.2873 Ops/s $\textbf{\color{#d91a1a}-6.96\%}$
test_serialize_weights_pickle 0.6552s 0.4519s 2.2129 Ops/s 1.1831 Ops/s $\textbf{\color{#35bf28}+87.04\%}$
test_serialize_weights_filesystem 0.1507s 0.1411s 7.0850 Ops/s 7.0371 Ops/s $\color{#35bf28}+0.68\%$
test_serialize_model_filesystem 0.1497s 0.1436s 6.9623 Ops/s 6.4835 Ops/s $\textbf{\color{#35bf28}+7.39\%}$
test_reshape_pytree 86.3310μs 38.6706μs 25.8594 KOps/s 25.7619 KOps/s $\color{#35bf28}+0.38\%$
test_reshape_td 0.1037ms 45.8130μs 21.8279 KOps/s 21.0762 KOps/s $\color{#35bf28}+3.57\%$
test_view_pytree 0.1040ms 39.4188μs 25.3686 KOps/s 25.7704 KOps/s $\color{#d91a1a}-1.56\%$
test_view_td 0.1334ms 52.0600μs 19.2086 KOps/s 18.8265 KOps/s $\color{#35bf28}+2.03\%$
test_unbind_pytree 0.1076ms 36.8332μs 27.1495 KOps/s 27.4717 KOps/s $\color{#d91a1a}-1.17\%$
test_unbind_td 0.3416ms 44.9734μs 22.2354 KOps/s 21.6201 KOps/s $\color{#35bf28}+2.85\%$
test_split_pytree 96.1800μs 38.5283μs 25.9549 KOps/s 25.8173 KOps/s $\color{#35bf28}+0.53\%$
test_split_td 0.5026ms 58.5245μs 17.0869 KOps/s 17.2799 KOps/s $\color{#d91a1a}-1.12\%$
test_add_pytree 0.1128ms 45.1001μs 22.1729 KOps/s 21.6743 KOps/s $\color{#35bf28}+2.30\%$
test_add_td 0.1517ms 78.1791μs 12.7911 KOps/s 12.2287 KOps/s $\color{#35bf28}+4.60\%$
test_compile_add_one_nested[tensordict-compile] 0.1298ms 58.6194μs 17.0592 KOps/s 17.3803 KOps/s $\color{#d91a1a}-1.85\%$
test_compile_add_one_nested[tensordict-eager] 0.3372ms 0.1870ms 5.3473 KOps/s 5.3208 KOps/s $\color{#35bf28}+0.50\%$
test_compile_add_one_nested[pytree-compile] 0.1355ms 57.3402μs 17.4398 KOps/s 17.8565 KOps/s $\color{#d91a1a}-2.33\%$
test_compile_add_one_nested[pytree-eager] 0.2772ms 0.1434ms 6.9730 KOps/s 6.9255 KOps/s $\color{#35bf28}+0.69\%$
test_compile_copy_nested[tensordict-compile] 73.9380μs 20.9325μs 47.7725 KOps/s 48.2755 KOps/s $\color{#d91a1a}-1.04\%$
test_compile_copy_nested[tensordict-eager] 0.1379ms 66.7807μs 14.9744 KOps/s 14.9747 KOps/s $-0.00\%$
test_compile_copy_nested[pytree-compile] 0.1524ms 75.4920μs 13.2464 KOps/s 13.2888 KOps/s $\color{#d91a1a}-0.32\%$
test_compile_copy_nested[pytree-eager] 0.1464ms 68.0274μs 14.6999 KOps/s 14.8574 KOps/s $\color{#d91a1a}-1.06\%$
test_compile_add_one_flat[tensordict-compile] 0.3631ms 0.1768ms 5.6576 KOps/s 5.7662 KOps/s $\color{#d91a1a}-1.88\%$
test_compile_add_one_flat[tensordict-eager] 0.3686ms 0.1874ms 5.3370 KOps/s 5.2173 KOps/s $\color{#35bf28}+2.30\%$
test_compile_add_one_flat[tensorclass-compile] 0.1127ms 48.0682μs 20.8038 KOps/s 21.3354 KOps/s $\color{#d91a1a}-2.49\%$
test_compile_add_one_flat[tensorclass-eager] 0.5302ms 68.1759μs 14.6679 KOps/s 14.3295 KOps/s $\color{#35bf28}+2.36\%$
test_compile_add_one_flat[pytree-compile] 0.3807ms 0.1785ms 5.6035 KOps/s 5.7385 KOps/s $\color{#d91a1a}-2.35\%$
test_compile_add_one_flat[pytree-eager] 0.4838ms 0.2957ms 3.3820 KOps/s 3.3258 KOps/s $\color{#35bf28}+1.69\%$
test_compile_add_self_flat[tensordict-eager] 0.4679ms 0.2077ms 4.8153 KOps/s 4.8936 KOps/s $\color{#d91a1a}-1.60\%$
test_compile_add_self_flat[tensordict-compile] 0.3899ms 0.1783ms 5.6070 KOps/s 5.7896 KOps/s $\color{#d91a1a}-3.15\%$
test_compile_add_self_flat[tensorclass-eager] 0.1235ms 62.7999μs 15.9236 KOps/s 15.9881 KOps/s $\color{#d91a1a}-0.40\%$
test_compile_add_self_flat[tensorclass-compile] 0.1136ms 48.5809μs 20.5842 KOps/s 20.4383 KOps/s $\color{#35bf28}+0.71\%$
test_compile_add_self_flat[pytree-eager] 0.3468ms 0.2396ms 4.1733 KOps/s 4.0896 KOps/s $\color{#35bf28}+2.04\%$
test_compile_add_self_flat[pytree-compile] 0.3403ms 0.1811ms 5.5226 KOps/s 5.6928 KOps/s $\color{#d91a1a}-2.99\%$
test_compile_copy_flat[tensordict-compile] 0.2402ms 0.1053ms 9.4973 KOps/s 9.7851 KOps/s $\color{#d91a1a}-2.94\%$
test_compile_copy_flat[tensordict-eager] 0.1335ms 57.7456μs 17.3173 KOps/s 16.8463 KOps/s $\color{#35bf28}+2.80\%$
test_compile_copy_flat[pytree-compile] 0.1632ms 78.5821μs 12.7255 KOps/s 12.8730 KOps/s $\color{#d91a1a}-1.15\%$
test_compile_copy_flat[pytree-eager] 0.1206ms 69.2044μs 14.4499 KOps/s 14.6087 KOps/s $\color{#d91a1a}-1.09\%$
test_compile_assign_and_add[tensordict-compile] 0.4085ms 0.1970ms 5.0772 KOps/s 5.1383 KOps/s $\color{#d91a1a}-1.19\%$
test_compile_assign_and_add[tensordict-eager] 2.2179ms 1.6323ms 612.6331 Ops/s 596.4690 Ops/s $\color{#35bf28}+2.71\%$
test_compile_assign_and_add[pytree-compile] 0.2738ms 0.1974ms 5.0655 KOps/s 5.1330 KOps/s $\color{#d91a1a}-1.32\%$
test_compile_assign_and_add[pytree-eager] 1.2610ms 1.1166ms 895.5898 Ops/s 878.2217 Ops/s $\color{#35bf28}+1.98\%$
test_compile_assign_and_add_stack[compile] 0.7638ms 0.4154ms 2.4072 KOps/s 2.3449 KOps/s $\color{#35bf28}+2.65\%$
test_compile_assign_and_add_stack[eager] 3.9383ms 3.6216ms 276.1188 Ops/s 258.0824 Ops/s $\textbf{\color{#35bf28}+6.99\%}$
test_compile_indexing[tensor-tensordict-compile] 77.1450μs 34.9693μs 28.5965 KOps/s 28.4616 KOps/s $\color{#35bf28}+0.47\%$
test_compile_indexing[tensor-tensordict-eager] 0.4004s 75.1795μs 13.3015 KOps/s 20.2657 KOps/s $\textbf{\color{#d91a1a}-34.36\%}$
test_compile_indexing[tensor-tensorclass-compile] 0.1026ms 30.7955μs 32.4723 KOps/s 32.9977 KOps/s $\color{#d91a1a}-1.59\%$
test_compile_indexing[tensor-tensorclass-eager] 83.2160μs 28.3454μs 35.2791 KOps/s 34.3835 KOps/s $\color{#35bf28}+2.60\%$
test_compile_indexing[tensor-pytree-compile] 88.5760μs 31.0732μs 32.1821 KOps/s 32.9461 KOps/s $\color{#d91a1a}-2.32\%$
test_compile_indexing[tensor-pytree-eager] 79.8290μs 28.1752μs 35.4922 KOps/s 34.0101 KOps/s $\color{#35bf28}+4.36\%$
test_compile_indexing[slice-tensordict-compile] 0.1636ms 76.3105μs 13.1044 KOps/s 13.5401 KOps/s $\color{#d91a1a}-3.22\%$
test_compile_indexing[slice-tensordict-eager] 0.5738ms 27.8241μs 35.9400 KOps/s 35.7627 KOps/s $\color{#35bf28}+0.50\%$
test_compile_indexing[slice-tensorclass-compile] 0.1685ms 71.2371μs 14.0376 KOps/s 14.2861 KOps/s $\color{#d91a1a}-1.74\%$
test_compile_indexing[slice-tensorclass-eager] 70.6620μs 23.2935μs 42.9304 KOps/s 43.2406 KOps/s $\color{#d91a1a}-0.72\%$
test_compile_indexing[slice-pytree-compile] 0.1286ms 69.9958μs 14.2866 KOps/s 14.8800 KOps/s $\color{#d91a1a}-3.99\%$
test_compile_indexing[slice-pytree-eager] 74.4690μs 23.2109μs 43.0832 KOps/s 43.3988 KOps/s $\color{#d91a1a}-0.73\%$
test_compile_indexing[int-tensordict-compile] 0.1341ms 75.4622μs 13.2517 KOps/s 13.6054 KOps/s $\color{#d91a1a}-2.60\%$
test_compile_indexing[int-tensordict-eager] 1.0381ms 28.0051μs 35.7078 KOps/s 35.8726 KOps/s $\color{#d91a1a}-0.46\%$
test_compile_indexing[int-tensorclass-compile] 0.1724ms 70.8194μs 14.1204 KOps/s 14.5739 KOps/s $\color{#d91a1a}-3.11\%$
test_compile_indexing[int-tensorclass-eager] 70.1910μs 23.2447μs 43.0205 KOps/s 42.9175 KOps/s $\color{#35bf28}+0.24\%$
test_compile_indexing[int-pytree-compile] 0.1532ms 70.5461μs 14.1751 KOps/s 14.4403 KOps/s $\color{#d91a1a}-1.84\%$
test_compile_indexing[int-pytree-eager] 68.2370μs 22.9970μs 43.4839 KOps/s 43.7762 KOps/s $\color{#d91a1a}-0.67\%$
test_mod_add[eager] 73.1570μs 22.9583μs 43.5572 KOps/s 39.4356 KOps/s $\textbf{\color{#35bf28}+10.45\%}$
test_mod_add[compile] 0.1070ms 40.1728μs 24.8924 KOps/s 23.8041 KOps/s $\color{#35bf28}+4.57\%$
test_mod_add[compile-overhead] 84.7780μs 40.2789μs 24.8269 KOps/s 24.2594 KOps/s $\color{#35bf28}+2.34\%$
test_mod_wrap[eager] 0.4121ms 0.2105ms 4.7514 KOps/s 4.6807 KOps/s $\color{#35bf28}+1.51\%$
test_mod_wrap[compile] 0.3666ms 0.2352ms 4.2515 KOps/s 4.3026 KOps/s $\color{#d91a1a}-1.19\%$
test_mod_wrap[compile-overhead] 0.3240ms 0.2299ms 4.3488 KOps/s 4.3408 KOps/s $\color{#35bf28}+0.19\%$
test_mod_wrap_and_backward[eager] 11.5221ms 10.5810ms 94.5086 Ops/s 93.4037 Ops/s $\color{#35bf28}+1.18\%$
test_mod_wrap_and_backward[compile] 12.1789ms 10.7582ms 92.9523 Ops/s 88.5089 Ops/s $\textbf{\color{#35bf28}+5.02\%}$
test_mod_wrap_and_backward[compile-overhead] 11.9226ms 10.8010ms 92.5838 Ops/s 83.4666 Ops/s $\textbf{\color{#35bf28}+10.92\%}$
test_seq_add[eager] 0.1787ms 84.7723μs 11.7963 KOps/s 11.1183 KOps/s $\textbf{\color{#35bf28}+6.10\%}$
test_seq_add[compile] 0.1528ms 66.2800μs 15.0875 KOps/s 15.4125 KOps/s $\color{#d91a1a}-2.11\%$
test_seq_add[compile-overhead] 0.1369ms 63.7724μs 15.6808 KOps/s 15.9715 KOps/s $\color{#d91a1a}-1.82\%$
test_seq_wrap[eager] 0.6012ms 0.3763ms 2.6573 KOps/s 2.5957 KOps/s $\color{#35bf28}+2.37\%$
test_seq_wrap[compile] 0.5359ms 0.2685ms 3.7244 KOps/s 3.6953 KOps/s $\color{#35bf28}+0.78\%$
test_seq_wrap[compile-overhead] 0.4851ms 0.2671ms 3.7434 KOps/s 3.6885 KOps/s $\color{#35bf28}+1.49\%$
test_func_call_runtime[False-eager] 0.9439ms 0.5310ms 1.8833 KOps/s 1.9074 KOps/s $\color{#d91a1a}-1.26\%$
test_func_call_runtime[False-compile] 0.8496ms 0.5055ms 1.9784 KOps/s 1.9594 KOps/s $\color{#35bf28}+0.97\%$
test_func_call_runtime[False-compile-overhead] 0.9395ms 0.5013ms 1.9946 KOps/s 1.9764 KOps/s $\color{#35bf28}+0.92\%$
test_func_call_runtime[True-eager] 1.2784ms 0.7459ms 1.3407 KOps/s 1.3207 KOps/s $\color{#35bf28}+1.51\%$
test_func_call_runtime[True-compile] 0.6399ms 0.5158ms 1.9386 KOps/s 1.9283 KOps/s $\color{#35bf28}+0.53\%$
test_func_call_runtime[True-compile-overhead] 0.9218ms 0.5154ms 1.9401 KOps/s 1.9262 KOps/s $\color{#35bf28}+0.72\%$
test_func_call_cm_runtime[False-eager] 0.7143ms 0.5237ms 1.9094 KOps/s 1.9381 KOps/s $\color{#d91a1a}-1.48\%$
test_func_call_cm_runtime[False-compile] 0.9198ms 0.5054ms 1.9787 KOps/s 1.9775 KOps/s $\color{#35bf28}+0.06\%$
test_func_call_cm_runtime[False-compile-overhead] 0.6074ms 0.5019ms 1.9923 KOps/s 1.9616 KOps/s $\color{#35bf28}+1.57\%$
test_func_call_cm_runtime[True-eager] 1.0511ms 0.8684ms 1.1516 KOps/s 1.1306 KOps/s $\color{#35bf28}+1.86\%$
test_func_call_cm_runtime[True-compile] 0.8945ms 0.7404ms 1.3506 KOps/s 1.3408 KOps/s $\color{#35bf28}+0.73\%$
test_func_call_cm_runtime[True-compile-overhead] 0.8974ms 0.7308ms 1.3683 KOps/s 1.3203 KOps/s $\color{#35bf28}+3.64\%$
test_vmap_func_call_cm_runtime[eager] 2.6149ms 1.8730ms 533.9011 Ops/s 525.6203 Ops/s $\color{#35bf28}+1.58\%$
test_vmap_func_call_cm_runtime[compile] 2.7310ms 1.9188ms 521.1678 Ops/s 510.9846 Ops/s $\color{#35bf28}+1.99\%$
test_vmap_func_call_cm_runtime[compile-overhead] 2.8024ms 1.9145ms 522.3354 Ops/s 510.1659 Ops/s $\color{#35bf28}+2.39\%$
test_distributed 0.2255ms 0.1230ms 8.1291 KOps/s 7.8411 KOps/s $\color{#35bf28}+3.67\%$
test_tdmodule 29.3040μs 16.0796μs 62.1906 KOps/s 55.7239 KOps/s $\textbf{\color{#35bf28}+11.60\%}$
test_tdmodule_dispatch 84.9190μs 34.2183μs 29.2242 KOps/s 27.7822 KOps/s $\textbf{\color{#35bf28}+5.19\%}$
test_tdseq 54.3920μs 19.4346μs 51.4546 KOps/s 48.7185 KOps/s $\textbf{\color{#35bf28}+5.62\%}$
test_tdseq_dispatch 71.5730μs 40.3985μs 24.7534 KOps/s 24.4642 KOps/s $\color{#35bf28}+1.18\%$
test_instantiation_functorch 2.5748ms 1.6051ms 623.0157 Ops/s 621.0659 Ops/s $\color{#35bf28}+0.31\%$
test_instantiation_td 1.8812ms 1.1767ms 849.8182 Ops/s 841.6440 Ops/s $\color{#35bf28}+0.97\%$
test_exec_functorch 0.3400ms 0.1825ms 5.4787 KOps/s 5.3926 KOps/s $\color{#35bf28}+1.60\%$
test_exec_functional_call 0.3223ms 0.1759ms 5.6841 KOps/s 5.6208 KOps/s $\color{#35bf28}+1.12\%$
test_exec_td 0.3081ms 0.1676ms 5.9652 KOps/s 5.7977 KOps/s $\color{#35bf28}+2.89\%$
test_exec_td_decorator 0.8653ms 0.2239ms 4.4670 KOps/s 4.4263 KOps/s $\color{#35bf28}+0.92\%$
test_vmap_mlp_speed[True-True] 0.9647ms 0.6409ms 1.5602 KOps/s 1.5229 KOps/s $\color{#35bf28}+2.45\%$
test_vmap_mlp_speed[True-False] 0.9870ms 0.6367ms 1.5705 KOps/s 1.5283 KOps/s $\color{#35bf28}+2.76\%$
test_vmap_mlp_speed[False-True] 0.6179ms 0.4944ms 2.0228 KOps/s 1.9564 KOps/s $\color{#35bf28}+3.40\%$
test_vmap_mlp_speed[False-False] 0.9615ms 0.5092ms 1.9640 KOps/s 1.9550 KOps/s $\color{#35bf28}+0.46\%$
test_vmap_mlp_speed_decorator[True-True] 1.3066ms 0.6155ms 1.6247 KOps/s 1.5661 KOps/s $\color{#35bf28}+3.74\%$
test_vmap_mlp_speed_decorator[True-False] 0.8801ms 0.6178ms 1.6186 KOps/s 1.5573 KOps/s $\color{#35bf28}+3.94\%$
test_vmap_mlp_speed_decorator[False-True] 0.9245ms 0.5132ms 1.9486 KOps/s 1.9031 KOps/s $\color{#35bf28}+2.39\%$
test_vmap_mlp_speed_decorator[False-False] 0.6929ms 0.5093ms 1.9634 KOps/s 1.9056 KOps/s $\color{#35bf28}+3.03\%$
test_to_module_speed[True] 1.7065ms 1.2786ms 782.0872 Ops/s 771.8578 Ops/s $\color{#35bf28}+1.33\%$
test_to_module_speed[False] 1.7648ms 1.2479ms 801.3408 Ops/s 791.1296 Ops/s $\color{#35bf28}+1.29\%$
test_tc_init 87.1920μs 43.0597μs 23.2236 KOps/s 23.1329 KOps/s $\color{#35bf28}+0.39\%$
test_tc_init_nested 0.1538ms 85.9261μs 11.6379 KOps/s 11.4905 KOps/s $\color{#35bf28}+1.28\%$
test_tc_first_layer_tensor 18.8750μs 1.5948μs 627.0506 KOps/s 675.6225 KOps/s $\textbf{\color{#d91a1a}-7.19\%}$
test_tc_first_layer_nontensor 39.7840μs 4.7654μs 209.8446 KOps/s 217.0363 KOps/s $\color{#d91a1a}-3.31\%$
test_tc_second_layer_tensor 27.7620μs 2.8945μs 345.4868 KOps/s 360.2096 KOps/s $\color{#d91a1a}-4.09\%$
test_tc_second_layer_nontensor 32.1200μs 6.0721μs 164.6865 KOps/s 163.6429 KOps/s $\color{#35bf28}+0.64\%$
test_unbind 0.4572s 12.7866ms 78.2067 Ops/s 64.4978 Ops/s $\textbf{\color{#35bf28}+21.25\%}$
test_full_like 8.1585ms 6.9807ms 143.2522 Ops/s 139.5363 Ops/s $\color{#35bf28}+2.66\%$
test_zeros_like 3.2738ms 2.6615ms 375.7263 Ops/s 154.1934 Ops/s $\textbf{\color{#35bf28}+143.67\%}$
test_ones_like 14.3302ms 6.6020ms 151.4681 Ops/s 123.4016 Ops/s $\textbf{\color{#35bf28}+22.74\%}$
test_clone 13.0379ms 8.2263ms 121.5615 Ops/s 104.5588 Ops/s $\textbf{\color{#35bf28}+16.26\%}$
test_squeeze 71.7140μs 12.3976μs 80.6606 KOps/s 81.0435 KOps/s $\color{#d91a1a}-0.47\%$
test_unsqueeze 0.1672ms 90.3444μs 11.0687 KOps/s 10.8349 KOps/s $\color{#35bf28}+2.16\%$
test_split 0.4739ms 0.1953ms 5.1211 KOps/s 5.1059 KOps/s $\color{#35bf28}+0.30\%$
test_permute 0.3594ms 0.2189ms 4.5693 KOps/s 4.5661 KOps/s $\color{#35bf28}+0.07\%$
test_stack 33.2928ms 25.5657ms 39.1149 Ops/s 41.5363 Ops/s $\textbf{\color{#d91a1a}-5.83\%}$
test_cat 33.4592ms 25.3897ms 39.3861 Ops/s 41.9868 Ops/s $\textbf{\color{#d91a1a}-6.19\%}$

Copy link

github-actions bot commented Aug 9, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 228. Improved: $\large\color{#35bf28}34$. Worsened: $\large\color{#d91a1a}6$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 0.4743ms 13.7088μs 72.9459 KOps/s 68.0308 KOps/s $\textbf{\color{#35bf28}+7.22\%}$
test_plain_set_stack_nested 39.6910μs 14.0089μs 71.3831 KOps/s 67.3871 KOps/s $\textbf{\color{#35bf28}+5.93\%}$
test_plain_set_nested_inplace 45.1610μs 14.7500μs 67.7967 KOps/s 62.9640 KOps/s $\textbf{\color{#35bf28}+7.68\%}$
test_plain_set_stack_nested_inplace 47.2200μs 14.9624μs 66.8344 KOps/s 63.7270 KOps/s $\color{#35bf28}+4.88\%$
test_items 33.8010μs 2.8452μs 351.4683 KOps/s 349.6420 KOps/s $\color{#35bf28}+0.52\%$
test_items_nested 0.3637ms 0.3127ms 3.1976 KOps/s 3.2116 KOps/s $\color{#d91a1a}-0.44\%$
test_items_nested_locked 0.3737ms 0.3127ms 3.1981 KOps/s 3.1570 KOps/s $\color{#35bf28}+1.30\%$
test_items_nested_leaf 97.3210μs 62.9724μs 15.8800 KOps/s 15.9115 KOps/s $\color{#d91a1a}-0.20\%$
test_items_stack_nested 0.3733ms 0.3145ms 3.1792 KOps/s 3.2054 KOps/s $\color{#d91a1a}-0.82\%$
test_items_stack_nested_leaf 96.2210μs 64.5668μs 15.4878 KOps/s 15.9401 KOps/s $\color{#d91a1a}-2.84\%$
test_items_stack_nested_locked 0.3701ms 0.3139ms 3.1853 KOps/s 3.1956 KOps/s $\color{#d91a1a}-0.32\%$
test_keys 32.0210μs 3.3990μs 294.2083 KOps/s 294.0127 KOps/s $\color{#35bf28}+0.07\%$
test_keys_nested 83.8910μs 55.5977μs 17.9864 KOps/s 18.1646 KOps/s $\color{#d91a1a}-0.98\%$
test_keys_nested_locked 0.8248ms 60.9095μs 16.4178 KOps/s 16.7641 KOps/s $\color{#d91a1a}-2.07\%$
test_keys_nested_leaf 74.2710μs 47.2139μs 21.1802 KOps/s 21.4172 KOps/s $\color{#d91a1a}-1.11\%$
test_keys_stack_nested 86.8610μs 55.7566μs 17.9351 KOps/s 18.0189 KOps/s $\color{#d91a1a}-0.47\%$
test_keys_stack_nested_leaf 77.1710μs 47.4506μs 21.0746 KOps/s 21.4875 KOps/s $\color{#d91a1a}-1.92\%$
test_keys_stack_nested_locked 90.1110μs 60.2971μs 16.5845 KOps/s 16.6465 KOps/s $\color{#d91a1a}-0.37\%$
test_values 9.4452μs 0.8164μs 1.2249 MOps/s 1.2435 MOps/s $\color{#d91a1a}-1.50\%$
test_values_nested 52.2910μs 27.3312μs 36.5882 KOps/s 36.1003 KOps/s $\color{#35bf28}+1.35\%$
test_values_nested_locked 56.6910μs 29.5147μs 33.8814 KOps/s 34.1316 KOps/s $\color{#d91a1a}-0.73\%$
test_values_nested_leaf 51.0810μs 24.0352μs 41.6056 KOps/s 41.0027 KOps/s $\color{#35bf28}+1.47\%$
test_values_stack_nested 52.8710μs 27.9831μs 35.7359 KOps/s 36.0172 KOps/s $\color{#d91a1a}-0.78\%$
test_values_stack_nested_leaf 48.9800μs 24.5665μs 40.7058 KOps/s 41.1467 KOps/s $\color{#d91a1a}-1.07\%$
test_values_stack_nested_locked 62.8510μs 30.1648μs 33.1513 KOps/s 34.0484 KOps/s $\color{#d91a1a}-2.63\%$
test_membership 2.1315μs 0.4700μs 2.1275 MOps/s 2.1137 MOps/s $\color{#35bf28}+0.65\%$
test_membership_nested 12.1350μs 1.7935μs 557.5776 KOps/s 563.9411 KOps/s $\color{#d91a1a}-1.13\%$
test_membership_nested_leaf 11.7367μs 1.7417μs 574.1385 KOps/s 564.6197 KOps/s $\color{#35bf28}+1.69\%$
test_membership_stacked_nested 24.4900μs 1.8093μs 552.6853 KOps/s 548.6632 KOps/s $\color{#35bf28}+0.73\%$
test_membership_stacked_nested_leaf 29.8000μs 1.7990μs 555.8709 KOps/s 538.8158 KOps/s $\color{#35bf28}+3.17\%$
test_membership_nested_last 71.4110μs 2.6451μs 378.0530 KOps/s 372.1097 KOps/s $\color{#35bf28}+1.60\%$
test_membership_nested_leaf_last 26.1700μs 2.6359μs 379.3795 KOps/s 373.6340 KOps/s $\color{#35bf28}+1.54\%$
test_membership_stacked_nested_last 25.4400μs 3.0219μs 330.9174 KOps/s 380.0421 KOps/s $\textbf{\color{#d91a1a}-12.93\%}$
test_membership_stacked_nested_leaf_last 36.5800μs 3.0099μs 332.2404 KOps/s 375.3886 KOps/s $\textbf{\color{#d91a1a}-11.49\%}$
test_nested_getleaf 25.3300μs 6.1552μs 162.4633 KOps/s 164.6943 KOps/s $\color{#d91a1a}-1.35\%$
test_nested_get 34.3410μs 5.8109μs 172.0891 KOps/s 173.9414 KOps/s $\color{#d91a1a}-1.06\%$
test_stacked_getleaf 35.7400μs 6.1585μs 162.3775 KOps/s 166.6915 KOps/s $\color{#d91a1a}-2.59\%$
test_stacked_get 29.5300μs 5.7120μs 175.0697 KOps/s 174.5312 KOps/s $\color{#35bf28}+0.31\%$
test_nested_getitemleaf 26.7600μs 6.2050μs 161.1614 KOps/s 163.5467 KOps/s $\color{#d91a1a}-1.46\%$
test_nested_getitem 30.1100μs 5.8161μs 171.9362 KOps/s 173.8755 KOps/s $\color{#d91a1a}-1.12\%$
test_stacked_getitemleaf 41.3100μs 6.1506μs 162.5859 KOps/s 164.5978 KOps/s $\color{#d91a1a}-1.22\%$
test_stacked_getitem 32.0510μs 5.7191μs 174.8526 KOps/s 175.5656 KOps/s $\color{#d91a1a}-0.41\%$
test_lock_nested 1.3286ms 0.4154ms 2.4070 KOps/s 2.3809 KOps/s $\color{#35bf28}+1.10\%$
test_lock_stack_nested 0.4387ms 0.3816ms 2.6207 KOps/s 2.6232 KOps/s $\color{#d91a1a}-0.10\%$
test_unlock_nested 0.7472ms 0.3562ms 2.8078 KOps/s 2.7960 KOps/s $\color{#35bf28}+0.42\%$
test_unlock_stack_nested 0.3659ms 0.3208ms 3.1174 KOps/s 3.1205 KOps/s $\color{#d91a1a}-0.10\%$
test_flatten_speed 0.1663ms 81.8554μs 12.2167 KOps/s 12.6031 KOps/s $\color{#d91a1a}-3.07\%$
test_unflatten_speed 0.3326ms 0.2865ms 3.4904 KOps/s 3.5064 KOps/s $\color{#d91a1a}-0.46\%$
test_common_ops 92.0901ms 1.3693ms 730.3248 Ops/s 761.0714 Ops/s $\color{#d91a1a}-4.04\%$
test_creation 30.4200μs 1.5137μs 660.6469 KOps/s 659.5173 KOps/s $\color{#35bf28}+0.17\%$
test_creation_empty 50.7500μs 14.9576μs 66.8556 KOps/s 58.2012 KOps/s $\textbf{\color{#35bf28}+14.87\%}$
test_creation_nested_1 46.5800μs 16.6530μs 60.0492 KOps/s 53.0669 KOps/s $\textbf{\color{#35bf28}+13.16\%}$
test_creation_nested_2 48.8210μs 19.3630μs 51.6448 KOps/s 46.3838 KOps/s $\textbf{\color{#35bf28}+11.34\%}$
test_clone 65.5210μs 28.8232μs 34.6943 KOps/s 34.8493 KOps/s $\color{#d91a1a}-0.44\%$
test_getitem[int] 1.2619ms 16.0810μs 62.1853 KOps/s 61.0078 KOps/s $\color{#35bf28}+1.93\%$
test_getitem[slice_int] 0.1200ms 27.7694μs 36.0109 KOps/s 35.9655 KOps/s $\color{#35bf28}+0.13\%$
test_getitem[range] 0.2180ms 0.1092ms 9.1600 KOps/s 9.1329 KOps/s $\color{#35bf28}+0.30\%$
test_getitem[tuple] 0.1203ms 24.3246μs 41.1106 KOps/s 41.6137 KOps/s $\color{#d91a1a}-1.21\%$
test_getitem[list] 0.1960ms 99.0681μs 10.0941 KOps/s 10.1807 KOps/s $\color{#d91a1a}-0.85\%$
test_setitem_dim[int] 75.0410μs 50.8740μs 19.6564 KOps/s 19.0281 KOps/s $\color{#35bf28}+3.30\%$
test_setitem_dim[slice_int] 0.1084ms 74.9354μs 13.3448 KOps/s 13.1548 KOps/s $\color{#35bf28}+1.44\%$
test_setitem_dim[range] 0.1799ms 0.1344ms 7.4428 KOps/s 7.3053 KOps/s $\color{#35bf28}+1.88\%$
test_setitem_dim[tuple] 93.3210μs 67.7284μs 14.7648 KOps/s 14.6068 KOps/s $\color{#35bf28}+1.08\%$
test_setitem 81.5210μs 41.7133μs 23.9732 KOps/s 23.8213 KOps/s $\color{#35bf28}+0.64\%$
test_set 99.9520μs 41.1619μs 24.2943 KOps/s 24.3181 KOps/s $\color{#d91a1a}-0.10\%$
test_set_shared 0.3570ms 50.7299μs 19.7122 KOps/s 19.7654 KOps/s $\color{#d91a1a}-0.27\%$
test_update 84.9810μs 49.2449μs 20.3067 KOps/s 18.7318 KOps/s $\textbf{\color{#35bf28}+8.41\%}$
test_update_nested 0.1144ms 55.5403μs 18.0050 KOps/s 16.3352 KOps/s $\textbf{\color{#35bf28}+10.22\%}$
test_update__nested 0.3857ms 60.3270μs 16.5763 KOps/s 15.1637 KOps/s $\textbf{\color{#35bf28}+9.32\%}$
test_set_nested 79.7710μs 43.4074μs 23.0375 KOps/s 23.0470 KOps/s $\color{#d91a1a}-0.04\%$
test_set_nested_new 0.1266ms 47.1341μs 21.2161 KOps/s 20.9971 KOps/s $\color{#35bf28}+1.04\%$
test_select 97.9610μs 60.5858μs 16.5055 KOps/s 15.7541 KOps/s $\color{#35bf28}+4.77\%$
test_select_nested 0.2244ms 43.3117μs 23.0885 KOps/s 23.7251 KOps/s $\color{#d91a1a}-2.68\%$
test_exclude_nested 95.9410μs 60.1923μs 16.6134 KOps/s 16.7417 KOps/s $\color{#d91a1a}-0.77\%$
test_empty[True] 0.2926ms 0.2418ms 4.1354 KOps/s 4.1288 KOps/s $\color{#35bf28}+0.16\%$
test_empty[False] 2.9880μs 0.7573μs 1.3204 MOps/s 1.3510 MOps/s $\color{#d91a1a}-2.27\%$
test_to 59.6110μs 25.0426μs 39.9319 KOps/s 34.9360 KOps/s $\textbf{\color{#35bf28}+14.30\%}$
test_to_nonblocking 64.9310μs 24.3882μs 41.0035 KOps/s 41.4692 KOps/s $\color{#d91a1a}-1.12\%$
test_unbind_speed 1.5204ms 0.2770ms 3.6101 KOps/s 3.3132 KOps/s $\textbf{\color{#35bf28}+8.96\%}$
test_unbind_speed_stack0 0.3213ms 0.2751ms 3.6355 KOps/s 3.3822 KOps/s $\textbf{\color{#35bf28}+7.49\%}$
test_unbind_speed_stack1 91.5579ms 0.7161ms 1.3965 KOps/s 1.3795 KOps/s $\color{#35bf28}+1.23\%$
test_split 92.7777ms 2.1880ms 457.0444 Ops/s 459.7868 Ops/s $\color{#d91a1a}-0.60\%$
test_chunk 95.0311ms 2.2020ms 454.1397 Ops/s 458.9797 Ops/s $\color{#d91a1a}-1.05\%$
test_creation[device0] 0.3530ms 0.1272ms 7.8604 KOps/s 7.8479 KOps/s $\color{#35bf28}+0.16\%$
test_creation_from_tensor 0.3473ms 0.1325ms 7.5498 KOps/s 7.4258 KOps/s $\color{#35bf28}+1.67\%$
test_add_one[memmap_tensor0] 0.1283ms 8.9551μs 111.6688 KOps/s 115.6200 KOps/s $\color{#d91a1a}-3.42\%$
test_contiguous[memmap_tensor0] 24.9510μs 2.2105μs 452.3773 KOps/s 460.4108 KOps/s $\color{#d91a1a}-1.74\%$
test_stack[memmap_tensor0] 40.7300μs 6.6893μs 149.4923 KOps/s 145.8618 KOps/s $\color{#35bf28}+2.49\%$
test_memmaptd_index 1.0460ms 0.4226ms 2.3661 KOps/s 2.2632 KOps/s $\color{#35bf28}+4.55\%$
test_memmaptd_index_astensor 0.7376ms 0.4784ms 2.0905 KOps/s 2.0114 KOps/s $\color{#35bf28}+3.93\%$
test_memmaptd_index_op 1.4075ms 1.0093ms 990.7431 Ops/s 951.6112 Ops/s $\color{#35bf28}+4.11\%$
test_serialize_model 0.1303s 0.1295s 7.7231 Ops/s 7.7068 Ops/s $\color{#35bf28}+0.21\%$
test_serialize_model_pickle 1.3490s 1.2118s 0.8252 Ops/s 0.8249 Ops/s $\color{#35bf28}+0.04\%$
test_serialize_weights 0.1291s 0.1280s 7.8121 Ops/s 7.7651 Ops/s $\color{#35bf28}+0.60\%$
test_serialize_weights_returnearly 0.2323s 61.3899ms 16.2893 Ops/s 18.2389 Ops/s $\textbf{\color{#d91a1a}-10.69\%}$
test_serialize_weights_pickle 1.3719s 1.2164s 0.8221 Ops/s 0.8213 Ops/s $\color{#35bf28}+0.10\%$
test_reshape_pytree 0.4193ms 35.8649μs 27.8824 KOps/s 27.8446 KOps/s $\color{#35bf28}+0.14\%$
test_reshape_td 84.7310μs 41.5846μs 24.0473 KOps/s 24.6233 KOps/s $\color{#d91a1a}-2.34\%$
test_view_pytree 0.4247ms 35.2929μs 28.3343 KOps/s 28.3625 KOps/s $\color{#d91a1a}-0.10\%$
test_view_td 88.2010μs 47.3397μs 21.1239 KOps/s 21.5831 KOps/s $\color{#d91a1a}-2.13\%$
test_unbind_pytree 0.4109ms 33.7302μs 29.6470 KOps/s 29.4641 KOps/s $\color{#35bf28}+0.62\%$
test_unbind_td 0.3955ms 42.2103μs 23.6909 KOps/s 23.9867 KOps/s $\color{#d91a1a}-1.23\%$
test_split_pytree 0.4238ms 46.0956μs 21.6940 KOps/s 21.4422 KOps/s $\color{#35bf28}+1.17\%$
test_split_td 0.1520ms 56.7946μs 17.6073 KOps/s 15.5292 KOps/s $\textbf{\color{#35bf28}+13.38\%}$
test_add_pytree 0.4322ms 56.0042μs 17.8558 KOps/s 18.1090 KOps/s $\color{#d91a1a}-1.40\%$
test_add_td 0.1558ms 90.5976μs 11.0378 KOps/s 10.8596 KOps/s $\color{#35bf28}+1.64\%$
test_compile_add_one_nested[tensordict-compile] 0.4080ms 0.2123ms 4.7099 KOps/s 4.7741 KOps/s $\color{#d91a1a}-1.34\%$
test_compile_add_one_nested[tensordict-eager] 0.5497ms 0.1556ms 6.4274 KOps/s 6.3558 KOps/s $\color{#35bf28}+1.13\%$
test_compile_add_one_nested[pytree-compile] 0.5425ms 0.1437ms 6.9609 KOps/s 6.9129 KOps/s $\color{#35bf28}+0.69\%$
test_compile_add_one_nested[pytree-eager] 0.2595ms 0.1804ms 5.5421 KOps/s 5.5738 KOps/s $\color{#d91a1a}-0.57\%$
test_compile_copy_nested[tensordict-compile] 0.4206ms 21.7546μs 45.9673 KOps/s 47.1873 KOps/s $\color{#d91a1a}-2.59\%$
test_compile_copy_nested[tensordict-eager] 0.1214ms 42.7597μs 23.3865 KOps/s 22.5447 KOps/s $\color{#35bf28}+3.73\%$
test_compile_copy_nested[pytree-compile] 0.2659ms 64.3071μs 15.5504 KOps/s 15.4964 KOps/s $\color{#35bf28}+0.35\%$
test_compile_copy_nested[pytree-eager] 92.0410μs 49.1940μs 20.3277 KOps/s 20.1499 KOps/s $\color{#35bf28}+0.88\%$
test_compile_add_one_flat[tensordict-compile] 0.4564ms 0.3190ms 3.1347 KOps/s 3.1306 KOps/s $\color{#35bf28}+0.13\%$
test_compile_add_one_flat[tensordict-eager] 0.3079ms 0.2079ms 4.8101 KOps/s 4.8899 KOps/s $\color{#d91a1a}-1.63\%$
test_compile_add_one_flat[tensorclass-compile] 0.1827ms 0.1265ms 7.9065 KOps/s 7.5651 KOps/s $\color{#35bf28}+4.51\%$
test_compile_add_one_flat[tensorclass-eager] 0.1285ms 59.8053μs 16.7209 KOps/s 16.1217 KOps/s $\color{#35bf28}+3.72\%$
test_compile_add_one_flat[pytree-compile] 0.4524ms 0.3161ms 3.1637 KOps/s 3.1085 KOps/s $\color{#35bf28}+1.77\%$
test_compile_add_one_flat[pytree-eager] 0.7164ms 0.6072ms 1.6470 KOps/s 1.6555 KOps/s $\color{#d91a1a}-0.52\%$
test_compile_add_self_flat[tensordict-eager] 0.3583ms 0.2442ms 4.0946 KOps/s 4.0787 KOps/s $\color{#35bf28}+0.39\%$
test_compile_add_self_flat[tensordict-compile] 0.3577ms 0.3206ms 3.1192 KOps/s 3.0992 KOps/s $\color{#35bf28}+0.65\%$
test_compile_add_self_flat[tensorclass-eager] 0.1501ms 70.9825μs 14.0880 KOps/s 14.2611 KOps/s $\color{#d91a1a}-1.21\%$
test_compile_add_self_flat[tensorclass-compile] 0.2059ms 0.1285ms 7.7817 KOps/s 7.5357 KOps/s $\color{#35bf28}+3.26\%$
test_compile_add_self_flat[pytree-eager] 0.6159ms 0.5289ms 1.8908 KOps/s 1.9593 KOps/s $\color{#d91a1a}-3.50\%$
test_compile_add_self_flat[pytree-compile] 0.3688ms 0.3175ms 3.1495 KOps/s 3.1292 KOps/s $\color{#35bf28}+0.65\%$
test_compile_copy_flat[tensordict-compile] 69.3810μs 18.9785μs 52.6911 KOps/s 53.1097 KOps/s $\color{#d91a1a}-0.79\%$
test_compile_copy_flat[tensordict-eager] 64.7210μs 27.2402μs 36.7105 KOps/s 36.4830 KOps/s $\color{#35bf28}+0.62\%$
test_compile_copy_flat[pytree-compile] 0.1078ms 70.0636μs 14.2727 KOps/s 14.1920 KOps/s $\color{#35bf28}+0.57\%$
test_compile_copy_flat[pytree-eager] 83.6820μs 51.4170μs 19.4488 KOps/s 19.4877 KOps/s $\color{#d91a1a}-0.20\%$
test_compile_assign_and_add[tensordict-compile] 2.3257ms 0.8100ms 1.2346 KOps/s 1.1223 KOps/s $\textbf{\color{#35bf28}+10.01\%}$
test_compile_assign_and_add[tensordict-eager] 3.2530ms 3.1172ms 320.7978 Ops/s 315.4240 Ops/s $\color{#35bf28}+1.70\%$
test_compile_assign_and_add[pytree-compile] 2.2741ms 0.7977ms 1.2537 KOps/s 1.1394 KOps/s $\textbf{\color{#35bf28}+10.03\%}$
test_compile_assign_and_add[pytree-eager] 3.3088ms 3.1754ms 314.9207 Ops/s 314.4383 Ops/s $\color{#35bf28}+0.15\%$
test_compile_indexing[tensor-tensordict-compile] 0.1553ms 0.1131ms 8.8424 KOps/s 9.2014 KOps/s $\color{#d91a1a}-3.90\%$
test_compile_indexing[tensor-tensordict-eager] 0.1960ms 63.7983μs 15.6744 KOps/s 16.1381 KOps/s $\color{#d91a1a}-2.87\%$
test_compile_indexing[tensor-tensorclass-compile] 0.1453ms 0.1025ms 9.7521 KOps/s 9.3182 KOps/s $\color{#35bf28}+4.66\%$
test_compile_indexing[tensor-tensorclass-eager] 0.1669ms 44.4635μs 22.4903 KOps/s 21.5227 KOps/s $\color{#35bf28}+4.50\%$
test_compile_indexing[tensor-pytree-compile] 0.1457ms 0.1038ms 9.6372 KOps/s 9.0992 KOps/s $\textbf{\color{#35bf28}+5.91\%}$
test_compile_indexing[tensor-pytree-eager] 84.1220μs 42.8070μs 23.3606 KOps/s 21.8371 KOps/s $\textbf{\color{#35bf28}+6.98\%}$
test_compile_indexing[slice-tensordict-compile] 0.1985ms 0.1389ms 7.2010 KOps/s 7.2642 KOps/s $\color{#d91a1a}-0.87\%$
test_compile_indexing[slice-tensordict-eager] 0.1677ms 25.2651μs 39.5803 KOps/s 38.9236 KOps/s $\color{#35bf28}+1.69\%$
test_compile_indexing[slice-tensorclass-compile] 0.1801ms 0.1339ms 7.4702 KOps/s 7.4520 KOps/s $\color{#35bf28}+0.24\%$
test_compile_indexing[slice-tensorclass-eager] 65.9810μs 21.0951μs 47.4044 KOps/s 48.6960 KOps/s $\color{#d91a1a}-2.65\%$
test_compile_indexing[slice-pytree-compile] 0.2249ms 0.1333ms 7.4996 KOps/s 7.5551 KOps/s $\color{#d91a1a}-0.73\%$
test_compile_indexing[slice-pytree-eager] 53.3610μs 20.8990μs 47.8491 KOps/s 48.9941 KOps/s $\color{#d91a1a}-2.34\%$
test_compile_indexing[int-tensordict-compile] 0.1702ms 0.1379ms 7.2534 KOps/s 7.2428 KOps/s $\color{#35bf28}+0.15\%$
test_compile_indexing[int-tensordict-eager] 0.4804ms 25.0352μs 39.9438 KOps/s 39.6744 KOps/s $\color{#35bf28}+0.68\%$
test_compile_indexing[int-tensorclass-compile] 0.1760ms 0.1306ms 7.6543 KOps/s 7.5341 KOps/s $\color{#35bf28}+1.59\%$
test_compile_indexing[int-tensorclass-eager] 0.1500ms 24.4888μs 40.8350 KOps/s 48.2346 KOps/s $\textbf{\color{#d91a1a}-15.34\%}$
test_compile_indexing[int-pytree-compile] 0.1925ms 0.1311ms 7.6274 KOps/s 7.5678 KOps/s $\color{#35bf28}+0.79\%$
test_compile_indexing[int-pytree-eager] 54.2010μs 20.7069μs 48.2930 KOps/s 47.8557 KOps/s $\color{#35bf28}+0.91\%$
test_mod_add[eager] 71.6910μs 31.2504μs 31.9996 KOps/s 29.0603 KOps/s $\textbf{\color{#35bf28}+10.11\%}$
test_mod_add[compile] 0.1106ms 69.0467μs 14.4829 KOps/s 14.0093 KOps/s $\color{#35bf28}+3.38\%$
test_mod_add[compile-overhead] 0.2648ms 0.1415ms 7.0647 KOps/s 7.0641 KOps/s $+0.01\%$
test_mod_wrap[eager] 0.3329ms 0.2395ms 4.1755 KOps/s 3.8779 KOps/s $\textbf{\color{#35bf28}+7.67\%}$
test_mod_wrap[compile] 0.4440ms 0.2823ms 3.5423 KOps/s 3.3413 KOps/s $\textbf{\color{#35bf28}+6.02\%}$
test_mod_wrap[compile-overhead] 7.3511ms 3.8992ms 256.4627 Ops/s 246.2545 Ops/s $\color{#35bf28}+4.15\%$
test_mod_wrap_and_backward[eager] 1.4533ms 1.3410ms 745.7356 Ops/s 697.8158 Ops/s $\textbf{\color{#35bf28}+6.87\%}$
test_mod_wrap_and_backward[compile] 2.7359ms 1.3076ms 764.7573 Ops/s 699.5514 Ops/s $\textbf{\color{#35bf28}+9.32\%}$
test_mod_wrap_and_backward[compile-overhead] 1.2757ms 0.8850ms 1.1299 KOps/s 1.0141 KOps/s $\textbf{\color{#35bf28}+11.42\%}$
test_seq_add[eager] 0.1530ms 95.3208μs 10.4909 KOps/s 9.7707 KOps/s $\textbf{\color{#35bf28}+7.37\%}$
test_seq_add[compile] 0.3126ms 80.9966μs 12.3462 KOps/s 12.2702 KOps/s $\color{#35bf28}+0.62\%$
test_seq_add[compile-overhead] 0.1514ms 0.1141ms 8.7625 KOps/s 8.7715 KOps/s $\color{#d91a1a}-0.10\%$
test_seq_wrap[eager] 0.4708ms 0.3729ms 2.6819 KOps/s 2.5493 KOps/s $\textbf{\color{#35bf28}+5.20\%}$
test_seq_wrap[compile] 0.3657ms 0.3013ms 3.3194 KOps/s 3.2154 KOps/s $\color{#35bf28}+3.24\%$
test_seq_wrap[compile-overhead] 0.2897ms 0.2087ms 4.7906 KOps/s 4.5678 KOps/s $\color{#35bf28}+4.88\%$
test_func_call_runtime[False-eager] 0.7998ms 0.7324ms 1.3654 KOps/s 1.3429 KOps/s $\color{#35bf28}+1.68\%$
test_func_call_runtime[False-compile] 1.0383ms 0.7784ms 1.2848 KOps/s 1.2668 KOps/s $\color{#35bf28}+1.42\%$
test_func_call_runtime[False-compile-overhead] 0.4025ms 0.3462ms 2.8885 KOps/s 2.8134 KOps/s $\color{#35bf28}+2.67\%$
test_func_call_runtime[True-eager] 1.0934ms 0.8906ms 1.1229 KOps/s 1.0862 KOps/s $\color{#35bf28}+3.37\%$
test_func_call_runtime[True-compile] 0.8982ms 0.8114ms 1.2324 KOps/s 1.1464 KOps/s $\textbf{\color{#35bf28}+7.51\%}$
test_func_call_runtime[True-compile-overhead] 0.4374ms 0.3809ms 2.6252 KOps/s 2.5873 KOps/s $\color{#35bf28}+1.46\%$
test_func_call_cm_runtime[False-eager] 0.8022ms 0.7300ms 1.3698 KOps/s 1.2721 KOps/s $\textbf{\color{#35bf28}+7.69\%}$
test_func_call_cm_runtime[False-compile] 1.0469ms 0.7775ms 1.2862 KOps/s 1.1832 KOps/s $\textbf{\color{#35bf28}+8.70\%}$
test_func_call_cm_runtime[False-compile-overhead] 0.4026ms 0.3497ms 2.8593 KOps/s 2.8106 KOps/s $\color{#35bf28}+1.73\%$
test_func_call_cm_runtime[True-eager] 1.0697ms 0.9853ms 1.0150 KOps/s 995.0649 Ops/s $\color{#35bf28}+2.00\%$
test_func_call_cm_runtime[True-compile] 0.8840ms 0.8370ms 1.1948 KOps/s 1.1715 KOps/s $\color{#35bf28}+1.99\%$
test_func_call_cm_runtime[True-compile-overhead] 0.5006ms 0.4055ms 2.4663 KOps/s 2.4266 KOps/s $\color{#35bf28}+1.63\%$
test_vmap_func_call_cm_runtime[eager] 2.5596ms 2.0650ms 484.2544 Ops/s 478.7228 Ops/s $\color{#35bf28}+1.16\%$
test_vmap_func_call_cm_runtime[compile] 0.9254ms 0.8587ms 1.1645 KOps/s 1.1465 KOps/s $\color{#35bf28}+1.57\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.4985ms 0.4098ms 2.4403 KOps/s 2.3994 KOps/s $\color{#35bf28}+1.70\%$
test_distributed 1.9164ms 0.1724ms 5.7990 KOps/s 8.6855 KOps/s $\textbf{\color{#d91a1a}-33.23\%}$
test_tdmodule 42.8300μs 14.2050μs 70.3977 KOps/s 62.9597 KOps/s $\textbf{\color{#35bf28}+11.81\%}$
test_tdmodule_dispatch 56.7410μs 28.4838μs 35.1077 KOps/s 32.8340 KOps/s $\textbf{\color{#35bf28}+6.92\%}$
test_tdseq 22.1100μs 14.8570μs 67.3083 KOps/s 62.2291 KOps/s $\textbf{\color{#35bf28}+8.16\%}$
test_tdseq_dispatch 50.8710μs 30.7098μs 32.5629 KOps/s 30.2979 KOps/s $\textbf{\color{#35bf28}+7.48\%}$
test_instantiation_functorch 2.0181ms 1.8397ms 543.5619 Ops/s 532.9209 Ops/s $\color{#35bf28}+2.00\%$
test_instantiation_td 1.7883ms 1.1937ms 837.7189 Ops/s 828.2105 Ops/s $\color{#35bf28}+1.15\%$
test_exec_functorch 0.2912ms 0.2209ms 4.5260 KOps/s 4.7320 KOps/s $\color{#d91a1a}-4.35\%$
test_exec_functional_call 0.3208ms 0.2064ms 4.8438 KOps/s 4.7233 KOps/s $\color{#35bf28}+2.55\%$
test_exec_td 0.2746ms 0.2135ms 4.6842 KOps/s 4.6944 KOps/s $\color{#d91a1a}-0.22\%$
test_exec_td_decorator 0.5882ms 0.2534ms 3.9469 KOps/s 3.7262 KOps/s $\textbf{\color{#35bf28}+5.92\%}$
test_vmap_mlp_speed[True-True] 0.8492ms 0.6882ms 1.4531 KOps/s 1.4473 KOps/s $\color{#35bf28}+0.40\%$
test_vmap_mlp_speed[True-False] 0.7311ms 0.6812ms 1.4679 KOps/s 1.4593 KOps/s $\color{#35bf28}+0.59\%$
test_vmap_mlp_speed[False-True] 0.6179ms 0.5714ms 1.7500 KOps/s 1.6600 KOps/s $\textbf{\color{#35bf28}+5.42\%}$
test_vmap_mlp_speed[False-False] 0.6150ms 0.5726ms 1.7463 KOps/s 1.6999 KOps/s $\color{#35bf28}+2.73\%$
test_vmap_mlp_speed_decorator[True-True] 0.7861ms 0.6655ms 1.5026 KOps/s 1.4802 KOps/s $\color{#35bf28}+1.51\%$
test_vmap_mlp_speed_decorator[True-False] 0.8538ms 0.6665ms 1.5003 KOps/s 1.4825 KOps/s $\color{#35bf28}+1.20\%$
test_vmap_mlp_speed_decorator[False-True] 0.7006ms 0.5880ms 1.7008 KOps/s 1.6985 KOps/s $\color{#35bf28}+0.14\%$
test_vmap_mlp_speed_decorator[False-False] 0.7143ms 0.5873ms 1.7028 KOps/s 1.6981 KOps/s $\color{#35bf28}+0.27\%$
test_vmap_transformer_speed[True-True] 8.3888ms 8.3053ms 120.4055 Ops/s 119.4687 Ops/s $\color{#35bf28}+0.78\%$
test_vmap_transformer_speed[True-False] 8.2966ms 8.2468ms 121.2587 Ops/s 119.6583 Ops/s $\color{#35bf28}+1.34\%$
test_vmap_transformer_speed[False-True] 8.1725ms 8.0684ms 123.9406 Ops/s 122.8932 Ops/s $\color{#35bf28}+0.85\%$
test_vmap_transformer_speed[False-False] 8.1694ms 8.0932ms 123.5605 Ops/s 122.8291 Ops/s $\color{#35bf28}+0.60\%$
test_vmap_transformer_speed_decorator[True-True] 19.9789ms 19.3177ms 51.7659 Ops/s 51.5586 Ops/s $\color{#35bf28}+0.40\%$
test_vmap_transformer_speed_decorator[True-False] 19.4196ms 19.3385ms 51.7103 Ops/s 51.5378 Ops/s $\color{#35bf28}+0.33\%$
test_vmap_transformer_speed_decorator[False-True] 19.3169ms 19.2166ms 52.0384 Ops/s 51.8863 Ops/s $\color{#35bf28}+0.29\%$
test_vmap_transformer_speed_decorator[False-False] 19.2779ms 19.1888ms 52.1138 Ops/s 51.9142 Ops/s $\color{#35bf28}+0.38\%$
test_to_module_speed[True] 1.4368ms 0.9399ms 1.0639 KOps/s 1.0635 KOps/s $\color{#35bf28}+0.04\%$
test_to_module_speed[False] 1.2903ms 0.9185ms 1.0887 KOps/s 1.0811 KOps/s $\color{#35bf28}+0.71\%$
test_tc_init 71.2910μs 34.3988μs 29.0708 KOps/s 27.7800 KOps/s $\color{#35bf28}+4.65\%$
test_tc_init_nested 0.1066ms 71.6526μs 13.9562 KOps/s 13.4348 KOps/s $\color{#35bf28}+3.88\%$
test_tc_first_layer_tensor 4.7030μs 0.6835μs 1.4632 MOps/s 1.4461 MOps/s $\color{#35bf28}+1.18\%$
test_tc_first_layer_nontensor 28.6810μs 2.2416μs 446.1033 KOps/s 448.2151 KOps/s $\color{#d91a1a}-0.47\%$
test_tc_second_layer_tensor 7.2950μs 1.3839μs 722.5981 KOps/s 729.3432 KOps/s $\color{#d91a1a}-0.92\%$
test_tc_second_layer_nontensor 22.8600μs 2.9543μs 338.4893 KOps/s 340.2674 KOps/s $\color{#d91a1a}-0.52\%$
test_unbind 0.1919s 11.9371ms 83.7721 Ops/s 93.2137 Ops/s $\textbf{\color{#d91a1a}-10.13\%}$
test_full_like 0.6579ms 0.5739ms 1.7425 KOps/s 1.7373 KOps/s $\color{#35bf28}+0.30\%$
test_zeros_like 0.2692ms 0.1979ms 5.0520 KOps/s 5.0525 KOps/s $\color{#d91a1a}-0.01\%$
test_ones_like 0.2575ms 0.1977ms 5.0586 KOps/s 5.0532 KOps/s $\color{#35bf28}+0.11\%$
test_clone 0.4453ms 0.4131ms 2.4206 KOps/s 2.4105 KOps/s $\color{#35bf28}+0.42\%$
test_squeeze 35.3200μs 9.7534μs 102.5283 KOps/s 100.9646 KOps/s $\color{#35bf28}+1.55\%$
test_unsqueeze 0.2205ms 74.0326μs 13.5076 KOps/s 13.3637 KOps/s $\color{#35bf28}+1.08\%$
test_split 0.4135ms 0.1569ms 6.3730 KOps/s 6.3295 KOps/s $\color{#35bf28}+0.69\%$
test_permute 0.2803ms 0.1800ms 5.5558 KOps/s 5.3402 KOps/s $\color{#35bf28}+4.04\%$
test_stack 1.2561ms 0.8742ms 1.1439 KOps/s 1.1964 KOps/s $\color{#d91a1a}-4.39\%$
test_cat 1.2701ms 1.2322ms 811.5606 Ops/s 811.7761 Ops/s $\color{#d91a1a}-0.03\%$

@vmoens vmoens added the enhancement New feature or request label Sep 9, 2024
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Sep 9, 2024
ghstack-source-id: c7737107beff69fa6f5438067c795d2e1a98d45f
Pull Request resolved: #953
@vmoens vmoens merged commit eef76bf into gh/vmoens/8/base Sep 10, 2024
44 of 48 checks passed
vmoens added a commit that referenced this pull request Sep 10, 2024
ghstack-source-id: c7737107beff69fa6f5438067c795d2e1a98d45f
Pull Request resolved: #953
@vmoens vmoens deleted the gh/vmoens/8/head branch September 10, 2024 07:10
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants