Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] Make NonTensorData a callable #939

Merged
merged 3 commits into from
Aug 1, 2024
Merged

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Aug 1, 2024

Closes #937

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Aug 1, 2024
@vmoens vmoens added the enhancement New feature or request label Aug 1, 2024
Copy link

github-actions bot commented Aug 1, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 219. Improved: $\large\color{#35bf28}15$. Worsened: $\large\color{#d91a1a}12$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 40.7050μs 21.5569μs 46.3888 KOps/s 44.5308 KOps/s $\color{#35bf28}+4.17\%$
test_plain_set_stack_nested 49.4720μs 21.8434μs 45.7805 KOps/s 44.8696 KOps/s $\color{#35bf28}+2.03\%$
test_plain_set_nested_inplace 76.0110μs 23.4607μs 42.6244 KOps/s 41.5698 KOps/s $\color{#35bf28}+2.54\%$
test_plain_set_stack_nested_inplace 79.0170μs 23.5227μs 42.5121 KOps/s 41.7377 KOps/s $\color{#35bf28}+1.86\%$
test_items 24.5160μs 2.6080μs 383.4344 KOps/s 373.7472 KOps/s $\color{#35bf28}+2.59\%$
test_items_nested 1.4068ms 0.3437ms 2.9091 KOps/s 2.7550 KOps/s $\textbf{\color{#35bf28}+5.59\%}$
test_items_nested_locked 0.6308ms 0.3445ms 2.9025 KOps/s 2.9709 KOps/s $\color{#d91a1a}-2.30\%$
test_items_nested_leaf 0.2522ms 85.7327μs 11.6642 KOps/s 11.8759 KOps/s $\color{#d91a1a}-1.78\%$
test_items_stack_nested 0.6691ms 0.3414ms 2.9292 KOps/s 2.9189 KOps/s $\color{#35bf28}+0.35\%$
test_items_stack_nested_leaf 0.1599ms 87.1862μs 11.4697 KOps/s 11.5437 KOps/s $\color{#d91a1a}-0.64\%$
test_items_stack_nested_locked 0.5431ms 0.3426ms 2.9191 KOps/s 2.9372 KOps/s $\color{#d91a1a}-0.62\%$
test_keys 22.8420μs 3.8433μs 260.1937 KOps/s 256.1236 KOps/s $\color{#35bf28}+1.59\%$
test_keys_nested 0.2347ms 0.1420ms 7.0439 KOps/s 7.1024 KOps/s $\color{#d91a1a}-0.82\%$
test_keys_nested_locked 0.7234ms 0.1480ms 6.7552 KOps/s 6.8627 KOps/s $\color{#d91a1a}-1.57\%$
test_keys_nested_leaf 0.2118ms 0.1226ms 8.1544 KOps/s 8.3422 KOps/s $\color{#d91a1a}-2.25\%$
test_keys_stack_nested 0.2332ms 0.1420ms 7.0409 KOps/s 7.0232 KOps/s $\color{#35bf28}+0.25\%$
test_keys_stack_nested_leaf 0.2105ms 0.1213ms 8.2464 KOps/s 8.2418 KOps/s $\color{#35bf28}+0.06\%$
test_keys_stack_nested_locked 0.2932ms 0.1480ms 6.7547 KOps/s 6.6883 KOps/s $\color{#35bf28}+0.99\%$
test_values 7.5265μs 1.1748μs 851.2171 KOps/s 840.9549 KOps/s $\color{#35bf28}+1.22\%$
test_values_nested 87.7140μs 50.7283μs 19.7129 KOps/s 20.2829 KOps/s $\color{#d91a1a}-2.81\%$
test_values_nested_locked 93.5150μs 50.2023μs 19.9194 KOps/s 20.1608 KOps/s $\color{#d91a1a}-1.20\%$
test_values_nested_leaf 90.9200μs 45.7578μs 21.8542 KOps/s 22.1688 KOps/s $\color{#d91a1a}-1.42\%$
test_values_stack_nested 97.3020μs 51.3790μs 19.4632 KOps/s 19.6685 KOps/s $\color{#d91a1a}-1.04\%$
test_values_stack_nested_leaf 89.9480μs 45.0938μs 22.1760 KOps/s 22.3456 KOps/s $\color{#d91a1a}-0.76\%$
test_values_stack_nested_locked 94.0060μs 51.0928μs 19.5722 KOps/s 19.5722 KOps/s $+0.00\%$
test_membership 13.5350μs 0.9097μs 1.0993 MOps/s 1.3366 MOps/s $\textbf{\color{#d91a1a}-17.75\%}$
test_membership_nested 23.1730μs 2.6309μs 380.0954 KOps/s 387.1768 KOps/s $\color{#d91a1a}-1.83\%$
test_membership_nested_leaf 29.5150μs 2.6098μs 383.1758 KOps/s 384.0617 KOps/s $\color{#d91a1a}-0.23\%$
test_membership_stacked_nested 27.6120μs 2.6059μs 383.7463 KOps/s 389.5553 KOps/s $\color{#d91a1a}-1.49\%$
test_membership_stacked_nested_leaf 33.9540μs 2.6203μs 381.6329 KOps/s 382.5286 KOps/s $\color{#d91a1a}-0.23\%$
test_membership_nested_last 25.7380μs 3.8811μs 257.6610 KOps/s 258.5849 KOps/s $\color{#d91a1a}-0.36\%$
test_membership_nested_leaf_last 35.8070μs 3.9145μs 255.4578 KOps/s 258.5955 KOps/s $\color{#d91a1a}-1.21\%$
test_membership_stacked_nested_last 33.2320μs 3.8920μs 256.9380 KOps/s 221.2117 KOps/s $\textbf{\color{#35bf28}+16.15\%}$
test_membership_stacked_nested_leaf_last 33.3830μs 3.8919μs 256.9432 KOps/s 222.8119 KOps/s $\textbf{\color{#35bf28}+15.32\%}$
test_nested_getleaf 42.0280μs 10.6290μs 94.0822 KOps/s 97.7887 KOps/s $\color{#d91a1a}-3.79\%$
test_nested_get 34.8050μs 10.0270μs 99.7304 KOps/s 102.9391 KOps/s $\color{#d91a1a}-3.12\%$
test_stacked_getleaf 34.4550μs 10.5017μs 95.2227 KOps/s 97.6393 KOps/s $\color{#d91a1a}-2.48\%$
test_stacked_get 44.0620μs 9.9060μs 100.9484 KOps/s 103.5080 KOps/s $\color{#d91a1a}-2.47\%$
test_nested_getitemleaf 62.2090μs 10.6189μs 94.1716 KOps/s 92.0862 KOps/s $\color{#35bf28}+2.26\%$
test_nested_getitem 56.3260μs 10.0932μs 99.0769 KOps/s 100.3691 KOps/s $\color{#d91a1a}-1.29\%$
test_stacked_getitemleaf 53.0590μs 10.8761μs 91.9445 KOps/s 92.8782 KOps/s $\color{#d91a1a}-1.01\%$
test_stacked_getitem 57.3470μs 10.0391μs 99.6110 KOps/s 99.9008 KOps/s $\color{#d91a1a}-0.29\%$
test_lock_nested 7.3772ms 0.5048ms 1.9811 KOps/s 1.9993 KOps/s $\color{#d91a1a}-0.91\%$
test_lock_stack_nested 0.7058ms 0.4688ms 2.1331 KOps/s 2.1834 KOps/s $\color{#d91a1a}-2.30\%$
test_unlock_nested 0.9448ms 0.4193ms 2.3852 KOps/s 2.4381 KOps/s $\color{#d91a1a}-2.17\%$
test_unlock_stack_nested 0.5884ms 0.3830ms 2.6113 KOps/s 2.6894 KOps/s $\color{#d91a1a}-2.90\%$
test_flatten_speed 0.5809ms 0.1046ms 9.5629 KOps/s 9.6274 KOps/s $\color{#d91a1a}-0.67\%$
test_unflatten_speed 0.5141ms 0.4269ms 2.3427 KOps/s 2.3776 KOps/s $\color{#d91a1a}-1.47\%$
test_common_ops 4.3303ms 1.1097ms 901.1588 Ops/s 908.6517 Ops/s $\color{#d91a1a}-0.82\%$
test_creation 47.4190μs 2.0174μs 495.6851 KOps/s 502.3599 KOps/s $\color{#d91a1a}-1.33\%$
test_creation_empty 45.6260μs 17.6914μs 56.5247 KOps/s 53.0766 KOps/s $\textbf{\color{#35bf28}+6.50\%}$
test_creation_nested_1 67.1260μs 21.0252μs 47.5619 KOps/s 44.3145 KOps/s $\textbf{\color{#35bf28}+7.33\%}$
test_creation_nested_2 0.1083ms 24.2483μs 41.2399 KOps/s 39.0474 KOps/s $\textbf{\color{#35bf28}+5.62\%}$
test_clone 81.0220μs 17.5005μs 57.1413 KOps/s 60.1686 KOps/s $\textbf{\color{#d91a1a}-5.03\%}$
test_getitem[int] 1.2456ms 16.7161μs 59.8225 KOps/s 61.0441 KOps/s $\color{#d91a1a}-2.00\%$
test_getitem[slice_int] 0.1335ms 31.7061μs 31.5397 KOps/s 31.8432 KOps/s $\color{#d91a1a}-0.95\%$
test_getitem[range] 0.1634ms 58.6001μs 17.0648 KOps/s 17.6165 KOps/s $\color{#d91a1a}-3.13\%$
test_getitem[tuple] 0.1263ms 25.2573μs 39.5926 KOps/s 39.8977 KOps/s $\color{#d91a1a}-0.76\%$
test_getitem[list] 0.3276ms 53.2255μs 18.7880 KOps/s 19.3241 KOps/s $\color{#d91a1a}-2.77\%$
test_setitem_dim[int] 0.1102ms 39.8117μs 25.1183 KOps/s 24.2489 KOps/s $\color{#35bf28}+3.59\%$
test_setitem_dim[slice_int] 0.1124ms 70.8192μs 14.1205 KOps/s 13.6698 KOps/s $\color{#35bf28}+3.30\%$
test_setitem_dim[range] 0.1391ms 93.1622μs 10.7340 KOps/s 10.6612 KOps/s $\color{#35bf28}+0.68\%$
test_setitem_dim[tuple] 99.8970μs 56.9996μs 17.5440 KOps/s 17.0988 KOps/s $\color{#35bf28}+2.60\%$
test_setitem 0.1863ms 29.6104μs 33.7719 KOps/s 33.5519 KOps/s $\color{#35bf28}+0.66\%$
test_set 0.1537ms 28.7456μs 34.7879 KOps/s 34.3686 KOps/s $\color{#35bf28}+1.22\%$
test_set_shared 3.9911ms 0.2205ms 4.5352 KOps/s 4.6910 KOps/s $\color{#d91a1a}-3.32\%$
test_update 0.1878ms 35.6252μs 28.0700 KOps/s 27.3880 KOps/s $\color{#35bf28}+2.49\%$
test_update_nested 0.1880ms 45.5272μs 21.9649 KOps/s 21.8979 KOps/s $\color{#35bf28}+0.31\%$
test_update__nested 0.1262ms 35.2422μs 28.3750 KOps/s 30.0324 KOps/s $\textbf{\color{#d91a1a}-5.52\%}$
test_set_nested 0.1106ms 31.0913μs 32.1633 KOps/s 32.4269 KOps/s $\color{#d91a1a}-0.81\%$
test_set_nested_new 0.1624ms 35.5549μs 28.1256 KOps/s 27.7518 KOps/s $\color{#35bf28}+1.35\%$
test_select 0.1706ms 52.7070μs 18.9728 KOps/s 17.5439 KOps/s $\textbf{\color{#35bf28}+8.14\%}$
test_select_nested 0.1716ms 58.8530μs 16.9915 KOps/s 17.3405 KOps/s $\color{#d91a1a}-2.01\%$
test_exclude_nested 0.1464ms 76.7688μs 13.0261 KOps/s 13.3284 KOps/s $\color{#d91a1a}-2.27\%$
test_empty[True] 0.7413ms 0.3272ms 3.0567 KOps/s 3.1591 KOps/s $\color{#d91a1a}-3.24\%$
test_empty[False] 9.8885μs 1.1701μs 854.6266 KOps/s 859.7462 KOps/s $\color{#d91a1a}-0.60\%$
test_unbind_speed 0.6118ms 0.3159ms 3.1658 KOps/s 3.2817 KOps/s $\color{#d91a1a}-3.53\%$
test_unbind_speed_stack0 0.3810ms 0.3027ms 3.3032 KOps/s 3.3831 KOps/s $\color{#d91a1a}-2.36\%$
test_unbind_speed_stack1 86.7007ms 0.7915ms 1.2634 KOps/s 1.3898 KOps/s $\textbf{\color{#d91a1a}-9.10\%}$
test_split 90.4262ms 2.2140ms 451.6738 Ops/s 457.7297 Ops/s $\color{#d91a1a}-1.32\%$
test_chunk 89.3407ms 2.2061ms 453.2922 Ops/s 464.2747 Ops/s $\color{#d91a1a}-2.37\%$
test_creation[device0] 0.2259ms 0.1195ms 8.3699 KOps/s 8.5950 KOps/s $\color{#d91a1a}-2.62\%$
test_creation_from_tensor 4.9890ms 0.1228ms 8.1438 KOps/s 8.3873 KOps/s $\color{#d91a1a}-2.90\%$
test_add_one[memmap_tensor0] 0.2520ms 8.0000μs 125.0000 KOps/s 134.1687 KOps/s $\textbf{\color{#d91a1a}-6.83\%}$
test_contiguous[memmap_tensor0] 25.5280μs 2.0347μs 491.4848 KOps/s 503.0307 KOps/s $\color{#d91a1a}-2.30\%$
test_stack[memmap_tensor0] 51.4470μs 5.8791μs 170.0945 KOps/s 180.4658 KOps/s $\textbf{\color{#d91a1a}-5.75\%}$
test_memmaptd_index 1.2247ms 0.4170ms 2.3981 KOps/s 2.4931 KOps/s $\color{#d91a1a}-3.81\%$
test_memmaptd_index_astensor 1.0196ms 0.4953ms 2.0191 KOps/s 2.0827 KOps/s $\color{#d91a1a}-3.05\%$
test_memmaptd_index_op 1.5783ms 1.0493ms 953.0145 Ops/s 940.6712 Ops/s $\color{#35bf28}+1.31\%$
test_serialize_model 0.1296s 0.1207s 8.2816 Ops/s 8.4174 Ops/s $\color{#d91a1a}-1.61\%$
test_serialize_model_pickle 0.4512s 0.3984s 2.5098 Ops/s 2.4946 Ops/s $\color{#35bf28}+0.61\%$
test_serialize_weights 0.1262s 0.1174s 8.5199 Ops/s 8.2753 Ops/s $\color{#35bf28}+2.96\%$
test_serialize_weights_returnearly 0.1761s 0.1592s 6.2822 Ops/s 6.2068 Ops/s $\color{#35bf28}+1.21\%$
test_serialize_weights_pickle 0.4748s 0.4149s 2.4103 Ops/s 2.4107 Ops/s $\color{#d91a1a}-0.02\%$
test_serialize_weights_filesystem 0.1512s 0.1443s 6.9284 Ops/s 6.9315 Ops/s $\color{#d91a1a}-0.04\%$
test_serialize_model_filesystem 0.1587s 0.1511s 6.6193 Ops/s 5.9606 Ops/s $\textbf{\color{#35bf28}+11.05\%}$
test_reshape_pytree 0.1256ms 40.8835μs 24.4597 KOps/s 25.0678 KOps/s $\color{#d91a1a}-2.43\%$
test_reshape_td 0.1287ms 48.3593μs 20.6785 KOps/s 21.5215 KOps/s $\color{#d91a1a}-3.92\%$
test_view_pytree 88.8060μs 40.1958μs 24.8782 KOps/s 25.3787 KOps/s $\color{#d91a1a}-1.97\%$
test_view_td 0.1273ms 56.0395μs 17.8445 KOps/s 18.3612 KOps/s $\color{#d91a1a}-2.81\%$
test_unbind_pytree 93.8550μs 37.3476μs 26.7755 KOps/s 27.3981 KOps/s $\color{#d91a1a}-2.27\%$
test_unbind_td 0.3891ms 46.9443μs 21.3018 KOps/s 22.0642 KOps/s $\color{#d91a1a}-3.46\%$
test_split_pytree 0.1018ms 41.5156μs 24.0873 KOps/s 26.1249 KOps/s $\textbf{\color{#d91a1a}-7.80\%}$
test_split_td 96.8688ms 70.6164μs 14.1610 KOps/s 16.7641 KOps/s $\textbf{\color{#d91a1a}-15.53\%}$
test_add_pytree 0.1233ms 47.5035μs 21.0511 KOps/s 21.3560 KOps/s $\color{#d91a1a}-1.43\%$
test_add_td 0.2916ms 84.0921μs 11.8917 KOps/s 11.4911 KOps/s $\color{#35bf28}+3.49\%$
test_compile_add_one_nested[tensordict-compile] 0.1305ms 54.3348μs 18.4044 KOps/s 18.5606 KOps/s $\color{#d91a1a}-0.84\%$
test_compile_add_one_nested[tensordict-eager] 0.4051ms 0.1920ms 5.2089 KOps/s 5.2792 KOps/s $\color{#d91a1a}-1.33\%$
test_compile_add_one_nested[pytree-compile] 0.1313ms 53.9153μs 18.5476 KOps/s 18.1840 KOps/s $\color{#35bf28}+2.00\%$
test_compile_add_one_nested[pytree-eager] 0.2892ms 0.1474ms 6.7826 KOps/s 6.9429 KOps/s $\color{#d91a1a}-2.31\%$
test_compile_copy_nested[tensordict-compile] 62.4570μs 20.1888μs 49.5323 KOps/s 49.3478 KOps/s $\color{#35bf28}+0.37\%$
test_compile_copy_nested[tensordict-eager] 0.1394ms 64.6130μs 15.4768 KOps/s 15.7144 KOps/s $\color{#d91a1a}-1.51\%$
test_compile_copy_nested[pytree-compile] 0.1812ms 78.4430μs 12.7481 KOps/s 12.8168 KOps/s $\color{#d91a1a}-0.54\%$
test_compile_copy_nested[pytree-eager] 0.1506ms 70.5486μs 14.1746 KOps/s 14.3762 KOps/s $\color{#d91a1a}-1.40\%$
test_compile_add_one_flat[tensordict-compile] 0.2651ms 0.1754ms 5.7014 KOps/s 5.7318 KOps/s $\color{#d91a1a}-0.53\%$
test_compile_add_one_flat[tensordict-eager] 0.2702ms 0.1972ms 5.0707 KOps/s 5.1257 KOps/s $\color{#d91a1a}-1.07\%$
test_compile_add_one_flat[tensorclass-compile] 0.1003ms 38.8485μs 25.7410 KOps/s 25.7172 KOps/s $\color{#35bf28}+0.09\%$
test_compile_add_one_flat[tensorclass-eager] 0.5232ms 70.8336μs 14.1176 KOps/s 14.0872 KOps/s $\color{#35bf28}+0.22\%$
test_compile_add_one_flat[pytree-compile] 0.3407ms 0.1716ms 5.8290 KOps/s 5.7484 KOps/s $\color{#35bf28}+1.40\%$
test_compile_add_one_flat[pytree-eager] 0.5674ms 0.3027ms 3.3031 KOps/s 3.3904 KOps/s $\color{#d91a1a}-2.58\%$
test_compile_add_self_flat[tensordict-eager] 6.3838ms 0.2130ms 4.6943 KOps/s 4.8521 KOps/s $\color{#d91a1a}-3.25\%$
test_compile_add_self_flat[tensordict-compile] 0.3644ms 0.1784ms 5.6063 KOps/s 5.6191 KOps/s $\color{#d91a1a}-0.23\%$
test_compile_add_self_flat[tensorclass-eager] 0.2084ms 62.8483μs 15.9113 KOps/s 15.7077 KOps/s $\color{#35bf28}+1.30\%$
test_compile_add_self_flat[tensorclass-compile] 0.1267ms 40.3882μs 24.7597 KOps/s 25.6440 KOps/s $\color{#d91a1a}-3.45\%$
test_compile_add_self_flat[pytree-eager] 0.4512ms 0.2477ms 4.0363 KOps/s 4.2010 KOps/s $\color{#d91a1a}-3.92\%$
test_compile_add_self_flat[pytree-compile] 0.3299ms 0.1750ms 5.7138 KOps/s 5.7750 KOps/s $\color{#d91a1a}-1.06\%$
test_compile_copy_flat[tensordict-compile] 0.2191ms 0.1086ms 9.2106 KOps/s 9.2989 KOps/s $\color{#d91a1a}-0.95\%$
test_compile_copy_flat[tensordict-eager] 0.1241ms 56.1217μs 17.8184 KOps/s 17.7553 KOps/s $\color{#35bf28}+0.36\%$
test_compile_copy_flat[pytree-compile] 0.1747ms 80.7593μs 12.3825 KOps/s 12.5978 KOps/s $\color{#d91a1a}-1.71\%$
test_compile_copy_flat[pytree-eager] 0.1820ms 69.9831μs 14.2892 KOps/s 14.1623 KOps/s $\color{#35bf28}+0.90\%$
test_compile_assign_and_add[tensordict-compile] 0.3736ms 0.1904ms 5.2514 KOps/s 5.2074 KOps/s $\color{#35bf28}+0.84\%$
test_compile_assign_and_add[tensordict-eager] 2.0403ms 1.7141ms 583.3964 Ops/s 590.1051 Ops/s $\color{#d91a1a}-1.14\%$
test_compile_assign_and_add[pytree-compile] 0.3569ms 0.1881ms 5.3150 KOps/s 5.2458 KOps/s $\color{#35bf28}+1.32\%$
test_compile_assign_and_add[pytree-eager] 1.3869ms 1.1266ms 887.6533 Ops/s 917.2488 Ops/s $\color{#d91a1a}-3.23\%$
test_compile_assign_and_add_stack[compile] 0.8776ms 0.4150ms 2.4094 KOps/s 2.3903 KOps/s $\color{#35bf28}+0.80\%$
test_compile_assign_and_add_stack[eager] 5.8708ms 3.8980ms 256.5420 Ops/s 251.6770 Ops/s $\color{#35bf28}+1.93\%$
test_compile_indexing[tensor-tensordict-compile] 99.4360μs 33.2194μs 30.1029 KOps/s 31.2415 KOps/s $\color{#d91a1a}-3.64\%$
test_compile_indexing[tensor-tensordict-eager] 1.1343ms 49.5521μs 20.1808 KOps/s 20.3900 KOps/s $\color{#d91a1a}-1.03\%$
test_compile_indexing[tensor-tensorclass-compile] 90.4990μs 29.0471μs 34.4269 KOps/s 32.3230 KOps/s $\textbf{\color{#35bf28}+6.51\%}$
test_compile_indexing[tensor-tensorclass-eager] 0.7140ms 30.6675μs 32.6078 KOps/s 32.6198 KOps/s $\color{#d91a1a}-0.04\%$
test_compile_indexing[tensor-pytree-compile] 93.2950μs 28.4875μs 35.1031 KOps/s 34.5012 KOps/s $\color{#35bf28}+1.74\%$
test_compile_indexing[tensor-pytree-eager] 0.1073ms 30.7802μs 32.4885 KOps/s 32.8464 KOps/s $\color{#d91a1a}-1.09\%$
test_compile_indexing[slice-tensordict-compile] 0.1451ms 72.8632μs 13.7243 KOps/s 13.5732 KOps/s $\color{#35bf28}+1.11\%$
test_compile_indexing[slice-tensordict-eager] 0.6333ms 28.4422μs 35.1590 KOps/s 35.3381 KOps/s $\color{#d91a1a}-0.51\%$
test_compile_indexing[slice-tensorclass-compile] 0.1472ms 67.0904μs 14.9053 KOps/s 14.2523 KOps/s $\color{#35bf28}+4.58\%$
test_compile_indexing[slice-tensorclass-eager] 73.4370μs 24.3223μs 41.1145 KOps/s 41.5409 KOps/s $\color{#d91a1a}-1.03\%$
test_compile_indexing[slice-pytree-compile] 0.1388ms 66.6636μs 15.0007 KOps/s 14.5385 KOps/s $\color{#35bf28}+3.18\%$
test_compile_indexing[slice-pytree-eager] 71.2940μs 24.2202μs 41.2879 KOps/s 41.7347 KOps/s $\color{#d91a1a}-1.07\%$
test_compile_indexing[int-tensordict-compile] 0.1503ms 72.3581μs 13.8202 KOps/s 13.6199 KOps/s $\color{#35bf28}+1.47\%$
test_compile_indexing[int-tensordict-eager] 0.6815ms 27.9952μs 35.7204 KOps/s 35.3895 KOps/s $\color{#35bf28}+0.94\%$
test_compile_indexing[int-tensorclass-compile] 0.1465ms 66.6252μs 15.0093 KOps/s 14.6943 KOps/s $\color{#35bf28}+2.14\%$
test_compile_indexing[int-tensorclass-eager] 82.1640μs 23.8756μs 41.8838 KOps/s 42.2386 KOps/s $\color{#d91a1a}-0.84\%$
test_compile_indexing[int-pytree-compile] 0.1405ms 67.1260μs 14.8974 KOps/s 14.6162 KOps/s $\color{#35bf28}+1.92\%$
test_compile_indexing[int-pytree-eager] 67.6060μs 24.3808μs 41.0159 KOps/s 42.6611 KOps/s $\color{#d91a1a}-3.86\%$
test_mod_add[eager] 89.0560μs 24.7645μs 40.3803 KOps/s 40.3881 KOps/s $\color{#d91a1a}-0.02\%$
test_mod_add[compile] 84.7890μs 37.6297μs 26.5748 KOps/s 27.8011 KOps/s $\color{#d91a1a}-4.41\%$
test_mod_add[compile-overhead] 82.6750μs 36.8976μs 27.1021 KOps/s 27.6141 KOps/s $\color{#d91a1a}-1.85\%$
test_mod_wrap[eager] 0.4130ms 0.2071ms 4.8297 KOps/s 4.9370 KOps/s $\color{#d91a1a}-2.17\%$
test_mod_wrap[compile] 1.7839ms 0.2317ms 4.3168 KOps/s 4.3673 KOps/s $\color{#d91a1a}-1.16\%$
test_mod_wrap[compile-overhead] 0.3686ms 0.2271ms 4.4030 KOps/s 4.4364 KOps/s $\color{#d91a1a}-0.75\%$
test_mod_wrap_and_backward[eager] 14.6133ms 11.8606ms 84.3129 Ops/s 82.8495 Ops/s $\color{#35bf28}+1.77\%$
test_mod_wrap_and_backward[compile] 20.8235ms 12.5109ms 79.9303 Ops/s 80.5995 Ops/s $\color{#d91a1a}-0.83\%$
test_mod_wrap_and_backward[compile-overhead] 12.2360ms 11.0317ms 90.6481 Ops/s 83.8290 Ops/s $\textbf{\color{#35bf28}+8.13\%}$
test_seq_add[eager] 0.1442ms 88.8205μs 11.2587 KOps/s 11.3980 KOps/s $\color{#d91a1a}-1.22\%$
test_seq_add[compile] 0.1561ms 61.3212μs 16.3076 KOps/s 16.3142 KOps/s $\color{#d91a1a}-0.04\%$
test_seq_add[compile-overhead] 0.1508ms 58.5460μs 17.0806 KOps/s 17.1085 KOps/s $\color{#d91a1a}-0.16\%$
test_seq_wrap[eager] 0.5941ms 0.3661ms 2.7312 KOps/s 2.6613 KOps/s $\color{#35bf28}+2.63\%$
test_seq_wrap[compile] 0.6478ms 0.2654ms 3.7674 KOps/s 3.8032 KOps/s $\color{#d91a1a}-0.94\%$
test_seq_wrap[compile-overhead] 0.5013ms 0.2663ms 3.7547 KOps/s 3.8044 KOps/s $\color{#d91a1a}-1.31\%$
test_func_call_runtime[False-eager] 1.0041ms 0.5475ms 1.8265 KOps/s 1.9463 KOps/s $\textbf{\color{#d91a1a}-6.15\%}$
test_func_call_runtime[False-compile] 0.8290ms 0.4984ms 2.0064 KOps/s 2.0215 KOps/s $\color{#d91a1a}-0.75\%$
test_func_call_runtime[False-compile-overhead] 0.7181ms 0.4987ms 2.0052 KOps/s 2.0335 KOps/s $\color{#d91a1a}-1.39\%$
test_func_call_runtime[True-eager] 1.1129ms 0.7680ms 1.3021 KOps/s 1.3480 KOps/s $\color{#d91a1a}-3.40\%$
test_func_call_runtime[True-compile] 0.9340ms 0.5180ms 1.9304 KOps/s 1.9645 KOps/s $\color{#d91a1a}-1.73\%$
test_func_call_runtime[True-compile-overhead] 0.7107ms 0.5136ms 1.9469 KOps/s 1.9706 KOps/s $\color{#d91a1a}-1.20\%$
test_func_call_cm_runtime[False-eager] 0.8946ms 0.5382ms 1.8581 KOps/s 1.9538 KOps/s $\color{#d91a1a}-4.90\%$
test_func_call_cm_runtime[False-compile] 0.6810ms 0.4945ms 2.0222 KOps/s 2.0328 KOps/s $\color{#d91a1a}-0.52\%$
test_func_call_cm_runtime[False-compile-overhead] 0.9222ms 0.5044ms 1.9825 KOps/s 2.0226 KOps/s $\color{#d91a1a}-1.98\%$
test_func_call_cm_runtime[True-eager] 1.0974ms 0.8962ms 1.1158 KOps/s 1.1499 KOps/s $\color{#d91a1a}-2.96\%$
test_func_call_cm_runtime[True-compile] 0.9551ms 0.8462ms 1.1817 KOps/s 1.2110 KOps/s $\color{#d91a1a}-2.42\%$
test_func_call_cm_runtime[True-compile-overhead] 1.1926ms 0.8474ms 1.1801 KOps/s 1.2123 KOps/s $\color{#d91a1a}-2.66\%$
test_distributed 0.2703ms 0.1327ms 7.5331 KOps/s 7.4536 KOps/s $\color{#35bf28}+1.07\%$
test_tdmodule 32.5300μs 17.3746μs 57.5552 KOps/s 54.9233 KOps/s $\color{#35bf28}+4.79\%$
test_tdmodule_dispatch 65.5130μs 36.3523μs 27.5086 KOps/s 26.1153 KOps/s $\textbf{\color{#35bf28}+5.34\%}$
test_tdseq 53.5500μs 19.4651μs 51.3740 KOps/s 49.8866 KOps/s $\color{#35bf28}+2.98\%$
test_tdseq_dispatch 71.5830μs 40.5816μs 24.6417 KOps/s 23.7677 KOps/s $\color{#35bf28}+3.68\%$
test_instantiation_functorch 2.0274ms 1.6682ms 599.4592 Ops/s 613.3389 Ops/s $\color{#d91a1a}-2.26\%$
test_instantiation_td 2.2676ms 1.1987ms 834.2380 Ops/s 846.2210 Ops/s $\color{#d91a1a}-1.42\%$
test_exec_functorch 0.2849ms 0.1843ms 5.4274 KOps/s 5.7553 KOps/s $\textbf{\color{#d91a1a}-5.70\%}$
test_exec_functional_call 0.3115ms 0.1760ms 5.6811 KOps/s 5.9405 KOps/s $\color{#d91a1a}-4.37\%$
test_exec_td 0.3125ms 0.1813ms 5.5164 KOps/s 5.9209 KOps/s $\textbf{\color{#d91a1a}-6.83\%}$
test_exec_td_decorator 1.0112ms 0.2311ms 4.3268 KOps/s 4.5336 KOps/s $\color{#d91a1a}-4.56\%$
test_vmap_mlp_speed[True-True] 0.8991ms 0.5777ms 1.7311 KOps/s 1.7608 KOps/s $\color{#d91a1a}-1.68\%$
test_vmap_mlp_speed[True-False] 1.0702ms 0.5709ms 1.7517 KOps/s 1.7893 KOps/s $\color{#d91a1a}-2.10\%$
test_vmap_mlp_speed[False-True] 0.6545ms 0.4718ms 2.1197 KOps/s 2.1217 KOps/s $\color{#d91a1a}-0.09\%$
test_vmap_mlp_speed[False-False] 0.8501ms 0.4781ms 2.0915 KOps/s 2.1408 KOps/s $\color{#d91a1a}-2.30\%$
test_vmap_mlp_speed_decorator[True-True] 1.0166ms 0.6286ms 1.5907 KOps/s 1.6108 KOps/s $\color{#d91a1a}-1.24\%$
test_vmap_mlp_speed_decorator[True-False] 0.9544ms 0.6238ms 1.6030 KOps/s 1.6255 KOps/s $\color{#d91a1a}-1.38\%$
test_vmap_mlp_speed_decorator[False-True] 0.8601ms 0.5172ms 1.9335 KOps/s 1.9566 KOps/s $\color{#d91a1a}-1.18\%$
test_vmap_mlp_speed_decorator[False-False] 0.9105ms 0.5188ms 1.9275 KOps/s 1.9620 KOps/s $\color{#d91a1a}-1.76\%$
test_to_module_speed[True] 1.6293ms 1.3267ms 753.7726 Ops/s 764.6880 Ops/s $\color{#d91a1a}-1.43\%$
test_to_module_speed[False] 1.7291ms 1.2978ms 770.5110 Ops/s 780.4181 Ops/s $\color{#d91a1a}-1.27\%$
test_tc_init 97.6330μs 46.0590μs 21.7113 KOps/s 21.8749 KOps/s $\color{#d91a1a}-0.75\%$
test_tc_init_nested 0.1814ms 92.5699μs 10.8026 KOps/s 11.0323 KOps/s $\color{#d91a1a}-2.08\%$
test_tc_first_layer_tensor 35.2860μs 1.4577μs 686.0063 KOps/s 713.7629 KOps/s $\color{#d91a1a}-3.89\%$
test_tc_first_layer_nontensor 38.2420μs 4.2698μs 234.2012 KOps/s 236.5951 KOps/s $\color{#d91a1a}-1.01\%$
test_tc_second_layer_tensor 28.2030μs 2.7424μs 364.6457 KOps/s 384.7982 KOps/s $\textbf{\color{#d91a1a}-5.24\%}$
test_tc_second_layer_nontensor 33.0610μs 5.5571μs 179.9511 KOps/s 186.0553 KOps/s $\color{#d91a1a}-3.28\%$
test_unbind 0.4735s 14.4470ms 69.2183 Ops/s 64.9068 Ops/s $\textbf{\color{#35bf28}+6.64\%}$
test_full_like 14.2006ms 8.5700ms 116.6857 Ops/s 120.7441 Ops/s $\color{#d91a1a}-3.36\%$
test_zeros_like 11.6672ms 7.0305ms 142.2375 Ops/s 131.7384 Ops/s $\textbf{\color{#35bf28}+7.97\%}$
test_ones_like 13.2507ms 7.5986ms 131.6033 Ops/s 124.0464 Ops/s $\textbf{\color{#35bf28}+6.09\%}$
test_clone 19.8587ms 9.2960ms 107.5737 Ops/s 102.2571 Ops/s $\textbf{\color{#35bf28}+5.20\%}$
test_squeeze 97.1620μs 12.7882μs 78.1970 KOps/s 76.9457 KOps/s $\color{#35bf28}+1.63\%$
test_unsqueeze 0.1526ms 93.5205μs 10.6928 KOps/s 10.7456 KOps/s $\color{#d91a1a}-0.49\%$
test_split 0.3560ms 0.2015ms 4.9628 KOps/s 5.0547 KOps/s $\color{#d91a1a}-1.82\%$
test_permute 0.3158ms 0.2187ms 4.5716 KOps/s 4.6183 KOps/s $\color{#d91a1a}-1.01\%$
test_stack 33.3918ms 25.8647ms 38.6628 Ops/s 39.0705 Ops/s $\color{#d91a1a}-1.04\%$
test_cat 29.9482ms 25.5082ms 39.2031 Ops/s 38.7598 Ops/s $\color{#35bf28}+1.14\%$

Copy link

github-actions bot commented Aug 1, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 225. Improved: $\large\color{#35bf28}12$. Worsened: $\large\color{#d91a1a}27$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 0.1452ms 17.1688μs 58.2453 KOps/s 60.5768 KOps/s $\color{#d91a1a}-3.85\%$
test_plain_set_stack_nested 35.1900μs 17.3523μs 57.6291 KOps/s 60.5483 KOps/s $\color{#d91a1a}-4.82\%$
test_plain_set_nested_inplace 38.7710μs 18.2326μs 54.8468 KOps/s 56.5368 KOps/s $\color{#d91a1a}-2.99\%$
test_plain_set_stack_nested_inplace 36.9800μs 18.2983μs 54.6498 KOps/s 56.8306 KOps/s $\color{#d91a1a}-3.84\%$
test_items 31.5100μs 4.6626μs 214.4743 KOps/s 215.5523 KOps/s $\color{#d91a1a}-0.50\%$
test_items_nested 0.4119ms 0.3618ms 2.7638 KOps/s 2.7175 KOps/s $\color{#35bf28}+1.70\%$
test_items_nested_locked 0.4196ms 0.3652ms 2.7381 KOps/s 2.6799 KOps/s $\color{#35bf28}+2.17\%$
test_items_nested_leaf 0.1096ms 84.0798μs 11.8935 KOps/s 11.9255 KOps/s $\color{#d91a1a}-0.27\%$
test_items_stack_nested 0.4204ms 0.3677ms 2.7194 KOps/s 2.7285 KOps/s $\color{#d91a1a}-0.33\%$
test_items_stack_nested_leaf 0.1087ms 85.7896μs 11.6564 KOps/s 11.8423 KOps/s $\color{#d91a1a}-1.57\%$
test_items_stack_nested_locked 0.4244ms 0.3694ms 2.7069 KOps/s 2.7554 KOps/s $\color{#d91a1a}-1.76\%$
test_keys 18.7900μs 4.3641μs 229.1444 KOps/s 229.4179 KOps/s $\color{#d91a1a}-0.12\%$
test_keys_nested 88.6710μs 65.7711μs 15.2042 KOps/s 14.9951 KOps/s $\color{#35bf28}+1.39\%$
test_keys_nested_locked 0.6789ms 72.7758μs 13.7408 KOps/s 13.8304 KOps/s $\color{#d91a1a}-0.65\%$
test_keys_nested_leaf 79.2110μs 55.8647μs 17.9004 KOps/s 18.0039 KOps/s $\color{#d91a1a}-0.57\%$
test_keys_stack_nested 85.6910μs 66.1650μs 15.1137 KOps/s 15.1377 KOps/s $\color{#d91a1a}-0.16\%$
test_keys_stack_nested_leaf 76.2710μs 58.0801μs 17.2176 KOps/s 17.3717 KOps/s $\color{#d91a1a}-0.89\%$
test_keys_stack_nested_locked 0.1029ms 72.1056μs 13.8686 KOps/s 14.0025 KOps/s $\color{#d91a1a}-0.96\%$
test_values 9.3637μs 1.8029μs 554.6688 KOps/s 567.9603 KOps/s $\color{#d91a1a}-2.34\%$
test_values_nested 52.3810μs 33.7206μs 29.6554 KOps/s 29.9826 KOps/s $\color{#d91a1a}-1.09\%$
test_values_nested_locked 53.2410μs 35.9476μs 27.8182 KOps/s 28.3084 KOps/s $\color{#d91a1a}-1.73\%$
test_values_nested_leaf 54.1910μs 30.0964μs 33.2265 KOps/s 33.5133 KOps/s $\color{#d91a1a}-0.86\%$
test_values_stack_nested 53.8810μs 34.3808μs 29.0860 KOps/s 29.1728 KOps/s $\color{#d91a1a}-0.30\%$
test_values_stack_nested_leaf 54.4600μs 30.8498μs 32.4151 KOps/s 32.5653 KOps/s $\color{#d91a1a}-0.46\%$
test_values_stack_nested_locked 53.9110μs 36.3649μs 27.4991 KOps/s 27.5257 KOps/s $\color{#d91a1a}-0.10\%$
test_membership 1.3156μs 0.5560μs 1.7986 MOps/s 1.8573 MOps/s $\color{#d91a1a}-3.16\%$
test_membership_nested 9.3405μs 1.9382μs 515.9308 KOps/s 500.8193 KOps/s $\color{#35bf28}+3.02\%$
test_membership_nested_leaf 9.3205μs 1.9364μs 516.4120 KOps/s 516.4297 KOps/s $-0.00\%$
test_membership_stacked_nested 25.3700μs 2.0017μs 499.5769 KOps/s 505.3808 KOps/s $\color{#d91a1a}-1.15\%$
test_membership_stacked_nested_leaf 20.7600μs 2.0029μs 499.2725 KOps/s 499.8722 KOps/s $\color{#d91a1a}-0.12\%$
test_membership_nested_last 16.4900μs 2.9111μs 343.5124 KOps/s 351.5697 KOps/s $\color{#d91a1a}-2.29\%$
test_membership_nested_leaf_last 19.4500μs 2.8770μs 347.5890 KOps/s 344.0058 KOps/s $\color{#35bf28}+1.04\%$
test_membership_stacked_nested_last 21.2900μs 2.9297μs 341.3327 KOps/s 182.8060 KOps/s $\textbf{\color{#35bf28}+86.72\%}$
test_membership_stacked_nested_leaf_last 17.1600μs 2.9429μs 339.8044 KOps/s 183.2134 KOps/s $\textbf{\color{#35bf28}+85.47\%}$
test_nested_getleaf 21.4000μs 8.0410μs 124.3630 KOps/s 125.6175 KOps/s $\color{#d91a1a}-1.00\%$
test_nested_get 22.2900μs 7.4337μs 134.5232 KOps/s 134.0402 KOps/s $\color{#35bf28}+0.36\%$
test_stacked_getleaf 29.7700μs 7.9738μs 125.4112 KOps/s 124.6191 KOps/s $\color{#35bf28}+0.64\%$
test_stacked_get 23.3910μs 7.4892μs 133.5263 KOps/s 133.9974 KOps/s $\color{#d91a1a}-0.35\%$
test_nested_getitemleaf 22.5200μs 8.1951μs 122.0246 KOps/s 123.3105 KOps/s $\color{#d91a1a}-1.04\%$
test_nested_getitem 22.5500μs 7.6398μs 130.8928 KOps/s 131.1404 KOps/s $\color{#d91a1a}-0.19\%$
test_stacked_getitemleaf 24.6010μs 8.1110μs 123.2894 KOps/s 122.2887 KOps/s $\color{#35bf28}+0.82\%$
test_stacked_getitem 22.2300μs 7.6494μs 130.7296 KOps/s 131.2759 KOps/s $\color{#d91a1a}-0.42\%$
test_lock_nested 9.9408ms 0.4823ms 2.0736 KOps/s 2.1431 KOps/s $\color{#d91a1a}-3.24\%$
test_lock_stack_nested 0.4812ms 0.4294ms 2.3287 KOps/s 2.3852 KOps/s $\color{#d91a1a}-2.37\%$
test_unlock_nested 0.8714ms 0.3910ms 2.5577 KOps/s 2.5755 KOps/s $\color{#d91a1a}-0.69\%$
test_unlock_stack_nested 0.4137ms 0.3483ms 2.8707 KOps/s 2.9523 KOps/s $\color{#d91a1a}-2.76\%$
test_flatten_speed 0.5281ms 0.1055ms 9.4752 KOps/s 9.5622 KOps/s $\color{#d91a1a}-0.91\%$
test_unflatten_speed 0.3544ms 0.2877ms 3.4763 KOps/s 3.4636 KOps/s $\color{#35bf28}+0.37\%$
test_common_ops 1.6404ms 1.3127ms 761.8032 Ops/s 743.8186 Ops/s $\color{#35bf28}+2.42\%$
test_creation 17.6310μs 1.6418μs 609.0933 KOps/s 612.3384 KOps/s $\color{#d91a1a}-0.53\%$
test_creation_empty 36.1610μs 17.5815μs 56.8779 KOps/s 61.6590 KOps/s $\textbf{\color{#d91a1a}-7.75\%}$
test_creation_nested_1 43.7700μs 19.6216μs 50.9642 KOps/s 54.4367 KOps/s $\textbf{\color{#d91a1a}-6.38\%}$
test_creation_nested_2 42.9600μs 22.2310μs 44.9822 KOps/s 49.0208 KOps/s $\textbf{\color{#d91a1a}-8.24\%}$
test_clone 58.4410μs 30.1915μs 33.1219 KOps/s 32.0291 KOps/s $\color{#35bf28}+3.41\%$
test_getitem[int] 1.0026ms 17.3358μs 57.6841 KOps/s 60.6296 KOps/s $\color{#d91a1a}-4.86\%$
test_getitem[slice_int] 0.1686ms 29.7400μs 33.6248 KOps/s 35.6359 KOps/s $\textbf{\color{#d91a1a}-5.64\%}$
test_getitem[range] 0.2937ms 0.1148ms 8.7083 KOps/s 8.9239 KOps/s $\color{#d91a1a}-2.42\%$
test_getitem[tuple] 0.1556ms 25.9135μs 38.5899 KOps/s 40.7794 KOps/s $\textbf{\color{#d91a1a}-5.37\%}$
test_getitem[list] 91.7430ms 0.1289ms 7.7563 KOps/s 9.2480 KOps/s $\textbf{\color{#d91a1a}-16.13\%}$
test_setitem_dim[int] 74.9510μs 54.0628μs 18.4970 KOps/s 18.4409 KOps/s $\color{#35bf28}+0.30\%$
test_setitem_dim[slice_int] 98.5310μs 79.3509μs 12.6023 KOps/s 13.4630 KOps/s $\textbf{\color{#d91a1a}-6.39\%}$
test_setitem_dim[range] 0.1739ms 0.1437ms 6.9588 KOps/s 7.3433 KOps/s $\textbf{\color{#d91a1a}-5.24\%}$
test_setitem_dim[tuple] 93.8910μs 71.6000μs 13.9665 KOps/s 14.9486 KOps/s $\textbf{\color{#d91a1a}-6.57\%}$
test_setitem 85.2110μs 43.4212μs 23.0303 KOps/s 23.6018 KOps/s $\color{#d91a1a}-2.42\%$
test_set 73.8710μs 42.8305μs 23.3478 KOps/s 22.6652 KOps/s $\color{#35bf28}+3.01\%$
test_set_shared 0.3711ms 53.8839μs 18.5584 KOps/s 18.8704 KOps/s $\color{#d91a1a}-1.65\%$
test_update 88.8310μs 52.6576μs 18.9906 KOps/s 20.6831 KOps/s $\textbf{\color{#d91a1a}-8.18\%}$
test_update_nested 84.1110μs 60.1112μs 16.6358 KOps/s 16.7196 KOps/s $\color{#d91a1a}-0.50\%$
test_update__nested 94.9720μs 62.8184μs 15.9189 KOps/s 15.4769 KOps/s $\color{#35bf28}+2.86\%$
test_set_nested 67.3210μs 44.8438μs 22.2996 KOps/s 21.1556 KOps/s $\textbf{\color{#35bf28}+5.41\%}$
test_set_nested_new 96.5020μs 49.0605μs 20.3830 KOps/s 19.7921 KOps/s $\color{#35bf28}+2.99\%$
test_select 85.8020μs 64.9340μs 15.4003 KOps/s 15.1343 KOps/s $\color{#35bf28}+1.76\%$
test_select_nested 71.8010μs 53.2929μs 18.7642 KOps/s 19.2987 KOps/s $\color{#d91a1a}-2.77\%$
test_exclude_nested 0.1090ms 70.5635μs 14.1716 KOps/s 14.2565 KOps/s $\color{#d91a1a}-0.60\%$
test_empty[True] 0.3372ms 0.2860ms 3.4967 KOps/s 3.5757 KOps/s $\color{#d91a1a}-2.21\%$
test_empty[False] 2.3511μs 0.8547μs 1.1699 MOps/s 1.1451 MOps/s $\color{#35bf28}+2.16\%$
test_to 66.8010μs 39.8922μs 25.0675 KOps/s 25.2802 KOps/s $\color{#d91a1a}-0.84\%$
test_to_nonblocking 47.5610μs 25.7135μs 38.8900 KOps/s 40.8593 KOps/s $\color{#d91a1a}-4.82\%$
test_unbind_speed 1.4269ms 0.3046ms 3.2825 KOps/s 3.3723 KOps/s $\color{#d91a1a}-2.66\%$
test_unbind_speed_stack0 0.3527ms 0.3019ms 3.3129 KOps/s 3.3891 KOps/s $\color{#d91a1a}-2.25\%$
test_unbind_speed_stack1 90.7687ms 0.7689ms 1.3005 KOps/s 1.3249 KOps/s $\color{#d91a1a}-1.84\%$
test_split 91.4186ms 2.3634ms 423.1141 Ops/s 444.1235 Ops/s $\color{#d91a1a}-4.73\%$
test_chunk 2.3244ms 2.1654ms 461.8020 Ops/s 441.4737 Ops/s $\color{#35bf28}+4.60\%$
test_creation[device0] 0.1559ms 0.1028ms 9.7245 KOps/s 9.6952 KOps/s $\color{#35bf28}+0.30\%$
test_creation_from_tensor 0.1610ms 0.1012ms 9.8813 KOps/s 9.4207 KOps/s $\color{#35bf28}+4.89\%$
test_add_one[memmap_tensor0] 78.7910μs 8.6094μs 116.1524 KOps/s 113.3005 KOps/s $\color{#35bf28}+2.52\%$
test_contiguous[memmap_tensor0] 27.4610μs 2.1585μs 463.2862 KOps/s 462.7545 KOps/s $\color{#35bf28}+0.11\%$
test_stack[memmap_tensor0] 31.0100μs 6.5435μs 152.8236 KOps/s 156.6894 KOps/s $\color{#d91a1a}-2.47\%$
test_memmaptd_index 1.0657ms 0.4296ms 2.3276 KOps/s 2.3722 KOps/s $\color{#d91a1a}-1.88\%$
test_memmaptd_index_astensor 0.8477ms 0.4968ms 2.0130 KOps/s 2.0503 KOps/s $\color{#d91a1a}-1.82\%$
test_memmaptd_index_op 1.4515ms 1.0597ms 943.7055 Ops/s 1.0060 KOps/s $\textbf{\color{#d91a1a}-6.19\%}$
test_serialize_model 93.3712ms 89.3092ms 11.1971 Ops/s 10.8638 Ops/s $\color{#35bf28}+3.07\%$
test_serialize_model_pickle 1.3540s 1.2363s 0.8089 Ops/s 0.8084 Ops/s $\color{#35bf28}+0.06\%$
test_serialize_weights 89.8706ms 85.9446ms 11.6354 Ops/s 11.0624 Ops/s $\textbf{\color{#35bf28}+5.18\%}$
test_serialize_weights_returnearly 0.2098s 65.1776ms 15.3427 Ops/s 19.1596 Ops/s $\textbf{\color{#d91a1a}-19.92\%}$
test_serialize_weights_pickle 1.4079s 1.2439s 0.8039 Ops/s 0.8082 Ops/s $\color{#d91a1a}-0.53\%$
test_reshape_pytree 0.2151ms 38.9845μs 25.6512 KOps/s 26.4096 KOps/s $\color{#d91a1a}-2.87\%$
test_reshape_td 78.8610μs 44.0584μs 22.6972 KOps/s 23.4596 KOps/s $\color{#d91a1a}-3.25\%$
test_view_pytree 56.1000μs 37.6972μs 26.5271 KOps/s 26.9682 KOps/s $\color{#d91a1a}-1.64\%$
test_view_td 78.1220μs 50.5385μs 19.7869 KOps/s 19.9865 KOps/s $\color{#d91a1a}-1.00\%$
test_unbind_pytree 70.7710μs 36.8133μs 27.1641 KOps/s 27.6955 KOps/s $\color{#d91a1a}-1.92\%$
test_unbind_td 0.4215ms 46.0442μs 21.7183 KOps/s 22.8011 KOps/s $\color{#d91a1a}-4.75\%$
test_split_pytree 82.9420μs 51.9336μs 19.2554 KOps/s 19.7798 KOps/s $\color{#d91a1a}-2.65\%$
test_split_td 92.0757ms 69.9481μs 14.2963 KOps/s 17.5668 KOps/s $\textbf{\color{#d91a1a}-18.62\%}$
test_add_pytree 0.1119ms 60.2328μs 16.6023 KOps/s 17.2957 KOps/s $\color{#d91a1a}-4.01\%$
test_add_td 0.1401ms 97.9020μs 10.2143 KOps/s 10.6486 KOps/s $\color{#d91a1a}-4.08\%$
test_compile_add_one_nested[tensordict-compile] 0.4190ms 0.2179ms 4.5886 KOps/s 4.6433 KOps/s $\color{#d91a1a}-1.18\%$
test_compile_add_one_nested[tensordict-eager] 0.2619ms 0.1731ms 5.7764 KOps/s 5.7550 KOps/s $\color{#35bf28}+0.37\%$
test_compile_add_one_nested[pytree-compile] 0.1918ms 0.1483ms 6.7411 KOps/s 6.7852 KOps/s $\color{#d91a1a}-0.65\%$
test_compile_add_one_nested[pytree-eager] 0.2497ms 0.1978ms 5.0557 KOps/s 5.2897 KOps/s $\color{#d91a1a}-4.42\%$
test_compile_copy_nested[tensordict-compile] 49.1510μs 22.6222μs 44.2044 KOps/s 44.0308 KOps/s $\color{#35bf28}+0.39\%$
test_compile_copy_nested[tensordict-eager] 74.4010μs 47.8051μs 20.9183 KOps/s 20.9825 KOps/s $\color{#d91a1a}-0.31\%$
test_compile_copy_nested[pytree-compile] 0.1297ms 75.0283μs 13.3283 KOps/s 13.3850 KOps/s $\color{#d91a1a}-0.42\%$
test_compile_copy_nested[pytree-eager] 84.1410μs 60.1435μs 16.6269 KOps/s 16.7399 KOps/s $\color{#d91a1a}-0.68\%$
test_compile_add_one_flat[tensordict-compile] 0.4261ms 0.3355ms 2.9808 KOps/s 3.0255 KOps/s $\color{#d91a1a}-1.48\%$
test_compile_add_one_flat[tensordict-eager] 0.2977ms 0.2236ms 4.4718 KOps/s 4.5507 KOps/s $\color{#d91a1a}-1.73\%$
test_compile_add_one_flat[tensorclass-compile] 0.2074ms 0.1332ms 7.5090 KOps/s 7.3362 KOps/s $\color{#35bf28}+2.36\%$
test_compile_add_one_flat[tensorclass-eager] 0.1363ms 62.5719μs 15.9816 KOps/s 15.7176 KOps/s $\color{#35bf28}+1.68\%$
test_compile_add_one_flat[pytree-compile] 0.4132ms 0.3318ms 3.0137 KOps/s 3.0406 KOps/s $\color{#d91a1a}-0.88\%$
test_compile_add_one_flat[pytree-eager] 0.7074ms 0.6445ms 1.5516 KOps/s 1.5560 KOps/s $\color{#d91a1a}-0.29\%$
test_compile_add_self_flat[tensordict-eager] 0.3208ms 0.2702ms 3.7006 KOps/s 3.7477 KOps/s $\color{#d91a1a}-1.26\%$
test_compile_add_self_flat[tensordict-compile] 0.3764ms 0.3347ms 2.9874 KOps/s 3.0114 KOps/s $\color{#d91a1a}-0.80\%$
test_compile_add_self_flat[tensorclass-eager] 0.1628ms 74.8124μs 13.3668 KOps/s 13.5527 KOps/s $\color{#d91a1a}-1.37\%$
test_compile_add_self_flat[tensorclass-compile] 0.1833ms 0.1321ms 7.5705 KOps/s 7.3801 KOps/s $\color{#35bf28}+2.58\%$
test_compile_add_self_flat[pytree-eager] 0.6300ms 0.5494ms 1.8200 KOps/s 1.9179 KOps/s $\textbf{\color{#d91a1a}-5.10\%}$
test_compile_add_self_flat[pytree-compile] 0.3947ms 0.3318ms 3.0137 KOps/s 3.0455 KOps/s $\color{#d91a1a}-1.04\%$
test_compile_copy_flat[tensordict-compile] 45.7710μs 19.2900μs 51.8403 KOps/s 54.6969 KOps/s $\textbf{\color{#d91a1a}-5.22\%}$
test_compile_copy_flat[tensordict-eager] 57.9210μs 32.3982μs 30.8659 KOps/s 30.7354 KOps/s $\color{#35bf28}+0.42\%$
test_compile_copy_flat[pytree-compile] 0.1037ms 76.7806μs 13.0241 KOps/s 13.0837 KOps/s $\color{#d91a1a}-0.46\%$
test_compile_copy_flat[pytree-eager] 83.6120μs 60.7289μs 16.4666 KOps/s 16.6099 KOps/s $\color{#d91a1a}-0.86\%$
test_compile_assign_and_add[tensordict-compile] 2.4802ms 0.8560ms 1.1682 KOps/s 1.1048 KOps/s $\textbf{\color{#35bf28}+5.74\%}$
test_compile_assign_and_add[tensordict-eager] 3.8207ms 3.3642ms 297.2489 Ops/s 314.9497 Ops/s $\textbf{\color{#d91a1a}-5.62\%}$
test_compile_assign_and_add[pytree-compile] 2.4269ms 0.8429ms 1.1863 KOps/s 1.1109 KOps/s $\textbf{\color{#35bf28}+6.79\%}$
test_compile_assign_and_add[pytree-eager] 3.7679ms 3.4242ms 292.0373 Ops/s 316.4868 Ops/s $\textbf{\color{#d91a1a}-7.73\%}$
test_compile_indexing[tensor-tensordict-compile] 0.2566ms 0.1151ms 8.6916 KOps/s 8.8621 KOps/s $\color{#d91a1a}-1.92\%$
test_compile_indexing[tensor-tensordict-eager] 0.2245ms 61.1816μs 16.3448 KOps/s 15.1674 KOps/s $\textbf{\color{#35bf28}+7.76\%}$
test_compile_indexing[tensor-tensorclass-compile] 0.1569ms 0.1044ms 9.5797 KOps/s 9.6138 KOps/s $\color{#d91a1a}-0.35\%$
test_compile_indexing[tensor-tensorclass-eager] 86.4910μs 48.3259μs 20.6928 KOps/s 22.2454 KOps/s $\textbf{\color{#d91a1a}-6.98\%}$
test_compile_indexing[tensor-pytree-compile] 0.1753ms 0.1091ms 9.1622 KOps/s 9.6090 KOps/s $\color{#d91a1a}-4.65\%$
test_compile_indexing[tensor-pytree-eager] 76.4610μs 45.5733μs 21.9427 KOps/s 22.2738 KOps/s $\color{#d91a1a}-1.49\%$
test_compile_indexing[slice-tensordict-compile] 0.1839ms 0.1409ms 7.0961 KOps/s 7.1595 KOps/s $\color{#d91a1a}-0.89\%$
test_compile_indexing[slice-tensordict-eager] 0.1807ms 25.8464μs 38.6900 KOps/s 39.5489 KOps/s $\color{#d91a1a}-2.17\%$
test_compile_indexing[slice-tensorclass-compile] 0.1699ms 0.1320ms 7.5758 KOps/s 7.5693 KOps/s $\color{#35bf28}+0.09\%$
test_compile_indexing[slice-tensorclass-eager] 45.2510μs 22.7165μs 44.0209 KOps/s 45.6780 KOps/s $\color{#d91a1a}-3.63\%$
test_compile_indexing[slice-pytree-compile] 0.1703ms 0.1322ms 7.5663 KOps/s 7.5829 KOps/s $\color{#d91a1a}-0.22\%$
test_compile_indexing[slice-pytree-eager] 48.4900μs 22.5793μs 44.2884 KOps/s 45.1927 KOps/s $\color{#d91a1a}-2.00\%$
test_compile_indexing[int-tensordict-compile] 0.2229ms 0.1402ms 7.1342 KOps/s 7.2107 KOps/s $\color{#d91a1a}-1.06\%$
test_compile_indexing[int-tensordict-eager] 0.4835ms 28.0589μs 35.6393 KOps/s 39.6466 KOps/s $\textbf{\color{#d91a1a}-10.11\%}$
test_compile_indexing[int-tensorclass-compile] 0.1721ms 0.1320ms 7.5753 KOps/s 7.5761 KOps/s $\color{#d91a1a}-0.01\%$
test_compile_indexing[int-tensorclass-eager] 52.3110μs 22.2452μs 44.9536 KOps/s 45.3700 KOps/s $\color{#d91a1a}-0.92\%$
test_compile_indexing[int-pytree-compile] 0.1673ms 0.1320ms 7.5785 KOps/s 7.6427 KOps/s $\color{#d91a1a}-0.84\%$
test_compile_indexing[int-pytree-eager] 44.7910μs 22.4287μs 44.5858 KOps/s 45.6774 KOps/s $\color{#d91a1a}-2.39\%$
test_mod_add[eager] 74.8610μs 38.6019μs 25.9054 KOps/s 27.3775 KOps/s $\textbf{\color{#d91a1a}-5.38\%}$
test_mod_add[compile] 0.1125ms 73.7329μs 13.5625 KOps/s 14.4370 KOps/s $\textbf{\color{#d91a1a}-6.06\%}$
test_mod_add[compile-overhead] 0.2817ms 0.1382ms 7.2375 KOps/s 6.5637 KOps/s $\textbf{\color{#35bf28}+10.27\%}$
test_mod_wrap[eager] 0.3290ms 0.2509ms 3.9852 KOps/s 3.9837 KOps/s $\color{#35bf28}+0.04\%$
test_mod_wrap[compile] 1.0594ms 0.2931ms 3.4115 KOps/s 3.3005 KOps/s $\color{#35bf28}+3.36\%$
test_mod_wrap[compile-overhead] 8.2427ms 4.3623ms 229.2383 Ops/s 232.0157 Ops/s $\color{#d91a1a}-1.20\%$
test_mod_wrap_and_backward[eager] 1.4371ms 1.3303ms 751.7312 Ops/s 703.4139 Ops/s $\textbf{\color{#35bf28}+6.87\%}$
test_mod_wrap_and_backward[compile] 1.4543ms 1.3319ms 750.8101 Ops/s 693.6235 Ops/s $\textbf{\color{#35bf28}+8.24\%}$
test_mod_wrap_and_backward[compile-overhead] 1.3245ms 0.8977ms 1.1140 KOps/s 994.6725 Ops/s $\textbf{\color{#35bf28}+11.99\%}$
test_seq_add[eager] 0.1650ms 0.1116ms 8.9587 KOps/s 9.1435 KOps/s $\color{#d91a1a}-2.02\%$
test_seq_add[compile] 0.1528ms 88.8043μs 11.2607 KOps/s 11.6042 KOps/s $\color{#d91a1a}-2.96\%$
test_seq_add[compile-overhead] 0.1626ms 0.1236ms 8.0938 KOps/s 8.0839 KOps/s $\color{#35bf28}+0.12\%$
test_seq_wrap[eager] 0.5089ms 0.4365ms 2.2911 KOps/s 2.4010 KOps/s $\color{#d91a1a}-4.58\%$
test_seq_wrap[compile] 0.3933ms 0.3304ms 3.0267 KOps/s 3.0466 KOps/s $\color{#d91a1a}-0.65\%$
test_seq_wrap[compile-overhead] 0.1926s 89.5843ms 11.1627 Ops/s 7.9022 Ops/s $\textbf{\color{#35bf28}+41.26\%}$
test_func_call_runtime[False-eager] 0.8144ms 0.7446ms 1.3430 KOps/s 1.3081 KOps/s $\color{#35bf28}+2.67\%$
test_func_call_runtime[False-compile] 0.9099ms 0.8126ms 1.2307 KOps/s 1.2464 KOps/s $\color{#d91a1a}-1.26\%$
test_func_call_runtime[False-compile-overhead] 0.4215ms 0.3687ms 2.7120 KOps/s 2.7149 KOps/s $\color{#d91a1a}-0.10\%$
test_func_call_runtime[True-eager] 1.0939ms 0.9495ms 1.0532 KOps/s 1.0845 KOps/s $\color{#d91a1a}-2.88\%$
test_func_call_runtime[True-compile] 0.9740ms 0.8561ms 1.1681 KOps/s 1.1792 KOps/s $\color{#d91a1a}-0.93\%$
test_func_call_runtime[True-compile-overhead] 0.4605ms 0.4166ms 2.4001 KOps/s 2.4202 KOps/s $\color{#d91a1a}-0.83\%$
test_func_call_cm_runtime[False-eager] 0.8703ms 0.7974ms 1.2541 KOps/s 1.2613 KOps/s $\color{#d91a1a}-0.57\%$
test_func_call_cm_runtime[False-compile] 0.9059ms 0.8401ms 1.1904 KOps/s 1.2320 KOps/s $\color{#d91a1a}-3.38\%$
test_func_call_cm_runtime[False-compile-overhead] 0.5535ms 0.3708ms 2.6968 KOps/s 2.7328 KOps/s $\color{#d91a1a}-1.32\%$
test_func_call_cm_runtime[True-eager] 1.1769ms 1.0550ms 947.8695 Ops/s 963.8055 Ops/s $\color{#d91a1a}-1.65\%$
test_func_call_cm_runtime[True-compile] 1.1360ms 1.0408ms 960.8086 Ops/s 987.6768 Ops/s $\color{#d91a1a}-2.72\%$
test_func_call_cm_runtime[True-compile-overhead] 1.1253ms 1.0187ms 981.6905 Ops/s 989.1326 Ops/s $\color{#d91a1a}-0.75\%$
test_distributed 1.1791ms 72.9342μs 13.7110 KOps/s 14.2549 KOps/s $\color{#d91a1a}-3.82\%$
test_tdmodule 32.2200μs 16.6801μs 59.9515 KOps/s 64.7004 KOps/s $\textbf{\color{#d91a1a}-7.34\%}$
test_tdmodule_dispatch 49.9810μs 33.8524μs 29.5400 KOps/s 31.8952 KOps/s $\textbf{\color{#d91a1a}-7.38\%}$
test_tdseq 33.5210μs 16.9521μs 58.9898 KOps/s 62.5089 KOps/s $\textbf{\color{#d91a1a}-5.63\%}$
test_tdseq_dispatch 52.9400μs 35.4881μs 28.1785 KOps/s 29.9008 KOps/s $\textbf{\color{#d91a1a}-5.76\%}$
test_instantiation_functorch 2.1209ms 2.0238ms 494.1125 Ops/s 496.1680 Ops/s $\color{#d91a1a}-0.41\%$
test_instantiation_td 2.0000ms 1.3180ms 758.7345 Ops/s 768.0839 Ops/s $\color{#d91a1a}-1.22\%$
test_exec_functorch 0.2801ms 0.2200ms 4.5452 KOps/s 4.6882 KOps/s $\color{#d91a1a}-3.05\%$
test_exec_functional_call 0.2976ms 0.2247ms 4.4512 KOps/s 4.7883 KOps/s $\textbf{\color{#d91a1a}-7.04\%}$
test_exec_td 0.2864ms 0.2257ms 4.4314 KOps/s 4.6417 KOps/s $\color{#d91a1a}-4.53\%$
test_exec_td_decorator 0.4238ms 0.2700ms 3.7036 KOps/s 3.7312 KOps/s $\color{#d91a1a}-0.74\%$
test_vmap_mlp_speed[True-True] 1.1925ms 0.6431ms 1.5549 KOps/s 1.5603 KOps/s $\color{#d91a1a}-0.35\%$
test_vmap_mlp_speed[True-False] 0.7164ms 0.6383ms 1.5666 KOps/s 1.5901 KOps/s $\color{#d91a1a}-1.48\%$
test_vmap_mlp_speed[False-True] 0.6816ms 0.5727ms 1.7462 KOps/s 1.7924 KOps/s $\color{#d91a1a}-2.58\%$
test_vmap_mlp_speed[False-False] 0.6498ms 0.5757ms 1.7370 KOps/s 1.7493 KOps/s $\color{#d91a1a}-0.70\%$
test_vmap_mlp_speed_decorator[True-True] 1.2953ms 0.6850ms 1.4599 KOps/s 1.4680 KOps/s $\color{#d91a1a}-0.55\%$
test_vmap_mlp_speed_decorator[True-False] 0.7845ms 0.6856ms 1.4586 KOps/s 1.4627 KOps/s $\color{#d91a1a}-0.28\%$
test_vmap_mlp_speed_decorator[False-True] 0.7534ms 0.6025ms 1.6596 KOps/s 1.6794 KOps/s $\color{#d91a1a}-1.17\%$
test_vmap_mlp_speed_decorator[False-False] 0.7386ms 0.6032ms 1.6578 KOps/s 1.6720 KOps/s $\color{#d91a1a}-0.85\%$
test_vmap_transformer_speed[True-True] 8.8291ms 8.5372ms 117.1349 Ops/s 117.8211 Ops/s $\color{#d91a1a}-0.58\%$
test_vmap_transformer_speed[True-False] 9.9652ms 8.5758ms 116.6072 Ops/s 117.7688 Ops/s $\color{#d91a1a}-0.99\%$
test_vmap_transformer_speed[False-True] 8.8983ms 8.5014ms 117.6283 Ops/s 119.4049 Ops/s $\color{#d91a1a}-1.49\%$
test_vmap_transformer_speed[False-False] 8.9407ms 8.4480ms 118.3708 Ops/s 119.1965 Ops/s $\color{#d91a1a}-0.69\%$
test_vmap_transformer_speed_decorator[True-True] 21.0236ms 20.1986ms 49.5083 Ops/s 49.9248 Ops/s $\color{#d91a1a}-0.83\%$
test_vmap_transformer_speed_decorator[True-False] 21.0009ms 20.1243ms 49.6912 Ops/s 50.0024 Ops/s $\color{#d91a1a}-0.62\%$
test_vmap_transformer_speed_decorator[False-True] 21.4750ms 20.0143ms 49.9642 Ops/s 50.4852 Ops/s $\color{#d91a1a}-1.03\%$
test_vmap_transformer_speed_decorator[False-False] 20.8289ms 19.9782ms 50.0546 Ops/s 50.4436 Ops/s $\color{#d91a1a}-0.77\%$
test_to_module_speed[True] 1.2859ms 1.1592ms 862.6663 Ops/s 872.9088 Ops/s $\color{#d91a1a}-1.17\%$
test_to_module_speed[False] 1.2239ms 1.1246ms 889.1840 Ops/s 877.9578 Ops/s $\color{#35bf28}+1.28\%$
test_tc_init 59.7210μs 39.7704μs 25.1443 KOps/s 27.0998 KOps/s $\textbf{\color{#d91a1a}-7.22\%}$
test_tc_init_nested 0.1442ms 81.1498μs 12.3229 KOps/s 12.9413 KOps/s $\color{#d91a1a}-4.78\%$
test_tc_first_layer_tensor 3.4167μs 0.8096μs 1.2352 MOps/s 1.2645 MOps/s $\color{#d91a1a}-2.32\%$
test_tc_first_layer_nontensor 18.2210μs 2.5377μs 394.0649 KOps/s 391.9895 KOps/s $\color{#35bf28}+0.53\%$
test_tc_second_layer_tensor 6.8467μs 1.5967μs 626.2874 KOps/s 620.2630 KOps/s $\color{#35bf28}+0.97\%$
test_tc_second_layer_nontensor 15.5910μs 3.3882μs 295.1440 KOps/s 298.8527 KOps/s $\color{#d91a1a}-1.24\%$
test_unbind 0.1814s 12.0960ms 82.6717 Ops/s 83.5464 Ops/s $\color{#d91a1a}-1.05\%$
test_full_like 0.6566ms 0.5768ms 1.7336 KOps/s 1.7359 KOps/s $\color{#d91a1a}-0.13\%$
test_zeros_like 0.2727ms 0.1978ms 5.0559 KOps/s 5.0568 KOps/s $\color{#d91a1a}-0.02\%$
test_ones_like 0.2312ms 0.1976ms 5.0601 KOps/s 5.0627 KOps/s $\color{#d91a1a}-0.05\%$
test_clone 0.4455ms 0.4154ms 2.4076 KOps/s 2.4160 KOps/s $\color{#d91a1a}-0.35\%$
test_squeeze 27.3500μs 10.5843μs 94.4799 KOps/s 90.0719 KOps/s $\color{#35bf28}+4.89\%$
test_unsqueeze 0.2478ms 80.5709μs 12.4114 KOps/s 12.4533 KOps/s $\color{#d91a1a}-0.34\%$
test_split 0.4673ms 0.1770ms 5.6485 KOps/s 5.7955 KOps/s $\color{#d91a1a}-2.54\%$
test_permute 0.2439ms 0.1904ms 5.2522 KOps/s 5.3502 KOps/s $\color{#d91a1a}-1.83\%$
test_stack 1.2557ms 0.8936ms 1.1191 KOps/s 1.1069 KOps/s $\color{#35bf28}+1.10\%$
test_cat 1.2622ms 1.2312ms 812.1852 Ops/s 812.2604 Ops/s $-0.01\%$

@vmoens vmoens merged commit b988f25 into main Aug 1, 2024
51 of 55 checks passed
@vmoens vmoens deleted the make-non-tensor-callable branch August 1, 2024 20:21
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[Feature Request] functiondict/opdict: TensorDict of Callables for apply()
2 participants