Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BugFix] Fix number check in array parsing (np>=2 compatibility) #999

Merged
merged 3 commits into from
Sep 17, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Sep 17, 2024

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Sep 17, 2024
ghstack-source-id: fa8f983381fd1019f9bd371389f728d84140a80c
Pull Request resolved: #999
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 17, 2024
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Sep 17, 2024
ghstack-source-id: 867b59d12577ce4e84a0aa0bd17afcc135ed291c
Pull Request resolved: #999
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Sep 17, 2024
ghstack-source-id: bdf9ab4f4c61ef5a407d76ebd5433d0d6a897b59
Pull Request resolved: #999
Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 222. Improved: $\large\color{#35bf28}29$. Worsened: $\large\color{#d91a1a}12$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 38.4210μs 19.8095μs 50.4809 KOps/s 48.5027 KOps/s $\color{#35bf28}+4.08\%$
test_plain_set_stack_nested 71.1930μs 19.7886μs 50.5341 KOps/s 48.0801 KOps/s $\textbf{\color{#35bf28}+5.10\%}$
test_plain_set_nested_inplace 65.4920μs 21.6906μs 46.1029 KOps/s 44.6459 KOps/s $\color{#35bf28}+3.26\%$
test_plain_set_stack_nested_inplace 0.1504ms 21.8656μs 45.7339 KOps/s 44.0759 KOps/s $\color{#35bf28}+3.76\%$
test_items 42.6990μs 4.3780μs 228.4152 KOps/s 243.9068 KOps/s $\textbf{\color{#d91a1a}-6.35\%}$
test_items_nested 0.7459ms 0.3567ms 2.8037 KOps/s 2.8016 KOps/s $\color{#35bf28}+0.07\%$
test_items_nested_locked 0.8277ms 0.3576ms 2.7963 KOps/s 2.7966 KOps/s $\color{#d91a1a}-0.01\%$
test_items_nested_leaf 0.1527ms 69.6365μs 14.3603 KOps/s 14.5355 KOps/s $\color{#d91a1a}-1.21\%$
test_items_stack_nested 0.6451ms 0.3593ms 2.7834 KOps/s 2.7460 KOps/s $\color{#35bf28}+1.36\%$
test_items_stack_nested_leaf 0.1301ms 71.0671μs 14.0712 KOps/s 14.3239 KOps/s $\color{#d91a1a}-1.76\%$
test_items_stack_nested_locked 0.6248ms 0.3578ms 2.7950 KOps/s 2.7856 KOps/s $\color{#35bf28}+0.34\%$
test_keys 37.8710μs 3.9717μs 251.7826 KOps/s 282.8754 KOps/s $\textbf{\color{#d91a1a}-10.99\%}$
test_keys_nested 0.1769ms 0.1027ms 9.7410 KOps/s 10.1439 KOps/s $\color{#d91a1a}-3.97\%$
test_keys_nested_locked 0.7169ms 0.1075ms 9.2983 KOps/s 9.5089 KOps/s $\color{#d91a1a}-2.22\%$
test_keys_nested_leaf 0.1522ms 82.4981μs 12.1215 KOps/s 12.1692 KOps/s $\color{#d91a1a}-0.39\%$
test_keys_stack_nested 0.1695ms 99.6175μs 10.0384 KOps/s 10.0796 KOps/s $\color{#d91a1a}-0.41\%$
test_keys_stack_nested_leaf 0.1416ms 81.3795μs 12.2881 KOps/s 12.2503 KOps/s $\color{#35bf28}+0.31\%$
test_keys_stack_nested_locked 0.1825ms 0.1061ms 9.4223 KOps/s 9.5436 KOps/s $\color{#d91a1a}-1.27\%$
test_values 11.2335μs 1.1671μs 856.8564 KOps/s 900.5919 KOps/s $\color{#d91a1a}-4.86\%$
test_values_nested 0.1236ms 74.5054μs 13.4218 KOps/s 13.5544 KOps/s $\color{#d91a1a}-0.98\%$
test_values_nested_locked 0.1362ms 74.2625μs 13.4657 KOps/s 13.6504 KOps/s $\color{#d91a1a}-1.35\%$
test_values_nested_leaf 0.1105ms 61.8846μs 16.1591 KOps/s 16.0812 KOps/s $\color{#35bf28}+0.48\%$
test_values_stack_nested 0.1581ms 74.9483μs 13.3425 KOps/s 13.6492 KOps/s $\color{#d91a1a}-2.25\%$
test_values_stack_nested_leaf 0.1140ms 61.6335μs 16.2249 KOps/s 16.1042 KOps/s $\color{#35bf28}+0.75\%$
test_values_stack_nested_locked 0.1365ms 75.3324μs 13.2745 KOps/s 13.6343 KOps/s $\color{#d91a1a}-2.64\%$
test_membership 30.0160μs 0.8471μs 1.1804 MOps/s 1.1144 MOps/s $\textbf{\color{#35bf28}+5.93\%}$
test_membership_nested 31.4280μs 2.6828μs 372.7439 KOps/s 365.2778 KOps/s $\color{#35bf28}+2.04\%$
test_membership_nested_leaf 22.4620μs 2.7145μs 368.3910 KOps/s 362.7005 KOps/s $\color{#35bf28}+1.57\%$
test_membership_stacked_nested 30.5280μs 2.7370μs 365.3681 KOps/s 363.2645 KOps/s $\color{#35bf28}+0.58\%$
test_membership_stacked_nested_leaf 36.2580μs 2.7246μs 367.0199 KOps/s 359.9597 KOps/s $\color{#35bf28}+1.96\%$
test_membership_nested_last 22.9720μs 3.8578μs 259.2134 KOps/s 251.1625 KOps/s $\color{#35bf28}+3.21\%$
test_membership_nested_leaf_last 32.5010μs 3.8884μs 257.1783 KOps/s 249.7245 KOps/s $\color{#35bf28}+2.98\%$
test_membership_stacked_nested_last 43.7020μs 3.8478μs 259.8899 KOps/s 253.2176 KOps/s $\color{#35bf28}+2.64\%$
test_membership_stacked_nested_leaf_last 29.1550μs 3.8901μs 257.0639 KOps/s 254.1571 KOps/s $\color{#35bf28}+1.14\%$
test_nested_getleaf 0.1401ms 10.6369μs 94.0121 KOps/s 95.3581 KOps/s $\color{#d91a1a}-1.41\%$
test_nested_get 0.1646ms 10.3509μs 96.6101 KOps/s 99.8596 KOps/s $\color{#d91a1a}-3.25\%$
test_stacked_getleaf 0.1723ms 10.8947μs 91.7874 KOps/s 95.7451 KOps/s $\color{#d91a1a}-4.13\%$
test_stacked_get 44.6530μs 10.0426μs 99.5753 KOps/s 100.9735 KOps/s $\color{#d91a1a}-1.38\%$
test_nested_getitemleaf 54.0710μs 10.9716μs 91.1446 KOps/s 90.4454 KOps/s $\color{#35bf28}+0.77\%$
test_nested_getitem 45.8260μs 10.2669μs 97.4004 KOps/s 99.6818 KOps/s $\color{#d91a1a}-2.29\%$
test_stacked_getitemleaf 0.2400ms 10.8786μs 91.9234 KOps/s 90.9965 KOps/s $\color{#35bf28}+1.02\%$
test_stacked_getitem 0.1277ms 10.2610μs 97.4563 KOps/s 99.1017 KOps/s $\color{#d91a1a}-1.66\%$
test_lock_nested 91.5955ms 0.5788ms 1.7277 KOps/s 2.0753 KOps/s $\textbf{\color{#d91a1a}-16.75\%}$
test_lock_stack_nested 0.6038ms 0.4594ms 2.1769 KOps/s 2.1951 KOps/s $\color{#d91a1a}-0.83\%$
test_unlock_nested 99.2661ms 0.5035ms 1.9859 KOps/s 2.4915 KOps/s $\textbf{\color{#d91a1a}-20.29\%}$
test_unlock_stack_nested 0.5561ms 0.3735ms 2.6776 KOps/s 2.6616 KOps/s $\color{#35bf28}+0.60\%$
test_flatten_speed 0.2197ms 89.5695μs 11.1645 KOps/s 11.3618 KOps/s $\color{#d91a1a}-1.74\%$
test_unflatten_speed 0.7446ms 0.4657ms 2.1473 KOps/s 2.1781 KOps/s $\color{#d91a1a}-1.42\%$
test_common_ops 2.2889ms 1.1151ms 896.7781 Ops/s 863.6591 Ops/s $\color{#35bf28}+3.83\%$
test_creation 27.5120μs 2.1971μs 455.1432 KOps/s 469.7966 KOps/s $\color{#d91a1a}-3.12\%$
test_creation_empty 85.5100μs 17.4895μs 57.1771 KOps/s 54.0268 KOps/s $\textbf{\color{#35bf28}+5.83\%}$
test_creation_nested_1 0.1002ms 20.8838μs 47.8840 KOps/s 45.7117 KOps/s $\color{#35bf28}+4.75\%$
test_creation_nested_2 79.4080μs 24.7883μs 40.3416 KOps/s 38.4066 KOps/s $\textbf{\color{#35bf28}+5.04\%}$
test_clone 1.3805ms 16.4232μs 60.8896 KOps/s 58.6224 KOps/s $\color{#35bf28}+3.87\%$
test_getitem[int] 0.7365ms 16.8905μs 59.2049 KOps/s 60.3035 KOps/s $\color{#d91a1a}-1.82\%$
test_getitem[slice_int] 0.1595ms 30.0362μs 33.2932 KOps/s 32.9113 KOps/s $\color{#35bf28}+1.16\%$
test_getitem[range] 0.2027ms 55.1203μs 18.1421 KOps/s 17.2967 KOps/s $\color{#35bf28}+4.89\%$
test_getitem[tuple] 0.1670ms 25.2241μs 39.6447 KOps/s 39.2543 KOps/s $\color{#35bf28}+0.99\%$
test_getitem[list] 0.1976ms 51.1505μs 19.5501 KOps/s 18.8818 KOps/s $\color{#35bf28}+3.54\%$
test_setitem_dim[int] 54.5110μs 31.3594μs 31.8884 KOps/s 30.0019 KOps/s $\textbf{\color{#35bf28}+6.29\%}$
test_setitem_dim[slice_int] 0.1292ms 60.6903μs 16.4771 KOps/s 16.2869 KOps/s $\color{#35bf28}+1.17\%$
test_setitem_dim[range] 0.2054ms 83.4793μs 11.9790 KOps/s 11.9230 KOps/s $\color{#35bf28}+0.47\%$
test_setitem_dim[tuple] 0.1094ms 48.8976μs 20.4509 KOps/s 20.5505 KOps/s $\color{#d91a1a}-0.48\%$
test_setitem 96.8810μs 29.0209μs 34.4579 KOps/s 32.9649 KOps/s $\color{#35bf28}+4.53\%$
test_set 0.3155ms 27.9440μs 35.7859 KOps/s 33.7095 KOps/s $\textbf{\color{#35bf28}+6.16\%}$
test_set_shared 7.4537ms 0.2132ms 4.6905 KOps/s 4.5531 KOps/s $\color{#35bf28}+3.02\%$
test_update 0.2912ms 35.5220μs 28.1516 KOps/s 26.8587 KOps/s $\color{#35bf28}+4.81\%$
test_update_nested 1.0785ms 45.8770μs 21.7974 KOps/s 20.9965 KOps/s $\color{#35bf28}+3.81\%$
test_update__nested 0.2996ms 33.2037μs 30.1171 KOps/s 29.1089 KOps/s $\color{#35bf28}+3.46\%$
test_set_nested 0.3518ms 30.9465μs 32.3138 KOps/s 31.3773 KOps/s $\color{#35bf28}+2.98\%$
test_set_nested_new 0.1123ms 35.7686μs 27.9575 KOps/s 26.5033 KOps/s $\textbf{\color{#35bf28}+5.49\%}$
test_select 0.1294ms 53.3484μs 18.7447 KOps/s 18.4849 KOps/s $\color{#35bf28}+1.41\%$
test_select_nested 0.1409ms 59.3664μs 16.8446 KOps/s 16.8785 KOps/s $\color{#d91a1a}-0.20\%$
test_exclude_nested 0.1477ms 74.7407μs 13.3796 KOps/s 13.2731 KOps/s $\color{#35bf28}+0.80\%$
test_empty[True] 0.5894ms 0.3158ms 3.1665 KOps/s 3.1693 KOps/s $\color{#d91a1a}-0.09\%$
test_empty[False] 14.0035μs 1.1948μs 836.9519 KOps/s 843.0833 KOps/s $\color{#d91a1a}-0.73\%$
test_unbind_speed 0.6177ms 0.2959ms 3.3795 KOps/s 3.3383 KOps/s $\color{#35bf28}+1.24\%$
test_unbind_speed_stack0 0.4481ms 0.2934ms 3.4081 KOps/s 3.3953 KOps/s $\color{#35bf28}+0.38\%$
test_unbind_speed_stack1 97.5352ms 0.8022ms 1.2466 KOps/s 1.3188 KOps/s $\textbf{\color{#d91a1a}-5.48\%}$
test_split 3.2370ms 1.9370ms 516.2752 Ops/s 456.5424 Ops/s $\textbf{\color{#35bf28}+13.08\%}$
test_chunk 94.1987ms 2.2860ms 437.4456 Ops/s 452.0026 Ops/s $\color{#d91a1a}-3.22\%$
test_creation[device0] 0.1906ms 0.1137ms 8.7989 KOps/s 8.2714 KOps/s $\textbf{\color{#35bf28}+6.38\%}$
test_creation_from_tensor 4.0315ms 0.1167ms 8.5669 KOps/s 8.4651 KOps/s $\color{#35bf28}+1.20\%$
test_add_one[memmap_tensor0] 0.1163ms 6.9679μs 143.5151 KOps/s 136.4298 KOps/s $\textbf{\color{#35bf28}+5.19\%}$
test_contiguous[memmap_tensor0] 25.0570μs 1.8996μs 526.4320 KOps/s 526.9050 KOps/s $\color{#d91a1a}-0.09\%$
test_stack[memmap_tensor0] 57.1560μs 5.4772μs 182.5754 KOps/s 175.8322 KOps/s $\color{#35bf28}+3.84\%$
test_memmaptd_index 1.0679ms 0.4007ms 2.4955 KOps/s 2.5881 KOps/s $\color{#d91a1a}-3.58\%$
test_memmaptd_index_astensor 0.7754ms 0.4766ms 2.0982 KOps/s 2.1482 KOps/s $\color{#d91a1a}-2.33\%$
test_memmaptd_index_op 1.5025ms 0.9918ms 1.0083 KOps/s 981.3538 Ops/s $\color{#35bf28}+2.75\%$
test_serialize_model 0.1246s 0.1182s 8.4591 Ops/s 8.2158 Ops/s $\color{#35bf28}+2.96\%$
test_serialize_model_pickle 0.4706s 0.4003s 2.4981 Ops/s 2.5249 Ops/s $\color{#d91a1a}-1.06\%$
test_serialize_weights 0.1257s 0.1163s 8.5992 Ops/s 8.3921 Ops/s $\color{#35bf28}+2.47\%$
test_serialize_weights_returnearly 0.2604s 0.1756s 5.6940 Ops/s 6.3004 Ops/s $\textbf{\color{#d91a1a}-9.63\%}$
test_serialize_weights_pickle 0.4787s 0.4060s 2.4633 Ops/s 1.1605 Ops/s $\textbf{\color{#35bf28}+112.26\%}$
test_serialize_weights_filesystem 0.1502s 0.1456s 6.8704 Ops/s 6.4637 Ops/s $\textbf{\color{#35bf28}+6.29\%}$
test_serialize_model_filesystem 0.2487s 0.1646s 6.0764 Ops/s 6.9851 Ops/s $\textbf{\color{#d91a1a}-13.01\%}$
test_reshape_pytree 84.9890μs 37.8978μs 26.3867 KOps/s 25.8687 KOps/s $\color{#35bf28}+2.00\%$
test_reshape_td 88.5760μs 44.7040μs 22.3694 KOps/s 21.6863 KOps/s $\color{#35bf28}+3.15\%$
test_view_pytree 81.0420μs 38.0059μs 26.3117 KOps/s 25.8671 KOps/s $\color{#35bf28}+1.72\%$
test_view_td 0.1227ms 49.9245μs 20.0303 KOps/s 18.9367 KOps/s $\textbf{\color{#35bf28}+5.77\%}$
test_unbind_pytree 73.6170μs 35.1762μs 28.4283 KOps/s 28.1853 KOps/s $\color{#35bf28}+0.86\%$
test_unbind_td 0.3465ms 43.4008μs 23.0410 KOps/s 22.3616 KOps/s $\color{#35bf28}+3.04\%$
test_split_pytree 99.9860μs 39.4832μs 25.3272 KOps/s 26.5076 KOps/s $\color{#d91a1a}-4.45\%$
test_split_td 0.2373ms 55.6600μs 17.9662 KOps/s 17.4131 KOps/s $\color{#35bf28}+3.18\%$
test_add_pytree 95.8290μs 41.9446μs 23.8410 KOps/s 22.3741 KOps/s $\textbf{\color{#35bf28}+6.56\%}$
test_add_td 0.1595ms 77.2134μs 12.9511 KOps/s 12.2399 KOps/s $\textbf{\color{#35bf28}+5.81\%}$
test_compile_add_one_nested[tensordict-compile] 0.1167ms 57.1063μs 17.5112 KOps/s 17.5569 KOps/s $\color{#d91a1a}-0.26\%$
test_compile_add_one_nested[tensordict-eager] 0.3657ms 0.1749ms 5.7179 KOps/s 5.6942 KOps/s $\color{#35bf28}+0.42\%$
test_compile_add_one_nested[pytree-compile] 0.1085ms 56.1045μs 17.8239 KOps/s 12.7760 KOps/s $\textbf{\color{#35bf28}+39.51\%}$
test_compile_add_one_nested[pytree-eager] 0.3873ms 0.1377ms 7.2636 KOps/s 7.0910 KOps/s $\color{#35bf28}+2.43\%$
test_compile_copy_nested[tensordict-compile] 85.9100μs 21.8433μs 45.7805 KOps/s 47.6833 KOps/s $\color{#d91a1a}-3.99\%$
test_compile_copy_nested[tensordict-eager] 0.1297ms 66.1382μs 15.1199 KOps/s 14.6962 KOps/s $\color{#35bf28}+2.88\%$
test_compile_copy_nested[pytree-compile] 0.1538ms 74.8147μs 13.3664 KOps/s 13.2806 KOps/s $\color{#35bf28}+0.65\%$
test_compile_copy_nested[pytree-eager] 0.1299ms 67.8455μs 14.7394 KOps/s 14.8104 KOps/s $\color{#d91a1a}-0.48\%$
test_compile_add_one_flat[tensordict-compile] 0.2864ms 0.1718ms 5.8212 KOps/s 5.7798 KOps/s $\color{#35bf28}+0.72\%$
test_compile_add_one_flat[tensordict-eager] 0.3630ms 0.1887ms 5.3005 KOps/s 5.3493 KOps/s $\color{#d91a1a}-0.91\%$
test_compile_add_one_flat[tensorclass-compile] 0.1001ms 46.3930μs 21.5550 KOps/s 21.2853 KOps/s $\color{#35bf28}+1.27\%$
test_compile_add_one_flat[tensorclass-eager] 0.1274ms 67.8163μs 14.7457 KOps/s 14.9630 KOps/s $\color{#d91a1a}-1.45\%$
test_compile_add_one_flat[pytree-compile] 0.2981ms 0.1738ms 5.7533 KOps/s 5.7654 KOps/s $\color{#d91a1a}-0.21\%$
test_compile_add_one_flat[pytree-eager] 0.6424ms 0.2809ms 3.5604 KOps/s 3.5208 KOps/s $\color{#35bf28}+1.13\%$
test_compile_add_self_flat[tensordict-eager] 0.3820ms 0.2018ms 4.9564 KOps/s 5.0024 KOps/s $\color{#d91a1a}-0.92\%$
test_compile_add_self_flat[tensordict-compile] 0.3473ms 0.1727ms 5.7919 KOps/s 5.7743 KOps/s $\color{#35bf28}+0.30\%$
test_compile_add_self_flat[tensorclass-eager] 0.1572ms 62.1164μs 16.0988 KOps/s 15.9552 KOps/s $\color{#35bf28}+0.90\%$
test_compile_add_self_flat[tensorclass-compile] 0.1153ms 46.5925μs 21.4627 KOps/s 21.0159 KOps/s $\color{#35bf28}+2.13\%$
test_compile_add_self_flat[pytree-eager] 0.4893ms 0.2283ms 4.3811 KOps/s 4.2267 KOps/s $\color{#35bf28}+3.65\%$
test_compile_add_self_flat[pytree-compile] 0.2878ms 0.1736ms 5.7615 KOps/s 5.7599 KOps/s $\color{#35bf28}+0.03\%$
test_compile_copy_flat[tensordict-compile] 0.2079ms 0.1022ms 9.7875 KOps/s 9.6263 KOps/s $\color{#35bf28}+1.67\%$
test_compile_copy_flat[tensordict-eager] 0.1234ms 56.6895μs 17.6400 KOps/s 16.7330 KOps/s $\textbf{\color{#35bf28}+5.42\%}$
test_compile_copy_flat[pytree-compile] 0.1893ms 78.1116μs 12.8022 KOps/s 13.0306 KOps/s $\color{#d91a1a}-1.75\%$
test_compile_copy_flat[pytree-eager] 0.1926ms 68.3245μs 14.6360 KOps/s 14.8148 KOps/s $\color{#d91a1a}-1.21\%$
test_compile_assign_and_add[tensordict-compile] 0.2954ms 0.1972ms 5.0698 KOps/s 5.1704 KOps/s $\color{#d91a1a}-1.94\%$
test_compile_assign_and_add[tensordict-eager] 2.0262ms 1.6306ms 613.2840 Ops/s 617.2924 Ops/s $\color{#d91a1a}-0.65\%$
test_compile_assign_and_add[pytree-compile] 0.3477ms 0.1939ms 5.1561 KOps/s 5.2351 KOps/s $\color{#d91a1a}-1.51\%$
test_compile_assign_and_add[pytree-eager] 1.3702ms 1.0670ms 937.1771 Ops/s 937.8228 Ops/s $\color{#d91a1a}-0.07\%$
test_compile_assign_and_add_stack[compile] 0.7292ms 0.4228ms 2.3654 KOps/s 2.4054 KOps/s $\color{#d91a1a}-1.66\%$
test_compile_assign_and_add_stack[eager] 3.8164ms 3.6704ms 272.4523 Ops/s 264.2362 Ops/s $\color{#35bf28}+3.11\%$
test_compile_indexing[tensor-tensordict-compile] 0.2675ms 34.4220μs 29.0512 KOps/s 28.9684 KOps/s $\color{#35bf28}+0.29\%$
test_compile_indexing[tensor-tensordict-eager] 0.6658ms 46.2419μs 21.6254 KOps/s 21.1766 KOps/s $\color{#35bf28}+2.12\%$
test_compile_indexing[tensor-tensorclass-compile] 0.1071ms 28.8531μs 34.6584 KOps/s 33.7492 KOps/s $\color{#35bf28}+2.69\%$
test_compile_indexing[tensor-tensorclass-eager] 75.2410μs 27.4688μs 36.4049 KOps/s 36.2255 KOps/s $\color{#35bf28}+0.50\%$
test_compile_indexing[tensor-pytree-compile] 93.9960μs 28.6806μs 34.8668 KOps/s 33.6290 KOps/s $\color{#35bf28}+3.68\%$
test_compile_indexing[tensor-pytree-eager] 83.0950μs 26.6157μs 37.5718 KOps/s 36.1356 KOps/s $\color{#35bf28}+3.97\%$
test_compile_indexing[slice-tensordict-compile] 0.1889ms 73.5696μs 13.5926 KOps/s 13.6240 KOps/s $\color{#d91a1a}-0.23\%$
test_compile_indexing[slice-tensordict-eager] 0.5356ms 26.6651μs 37.5021 KOps/s 36.2076 KOps/s $\color{#35bf28}+3.58\%$
test_compile_indexing[slice-tensorclass-compile] 0.1316ms 67.6838μs 14.7746 KOps/s 14.5442 KOps/s $\color{#35bf28}+1.58\%$
test_compile_indexing[slice-tensorclass-eager] 95.6990μs 22.7910μs 43.8769 KOps/s 43.2312 KOps/s $\color{#35bf28}+1.49\%$
test_compile_indexing[slice-pytree-compile] 0.1517ms 68.3635μs 14.6277 KOps/s 14.4986 KOps/s $\color{#35bf28}+0.89\%$
test_compile_indexing[slice-pytree-eager] 0.1914ms 22.6583μs 44.1338 KOps/s 43.5674 KOps/s $\color{#35bf28}+1.30\%$
test_compile_indexing[int-tensordict-compile] 0.1509ms 74.0370μs 13.5068 KOps/s 13.8683 KOps/s $\color{#d91a1a}-2.61\%$
test_compile_indexing[int-tensordict-eager] 0.9247ms 26.6509μs 37.5221 KOps/s 36.3597 KOps/s $\color{#35bf28}+3.20\%$
test_compile_indexing[int-tensorclass-compile] 0.1688ms 67.4333μs 14.8295 KOps/s 14.1776 KOps/s $\color{#35bf28}+4.60\%$
test_compile_indexing[int-tensorclass-eager] 0.1060ms 22.0382μs 45.3758 KOps/s 43.3151 KOps/s $\color{#35bf28}+4.76\%$
test_compile_indexing[int-pytree-compile] 0.4127ms 67.8408μs 14.7404 KOps/s 14.5413 KOps/s $\color{#35bf28}+1.37\%$
test_compile_indexing[int-pytree-eager] 85.2990μs 22.1556μs 45.1354 KOps/s 43.4301 KOps/s $\color{#35bf28}+3.93\%$
test_mod_add[eager] 87.4640μs 25.1605μs 39.7449 KOps/s 38.3365 KOps/s $\color{#35bf28}+3.67\%$
test_mod_add[compile] 0.1153ms 39.0774μs 25.5902 KOps/s 26.7957 KOps/s $\color{#d91a1a}-4.50\%$
test_mod_add[compile-overhead] 0.1039ms 40.0235μs 24.9853 KOps/s 26.3384 KOps/s $\textbf{\color{#d91a1a}-5.14\%}$
test_mod_wrap[eager] 0.3832ms 0.2060ms 4.8537 KOps/s 5.0260 KOps/s $\color{#d91a1a}-3.43\%$
test_mod_wrap[compile] 0.3135ms 0.2262ms 4.4206 KOps/s 4.4141 KOps/s $\color{#35bf28}+0.15\%$
test_mod_wrap[compile-overhead] 0.3898ms 0.2204ms 4.5368 KOps/s 4.3870 KOps/s $\color{#35bf28}+3.41\%$
test_mod_wrap_and_backward[eager] 12.3492ms 10.9749ms 91.1172 Ops/s 90.5053 Ops/s $\color{#35bf28}+0.68\%$
test_mod_wrap_and_backward[compile] 12.1706ms 10.8229ms 92.3971 Ops/s 78.8386 Ops/s $\textbf{\color{#35bf28}+17.20\%}$
test_mod_wrap_and_backward[compile-overhead] 12.2722ms 10.8651ms 92.0377 Ops/s 78.4642 Ops/s $\textbf{\color{#35bf28}+17.30\%}$
test_seq_add[eager] 0.1632ms 89.3984μs 11.1859 KOps/s 11.0824 KOps/s $\color{#35bf28}+0.93\%$
test_seq_add[compile] 0.1150ms 63.1836μs 15.8269 KOps/s 15.6747 KOps/s $\color{#35bf28}+0.97\%$
test_seq_add[compile-overhead] 0.1282ms 62.3210μs 16.0460 KOps/s 15.9222 KOps/s $\color{#35bf28}+0.78\%$
test_seq_wrap[eager] 0.6285ms 0.3665ms 2.7284 KOps/s 2.6047 KOps/s $\color{#35bf28}+4.75\%$
test_seq_wrap[compile] 5.3693ms 0.2614ms 3.8258 KOps/s 3.7625 KOps/s $\color{#35bf28}+1.68\%$
test_seq_wrap[compile-overhead] 1.3734ms 0.2589ms 3.8621 KOps/s 3.7922 KOps/s $\color{#35bf28}+1.84\%$
test_func_call_runtime[False-eager] 0.8626ms 0.4848ms 2.0628 KOps/s 2.0023 KOps/s $\color{#35bf28}+3.02\%$
test_func_call_runtime[False-compile] 1.0396ms 0.4870ms 2.0536 KOps/s 1.9944 KOps/s $\color{#35bf28}+2.97\%$
test_func_call_runtime[False-compile-overhead] 0.6029ms 0.4799ms 2.0838 KOps/s 1.9903 KOps/s $\color{#35bf28}+4.70\%$
test_func_call_runtime[True-eager] 0.8715ms 0.6984ms 1.4317 KOps/s 1.3762 KOps/s $\color{#35bf28}+4.03\%$
test_func_call_runtime[True-compile] 0.8278ms 0.4996ms 2.0014 KOps/s 1.9938 KOps/s $\color{#35bf28}+0.38\%$
test_func_call_runtime[True-compile-overhead] 0.8650ms 0.4977ms 2.0094 KOps/s 1.9721 KOps/s $\color{#35bf28}+1.89\%$
test_func_call_cm_runtime[False-eager] 0.8523ms 0.4862ms 2.0568 KOps/s 2.0368 KOps/s $\color{#35bf28}+0.98\%$
test_func_call_cm_runtime[False-compile] 1.0492ms 0.5081ms 1.9679 KOps/s 1.9693 KOps/s $\color{#d91a1a}-0.07\%$
test_func_call_cm_runtime[False-compile-overhead] 0.6494ms 0.4818ms 2.0754 KOps/s 1.9541 KOps/s $\textbf{\color{#35bf28}+6.21\%}$
test_func_call_cm_runtime[True-eager] 0.9952ms 0.8267ms 1.2096 KOps/s 1.1695 KOps/s $\color{#35bf28}+3.43\%$
test_func_call_cm_runtime[True-compile] 1.0022ms 0.6945ms 1.4400 KOps/s 1.3501 KOps/s $\textbf{\color{#35bf28}+6.66\%}$
test_func_call_cm_runtime[True-compile-overhead] 0.9582ms 0.6997ms 1.4291 KOps/s 1.3862 KOps/s $\color{#35bf28}+3.09\%$
test_vmap_func_call_cm_runtime[eager] 2.5390ms 1.8258ms 547.6911 Ops/s 531.2093 Ops/s $\color{#35bf28}+3.10\%$
test_vmap_func_call_cm_runtime[compile] 3.1950ms 1.8901ms 529.0835 Ops/s 520.9236 Ops/s $\color{#35bf28}+1.57\%$
test_vmap_func_call_cm_runtime[compile-overhead] 2.9753ms 1.8809ms 531.6536 Ops/s 517.3321 Ops/s $\color{#35bf28}+2.77\%$
test_distributed 0.5119ms 0.1255ms 7.9701 KOps/s 7.4780 KOps/s $\textbf{\color{#35bf28}+6.58\%}$
test_tdmodule 33.3620μs 17.7086μs 56.4696 KOps/s 51.9032 KOps/s $\textbf{\color{#35bf28}+8.80\%}$
test_tdmodule_dispatch 66.6950μs 35.6516μs 28.0492 KOps/s 26.5659 KOps/s $\textbf{\color{#35bf28}+5.58\%}$
test_tdseq 37.6100μs 19.9119μs 50.2212 KOps/s 47.4015 KOps/s $\textbf{\color{#35bf28}+5.95\%}$
test_tdseq_dispatch 68.3880μs 40.4125μs 24.7448 KOps/s 23.5739 KOps/s $\color{#35bf28}+4.97\%$
test_instantiation_functorch 1.8356ms 1.5225ms 656.8141 Ops/s 632.0929 Ops/s $\color{#35bf28}+3.91\%$
test_instantiation_td 1.9061ms 1.1497ms 869.7969 Ops/s 854.6135 Ops/s $\color{#35bf28}+1.78\%$
test_exec_functorch 0.6122ms 0.1793ms 5.5772 KOps/s 5.4501 KOps/s $\color{#35bf28}+2.33\%$
test_exec_functional_call 0.3490ms 0.1676ms 5.9663 KOps/s 6.0465 KOps/s $\color{#d91a1a}-1.33\%$
test_exec_td 0.4079ms 0.1633ms 6.1253 KOps/s 6.1739 KOps/s $\color{#d91a1a}-0.79\%$
test_exec_td_decorator 0.9330ms 0.2142ms 4.6680 KOps/s 4.6070 KOps/s $\color{#35bf28}+1.32\%$
test_vmap_mlp_speed[True-True] 0.9924ms 0.6351ms 1.5745 KOps/s 1.5681 KOps/s $\color{#35bf28}+0.40\%$
test_vmap_mlp_speed[True-False] 0.9405ms 0.6260ms 1.5974 KOps/s 1.6047 KOps/s $\color{#d91a1a}-0.45\%$
test_vmap_mlp_speed[False-True] 0.8661ms 0.4892ms 2.0439 KOps/s 2.0785 KOps/s $\color{#d91a1a}-1.66\%$
test_vmap_mlp_speed[False-False] 0.9294ms 0.4857ms 2.0588 KOps/s 2.0636 KOps/s $\color{#d91a1a}-0.24\%$
test_vmap_mlp_speed_decorator[True-True] 1.5362ms 0.6101ms 1.6391 KOps/s 1.6442 KOps/s $\color{#d91a1a}-0.31\%$
test_vmap_mlp_speed_decorator[True-False] 0.8088ms 0.6040ms 1.6557 KOps/s 1.6300 KOps/s $\color{#35bf28}+1.57\%$
test_vmap_mlp_speed_decorator[False-True] 0.7737ms 0.4985ms 2.0059 KOps/s 1.9981 KOps/s $\color{#35bf28}+0.39\%$
test_vmap_mlp_speed_decorator[False-False] 0.7422ms 0.4971ms 2.0118 KOps/s 2.0033 KOps/s $\color{#35bf28}+0.42\%$
test_to_module_speed[True] 2.5293ms 1.2769ms 783.1576 Ops/s 760.0149 Ops/s $\color{#35bf28}+3.05\%$
test_to_module_speed[False] 2.0390ms 1.2733ms 785.3554 Ops/s 776.7833 Ops/s $\color{#35bf28}+1.10\%$
test_tc_init 82.5140μs 43.4190μs 23.0314 KOps/s 21.4822 KOps/s $\textbf{\color{#35bf28}+7.21\%}$
test_tc_init_nested 0.1493ms 84.4776μs 11.8375 KOps/s 10.8325 KOps/s $\textbf{\color{#35bf28}+9.28\%}$
test_tc_first_layer_tensor 20.9900μs 1.5500μs 645.1497 KOps/s 643.4956 KOps/s $\color{#35bf28}+0.26\%$
test_tc_first_layer_nontensor 42.6190μs 4.9622μs 201.5249 KOps/s 208.9628 KOps/s $\color{#d91a1a}-3.56\%$
test_tc_second_layer_tensor 24.3850μs 2.8342μs 352.8279 KOps/s 346.0527 KOps/s $\color{#35bf28}+1.96\%$
test_tc_second_layer_nontensor 37.2500μs 6.3441μs 157.6261 KOps/s 161.9332 KOps/s $\color{#d91a1a}-2.66\%$
test_unbind 0.4753s 13.4613ms 74.2869 Ops/s 63.7104 Ops/s $\textbf{\color{#35bf28}+16.60\%}$
test_full_like 20.6506ms 12.4465ms 80.3436 Ops/s 102.9791 Ops/s $\textbf{\color{#d91a1a}-21.98\%}$
test_zeros_like 14.3546ms 7.6277ms 131.1009 Ops/s 324.4049 Ops/s $\textbf{\color{#d91a1a}-59.59\%}$
test_ones_like 13.3937ms 7.5087ms 133.1793 Ops/s 299.6888 Ops/s $\textbf{\color{#d91a1a}-55.56\%}$
test_clone 14.9913ms 9.1281ms 109.5513 Ops/s 170.5978 Ops/s $\textbf{\color{#d91a1a}-35.78\%}$
test_squeeze 62.9570μs 12.7457μs 78.4579 KOps/s 76.3815 KOps/s $\color{#35bf28}+2.72\%$
test_unsqueeze 0.2924ms 90.8720μs 11.0045 KOps/s 10.5551 KOps/s $\color{#35bf28}+4.26\%$
test_split 0.3391ms 0.1899ms 5.2651 KOps/s 5.0114 KOps/s $\textbf{\color{#35bf28}+5.06\%}$
test_permute 0.3654ms 0.2138ms 4.6774 KOps/s 4.5000 KOps/s $\color{#35bf28}+3.94\%$
test_stack 27.9960ms 25.8100ms 38.7447 Ops/s 39.5397 Ops/s $\color{#d91a1a}-2.01\%$
test_cat 31.1312ms 25.4561ms 39.2834 Ops/s 38.9400 Ops/s $\color{#35bf28}+0.88\%$

Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 228. Improved: $\large\color{#35bf28}13$. Worsened: $\large\color{#d91a1a}7$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 0.1147ms 14.7169μs 67.9493 KOps/s 70.7691 KOps/s $\color{#d91a1a}-3.98\%$
test_plain_set_stack_nested 45.9400μs 14.4911μs 69.0077 KOps/s 71.3100 KOps/s $\color{#d91a1a}-3.23\%$
test_plain_set_nested_inplace 54.5110μs 15.8968μs 62.9058 KOps/s 65.6420 KOps/s $\color{#d91a1a}-4.17\%$
test_plain_set_stack_nested_inplace 44.6510μs 15.4091μs 64.8965 KOps/s 66.1759 KOps/s $\color{#d91a1a}-1.93\%$
test_items 29.9110μs 2.8541μs 350.3716 KOps/s 342.5932 KOps/s $\color{#35bf28}+2.27\%$
test_items_nested 0.3720ms 0.3295ms 3.0345 KOps/s 3.0112 KOps/s $\color{#35bf28}+0.77\%$
test_items_nested_locked 0.3739ms 0.3279ms 3.0494 KOps/s 3.0246 KOps/s $\color{#35bf28}+0.82\%$
test_items_nested_leaf 81.8510μs 55.5066μs 18.0159 KOps/s 17.8671 KOps/s $\color{#35bf28}+0.83\%$
test_items_stack_nested 0.3821ms 0.3239ms 3.0878 KOps/s 3.0165 KOps/s $\color{#35bf28}+2.36\%$
test_items_stack_nested_leaf 88.9510μs 56.5590μs 17.6807 KOps/s 17.3524 KOps/s $\color{#35bf28}+1.89\%$
test_items_stack_nested_locked 0.3840ms 0.3315ms 3.0164 KOps/s 3.0552 KOps/s $\color{#d91a1a}-1.27\%$
test_keys 33.0400μs 3.4239μs 292.0682 KOps/s 293.6231 KOps/s $\color{#d91a1a}-0.53\%$
test_keys_nested 83.6620μs 56.4525μs 17.7140 KOps/s 18.2813 KOps/s $\color{#d91a1a}-3.10\%$
test_keys_nested_locked 2.5685ms 62.7144μs 15.9453 KOps/s 16.1089 KOps/s $\color{#d91a1a}-1.02\%$
test_keys_nested_leaf 0.1086ms 47.0493μs 21.2543 KOps/s 21.0757 KOps/s $\color{#35bf28}+0.85\%$
test_keys_stack_nested 85.7010μs 55.7596μs 17.9341 KOps/s 18.2275 KOps/s $\color{#d91a1a}-1.61\%$
test_keys_stack_nested_leaf 97.5620μs 48.0839μs 20.7970 KOps/s 20.7079 KOps/s $\color{#35bf28}+0.43\%$
test_keys_stack_nested_locked 94.0210μs 60.7620μs 16.4577 KOps/s 16.2829 KOps/s $\color{#35bf28}+1.07\%$
test_values 4.0583μs 0.8472μs 1.1804 MOps/s 1.2011 MOps/s $\color{#d91a1a}-1.73\%$
test_values_nested 58.4910μs 40.8789μs 24.4625 KOps/s 24.5124 KOps/s $\color{#d91a1a}-0.20\%$
test_values_nested_locked 71.0310μs 42.7490μs 23.3924 KOps/s 23.2944 KOps/s $\color{#35bf28}+0.42\%$
test_values_nested_leaf 59.9510μs 35.4260μs 28.2278 KOps/s 28.2037 KOps/s $\color{#35bf28}+0.09\%$
test_values_stack_nested 75.2320μs 41.4817μs 24.1070 KOps/s 23.9840 KOps/s $\color{#35bf28}+0.51\%$
test_values_stack_nested_leaf 77.6220μs 35.5863μs 28.1007 KOps/s 28.2026 KOps/s $\color{#d91a1a}-0.36\%$
test_values_stack_nested_locked 74.1110μs 43.1447μs 23.1778 KOps/s 22.9853 KOps/s $\color{#35bf28}+0.84\%$
test_membership 1.6421μs 0.4995μs 2.0022 MOps/s 1.9897 MOps/s $\color{#35bf28}+0.63\%$
test_membership_nested 17.1405μs 1.9076μs 524.2231 KOps/s 531.2675 KOps/s $\color{#d91a1a}-1.33\%$
test_membership_nested_leaf 15.9755μs 1.9118μs 523.0743 KOps/s 540.0284 KOps/s $\color{#d91a1a}-3.14\%$
test_membership_stacked_nested 24.5600μs 1.9432μs 514.6210 KOps/s 525.8887 KOps/s $\color{#d91a1a}-2.14\%$
test_membership_stacked_nested_leaf 33.7910μs 1.9423μs 514.8476 KOps/s 518.7014 KOps/s $\color{#d91a1a}-0.74\%$
test_membership_nested_last 27.1400μs 2.8642μs 349.1398 KOps/s 359.1361 KOps/s $\color{#d91a1a}-2.78\%$
test_membership_nested_leaf_last 27.4100μs 2.8131μs 355.4755 KOps/s 363.1856 KOps/s $\color{#d91a1a}-2.12\%$
test_membership_stacked_nested_last 41.1210μs 7.9590μs 125.6443 KOps/s 210.8753 KOps/s $\textbf{\color{#d91a1a}-40.42\%}$
test_membership_stacked_nested_leaf_last 24.5800μs 7.8146μs 127.9657 KOps/s 212.7440 KOps/s $\textbf{\color{#d91a1a}-39.85\%}$
test_nested_getleaf 29.5310μs 6.0637μs 164.9160 KOps/s 164.6661 KOps/s $\color{#35bf28}+0.15\%$
test_nested_get 27.4110μs 5.7208μs 174.7994 KOps/s 175.8635 KOps/s $\color{#d91a1a}-0.61\%$
test_stacked_getleaf 32.0500μs 6.0475μs 165.3581 KOps/s 163.3499 KOps/s $\color{#35bf28}+1.23\%$
test_stacked_get 25.1610μs 5.7209μs 174.7992 KOps/s 178.8267 KOps/s $\color{#d91a1a}-2.25\%$
test_nested_getitemleaf 29.7110μs 6.1337μs 163.0327 KOps/s 161.8152 KOps/s $\color{#35bf28}+0.75\%$
test_nested_getitem 27.7300μs 5.7853μs 172.8531 KOps/s 172.4962 KOps/s $\color{#35bf28}+0.21\%$
test_stacked_getitemleaf 36.9210μs 6.0832μs 164.3883 KOps/s 163.8443 KOps/s $\color{#35bf28}+0.33\%$
test_stacked_getitem 34.2210μs 5.6870μs 175.8384 KOps/s 173.5926 KOps/s $\color{#35bf28}+1.29\%$
test_lock_nested 4.7097ms 0.4244ms 2.3561 KOps/s 2.3866 KOps/s $\color{#d91a1a}-1.27\%$
test_lock_stack_nested 0.4180ms 0.3708ms 2.6966 KOps/s 2.6860 KOps/s $\color{#35bf28}+0.39\%$
test_unlock_nested 0.7722ms 0.3591ms 2.7848 KOps/s 2.8036 KOps/s $\color{#d91a1a}-0.67\%$
test_unlock_stack_nested 0.3521ms 0.3088ms 3.2382 KOps/s 3.2134 KOps/s $\color{#35bf28}+0.77\%$
test_flatten_speed 0.1462ms 69.3596μs 14.4176 KOps/s 14.2669 KOps/s $\color{#35bf28}+1.06\%$
test_unflatten_speed 0.3459ms 0.2867ms 3.4882 KOps/s 3.5050 KOps/s $\color{#d91a1a}-0.48\%$
test_common_ops 1.5612ms 1.2724ms 785.9400 Ops/s 796.8171 Ops/s $\color{#d91a1a}-1.37\%$
test_creation 26.4700μs 1.4954μs 668.7048 KOps/s 669.1785 KOps/s $\color{#d91a1a}-0.07\%$
test_creation_empty 44.3210μs 16.7462μs 59.7151 KOps/s 62.5183 KOps/s $\color{#d91a1a}-4.48\%$
test_creation_nested_1 53.4410μs 18.5056μs 54.0378 KOps/s 57.0517 KOps/s $\textbf{\color{#d91a1a}-5.28\%}$
test_creation_nested_2 47.2110μs 21.0210μs 47.5715 KOps/s 48.9843 KOps/s $\color{#d91a1a}-2.88\%$
test_clone 59.0110μs 29.4012μs 34.0122 KOps/s 33.5641 KOps/s $\color{#35bf28}+1.34\%$
test_getitem[int] 1.3069ms 15.9602μs 62.6559 KOps/s 61.6586 KOps/s $\color{#35bf28}+1.62\%$
test_getitem[slice_int] 0.1196ms 27.1910μs 36.7769 KOps/s 37.0718 KOps/s $\color{#d91a1a}-0.80\%$
test_getitem[range] 0.2190ms 0.1101ms 9.0797 KOps/s 9.1494 KOps/s $\color{#d91a1a}-0.76\%$
test_getitem[tuple] 0.1166ms 23.5987μs 42.3752 KOps/s 42.2090 KOps/s $\color{#35bf28}+0.39\%$
test_getitem[list] 0.1988ms 97.9544μs 10.2088 KOps/s 10.1963 KOps/s $\color{#35bf28}+0.12\%$
test_setitem_dim[int] 67.6210μs 45.3942μs 22.0292 KOps/s 22.1552 KOps/s $\color{#d91a1a}-0.57\%$
test_setitem_dim[slice_int] 0.1001ms 66.5719μs 15.0214 KOps/s 14.8444 KOps/s $\color{#35bf28}+1.19\%$
test_setitem_dim[range] 0.1738ms 0.1268ms 7.8880 KOps/s 7.8812 KOps/s $\color{#35bf28}+0.09\%$
test_setitem_dim[tuple] 88.0110μs 61.0650μs 16.3760 KOps/s 16.3399 KOps/s $\color{#35bf28}+0.22\%$
test_setitem 82.9120μs 43.0192μs 23.2454 KOps/s 23.5532 KOps/s $\color{#d91a1a}-1.31\%$
test_set 83.0020μs 42.4016μs 23.5840 KOps/s 23.9533 KOps/s $\color{#d91a1a}-1.54\%$
test_set_shared 0.3083ms 51.3038μs 19.4917 KOps/s 19.2732 KOps/s $\color{#35bf28}+1.13\%$
test_update 93.7510μs 51.2990μs 19.4935 KOps/s 19.8010 KOps/s $\color{#d91a1a}-1.55\%$
test_update_nested 91.2820μs 58.6683μs 17.0450 KOps/s 17.6523 KOps/s $\color{#d91a1a}-3.44\%$
test_update__nested 97.2410μs 60.5091μs 16.5264 KOps/s 16.8470 KOps/s $\color{#d91a1a}-1.90\%$
test_set_nested 80.5320μs 44.7858μs 22.3285 KOps/s 22.8653 KOps/s $\color{#d91a1a}-2.35\%$
test_set_nested_new 84.2720μs 48.2859μs 20.7100 KOps/s 20.9243 KOps/s $\color{#d91a1a}-1.02\%$
test_select 0.1017ms 61.9894μs 16.1318 KOps/s 16.3019 KOps/s $\color{#d91a1a}-1.04\%$
test_select_nested 0.5746ms 42.6454μs 23.4492 KOps/s 23.0602 KOps/s $\color{#35bf28}+1.69\%$
test_exclude_nested 95.7420μs 60.3953μs 16.5576 KOps/s 16.7444 KOps/s $\color{#d91a1a}-1.12\%$
test_empty[True] 0.2947ms 0.2443ms 4.0929 KOps/s 4.0742 KOps/s $\color{#35bf28}+0.46\%$
test_empty[False] 2.9130μs 0.7409μs 1.3497 MOps/s 1.3164 MOps/s $\color{#35bf28}+2.53\%$
test_to 55.1410μs 25.5370μs 39.1589 KOps/s 39.3872 KOps/s $\color{#d91a1a}-0.58\%$
test_to_nonblocking 53.5010μs 23.6045μs 42.3647 KOps/s 40.7979 KOps/s $\color{#35bf28}+3.84\%$
test_unbind_speed 0.3418ms 0.2806ms 3.5641 KOps/s 3.5801 KOps/s $\color{#d91a1a}-0.45\%$
test_unbind_speed_stack0 0.3401ms 0.2690ms 3.7171 KOps/s 3.7067 KOps/s $\color{#35bf28}+0.28\%$
test_unbind_speed_stack1 0.6790ms 0.6381ms 1.5671 KOps/s 1.4394 KOps/s $\textbf{\color{#35bf28}+8.87\%}$
test_split 94.0752ms 2.1532ms 464.4348 Ops/s 464.4564 Ops/s $-0.00\%$
test_chunk 93.4650ms 2.1602ms 462.9161 Ops/s 459.2503 Ops/s $\color{#35bf28}+0.80\%$
test_creation[device0] 0.3413ms 0.1251ms 7.9952 KOps/s 7.9681 KOps/s $\color{#35bf28}+0.34\%$
test_creation_from_tensor 0.3378ms 0.1274ms 7.8464 KOps/s 7.8061 KOps/s $\color{#35bf28}+0.52\%$
test_add_one[memmap_tensor0] 0.2403ms 8.7728μs 113.9892 KOps/s 117.4325 KOps/s $\color{#d91a1a}-2.93\%$
test_contiguous[memmap_tensor0] 30.2710μs 2.2207μs 450.3147 KOps/s 454.5871 KOps/s $\color{#d91a1a}-0.94\%$
test_stack[memmap_tensor0] 39.4910μs 6.6089μs 151.3100 KOps/s 150.6485 KOps/s $\color{#35bf28}+0.44\%$
test_memmaptd_index 1.1517ms 0.4204ms 2.3788 KOps/s 2.3813 KOps/s $\color{#d91a1a}-0.10\%$
test_memmaptd_index_astensor 0.9534ms 0.4778ms 2.0929 KOps/s 2.0832 KOps/s $\color{#35bf28}+0.46\%$
test_memmaptd_index_op 1.4215ms 1.0489ms 953.3441 Ops/s 976.2743 Ops/s $\color{#d91a1a}-2.35\%$
test_serialize_model 0.1298s 0.1292s 7.7403 Ops/s 7.7165 Ops/s $\color{#35bf28}+0.31\%$
test_serialize_model_pickle 1.3606s 1.2189s 0.8204 Ops/s 0.8247 Ops/s $\color{#d91a1a}-0.52\%$
test_serialize_weights 0.2213s 0.1419s 7.0465 Ops/s 7.6758 Ops/s $\textbf{\color{#d91a1a}-8.20\%}$
test_serialize_weights_returnearly 0.2162s 55.6281ms 17.9765 Ops/s 16.2658 Ops/s $\textbf{\color{#35bf28}+10.52\%}$
test_serialize_weights_pickle 1.3757s 1.2166s 0.8220 Ops/s 0.8139 Ops/s $\color{#35bf28}+1.00\%$
test_reshape_pytree 71.2610μs 35.7951μs 27.9368 KOps/s 28.2200 KOps/s $\color{#d91a1a}-1.00\%$
test_reshape_td 0.4226ms 42.2475μs 23.6701 KOps/s 23.8460 KOps/s $\color{#d91a1a}-0.74\%$
test_view_pytree 62.2510μs 35.4919μs 28.1755 KOps/s 28.0069 KOps/s $\color{#35bf28}+0.60\%$
test_view_td 0.4388ms 47.5306μs 21.0391 KOps/s 21.0635 KOps/s $\color{#d91a1a}-0.12\%$
test_unbind_pytree 75.2110μs 34.8218μs 28.7176 KOps/s 29.0371 KOps/s $\color{#d91a1a}-1.10\%$
test_unbind_td 0.4249ms 43.7699μs 22.8467 KOps/s 23.6352 KOps/s $\color{#d91a1a}-3.34\%$
test_split_pytree 90.6010μs 46.3859μs 21.5583 KOps/s 21.4822 KOps/s $\color{#35bf28}+0.35\%$
test_split_td 0.6660ms 56.0938μs 17.8273 KOps/s 18.0502 KOps/s $\color{#d91a1a}-1.24\%$
test_add_pytree 95.3910μs 57.9688μs 17.2507 KOps/s 17.4350 KOps/s $\color{#d91a1a}-1.06\%$
test_add_td 0.4929ms 93.8494μs 10.6554 KOps/s 10.8605 KOps/s $\color{#d91a1a}-1.89\%$
test_compile_add_one_nested[tensordict-compile] 0.4131ms 0.2105ms 4.7503 KOps/s 4.7879 KOps/s $\color{#d91a1a}-0.78\%$
test_compile_add_one_nested[tensordict-eager] 0.5491ms 0.1505ms 6.6465 KOps/s 6.6148 KOps/s $\color{#35bf28}+0.48\%$
test_compile_add_one_nested[pytree-compile] 0.1858ms 0.1455ms 6.8744 KOps/s 6.8691 KOps/s $\color{#35bf28}+0.08\%$
test_compile_add_one_nested[pytree-eager] 0.5856ms 0.1869ms 5.3491 KOps/s 5.0325 KOps/s $\textbf{\color{#35bf28}+6.29\%}$
test_compile_copy_nested[tensordict-compile] 49.4210μs 21.4795μs 46.5560 KOps/s 46.5285 KOps/s $\color{#35bf28}+0.06\%$
test_compile_copy_nested[tensordict-eager] 0.4347ms 43.0468μs 23.2306 KOps/s 22.6780 KOps/s $\color{#35bf28}+2.44\%$
test_compile_copy_nested[pytree-compile] 0.2624ms 64.9106μs 15.4058 KOps/s 15.8480 KOps/s $\color{#d91a1a}-2.79\%$
test_compile_copy_nested[pytree-eager] 0.4329ms 49.7606μs 20.0962 KOps/s 20.4889 KOps/s $\color{#d91a1a}-1.92\%$
test_compile_add_one_flat[tensordict-compile] 0.7154ms 0.3187ms 3.1376 KOps/s 3.1542 KOps/s $\color{#d91a1a}-0.52\%$
test_compile_add_one_flat[tensordict-eager] 0.6047ms 0.2088ms 4.7882 KOps/s 4.6235 KOps/s $\color{#35bf28}+3.56\%$
test_compile_add_one_flat[tensorclass-compile] 0.1649ms 0.1284ms 7.7898 KOps/s 7.3918 KOps/s $\textbf{\color{#35bf28}+5.38\%}$
test_compile_add_one_flat[tensorclass-eager] 0.4643ms 59.8903μs 16.6972 KOps/s 15.7980 KOps/s $\textbf{\color{#35bf28}+5.69\%}$
test_compile_add_one_flat[pytree-compile] 0.4905ms 0.3188ms 3.1369 KOps/s 3.1498 KOps/s $\color{#d91a1a}-0.41\%$
test_compile_add_one_flat[pytree-eager] 0.7238ms 0.6291ms 1.5895 KOps/s 1.5918 KOps/s $\color{#d91a1a}-0.15\%$
test_compile_add_self_flat[tensordict-eager] 0.3576ms 0.2480ms 4.0330 KOps/s 4.0085 KOps/s $\color{#35bf28}+0.61\%$
test_compile_add_self_flat[tensordict-compile] 0.3651ms 0.3164ms 3.1608 KOps/s 3.0211 KOps/s $\color{#35bf28}+4.62\%$
test_compile_add_self_flat[tensorclass-eager] 0.1217ms 69.8409μs 14.3183 KOps/s 13.8600 KOps/s $\color{#35bf28}+3.31\%$
test_compile_add_self_flat[tensorclass-compile] 0.1916ms 0.1282ms 7.7998 KOps/s 7.7777 KOps/s $\color{#35bf28}+0.28\%$
test_compile_add_self_flat[pytree-eager] 0.6264ms 0.5411ms 1.8482 KOps/s 1.8550 KOps/s $\color{#d91a1a}-0.36\%$
test_compile_add_self_flat[pytree-compile] 0.3715ms 0.3156ms 3.1690 KOps/s 3.1587 KOps/s $\color{#35bf28}+0.33\%$
test_compile_copy_flat[tensordict-compile] 89.4310μs 18.0793μs 55.3118 KOps/s 54.4360 KOps/s $\color{#35bf28}+1.61\%$
test_compile_copy_flat[tensordict-eager] 0.1000ms 26.7165μs 37.4301 KOps/s 36.7716 KOps/s $\color{#35bf28}+1.79\%$
test_compile_copy_flat[pytree-compile] 0.1322ms 70.0555μs 14.2744 KOps/s 14.2311 KOps/s $\color{#35bf28}+0.30\%$
test_compile_copy_flat[pytree-eager] 84.8920μs 51.2267μs 19.5211 KOps/s 19.5633 KOps/s $\color{#d91a1a}-0.22\%$
test_compile_assign_and_add[tensordict-compile] 2.2972ms 0.8119ms 1.2316 KOps/s 1.1235 KOps/s $\textbf{\color{#35bf28}+9.62\%}$
test_compile_assign_and_add[tensordict-eager] 3.5458ms 3.2310ms 309.5042 Ops/s 319.4211 Ops/s $\color{#d91a1a}-3.10\%$
test_compile_assign_and_add[pytree-compile] 2.2627ms 0.8079ms 1.2378 KOps/s 1.1250 KOps/s $\textbf{\color{#35bf28}+10.02\%}$
test_compile_assign_and_add[pytree-eager] 3.2521ms 3.1809ms 314.3755 Ops/s 311.3846 Ops/s $\color{#35bf28}+0.96\%$
test_compile_indexing[tensor-tensordict-compile] 0.1669ms 0.1099ms 9.1024 KOps/s 9.1847 KOps/s $\color{#d91a1a}-0.90\%$
test_compile_indexing[tensor-tensordict-eager] 0.1864ms 60.6886μs 16.4776 KOps/s 16.3259 KOps/s $\color{#35bf28}+0.93\%$
test_compile_indexing[tensor-tensorclass-compile] 0.1381ms 0.1038ms 9.6375 KOps/s 9.6208 KOps/s $\color{#35bf28}+0.17\%$
test_compile_indexing[tensor-tensorclass-eager] 0.1540ms 42.8503μs 23.3370 KOps/s 21.8671 KOps/s $\textbf{\color{#35bf28}+6.72\%}$
test_compile_indexing[tensor-pytree-compile] 0.1626ms 0.1049ms 9.5312 KOps/s 9.2340 KOps/s $\color{#35bf28}+3.22\%$
test_compile_indexing[tensor-pytree-eager] 93.4420μs 42.4760μs 23.5427 KOps/s 22.8597 KOps/s $\color{#35bf28}+2.99\%$
test_compile_indexing[slice-tensordict-compile] 0.1784ms 0.1374ms 7.2784 KOps/s 7.2634 KOps/s $\color{#35bf28}+0.21\%$
test_compile_indexing[slice-tensordict-eager] 0.1666ms 24.8982μs 40.1636 KOps/s 39.9213 KOps/s $\color{#35bf28}+0.61\%$
test_compile_indexing[slice-tensorclass-compile] 0.1792ms 0.1302ms 7.6829 KOps/s 7.6432 KOps/s $\color{#35bf28}+0.52\%$
test_compile_indexing[slice-tensorclass-eager] 57.9310μs 20.6954μs 48.3199 KOps/s 47.8878 KOps/s $\color{#35bf28}+0.90\%$
test_compile_indexing[slice-pytree-compile] 0.1815ms 0.1313ms 7.6177 KOps/s 7.4496 KOps/s $\color{#35bf28}+2.26\%$
test_compile_indexing[slice-pytree-eager] 84.4710μs 20.6005μs 48.5426 KOps/s 47.0439 KOps/s $\color{#35bf28}+3.19\%$
test_compile_indexing[int-tensordict-compile] 0.1753ms 0.1383ms 7.2296 KOps/s 7.2190 KOps/s $\color{#35bf28}+0.15\%$
test_compile_indexing[int-tensordict-eager] 0.5322ms 24.6684μs 40.5377 KOps/s 39.6789 KOps/s $\color{#35bf28}+2.16\%$
test_compile_indexing[int-tensorclass-compile] 0.2829ms 0.1313ms 7.6178 KOps/s 7.5412 KOps/s $\color{#35bf28}+1.02\%$
test_compile_indexing[int-tensorclass-eager] 56.2700μs 20.6794μs 48.3572 KOps/s 48.5992 KOps/s $\color{#d91a1a}-0.50\%$
test_compile_indexing[int-pytree-compile] 0.1957ms 0.1311ms 7.6266 KOps/s 7.3933 KOps/s $\color{#35bf28}+3.15\%$
test_compile_indexing[int-pytree-eager] 57.1010μs 20.3913μs 49.0405 KOps/s 48.1782 KOps/s $\color{#35bf28}+1.79\%$
test_mod_add[eager] 79.2720μs 32.1709μs 31.0840 KOps/s 31.4233 KOps/s $\color{#d91a1a}-1.08\%$
test_mod_add[compile] 0.2232ms 71.1515μs 14.0545 KOps/s 13.6349 KOps/s $\color{#35bf28}+3.08\%$
test_mod_add[compile-overhead] 0.2671ms 0.1362ms 7.3406 KOps/s 7.0298 KOps/s $\color{#35bf28}+4.42\%$
test_mod_wrap[eager] 0.3231ms 0.2370ms 4.2194 KOps/s 3.9099 KOps/s $\textbf{\color{#35bf28}+7.92\%}$
test_mod_wrap[compile] 0.6992ms 0.2990ms 3.3440 KOps/s 3.3648 KOps/s $\color{#d91a1a}-0.62\%$
test_mod_wrap[compile-overhead] 7.5751ms 3.9957ms 250.2710 Ops/s 253.7883 Ops/s $\color{#d91a1a}-1.39\%$
test_mod_wrap_and_backward[eager] 1.7440ms 1.3481ms 741.7587 Ops/s 687.2420 Ops/s $\textbf{\color{#35bf28}+7.93\%}$
test_mod_wrap_and_backward[compile] 1.7169ms 1.3176ms 758.9669 Ops/s 695.9793 Ops/s $\textbf{\color{#35bf28}+9.05\%}$
test_mod_wrap_and_backward[compile-overhead] 1.3112ms 0.8915ms 1.1217 KOps/s 947.5567 Ops/s $\textbf{\color{#35bf28}+18.37\%}$
test_seq_add[eager] 0.1506ms 97.8678μs 10.2179 KOps/s 10.0308 KOps/s $\color{#35bf28}+1.86\%$
test_seq_add[compile] 0.1388ms 81.4139μs 12.2829 KOps/s 12.1687 KOps/s $\color{#35bf28}+0.94\%$
test_seq_add[compile-overhead] 0.1633ms 0.1134ms 8.8203 KOps/s 8.6802 KOps/s $\color{#35bf28}+1.61\%$
test_seq_wrap[eager] 0.4745ms 0.3751ms 2.6662 KOps/s 2.5769 KOps/s $\color{#35bf28}+3.47\%$
test_seq_wrap[compile] 0.3993ms 0.3090ms 3.2362 KOps/s 3.1599 KOps/s $\color{#35bf28}+2.41\%$
test_seq_wrap[compile-overhead] 0.2862ms 0.2187ms 4.5724 KOps/s 4.5844 KOps/s $\color{#d91a1a}-0.26\%$
test_func_call_runtime[False-eager] 0.8098ms 0.7151ms 1.3983 KOps/s 1.3426 KOps/s $\color{#35bf28}+4.15\%$
test_func_call_runtime[False-compile] 0.9267ms 0.7719ms 1.2955 KOps/s 1.2705 KOps/s $\color{#35bf28}+1.97\%$
test_func_call_runtime[False-compile-overhead] 0.4022ms 0.3549ms 2.8179 KOps/s 2.7991 KOps/s $\color{#35bf28}+0.67\%$
test_func_call_runtime[True-eager] 1.0246ms 0.8821ms 1.1336 KOps/s 1.1020 KOps/s $\color{#35bf28}+2.87\%$
test_func_call_runtime[True-compile] 1.0332ms 0.7953ms 1.2574 KOps/s 1.2380 KOps/s $\color{#35bf28}+1.57\%$
test_func_call_runtime[True-compile-overhead] 0.5146ms 0.3766ms 2.6551 KOps/s 2.6598 KOps/s $\color{#d91a1a}-0.18\%$
test_func_call_cm_runtime[False-eager] 0.8581ms 0.7175ms 1.3936 KOps/s 1.3472 KOps/s $\color{#35bf28}+3.45\%$
test_func_call_cm_runtime[False-compile] 0.9773ms 0.7732ms 1.2934 KOps/s 1.2647 KOps/s $\color{#35bf28}+2.27\%$
test_func_call_cm_runtime[False-compile-overhead] 0.4929ms 0.3569ms 2.8016 KOps/s 2.7923 KOps/s $\color{#35bf28}+0.33\%$
test_func_call_cm_runtime[True-eager] 1.1196ms 0.9679ms 1.0332 KOps/s 992.2607 Ops/s $\color{#35bf28}+4.12\%$
test_func_call_cm_runtime[True-compile] 0.9570ms 0.8239ms 1.2137 KOps/s 1.1907 KOps/s $\color{#35bf28}+1.93\%$
test_func_call_cm_runtime[True-compile-overhead] 0.4763ms 0.4024ms 2.4853 KOps/s 2.4770 KOps/s $\color{#35bf28}+0.33\%$
test_vmap_func_call_cm_runtime[eager] 2.4851ms 2.0521ms 487.3107 Ops/s 477.7378 Ops/s $\color{#35bf28}+2.00\%$
test_vmap_func_call_cm_runtime[compile] 1.2466ms 0.8392ms 1.1916 KOps/s 1.1149 KOps/s $\textbf{\color{#35bf28}+6.88\%}$
test_vmap_func_call_cm_runtime[compile-overhead] 0.4934ms 0.4068ms 2.4585 KOps/s 2.4390 KOps/s $\color{#35bf28}+0.80\%$
test_distributed 4.2306ms 0.1971ms 5.0746 KOps/s 8.4422 KOps/s $\textbf{\color{#d91a1a}-39.89\%}$
test_tdmodule 55.1310μs 15.4623μs 64.6734 KOps/s 61.6996 KOps/s $\color{#35bf28}+4.82\%$
test_tdmodule_dispatch 50.0610μs 30.4262μs 32.8664 KOps/s 34.1244 KOps/s $\color{#d91a1a}-3.69\%$
test_tdseq 37.9310μs 15.9731μs 62.6051 KOps/s 62.1004 KOps/s $\color{#35bf28}+0.81\%$
test_tdseq_dispatch 56.8510μs 32.6544μs 30.6237 KOps/s 30.9775 KOps/s $\color{#d91a1a}-1.14\%$
test_instantiation_functorch 1.9589ms 1.8483ms 541.0362 Ops/s 534.0351 Ops/s $\color{#35bf28}+1.31\%$
test_instantiation_td 1.8533ms 1.2240ms 816.9686 Ops/s 834.2453 Ops/s $\color{#d91a1a}-2.07\%$
test_exec_functorch 0.3182ms 0.2187ms 4.5725 KOps/s 4.7479 KOps/s $\color{#d91a1a}-3.69\%$
test_exec_functional_call 0.2782ms 0.2213ms 4.5193 KOps/s 4.7969 KOps/s $\textbf{\color{#d91a1a}-5.79\%}$
test_exec_td 0.2833ms 0.2250ms 4.4436 KOps/s 4.6224 KOps/s $\color{#d91a1a}-3.87\%$
test_exec_td_decorator 0.5717ms 0.2674ms 3.7395 KOps/s 3.8682 KOps/s $\color{#d91a1a}-3.33\%$
test_vmap_mlp_speed[True-True] 0.8220ms 0.6957ms 1.4374 KOps/s 1.4595 KOps/s $\color{#d91a1a}-1.51\%$
test_vmap_mlp_speed[True-False] 0.7656ms 0.6825ms 1.4651 KOps/s 1.4558 KOps/s $\color{#35bf28}+0.64\%$
test_vmap_mlp_speed[False-True] 0.6333ms 0.5700ms 1.7543 KOps/s 1.7303 KOps/s $\color{#35bf28}+1.39\%$
test_vmap_mlp_speed[False-False] 0.6179ms 0.5716ms 1.7493 KOps/s 1.7335 KOps/s $\color{#35bf28}+0.92\%$
test_vmap_mlp_speed_decorator[True-True] 1.2009ms 0.6691ms 1.4946 KOps/s 1.4809 KOps/s $\color{#35bf28}+0.93\%$
test_vmap_mlp_speed_decorator[True-False] 0.7925ms 0.6707ms 1.4910 KOps/s 1.4746 KOps/s $\color{#35bf28}+1.11\%$
test_vmap_mlp_speed_decorator[False-True] 0.6917ms 0.5853ms 1.7084 KOps/s 1.6868 KOps/s $\color{#35bf28}+1.28\%$
test_vmap_mlp_speed_decorator[False-False] 0.7121ms 0.5908ms 1.6925 KOps/s 1.6893 KOps/s $\color{#35bf28}+0.19\%$
test_vmap_transformer_speed[True-True] 9.1597ms 8.3333ms 120.0008 Ops/s 119.0690 Ops/s $\color{#35bf28}+0.78\%$
test_vmap_transformer_speed[True-False] 8.3100ms 8.2573ms 121.1046 Ops/s 119.3712 Ops/s $\color{#35bf28}+1.45\%$
test_vmap_transformer_speed[False-True] 8.1274ms 8.0612ms 124.0511 Ops/s 122.3798 Ops/s $\color{#35bf28}+1.37\%$
test_vmap_transformer_speed[False-False] 8.1691ms 8.0779ms 123.7947 Ops/s 122.0096 Ops/s $\color{#35bf28}+1.46\%$
test_vmap_transformer_speed_decorator[True-True] 20.0525ms 19.3018ms 51.8087 Ops/s 51.3629 Ops/s $\color{#35bf28}+0.87\%$
test_vmap_transformer_speed_decorator[True-False] 19.4321ms 19.3628ms 51.6453 Ops/s 51.0118 Ops/s $\color{#35bf28}+1.24\%$
test_vmap_transformer_speed_decorator[False-True] 19.2859ms 19.2113ms 52.0527 Ops/s 51.7465 Ops/s $\color{#35bf28}+0.59\%$
test_vmap_transformer_speed_decorator[False-False] 19.3091ms 19.2040ms 52.0724 Ops/s 51.6986 Ops/s $\color{#35bf28}+0.72\%$
test_to_module_speed[True] 2.0240ms 0.9417ms 1.0620 KOps/s 1.0371 KOps/s $\color{#35bf28}+2.40\%$
test_to_module_speed[False] 1.0160ms 0.9171ms 1.0903 KOps/s 1.0695 KOps/s $\color{#35bf28}+1.95\%$
test_tc_init 73.1410μs 36.2384μs 27.5950 KOps/s 28.1108 KOps/s $\color{#d91a1a}-1.83\%$
test_tc_init_nested 0.1055ms 73.1124μs 13.6776 KOps/s 13.6855 KOps/s $\color{#d91a1a}-0.06\%$
test_tc_first_layer_tensor 15.5131μs 0.6906μs 1.4479 MOps/s 1.4508 MOps/s $\color{#d91a1a}-0.20\%$
test_tc_first_layer_nontensor 33.1700μs 2.2495μs 444.5510 KOps/s 447.5195 KOps/s $\color{#d91a1a}-0.66\%$
test_tc_second_layer_tensor 7.2525μs 1.3792μs 725.0404 KOps/s 722.4722 KOps/s $\color{#35bf28}+0.36\%$
test_tc_second_layer_nontensor 28.9710μs 2.9239μs 342.0086 KOps/s 340.4652 KOps/s $\color{#35bf28}+0.45\%$
test_unbind 0.1952s 10.9018ms 91.7278 Ops/s 101.4254 Ops/s $\textbf{\color{#d91a1a}-9.56\%}$
test_full_like 0.6491ms 0.5739ms 1.7424 KOps/s 1.7374 KOps/s $\color{#35bf28}+0.29\%$
test_zeros_like 0.2635ms 0.1979ms 5.0543 KOps/s 5.0523 KOps/s $\color{#35bf28}+0.04\%$
test_ones_like 0.2319ms 0.1977ms 5.0591 KOps/s 5.0570 KOps/s $\color{#35bf28}+0.04\%$
test_clone 0.4577ms 0.4144ms 2.4131 KOps/s 2.4140 KOps/s $\color{#d91a1a}-0.04\%$
test_squeeze 36.1910μs 9.6456μs 103.6747 KOps/s 102.9745 KOps/s $\color{#35bf28}+0.68\%$
test_unsqueeze 0.2167ms 75.1298μs 13.3103 KOps/s 13.3092 KOps/s $+0.01\%$
test_split 0.4489ms 0.1599ms 6.2557 KOps/s 6.3777 KOps/s $\color{#d91a1a}-1.91\%$
test_permute 0.2489ms 0.1804ms 5.5438 KOps/s 5.5986 KOps/s $\color{#d91a1a}-0.98\%$
test_stack 1.2680ms 0.8554ms 1.1690 KOps/s 1.1415 KOps/s $\color{#35bf28}+2.41\%$
test_cat 1.2654ms 1.2315ms 812.0003 Ops/s 811.8053 Ops/s $\color{#35bf28}+0.02\%$

@vmoens vmoens added the bug Something isn't working label Sep 17, 2024
@vmoens vmoens merged commit 4a0b18c into gh/vmoens/23/base Sep 17, 2024
40 of 45 checks passed
vmoens added a commit that referenced this pull request Sep 17, 2024
ghstack-source-id: bdf9ab4f4c61ef5a407d76ebd5433d0d6a897b59
Pull Request resolved: #999
@vmoens vmoens deleted the gh/vmoens/23/head branch September 17, 2024 18:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants