Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BugFix] Fix get for nestedkeys with default in tensorclass #1211

Merged
merged 2 commits into from
Feb 6, 2025

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Feb 6, 2025

[ghstack-poisoned]
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 6, 2025
@vmoens vmoens added bug Something isn't working suitable for minor labels Feb 6, 2025
[ghstack-poisoned]
Copy link

github-actions bot commented Feb 6, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 217. Improved: $\large\color{#35bf28}10$. Worsened: $\large\color{#d91a1a}21$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 41.2970μs 21.0975μs 47.3990 KOps/s 48.7173 KOps/s $\color{#d91a1a}-2.71\%$
test_plain_set_stack_nested 56.5150μs 21.3987μs 46.7319 KOps/s 48.5732 KOps/s $\color{#d91a1a}-3.79\%$
test_plain_set_nested_inplace 56.8660μs 23.2200μs 43.0664 KOps/s 44.4481 KOps/s $\color{#d91a1a}-3.11\%$
test_plain_set_stack_nested_inplace 53.8410μs 23.1042μs 43.2821 KOps/s 44.2785 KOps/s $\color{#d91a1a}-2.25\%$
test_items 33.9630μs 4.2105μs 237.5005 KOps/s 244.7400 KOps/s $\color{#d91a1a}-2.96\%$
test_items_nested 0.5749ms 0.4041ms 2.4745 KOps/s 2.4863 KOps/s $\color{#d91a1a}-0.48\%$
test_items_nested_locked 0.7719ms 0.4058ms 2.4645 KOps/s 2.4737 KOps/s $\color{#d91a1a}-0.37\%$
test_items_nested_leaf 0.1616ms 77.4093μs 12.9183 KOps/s 13.0079 KOps/s $\color{#d91a1a}-0.69\%$
test_items_stack_nested 0.7416ms 0.4112ms 2.4318 KOps/s 2.4763 KOps/s $\color{#d91a1a}-1.80\%$
test_items_stack_nested_leaf 0.1605ms 80.7940μs 12.3772 KOps/s 12.7532 KOps/s $\color{#d91a1a}-2.95\%$
test_items_stack_nested_locked 0.5002ms 0.4050ms 2.4691 KOps/s 2.4623 KOps/s $\color{#35bf28}+0.27\%$
test_keys 29.7650μs 3.5113μs 284.7918 KOps/s 286.4989 KOps/s $\color{#d91a1a}-0.60\%$
test_keys_nested 0.2078ms 0.1628ms 6.1416 KOps/s 6.1448 KOps/s $\color{#d91a1a}-0.05\%$
test_keys_nested_locked 0.8682ms 0.1685ms 5.9363 KOps/s 5.9409 KOps/s $\color{#d91a1a}-0.08\%$
test_keys_nested_leaf 0.1817ms 0.1424ms 7.0203 KOps/s 7.0989 KOps/s $\color{#d91a1a}-1.11\%$
test_keys_stack_nested 0.2116ms 0.1608ms 6.2188 KOps/s 6.2346 KOps/s $\color{#d91a1a}-0.25\%$
test_keys_stack_nested_leaf 0.1772ms 0.1411ms 7.0869 KOps/s 7.3530 KOps/s $\color{#d91a1a}-3.62\%$
test_keys_stack_nested_locked 0.2122ms 0.1678ms 5.9607 KOps/s 6.0575 KOps/s $\color{#d91a1a}-1.60\%$
test_values 8.5635μs 1.0530μs 949.7033 KOps/s 952.7548 KOps/s $\color{#d91a1a}-0.32\%$
test_values_nested 0.1189ms 62.0936μs 16.1047 KOps/s 16.3184 KOps/s $\color{#d91a1a}-1.31\%$
test_values_nested_locked 0.1243ms 62.0653μs 16.1121 KOps/s 15.6481 KOps/s $\color{#35bf28}+2.97\%$
test_values_nested_leaf 0.1338ms 71.6576μs 13.9553 KOps/s 14.0624 KOps/s $\color{#d91a1a}-0.76\%$
test_values_stack_nested 0.1203ms 63.5883μs 15.7262 KOps/s 15.6414 KOps/s $\color{#35bf28}+0.54\%$
test_values_stack_nested_leaf 0.1399ms 71.7497μs 13.9373 KOps/s 14.4991 KOps/s $\color{#d91a1a}-3.87\%$
test_values_stack_nested_locked 0.1317ms 63.2394μs 15.8129 KOps/s 15.7646 KOps/s $\color{#35bf28}+0.31\%$
test_membership 15.8900μs 0.8641μs 1.1573 MOps/s 1.4071 MOps/s $\textbf{\color{#d91a1a}-17.76\%}$
test_membership_nested 35.8070μs 2.8958μs 345.3293 KOps/s 347.7956 KOps/s $\color{#d91a1a}-0.71\%$
test_membership_nested_leaf 19.6570μs 2.9183μs 342.6698 KOps/s 346.8222 KOps/s $\color{#d91a1a}-1.20\%$
test_membership_stacked_nested 23.4230μs 2.9208μs 342.3741 KOps/s 348.0158 KOps/s $\color{#d91a1a}-1.62\%$
test_membership_stacked_nested_leaf 23.0930μs 2.8896μs 346.0683 KOps/s 347.5217 KOps/s $\color{#d91a1a}-0.42\%$
test_membership_nested_last 51.8090μs 4.3600μs 229.3552 KOps/s 228.1391 KOps/s $\color{#35bf28}+0.53\%$
test_membership_nested_leaf_last 40.4530μs 4.4096μs 226.7801 KOps/s 224.1548 KOps/s $\color{#35bf28}+1.17\%$
test_membership_stacked_nested_last 33.7830μs 5.2530μs 190.3681 KOps/s 74.8151 KOps/s $\textbf{\color{#35bf28}+154.45\%}$
test_membership_stacked_nested_leaf_last 31.8200μs 5.2786μs 189.4451 KOps/s 74.9572 KOps/s $\textbf{\color{#35bf28}+152.74\%}$
test_nested_getleaf 33.0320μs 11.3096μs 88.4202 KOps/s 96.3593 KOps/s $\textbf{\color{#d91a1a}-8.24\%}$
test_nested_get 30.6870μs 10.8774μs 91.9335 KOps/s 101.9186 KOps/s $\textbf{\color{#d91a1a}-9.80\%}$
test_stacked_getleaf 48.8740μs 11.4385μs 87.4239 KOps/s 97.7895 KOps/s $\textbf{\color{#d91a1a}-10.60\%}$
test_stacked_get 29.9360μs 10.8370μs 92.2767 KOps/s 102.1108 KOps/s $\textbf{\color{#d91a1a}-9.63\%}$
test_nested_getitemleaf 35.2360μs 12.1437μs 82.3471 KOps/s 91.9894 KOps/s $\textbf{\color{#d91a1a}-10.48\%}$
test_nested_getitem 45.4750μs 11.5327μs 86.7101 KOps/s 96.4582 KOps/s $\textbf{\color{#d91a1a}-10.11\%}$
test_stacked_getitemleaf 31.5490μs 11.9127μs 83.9440 KOps/s 93.0179 KOps/s $\textbf{\color{#d91a1a}-9.76\%}$
test_stacked_getitem 37.0190μs 11.4882μs 87.0456 KOps/s 96.7908 KOps/s $\textbf{\color{#d91a1a}-10.07\%}$
test_lock_nested 6.7322ms 0.4229ms 2.3647 KOps/s 2.4320 KOps/s $\color{#d91a1a}-2.77\%$
test_lock_stack_nested 0.6584ms 0.4228ms 2.3655 KOps/s 2.4516 KOps/s $\color{#d91a1a}-3.51\%$
test_unlock_nested 0.4260ms 0.3342ms 2.9919 KOps/s 2.9446 KOps/s $\color{#35bf28}+1.61\%$
test_unlock_stack_nested 0.5310ms 0.3376ms 2.9618 KOps/s 3.0314 KOps/s $\color{#d91a1a}-2.30\%$
test_flatten_speed 0.2079ms 0.1006ms 9.9387 KOps/s 9.8463 KOps/s $\color{#35bf28}+0.94\%$
test_unflatten_speed 0.6352ms 0.5350ms 1.8691 KOps/s 1.9718 KOps/s $\textbf{\color{#d91a1a}-5.21\%}$
test_common_ops 0.9726ms 0.8128ms 1.2304 KOps/s 1.1957 KOps/s $\color{#35bf28}+2.90\%$
test_creation 33.6130μs 2.6105μs 383.0665 KOps/s 405.9510 KOps/s $\textbf{\color{#d91a1a}-5.64\%}$
test_creation_empty 38.9020μs 13.3447μs 74.9360 KOps/s 79.9108 KOps/s $\textbf{\color{#d91a1a}-6.23\%}$
test_creation_nested_1 41.1570μs 16.3145μs 61.2951 KOps/s 64.2953 KOps/s $\color{#d91a1a}-4.67\%$
test_creation_nested_2 53.7400μs 20.9747μs 47.6765 KOps/s 49.6301 KOps/s $\color{#d91a1a}-3.94\%$
test_clone 64.5900μs 13.7813μs 72.5619 KOps/s 73.7167 KOps/s $\color{#d91a1a}-1.57\%$
test_getitem[int] 0.9051ms 13.1561μs 76.0105 KOps/s 78.5636 KOps/s $\color{#d91a1a}-3.25\%$
test_getitem[slice_int] 0.1279ms 25.0587μs 39.9063 KOps/s 40.6341 KOps/s $\color{#d91a1a}-1.79\%$
test_getitem[range] 0.1606ms 52.4879μs 19.0520 KOps/s 20.5570 KOps/s $\textbf{\color{#d91a1a}-7.32\%}$
test_getitem[tuple] 0.1241ms 20.7158μs 48.2723 KOps/s 49.4695 KOps/s $\color{#d91a1a}-2.42\%$
test_getitem[list] 0.1595ms 46.4327μs 21.5365 KOps/s 23.0080 KOps/s $\textbf{\color{#d91a1a}-6.40\%}$
test_setitem_dim[int] 60.6430μs 25.0930μs 39.8517 KOps/s 39.6207 KOps/s $\color{#35bf28}+0.58\%$
test_setitem_dim[slice_int] 94.5160μs 50.7685μs 19.6972 KOps/s 20.0573 KOps/s $\color{#d91a1a}-1.80\%$
test_setitem_dim[range] 0.1381ms 78.7211μs 12.7031 KOps/s 13.1409 KOps/s $\color{#d91a1a}-3.33\%$
test_setitem_dim[tuple] 73.9080μs 40.1818μs 24.8869 KOps/s 25.0630 KOps/s $\color{#d91a1a}-0.70\%$
test_setitem 71.5230μs 21.4361μs 46.6504 KOps/s 47.7877 KOps/s $\color{#d91a1a}-2.38\%$
test_set 86.1310μs 21.1249μs 47.3375 KOps/s 48.9219 KOps/s $\color{#d91a1a}-3.24\%$
test_set_shared 0.3662ms 0.1808ms 5.5297 KOps/s 5.4479 KOps/s $\color{#35bf28}+1.50\%$
test_update 0.1109ms 24.3781μs 41.0205 KOps/s 42.0779 KOps/s $\color{#d91a1a}-2.51\%$
test_update_nested 94.0550μs 33.8700μs 29.5247 KOps/s 29.5284 KOps/s $\color{#d91a1a}-0.01\%$
test_update__nested 0.4651ms 33.9695μs 29.4381 KOps/s 29.5119 KOps/s $\color{#d91a1a}-0.25\%$
test_set_nested 72.8660μs 23.3041μs 42.9108 KOps/s 43.9302 KOps/s $\color{#d91a1a}-2.32\%$
test_set_nested_new 80.0090μs 27.7783μs 35.9994 KOps/s 36.5824 KOps/s $\color{#d91a1a}-1.59\%$
test_select 0.9736ms 43.9297μs 22.7636 KOps/s 22.5518 KOps/s $\color{#35bf28}+0.94\%$
test_select_nested 0.1319ms 63.7027μs 15.6979 KOps/s 15.9148 KOps/s $\color{#d91a1a}-1.36\%$
test_exclude_nested 0.1517ms 81.7923μs 12.2261 KOps/s 12.4380 KOps/s $\color{#d91a1a}-1.70\%$
test_empty[True] 0.7307ms 0.4121ms 2.4266 KOps/s 2.4639 KOps/s $\color{#d91a1a}-1.52\%$
test_empty[False] 9.3272μs 1.3798μs 724.7238 KOps/s 706.5737 KOps/s $\color{#35bf28}+2.57\%$
test_unbind_speed 0.3841ms 0.2730ms 3.6630 KOps/s 3.6468 KOps/s $\color{#35bf28}+0.44\%$
test_unbind_speed_stack0 0.4707ms 0.2693ms 3.7132 KOps/s 3.8329 KOps/s $\color{#d91a1a}-3.12\%$
test_unbind_speed_stack1 0.1024s 0.7269ms 1.3757 KOps/s 1.2810 KOps/s $\textbf{\color{#35bf28}+7.40\%}$
test_split 0.1018s 1.7673ms 565.8323 Ops/s 579.1086 Ops/s $\color{#d91a1a}-2.29\%$
test_chunk 0.1047s 1.7684ms 565.4789 Ops/s 638.6596 Ops/s $\textbf{\color{#d91a1a}-11.46\%}$
test_consolidate_njt[False-None] 8.5542ms 8.1495ms 122.7072 Ops/s 108.2749 Ops/s $\textbf{\color{#35bf28}+13.33\%}$
test_creation[device0] 3.4058ms 92.1524μs 10.8516 KOps/s 10.8434 KOps/s $\color{#35bf28}+0.08\%$
test_creation_from_tensor 0.2150ms 93.9803μs 10.6405 KOps/s 10.5186 KOps/s $\color{#35bf28}+1.16\%$
test_add_one[memmap_tensor0] 0.1003ms 4.9960μs 200.1596 KOps/s 202.0771 KOps/s $\color{#d91a1a}-0.95\%$
test_contiguous[memmap_tensor0] 10.4890μs 0.5373μs 1.8613 MOps/s 1.9492 MOps/s $\color{#d91a1a}-4.51\%$
test_stack[memmap_tensor0] 27.5410μs 3.4495μs 289.8979 KOps/s 300.6608 KOps/s $\color{#d91a1a}-3.58\%$
test_memmaptd_index 1.2487ms 0.2283ms 4.3794 KOps/s 4.4565 KOps/s $\color{#d91a1a}-1.73\%$
test_memmaptd_index_astensor 0.5273ms 0.3147ms 3.1780 KOps/s 3.2484 KOps/s $\color{#d91a1a}-2.17\%$
test_memmaptd_index_op 1.0497ms 0.6063ms 1.6493 KOps/s 1.6703 KOps/s $\color{#d91a1a}-1.26\%$
test_serialize_model 0.2090s 0.1283s 7.7962 Ops/s 8.7276 Ops/s $\textbf{\color{#d91a1a}-10.67\%}$
test_serialize_model_pickle 0.4340s 0.3879s 2.5777 Ops/s 2.5362 Ops/s $\color{#35bf28}+1.64\%$
test_serialize_weights 0.1212s 0.1143s 8.7460 Ops/s 8.9305 Ops/s $\color{#d91a1a}-2.07\%$
test_serialize_weights_returnearly 0.3295s 0.1833s 5.4555 Ops/s 5.7023 Ops/s $\color{#d91a1a}-4.33\%$
test_serialize_weights_pickle 1.0544s 0.7027s 1.4232 Ops/s 1.2027 Ops/s $\textbf{\color{#35bf28}+18.33\%}$
test_serialize_weights_filesystem 0.2557s 0.1546s 6.4667 Ops/s 6.9826 Ops/s $\textbf{\color{#d91a1a}-7.39\%}$
test_serialize_model_filesystem 0.1488s 0.1408s 7.1009 Ops/s 6.7794 Ops/s $\color{#35bf28}+4.74\%$
test_reshape_pytree 68.5580μs 26.1064μs 38.3048 KOps/s 38.1752 KOps/s $\color{#35bf28}+0.34\%$
test_reshape_td 72.2150μs 33.1188μs 30.1944 KOps/s 29.8519 KOps/s $\color{#35bf28}+1.15\%$
test_view_pytree 66.2440μs 26.3745μs 37.9154 KOps/s 38.2273 KOps/s $\color{#d91a1a}-0.82\%$
test_view_td 0.1036ms 38.8200μs 25.7599 KOps/s 26.5242 KOps/s $\color{#d91a1a}-2.88\%$
test_unbind_pytree 74.9300μs 29.8135μs 33.5419 KOps/s 33.3249 KOps/s $\color{#35bf28}+0.65\%$
test_unbind_td 0.3489ms 40.0786μs 24.9510 KOps/s 25.0972 KOps/s $\color{#d91a1a}-0.58\%$
test_split_pytree 75.5010μs 29.3133μs 34.1142 KOps/s 34.2521 KOps/s $\color{#d91a1a}-0.40\%$
test_split_td 0.5208ms 45.6766μs 21.8931 KOps/s 22.3884 KOps/s $\color{#d91a1a}-2.21\%$
test_add_pytree 0.1079ms 34.9200μs 28.6369 KOps/s 28.2312 KOps/s $\color{#35bf28}+1.44\%$
test_add_td 0.1244ms 59.5397μs 16.7955 KOps/s 17.1717 KOps/s $\color{#d91a1a}-2.19\%$
test_compile_add_one_nested[tensordict-compile] 0.1231ms 67.2236μs 14.8757 KOps/s 15.0449 KOps/s $\color{#d91a1a}-1.12\%$
test_compile_add_one_nested[tensordict-eager] 0.6439ms 0.1729ms 5.7847 KOps/s 5.9455 KOps/s $\color{#d91a1a}-2.71\%$
test_compile_add_one_nested[pytree-compile] 0.1440ms 45.5173μs 21.9697 KOps/s 21.6422 KOps/s $\color{#35bf28}+1.51\%$
test_compile_add_one_nested[pytree-eager] 0.2301ms 0.1178ms 8.4901 KOps/s 8.4195 KOps/s $\color{#35bf28}+0.84\%$
test_compile_copy_nested[tensordict-compile] 89.5270μs 28.9546μs 34.5368 KOps/s 35.8037 KOps/s $\color{#d91a1a}-3.54\%$
test_compile_copy_nested[tensordict-eager] 0.1278ms 59.6610μs 16.7614 KOps/s 17.2739 KOps/s $\color{#d91a1a}-2.97\%$
test_compile_copy_nested[pytree-compile] 0.1518ms 79.6792μs 12.5503 KOps/s 12.6727 KOps/s $\color{#d91a1a}-0.97\%$
test_compile_copy_nested[pytree-eager] 0.1287ms 66.6680μs 14.9997 KOps/s 15.0754 KOps/s $\color{#d91a1a}-0.50\%$
test_compile_add_one_flat[tensordict-compile] 0.1992ms 0.1092ms 9.1607 KOps/s 9.2914 KOps/s $\color{#d91a1a}-1.41\%$
test_compile_add_one_flat[tensordict-eager] 0.4213ms 0.2168ms 4.6131 KOps/s 4.6696 KOps/s $\color{#d91a1a}-1.21\%$
test_compile_add_one_flat[tensorclass-compile] 0.2234ms 48.4961μs 20.6202 KOps/s 20.6775 KOps/s $\color{#d91a1a}-0.28\%$
test_compile_add_one_flat[tensorclass-eager] 0.2005ms 66.8540μs 14.9580 KOps/s 15.2795 KOps/s $\color{#d91a1a}-2.10\%$
test_compile_add_one_flat[pytree-compile] 0.1847ms 0.1011ms 9.8866 KOps/s 9.9411 KOps/s $\color{#d91a1a}-0.55\%$
test_compile_add_one_flat[pytree-eager] 0.3456ms 0.2022ms 4.9448 KOps/s 5.0205 KOps/s $\color{#d91a1a}-1.51\%$
test_compile_add_self_flat[tensordict-eager] 0.4253ms 0.2337ms 4.2788 KOps/s 4.2830 KOps/s $\color{#d91a1a}-0.10\%$
test_compile_add_self_flat[tensordict-compile] 0.2162ms 0.1122ms 8.9091 KOps/s 9.2443 KOps/s $\color{#d91a1a}-3.63\%$
test_compile_add_self_flat[tensorclass-eager] 0.3038ms 63.5556μs 15.7343 KOps/s 16.1193 KOps/s $\color{#d91a1a}-2.39\%$
test_compile_add_self_flat[tensorclass-compile] 0.1749ms 49.6059μs 20.1589 KOps/s 19.9647 KOps/s $\color{#35bf28}+0.97\%$
test_compile_add_self_flat[pytree-eager] 0.2440ms 0.1583ms 6.3153 KOps/s 6.3949 KOps/s $\color{#d91a1a}-1.25\%$
test_compile_add_self_flat[pytree-compile] 0.2323ms 0.1026ms 9.7468 KOps/s 9.8907 KOps/s $\color{#d91a1a}-1.46\%$
test_compile_copy_flat[tensordict-compile] 49.8540μs 22.0536μs 45.3441 KOps/s 46.6687 KOps/s $\color{#d91a1a}-2.84\%$
test_compile_copy_flat[tensordict-eager] 0.1341ms 69.5944μs 14.3690 KOps/s 15.1530 KOps/s $\textbf{\color{#d91a1a}-5.17\%}$
test_compile_copy_flat[pytree-compile] 0.1820ms 83.6681μs 11.9520 KOps/s 12.2432 KOps/s $\color{#d91a1a}-2.38\%$
test_compile_copy_flat[pytree-eager] 0.1587ms 68.3786μs 14.6245 KOps/s 14.8903 KOps/s $\color{#d91a1a}-1.79\%$
test_compile_assign_and_add[tensordict-compile] 0.3002ms 0.2166ms 4.6176 KOps/s 4.6085 KOps/s $\color{#35bf28}+0.20\%$
test_compile_assign_and_add[tensordict-eager] 1.9870ms 1.3857ms 721.6512 Ops/s 716.8705 Ops/s $\color{#35bf28}+0.67\%$
test_compile_assign_and_add[pytree-compile] 0.4880ms 0.2099ms 4.7647 KOps/s 4.6527 KOps/s $\color{#35bf28}+2.41\%$
test_compile_assign_and_add[pytree-eager] 1.0333ms 0.8231ms 1.2149 KOps/s 1.2293 KOps/s $\color{#d91a1a}-1.17\%$
test_compile_assign_and_add_stack[compile] 0.8475ms 0.4595ms 2.1763 KOps/s 2.1809 KOps/s $\color{#d91a1a}-0.21\%$
test_compile_assign_and_add_stack[eager] 3.6995ms 2.8335ms 352.9180 Ops/s 353.1213 Ops/s $\color{#d91a1a}-0.06\%$
test_compile_indexing[tensor-tensordict-compile] 0.1025ms 38.5586μs 25.9346 KOps/s 25.3787 KOps/s $\color{#35bf28}+2.19\%$
test_compile_indexing[tensor-tensordict-eager] 0.5307ms 34.1947μs 29.2443 KOps/s 30.0857 KOps/s $\color{#d91a1a}-2.80\%$
test_compile_indexing[tensor-tensorclass-compile] 0.1005ms 30.8458μs 32.4193 KOps/s 32.3329 KOps/s $\color{#35bf28}+0.27\%$
test_compile_indexing[tensor-tensorclass-eager] 70.6510μs 23.1574μs 43.1828 KOps/s 41.9026 KOps/s $\color{#35bf28}+3.06\%$
test_compile_indexing[tensor-pytree-compile] 91.8910μs 31.5819μs 31.6637 KOps/s 31.2978 KOps/s $\color{#35bf28}+1.17\%$
test_compile_indexing[tensor-pytree-eager] 55.2930μs 23.1212μs 43.2504 KOps/s 41.6993 KOps/s $\color{#35bf28}+3.72\%$
test_compile_indexing[slice-tensordict-compile] 0.1033ms 53.5708μs 18.6669 KOps/s 18.6865 KOps/s $\color{#d91a1a}-0.11\%$
test_compile_indexing[slice-tensordict-eager] 0.4695ms 20.1524μs 49.6219 KOps/s 48.4865 KOps/s $\color{#35bf28}+2.34\%$
test_compile_indexing[slice-tensorclass-compile] 0.1017ms 45.8213μs 21.8239 KOps/s 21.7831 KOps/s $\color{#35bf28}+0.19\%$
test_compile_indexing[slice-tensorclass-eager] 56.9260μs 19.1201μs 52.3009 KOps/s 52.3075 KOps/s $\color{#d91a1a}-0.01\%$
test_compile_indexing[slice-pytree-compile] 97.3920μs 46.3834μs 21.5594 KOps/s 21.2538 KOps/s $\color{#35bf28}+1.44\%$
test_compile_indexing[slice-pytree-eager] 52.6890μs 18.9554μs 52.7553 KOps/s 52.9796 KOps/s $\color{#d91a1a}-0.42\%$
test_compile_indexing[int-tensordict-compile] 0.1087ms 55.0297μs 18.1720 KOps/s 18.1987 KOps/s $\color{#d91a1a}-0.15\%$
test_compile_indexing[int-tensordict-eager] 0.9171ms 20.0102μs 49.9744 KOps/s 49.7976 KOps/s $\color{#35bf28}+0.36\%$
test_compile_indexing[int-tensorclass-compile] 94.9770μs 46.4628μs 21.5226 KOps/s 20.9122 KOps/s $\color{#35bf28}+2.92\%$
test_compile_indexing[int-tensorclass-eager] 51.4960μs 19.1640μs 52.1811 KOps/s 52.6326 KOps/s $\color{#d91a1a}-0.86\%$
test_compile_indexing[int-pytree-compile] 95.4580μs 46.3050μs 21.5959 KOps/s 21.0491 KOps/s $\color{#35bf28}+2.60\%$
test_compile_indexing[int-pytree-eager] 55.2730μs 18.9914μs 52.6554 KOps/s 53.2244 KOps/s $\color{#d91a1a}-1.07\%$
test_mod_add[eager] 83.8160μs 35.8859μs 27.8661 KOps/s 27.2295 KOps/s $\color{#35bf28}+2.34\%$
test_mod_add[compile] 0.2084ms 68.3220μs 14.6366 KOps/s 14.8205 KOps/s $\color{#d91a1a}-1.24\%$
test_mod_add[compile-overhead] 0.1358ms 64.8709μs 15.4152 KOps/s 15.0624 KOps/s $\color{#35bf28}+2.34\%$
test_mod_wrap[eager] 0.4440ms 0.2197ms 4.5514 KOps/s 4.5288 KOps/s $\color{#35bf28}+0.50\%$
test_mod_wrap[compile] 2.7032ms 0.2347ms 4.2614 KOps/s 4.2879 KOps/s $\color{#d91a1a}-0.62\%$
test_mod_wrap[compile-overhead] 0.4460ms 0.2322ms 4.3063 KOps/s 4.3043 KOps/s $\color{#35bf28}+0.05\%$
test_mod_wrap_and_backward[eager] 16.0338ms 13.2525ms 75.4576 Ops/s 91.2782 Ops/s $\textbf{\color{#d91a1a}-17.33\%}$
test_mod_wrap_and_backward[compile] 13.0434ms 11.3132ms 88.3921 Ops/s 92.9863 Ops/s $\color{#d91a1a}-4.94\%$
test_mod_wrap_and_backward[compile-overhead] 12.8760ms 11.1205ms 89.9236 Ops/s 92.6948 Ops/s $\color{#d91a1a}-2.99\%$
test_seq_add[eager] 0.2000ms 0.1180ms 8.4714 KOps/s 8.2910 KOps/s $\color{#35bf28}+2.18\%$
test_seq_add[compile] 0.1341ms 79.8150μs 12.5290 KOps/s 12.7388 KOps/s $\color{#d91a1a}-1.65\%$
test_seq_add[compile-overhead] 0.1503ms 78.0501μs 12.8123 KOps/s 13.0877 KOps/s $\color{#d91a1a}-2.10\%$
test_seq_wrap[eager] 0.6013ms 0.4466ms 2.2392 KOps/s 2.1528 KOps/s $\color{#35bf28}+4.01\%$
test_seq_wrap[compile] 0.3388ms 0.2433ms 4.1101 KOps/s 3.9961 KOps/s $\color{#35bf28}+2.85\%$
test_seq_wrap[compile-overhead] 0.3761ms 0.2429ms 4.1168 KOps/s 4.0083 KOps/s $\color{#35bf28}+2.71\%$
test_func_call_runtime[False-eager] 0.6485ms 0.5398ms 1.8526 KOps/s 1.8397 KOps/s $\color{#35bf28}+0.70\%$
test_func_call_runtime[False-compile] 0.8035ms 0.4447ms 2.2486 KOps/s 2.2342 KOps/s $\color{#35bf28}+0.65\%$
test_func_call_runtime[False-compile-overhead] 0.6141ms 0.4414ms 2.2655 KOps/s 2.2373 KOps/s $\color{#35bf28}+1.26\%$
test_func_call_runtime[True-eager] 1.1938ms 0.7537ms 1.3269 KOps/s 1.3064 KOps/s $\color{#35bf28}+1.57\%$
test_func_call_runtime[True-compile] 0.5744ms 0.4627ms 2.1611 KOps/s 2.1255 KOps/s $\color{#35bf28}+1.68\%$
test_func_call_runtime[True-compile-overhead] 0.9547ms 0.4646ms 2.1523 KOps/s 2.1301 KOps/s $\color{#35bf28}+1.05\%$
test_func_call_cm_runtime[False-eager] 0.7342ms 0.5357ms 1.8668 KOps/s 1.8532 KOps/s $\color{#35bf28}+0.74\%$
test_func_call_cm_runtime[False-compile] 0.6183ms 0.4405ms 2.2704 KOps/s 2.2113 KOps/s $\color{#35bf28}+2.67\%$
test_func_call_cm_runtime[False-compile-overhead] 0.5648ms 0.4408ms 2.2686 KOps/s 2.2209 KOps/s $\color{#35bf28}+2.15\%$
test_func_call_cm_runtime[True-eager] 1.6005ms 0.8977ms 1.1140 KOps/s 1.1034 KOps/s $\color{#35bf28}+0.96\%$
test_func_call_cm_runtime[True-compile] 1.3541ms 0.7891ms 1.2672 KOps/s 1.2284 KOps/s $\color{#35bf28}+3.16\%$
test_func_call_cm_runtime[True-compile-overhead] 0.9016ms 0.7919ms 1.2628 KOps/s 1.2066 KOps/s $\color{#35bf28}+4.66\%$
test_vmap_func_call_cm_runtime[eager] 2.8874ms 1.9109ms 523.3113 Ops/s 513.9987 Ops/s $\color{#35bf28}+1.81\%$
test_vmap_func_call_cm_runtime[compile] 1.1331ms 0.5399ms 1.8523 KOps/s 1.7629 KOps/s $\textbf{\color{#35bf28}+5.07\%}$
test_vmap_func_call_cm_runtime[compile-overhead] 0.9546ms 0.5423ms 1.8438 KOps/s 1.8245 KOps/s $\color{#35bf28}+1.06\%$
test_distributed 0.2386ms 0.1289ms 7.7588 KOps/s 7.8443 KOps/s $\color{#d91a1a}-1.09\%$
test_tdmodule 83.1450μs 27.5251μs 36.3304 KOps/s 35.9063 KOps/s $\color{#35bf28}+1.18\%$
test_tdmodule_dispatch 89.4770μs 54.9102μs 18.2116 KOps/s 19.8699 KOps/s $\textbf{\color{#d91a1a}-8.35\%}$
test_tdseq 58.9000μs 30.5722μs 32.7094 KOps/s 31.9034 KOps/s $\color{#35bf28}+2.53\%$
test_tdseq_dispatch 83.9670μs 57.7461μs 17.3172 KOps/s 17.8809 KOps/s $\color{#d91a1a}-3.15\%$
test_instantiation_functorch 2.3201ms 1.5555ms 642.8891 Ops/s 662.4767 Ops/s $\color{#d91a1a}-2.96\%$
test_exec_functorch 0.4237ms 0.1828ms 5.4690 KOps/s 5.6156 KOps/s $\color{#d91a1a}-2.61\%$
test_exec_functional_call 0.2827ms 0.1705ms 5.8653 KOps/s 5.8744 KOps/s $\color{#d91a1a}-0.15\%$
test_exec_td_decorator 0.4681ms 0.2330ms 4.2927 KOps/s 4.3855 KOps/s $\color{#d91a1a}-2.12\%$
test_vmap_mlp_speed_decorator[True-True] 1.0276ms 0.6666ms 1.5002 KOps/s 1.5023 KOps/s $\color{#d91a1a}-0.14\%$
test_vmap_mlp_speed_decorator[True-False] 0.9001ms 0.6619ms 1.5108 KOps/s 1.5057 KOps/s $\color{#35bf28}+0.34\%$
test_vmap_mlp_speed_decorator[False-True] 0.8563ms 0.5389ms 1.8555 KOps/s 1.8713 KOps/s $\color{#d91a1a}-0.84\%$
test_vmap_mlp_speed_decorator[False-False] 0.8671ms 0.5333ms 1.8753 KOps/s 1.8701 KOps/s $\color{#35bf28}+0.28\%$
test_to_module_speed[True] 2.2263ms 1.3814ms 723.8916 Ops/s 753.5332 Ops/s $\color{#d91a1a}-3.93\%$
test_to_module_speed[False] 1.8452ms 1.3390ms 746.8121 Ops/s 776.1250 Ops/s $\color{#d91a1a}-3.78\%$
test_tc_init 0.1044ms 47.3964μs 21.0987 KOps/s 20.1085 KOps/s $\color{#35bf28}+4.92\%$
test_tc_init_nested 0.2028ms 95.9423μs 10.4229 KOps/s 10.1108 KOps/s $\color{#35bf28}+3.09\%$
test_tc_first_layer_tensor 16.6110μs 1.5540μs 643.5008 KOps/s 580.0745 KOps/s $\textbf{\color{#35bf28}+10.93\%}$
test_tc_first_layer_nontensor 44.0620μs 4.9823μs 200.7096 KOps/s 201.8775 KOps/s $\color{#d91a1a}-0.58\%$
test_tc_second_layer_tensor 23.2530μs 2.8655μs 348.9752 KOps/s 311.1409 KOps/s $\textbf{\color{#35bf28}+12.16\%}$
test_tc_second_layer_nontensor 47.7390μs 6.2784μs 159.2752 KOps/s 154.6483 KOps/s $\color{#35bf28}+2.99\%$
test_unbind 0.2352s 13.8955ms 71.9658 Ops/s 77.6679 Ops/s $\textbf{\color{#d91a1a}-7.34\%}$
test_full_like 9.3345ms 7.9109ms 126.4081 Ops/s 128.0069 Ops/s $\color{#d91a1a}-1.25\%$
test_zeros_like 5.9790ms 2.8308ms 353.2629 Ops/s 360.9436 Ops/s $\color{#d91a1a}-2.13\%$
test_ones_like 4.6266ms 3.1823ms 314.2426 Ops/s 319.6287 Ops/s $\color{#d91a1a}-1.69\%$
test_clone 7.0485ms 5.3164ms 188.0975 Ops/s 139.6760 Ops/s $\textbf{\color{#35bf28}+34.67\%}$
test_squeeze 59.4600μs 12.3594μs 80.9099 KOps/s 81.6699 KOps/s $\color{#d91a1a}-0.93\%$
test_unsqueeze 0.1841ms 91.4457μs 10.9355 KOps/s 10.9102 KOps/s $\color{#35bf28}+0.23\%$
test_split 0.4854ms 0.1959ms 5.1046 KOps/s 5.2305 KOps/s $\color{#d91a1a}-2.41\%$
test_permute 0.3480ms 0.2012ms 4.9693 KOps/s 4.9301 KOps/s $\color{#35bf28}+0.80\%$
test_stack 32.4199ms 25.7551ms 38.8273 Ops/s 35.7972 Ops/s $\textbf{\color{#35bf28}+8.46\%}$
test_cat 27.5516ms 25.4399ms 39.3083 Ops/s 38.8349 Ops/s $\color{#35bf28}+1.22\%$

Copy link

github-actions bot commented Feb 6, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 229. Improved: $\large\color{#35bf28}39$. Worsened: $\large\color{#d91a1a}12$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 27.4410μs 11.1697μs 89.5282 KOps/s 75.3457 KOps/s $\textbf{\color{#35bf28}+18.82\%}$
test_plain_set_stack_nested 35.1100μs 11.2417μs 88.9548 KOps/s 74.5748 KOps/s $\textbf{\color{#35bf28}+19.28\%}$
test_plain_set_nested_inplace 40.0300μs 12.1366μs 82.3954 KOps/s 70.1996 KOps/s $\textbf{\color{#35bf28}+17.37\%}$
test_plain_set_stack_nested_inplace 35.3510μs 12.2272μs 81.7849 KOps/s 70.0550 KOps/s $\textbf{\color{#35bf28}+16.74\%}$
test_items 27.7210μs 2.9220μs 342.2362 KOps/s 347.1277 KOps/s $\color{#d91a1a}-1.41\%$
test_items_nested 0.4462ms 0.3752ms 2.6654 KOps/s 2.6712 KOps/s $\color{#d91a1a}-0.22\%$
test_items_nested_locked 0.4481ms 0.3746ms 2.6693 KOps/s 2.6953 KOps/s $\color{#d91a1a}-0.97\%$
test_items_nested_leaf 86.0600μs 57.7176μs 17.3257 KOps/s 17.1050 KOps/s $\color{#35bf28}+1.29\%$
test_items_stack_nested 0.5237ms 0.3783ms 2.6437 KOps/s 2.6506 KOps/s $\color{#d91a1a}-0.26\%$
test_items_stack_nested_leaf 90.0010μs 58.0536μs 17.2255 KOps/s 16.6786 KOps/s $\color{#35bf28}+3.28\%$
test_items_stack_nested_locked 0.4057ms 0.3781ms 2.6451 KOps/s 2.6671 KOps/s $\color{#d91a1a}-0.82\%$
test_keys 28.1000μs 3.4359μs 291.0449 KOps/s 290.5446 KOps/s $\color{#35bf28}+0.17\%$
test_keys_nested 0.1173ms 87.6921μs 11.4035 KOps/s 11.0728 KOps/s $\color{#35bf28}+2.99\%$
test_keys_nested_locked 0.8381ms 93.6892μs 10.6736 KOps/s 10.3798 KOps/s $\color{#35bf28}+2.83\%$
test_keys_nested_leaf 0.1216ms 78.2304μs 12.7828 KOps/s 12.3569 KOps/s $\color{#35bf28}+3.45\%$
test_keys_stack_nested 0.1264ms 87.8747μs 11.3798 KOps/s 10.9458 KOps/s $\color{#35bf28}+3.96\%$
test_keys_stack_nested_leaf 0.1084ms 78.2725μs 12.7759 KOps/s 12.1726 KOps/s $\color{#35bf28}+4.96\%$
test_keys_stack_nested_locked 0.1312ms 94.0990μs 10.6271 KOps/s 10.4499 KOps/s $\color{#35bf28}+1.70\%$
test_values 3.7271μs 0.8399μs 1.1906 MOps/s 750.0695 KOps/s $\textbf{\color{#35bf28}+58.74\%}$
test_values_nested 65.7100μs 37.6754μs 26.5425 KOps/s 26.2856 KOps/s $\color{#35bf28}+0.98\%$
test_values_nested_locked 66.9010μs 39.4061μs 25.3768 KOps/s 24.9847 KOps/s $\color{#35bf28}+1.57\%$
test_values_nested_leaf 66.9700μs 42.0306μs 23.7922 KOps/s 23.1012 KOps/s $\color{#35bf28}+2.99\%$
test_values_stack_nested 68.3810μs 37.5723μs 26.6153 KOps/s 25.5577 KOps/s $\color{#35bf28}+4.14\%$
test_values_stack_nested_leaf 67.8400μs 42.3646μs 23.6046 KOps/s 22.9873 KOps/s $\color{#35bf28}+2.69\%$
test_values_stack_nested_locked 71.6100μs 39.8196μs 25.1132 KOps/s 24.6220 KOps/s $\color{#35bf28}+2.00\%$
test_membership 1.9035μs 0.5055μs 1.9782 MOps/s 1.9362 MOps/s $\color{#35bf28}+2.17\%$
test_membership_nested 19.0000μs 2.0242μs 494.0144 KOps/s 473.9095 KOps/s $\color{#35bf28}+4.24\%$
test_membership_nested_leaf 16.7155μs 2.0572μs 486.0945 KOps/s 488.5031 KOps/s $\color{#d91a1a}-0.49\%$
test_membership_stacked_nested 27.1210μs 2.1301μs 469.4662 KOps/s 465.1070 KOps/s $\color{#35bf28}+0.94\%$
test_membership_stacked_nested_leaf 26.9900μs 2.1269μs 470.1583 KOps/s 474.0185 KOps/s $\color{#d91a1a}-0.81\%$
test_membership_nested_last 39.1400μs 3.1274μs 319.7574 KOps/s 320.8407 KOps/s $\color{#d91a1a}-0.34\%$
test_membership_nested_leaf_last 38.4400μs 3.0721μs 325.5103 KOps/s 324.1483 KOps/s $\color{#35bf28}+0.42\%$
test_membership_stacked_nested_last 33.6410μs 3.2043μs 312.0831 KOps/s 182.8469 KOps/s $\textbf{\color{#35bf28}+70.68\%}$
test_membership_stacked_nested_leaf_last 30.6710μs 3.1618μs 316.2751 KOps/s 185.2401 KOps/s $\textbf{\color{#35bf28}+70.74\%}$
test_nested_getleaf 41.8400μs 6.0778μs 164.5338 KOps/s 161.3863 KOps/s $\color{#35bf28}+1.95\%$
test_nested_get 33.3300μs 5.8707μs 170.3380 KOps/s 170.7787 KOps/s $\color{#d91a1a}-0.26\%$
test_stacked_getleaf 32.6310μs 6.1627μs 162.2662 KOps/s 162.3358 KOps/s $\color{#d91a1a}-0.04\%$
test_stacked_get 35.1410μs 5.8107μs 172.0969 KOps/s 171.7535 KOps/s $\color{#35bf28}+0.20\%$
test_nested_getitemleaf 40.5600μs 6.3747μs 156.8696 KOps/s 153.4432 KOps/s $\color{#35bf28}+2.23\%$
test_nested_getitem 38.0000μs 6.0750μs 164.6085 KOps/s 160.9475 KOps/s $\color{#35bf28}+2.27\%$
test_stacked_getitemleaf 43.2500μs 6.4579μs 154.8488 KOps/s 154.6110 KOps/s $\color{#35bf28}+0.15\%$
test_stacked_getitem 31.1700μs 6.0572μs 165.0917 KOps/s 164.1889 KOps/s $\color{#35bf28}+0.55\%$
test_lock_nested 0.3968ms 0.3320ms 3.0118 KOps/s 2.9335 KOps/s $\color{#35bf28}+2.67\%$
test_lock_stack_nested 0.3783ms 0.3366ms 2.9713 KOps/s 2.9860 KOps/s $\color{#d91a1a}-0.49\%$
test_unlock_nested 0.3591ms 0.2726ms 3.6687 KOps/s 3.6422 KOps/s $\color{#35bf28}+0.73\%$
test_unlock_stack_nested 0.3784ms 0.2757ms 3.6271 KOps/s 3.6425 KOps/s $\color{#d91a1a}-0.42\%$
test_flatten_speed 0.1144ms 74.4272μs 13.4359 KOps/s 13.0673 KOps/s $\color{#35bf28}+2.82\%$
test_unflatten_speed 0.3776ms 0.3261ms 3.0669 KOps/s 3.0442 KOps/s $\color{#35bf28}+0.74\%$
test_common_ops 0.7068ms 0.5626ms 1.7774 KOps/s 1.4871 KOps/s $\textbf{\color{#35bf28}+19.52\%}$
test_creation 80.5410μs 1.7387μs 575.1429 KOps/s 571.2719 KOps/s $\color{#35bf28}+0.68\%$
test_creation_empty 30.5700μs 6.4425μs 155.2197 KOps/s 99.8913 KOps/s $\textbf{\color{#35bf28}+55.39\%}$
test_creation_nested_1 30.2900μs 7.9544μs 125.7158 KOps/s 87.3140 KOps/s $\textbf{\color{#35bf28}+43.98\%}$
test_creation_nested_2 38.6500μs 10.5840μs 94.4820 KOps/s 69.9811 KOps/s $\textbf{\color{#35bf28}+35.01\%}$
test_clone 37.8610μs 10.1899μs 98.1360 KOps/s 96.1003 KOps/s $\color{#35bf28}+2.12\%$
test_getitem[int] 1.4158ms 10.6170μs 94.1882 KOps/s 96.9382 KOps/s $\color{#d91a1a}-2.84\%$
test_getitem[slice_int] 0.1298ms 20.6536μs 48.4178 KOps/s 49.6060 KOps/s $\color{#d91a1a}-2.40\%$
test_getitem[range] 0.1235ms 36.7199μs 27.2332 KOps/s 27.7197 KOps/s $\color{#d91a1a}-1.76\%$
test_getitem[tuple] 0.1050ms 17.9746μs 55.6340 KOps/s 57.0815 KOps/s $\color{#d91a1a}-2.54\%$
test_getitem[list] 0.1213ms 31.7110μs 31.5348 KOps/s 31.2410 KOps/s $\color{#35bf28}+0.94\%$
test_setitem_dim[int] 36.8510μs 19.0273μs 52.5561 KOps/s 52.2826 KOps/s $\color{#35bf28}+0.52\%$
test_setitem_dim[slice_int] 58.9600μs 37.6643μs 26.5503 KOps/s 26.3784 KOps/s $\color{#35bf28}+0.65\%$
test_setitem_dim[range] 76.4000μs 51.2338μs 19.5184 KOps/s 19.0364 KOps/s $\color{#35bf28}+2.53\%$
test_setitem_dim[tuple] 54.5900μs 32.2150μs 31.0415 KOps/s 30.9984 KOps/s $\color{#35bf28}+0.14\%$
test_setitem 73.9610μs 13.7255μs 72.8572 KOps/s 62.7656 KOps/s $\textbf{\color{#35bf28}+16.08\%}$
test_set 73.2910μs 13.0159μs 76.8289 KOps/s 64.5667 KOps/s $\textbf{\color{#35bf28}+18.99\%}$
test_set_shared 0.5091ms 0.1572ms 6.3633 KOps/s 6.3553 KOps/s $\color{#35bf28}+0.12\%$
test_update 0.4067ms 15.0321μs 66.5242 KOps/s 51.4918 KOps/s $\textbf{\color{#35bf28}+29.19\%}$
test_update_nested 81.8900μs 20.2381μs 49.4117 KOps/s 40.0459 KOps/s $\textbf{\color{#35bf28}+23.39\%}$
test_update__nested 0.5769ms 24.6458μs 40.5749 KOps/s 39.7688 KOps/s $\color{#35bf28}+2.03\%$
test_set_nested 87.3200μs 14.5424μs 68.7645 KOps/s 59.6960 KOps/s $\textbf{\color{#35bf28}+15.19\%}$
test_set_nested_new 90.8300μs 16.6145μs 60.1883 KOps/s 52.0346 KOps/s $\textbf{\color{#35bf28}+15.67\%}$
test_select 58.0610μs 28.4056μs 35.2043 KOps/s 31.9785 KOps/s $\textbf{\color{#35bf28}+10.09\%}$
test_select_nested 85.2410μs 45.5571μs 21.9505 KOps/s 22.3122 KOps/s $\color{#d91a1a}-1.62\%$
test_exclude_nested 88.3910μs 63.9595μs 15.6349 KOps/s 15.7172 KOps/s $\color{#d91a1a}-0.52\%$
test_empty[True] 0.6890ms 0.2993ms 3.3409 KOps/s 3.3593 KOps/s $\color{#d91a1a}-0.55\%$
test_empty[False] 3.3141μs 0.8358μs 1.1964 MOps/s 1.1924 MOps/s $\color{#35bf28}+0.34\%$
test_to 86.4910μs 56.5176μs 17.6936 KOps/s 17.8370 KOps/s $\color{#d91a1a}-0.80\%$
test_to_nonblocking 90.0710μs 48.4065μs 20.6584 KOps/s 21.3313 KOps/s $\color{#d91a1a}-3.15\%$
test_unbind_speed 0.2712ms 0.2323ms 4.3049 KOps/s 4.2215 KOps/s $\color{#35bf28}+1.97\%$
test_unbind_speed_stack0 0.2656ms 0.2349ms 4.2567 KOps/s 4.3003 KOps/s $\color{#d91a1a}-1.01\%$
test_unbind_speed_stack1 94.0365ms 0.7223ms 1.3845 KOps/s 1.3894 KOps/s $\color{#d91a1a}-0.36\%$
test_split 94.9062ms 1.5949ms 627.0097 Ops/s 631.5403 Ops/s $\color{#d91a1a}-0.72\%$
test_chunk 96.9287ms 1.6097ms 621.2372 Ops/s 627.7284 Ops/s $\color{#d91a1a}-1.03\%$
test_consolidate[False-None] 97.1368ms 2.9735ms 336.3081 Ops/s 368.6059 Ops/s $\textbf{\color{#d91a1a}-8.76\%}$
test_consolidate[default-None] 1.7922ms 1.6899ms 591.7410 Ops/s 598.9217 Ops/s $\color{#d91a1a}-1.20\%$
test_consolidate[reduce-overhead-None] 1.8322ms 1.7460ms 572.7241 Ops/s 587.2988 Ops/s $\color{#d91a1a}-2.48\%$
test_consolidate_njt[False-None] 6.8182ms 6.5004ms 153.8377 Ops/s 155.5010 Ops/s $\color{#d91a1a}-1.07\%$
test_to[False-False-None] 1.8469ms 1.7604ms 568.0559 Ops/s 573.3153 Ops/s $\color{#d91a1a}-0.92\%$
test_to[True-False-None] 1.5540ms 1.3239ms 755.3310 Ops/s 781.0233 Ops/s $\color{#d91a1a}-3.29\%$
test_to[within-False-None] 4.3965ms 4.1636ms 240.1770 Ops/s 178.5545 Ops/s $\textbf{\color{#35bf28}+34.51\%}$
test_to[True-default-None] 5.5866ms 5.2945ms 188.8745 Ops/s 190.6869 Ops/s $\color{#d91a1a}-0.95\%$
test_to_njt[False-False-None] 7.1104ms 6.8268ms 146.4824 Ops/s 145.1540 Ops/s $\color{#35bf28}+0.92\%$
test_to_njt[True-False-None] 5.6310ms 5.4240ms 184.3646 Ops/s 179.7739 Ops/s $\color{#35bf28}+2.55\%$
test_to_njt[within-False-None] 12.3183ms 12.1423ms 82.3570 Ops/s 82.0778 Ops/s $\color{#35bf28}+0.34\%$
test_creation[device0] 0.4549ms 78.6757μs 12.7104 KOps/s 12.6673 KOps/s $\color{#35bf28}+0.34\%$
test_creation_from_tensor 0.6177ms 82.2467μs 12.1585 KOps/s 12.0369 KOps/s $\color{#35bf28}+1.01\%$
test_add_one[memmap_tensor0] 0.2529ms 6.4319μs 155.4739 KOps/s 154.2897 KOps/s $\color{#35bf28}+0.77\%$
test_contiguous[memmap_tensor0] 1.8675μs 0.3992μs 2.5052 MOps/s 2.4973 MOps/s $\color{#35bf28}+0.32\%$
test_stack[memmap_tensor0] 43.4410μs 4.2602μs 234.7289 KOps/s 240.7432 KOps/s $\color{#d91a1a}-2.50\%$
test_memmaptd_index 1.5017ms 0.2341ms 4.2721 KOps/s 4.3795 KOps/s $\color{#d91a1a}-2.45\%$
test_memmaptd_index_astensor 0.4417ms 0.2971ms 3.3660 KOps/s 3.3898 KOps/s $\color{#d91a1a}-0.70\%$
test_memmaptd_index_op 0.7189ms 0.5274ms 1.8960 KOps/s 1.6998 KOps/s $\textbf{\color{#35bf28}+11.54\%}$
test_serialize_model 0.1324s 0.1305s 7.6603 Ops/s 7.6585 Ops/s $\color{#35bf28}+0.02\%$
test_serialize_model_pickle 1.3494s 1.1858s 0.8433 Ops/s 0.8263 Ops/s $\color{#35bf28}+2.06\%$
test_serialize_weights 0.1306s 0.1298s 7.7049 Ops/s 7.7108 Ops/s $\color{#d91a1a}-0.08\%$
test_serialize_weights_returnearly 0.4292s 69.8852ms 14.3092 Ops/s 23.7892 Ops/s $\textbf{\color{#d91a1a}-39.85\%}$
test_serialize_weights_pickle 1.3759s 1.2189s 0.8204 Ops/s 0.8236 Ops/s $\color{#d91a1a}-0.39\%$
test_reshape_pytree 67.2400μs 21.8590μs 45.7477 KOps/s 46.0054 KOps/s $\color{#d91a1a}-0.56\%$
test_reshape_td 67.7900μs 27.1268μs 36.8639 KOps/s 36.9385 KOps/s $\color{#d91a1a}-0.20\%$
test_view_pytree 55.5710μs 21.7536μs 45.9695 KOps/s 45.8628 KOps/s $\color{#35bf28}+0.23\%$
test_view_td 84.8800μs 31.7017μs 31.5440 KOps/s 31.3744 KOps/s $\color{#35bf28}+0.54\%$
test_unbind_pytree 55.8200μs 27.5697μs 36.2718 KOps/s 36.6240 KOps/s $\color{#d91a1a}-0.96\%$
test_unbind_td 0.7722ms 35.7984μs 27.9342 KOps/s 28.3139 KOps/s $\color{#d91a1a}-1.34\%$
test_split_pytree 0.1218ms 29.7880μs 33.5706 KOps/s 34.0007 KOps/s $\color{#d91a1a}-1.26\%$
test_split_td 0.9405ms 38.2931μs 26.1144 KOps/s 26.1243 KOps/s $\color{#d91a1a}-0.04\%$
test_add_pytree 74.6910μs 33.5654μs 29.7926 KOps/s 30.0502 KOps/s $\color{#d91a1a}-0.86\%$
test_add_td 88.6500μs 47.6117μs 21.0032 KOps/s 19.8561 KOps/s $\textbf{\color{#35bf28}+5.78\%}$
test_compile_add_one_nested[tensordict-compile] 0.1769ms 0.1230ms 8.1327 KOps/s 7.7041 KOps/s $\textbf{\color{#35bf28}+5.56\%}$
test_compile_add_one_nested[tensordict-eager] 0.2556ms 0.1338ms 7.4712 KOps/s 7.4875 KOps/s $\color{#d91a1a}-0.22\%$
test_compile_add_one_nested[pytree-compile] 0.1409ms 94.8735μs 10.5404 KOps/s 10.3661 KOps/s $\color{#35bf28}+1.68\%$
test_compile_add_one_nested[pytree-eager] 0.2165ms 0.1489ms 6.7162 KOps/s 6.7255 KOps/s $\color{#d91a1a}-0.14\%$
test_compile_copy_nested[tensordict-compile] 57.0510μs 24.4590μs 40.8847 KOps/s 41.9335 KOps/s $\color{#d91a1a}-2.50\%$
test_compile_copy_nested[tensordict-eager] 54.9710μs 29.1360μs 34.3219 KOps/s 33.6024 KOps/s $\color{#35bf28}+2.14\%$
test_compile_copy_nested[pytree-compile] 0.1216ms 63.5038μs 15.7471 KOps/s 15.4458 KOps/s $\color{#35bf28}+1.95\%$
test_compile_copy_nested[pytree-eager] 79.0910μs 48.6915μs 20.5374 KOps/s 20.1245 KOps/s $\color{#35bf28}+2.05\%$
test_compile_add_one_flat[tensordict-compile] 0.2015ms 0.1411ms 7.0860 KOps/s 7.0892 KOps/s $\color{#d91a1a}-0.04\%$
test_compile_add_one_flat[tensordict-eager] 0.3341ms 0.2177ms 4.5924 KOps/s 4.6136 KOps/s $\color{#d91a1a}-0.46\%$
test_compile_add_one_flat[tensorclass-compile] 0.1507ms 97.2148μs 10.2865 KOps/s 9.9613 KOps/s $\color{#35bf28}+3.26\%$
test_compile_add_one_flat[tensorclass-eager] 0.1185ms 53.9431μs 18.5380 KOps/s 17.5657 KOps/s $\textbf{\color{#35bf28}+5.54\%}$
test_compile_add_one_flat[pytree-compile] 0.1749ms 0.1346ms 7.4308 KOps/s 7.4062 KOps/s $\color{#35bf28}+0.33\%$
test_compile_add_one_flat[pytree-eager] 0.6132ms 0.4822ms 2.0738 KOps/s 2.0760 KOps/s $\color{#d91a1a}-0.11\%$
test_compile_add_self_flat[tensordict-eager] 0.3958ms 0.2609ms 3.8322 KOps/s 3.7987 KOps/s $\color{#35bf28}+0.88\%$
test_compile_add_self_flat[tensordict-compile] 0.1826ms 0.1433ms 6.9801 KOps/s 7.0075 KOps/s $\color{#d91a1a}-0.39\%$
test_compile_add_self_flat[tensorclass-eager] 0.1471ms 65.9313μs 15.1673 KOps/s 14.6398 KOps/s $\color{#35bf28}+3.60\%$
test_compile_add_self_flat[tensorclass-compile] 0.1420ms 99.0031μs 10.1007 KOps/s 10.0451 KOps/s $\color{#35bf28}+0.55\%$
test_compile_add_self_flat[pytree-eager] 0.4697ms 0.4114ms 2.4305 KOps/s 2.4266 KOps/s $\color{#35bf28}+0.16\%$
test_compile_add_self_flat[pytree-compile] 0.1734ms 0.1344ms 7.4392 KOps/s 7.3051 KOps/s $\color{#35bf28}+1.84\%$
test_compile_copy_flat[tensordict-compile] 49.7100μs 18.6694μs 53.5637 KOps/s 53.6566 KOps/s $\color{#d91a1a}-0.17\%$
test_compile_copy_flat[tensordict-eager] 63.4100μs 31.2058μs 32.0454 KOps/s 31.5155 KOps/s $\color{#35bf28}+1.68\%$
test_compile_copy_flat[pytree-compile] 0.1258ms 69.8558μs 14.3152 KOps/s 14.3603 KOps/s $\color{#d91a1a}-0.31\%$
test_compile_copy_flat[pytree-eager] 82.2300μs 51.1754μs 19.5406 KOps/s 19.3033 KOps/s $\color{#35bf28}+1.23\%$
test_compile_assign_and_add[tensordict-compile] 1.6298ms 0.3915ms 2.5542 KOps/s 2.1978 KOps/s $\textbf{\color{#35bf28}+16.21\%}$
test_compile_assign_and_add[tensordict-eager] 2.7436ms 2.6245ms 381.0279 Ops/s 371.1404 Ops/s $\color{#35bf28}+2.66\%$
test_compile_assign_and_add[pytree-compile] 1.5856ms 0.4370ms 2.2884 KOps/s 2.2799 KOps/s $\color{#35bf28}+0.37\%$
test_compile_assign_and_add[pytree-eager] 2.8462ms 2.6785ms 373.3404 Ops/s 371.3994 Ops/s $\color{#35bf28}+0.52\%$
test_compile_indexing[tensor-tensordict-compile] 0.1715ms 0.1218ms 8.2080 KOps/s 8.5446 KOps/s $\color{#d91a1a}-3.94\%$
test_compile_indexing[tensor-tensordict-eager] 0.5618ms 82.1589μs 12.1715 KOps/s 12.6749 KOps/s $\color{#d91a1a}-3.97\%$
test_compile_indexing[tensor-tensorclass-compile] 0.1701ms 0.1142ms 8.7564 KOps/s 9.3180 KOps/s $\textbf{\color{#d91a1a}-6.03\%}$
test_compile_indexing[tensor-tensorclass-eager] 0.1206ms 67.4788μs 14.8195 KOps/s 14.7407 KOps/s $\color{#35bf28}+0.53\%$
test_compile_indexing[tensor-pytree-compile] 0.1773ms 0.1135ms 8.8128 KOps/s 9.3419 KOps/s $\textbf{\color{#d91a1a}-5.66\%}$
test_compile_indexing[tensor-pytree-eager] 0.1208ms 71.4401μs 13.9977 KOps/s 14.8065 KOps/s $\textbf{\color{#d91a1a}-5.46\%}$
test_compile_indexing[slice-tensordict-compile] 0.1506ms 0.1055ms 9.4816 KOps/s 9.8664 KOps/s $\color{#d91a1a}-3.90\%$
test_compile_indexing[slice-tensordict-eager] 0.1455ms 17.1335μs 58.3653 KOps/s 57.0590 KOps/s $\color{#35bf28}+2.29\%$
test_compile_indexing[slice-tensorclass-compile] 0.1442ms 98.9408μs 10.1071 KOps/s 10.2143 KOps/s $\color{#d91a1a}-1.05\%$
test_compile_indexing[slice-tensorclass-eager] 54.4300μs 15.7741μs 63.3952 KOps/s 63.9281 KOps/s $\color{#d91a1a}-0.83\%$
test_compile_indexing[slice-pytree-compile] 0.1822ms 95.5610μs 10.4645 KOps/s 10.2452 KOps/s $\color{#35bf28}+2.14\%$
test_compile_indexing[slice-pytree-eager] 56.2910μs 16.5159μs 60.5477 KOps/s 63.3007 KOps/s $\color{#d91a1a}-4.35\%$
test_compile_indexing[int-tensordict-compile] 0.1684ms 0.1056ms 9.4706 KOps/s 9.8496 KOps/s $\color{#d91a1a}-3.85\%$
test_compile_indexing[int-tensordict-eager] 0.5977ms 17.3693μs 57.5729 KOps/s 58.1931 KOps/s $\color{#d91a1a}-1.07\%$
test_compile_indexing[int-tensorclass-compile] 0.1467ms 95.6479μs 10.4550 KOps/s 10.2145 KOps/s $\color{#35bf28}+2.35\%$
test_compile_indexing[int-tensorclass-eager] 43.2700μs 15.6841μs 63.7589 KOps/s 63.8172 KOps/s $\color{#d91a1a}-0.09\%$
test_compile_indexing[int-pytree-compile] 0.1542ms 95.4416μs 10.4776 KOps/s 10.3028 KOps/s $\color{#35bf28}+1.70\%$
test_compile_indexing[int-pytree-eager] 53.1010μs 16.5292μs 60.4990 KOps/s 64.3438 KOps/s $\textbf{\color{#d91a1a}-5.98\%}$
test_mod_add[eager] 89.0600μs 35.5178μs 28.1549 KOps/s 25.7233 KOps/s $\textbf{\color{#35bf28}+9.45\%}$
test_mod_add[compile] 0.1278ms 78.1809μs 12.7908 KOps/s 11.8328 KOps/s $\textbf{\color{#35bf28}+8.10\%}$
test_mod_add[compile-overhead] 0.3302ms 0.1676ms 5.9651 KOps/s 5.6940 KOps/s $\color{#35bf28}+4.76\%$
test_mod_wrap[eager] 0.3301ms 0.2472ms 4.0459 KOps/s 3.8232 KOps/s $\textbf{\color{#35bf28}+5.82\%}$
test_mod_wrap[compile] 0.3895ms 0.2817ms 3.5497 KOps/s 3.5084 KOps/s $\color{#35bf28}+1.18\%$
test_mod_wrap[compile-overhead] 7.1334ms 3.7857ms 264.1518 Ops/s 263.7635 Ops/s $\color{#35bf28}+0.15\%$
test_mod_wrap_and_backward[eager] 1.4596ms 1.3539ms 738.6278 Ops/s 683.6757 Ops/s $\textbf{\color{#35bf28}+8.04\%}$
test_mod_wrap_and_backward[compile] 1.5969ms 1.2677ms 788.8249 Ops/s 722.6421 Ops/s $\textbf{\color{#35bf28}+9.16\%}$
test_mod_wrap_and_backward[compile-overhead] 1.3750ms 0.9290ms 1.0765 KOps/s 956.6002 Ops/s $\textbf{\color{#35bf28}+12.53\%}$
test_seq_add[eager] 0.1580ms 0.1112ms 8.9921 KOps/s 8.4671 KOps/s $\textbf{\color{#35bf28}+6.20\%}$
test_seq_add[compile] 0.1342ms 90.7204μs 11.0229 KOps/s 11.5442 KOps/s $\color{#d91a1a}-4.52\%$
test_seq_add[compile-overhead] 0.1698ms 0.1284ms 7.7874 KOps/s 7.4837 KOps/s $\color{#35bf28}+4.06\%$
test_seq_wrap[eager] 0.5216ms 0.4354ms 2.2965 KOps/s 2.3200 KOps/s $\color{#d91a1a}-1.01\%$
test_seq_wrap[compile] 0.3655ms 0.3011ms 3.3216 KOps/s 3.1290 KOps/s $\textbf{\color{#35bf28}+6.15\%}$
test_seq_wrap[compile-overhead] 0.2943ms 0.2220ms 4.5047 KOps/s 4.2576 KOps/s $\textbf{\color{#35bf28}+5.80\%}$
test_func_call_runtime[False-eager] 0.7871ms 0.7283ms 1.3730 KOps/s 1.3291 KOps/s $\color{#35bf28}+3.31\%$
test_func_call_runtime[False-compile] 0.8041ms 0.7351ms 1.3604 KOps/s 1.3521 KOps/s $\color{#35bf28}+0.62\%$
test_func_call_runtime[False-compile-overhead] 0.4433ms 0.3601ms 2.7773 KOps/s 2.7768 KOps/s $\color{#35bf28}+0.02\%$
test_func_call_runtime[True-eager] 0.9760ms 0.8893ms 1.1245 KOps/s 1.1097 KOps/s $\color{#35bf28}+1.33\%$
test_func_call_runtime[True-compile] 0.8295ms 0.7586ms 1.3181 KOps/s 1.2673 KOps/s $\color{#35bf28}+4.01\%$
test_func_call_runtime[True-compile-overhead] 0.4346ms 0.3811ms 2.6239 KOps/s 2.6103 KOps/s $\color{#35bf28}+0.52\%$
test_func_call_cm_runtime[False-eager] 0.7878ms 0.7295ms 1.3707 KOps/s 1.3625 KOps/s $\color{#35bf28}+0.60\%$
test_func_call_cm_runtime[False-compile] 0.9030ms 0.7411ms 1.3494 KOps/s 1.3428 KOps/s $\color{#35bf28}+0.49\%$
test_func_call_cm_runtime[False-compile-overhead] 0.4668ms 0.3652ms 2.7386 KOps/s 2.7660 KOps/s $\color{#d91a1a}-0.99\%$
test_func_call_cm_runtime[True-eager] 1.0756ms 0.9984ms 1.0016 KOps/s 986.5633 Ops/s $\color{#35bf28}+1.52\%$
test_func_call_cm_runtime[True-compile] 1.1180ms 0.9856ms 1.0146 KOps/s 1.0150 KOps/s $\color{#d91a1a}-0.04\%$
test_func_call_cm_runtime[True-compile-overhead] 1.0629ms 0.9795ms 1.0209 KOps/s 1.0093 KOps/s $\color{#35bf28}+1.16\%$
test_vmap_func_call_cm_runtime[eager] 2.5085ms 2.0845ms 479.7335 Ops/s 470.9893 Ops/s $\color{#35bf28}+1.86\%$
test_vmap_func_call_cm_runtime[compile] 0.8546ms 0.7992ms 1.2513 KOps/s 1.2306 KOps/s $\color{#35bf28}+1.68\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.4645ms 0.4139ms 2.4158 KOps/s 2.4099 KOps/s $\color{#35bf28}+0.24\%$
test_distributed 3.0981ms 0.2008ms 4.9792 KOps/s 8.4217 KOps/s $\textbf{\color{#d91a1a}-40.88\%}$
test_tdmodule 0.5739ms 18.9320μs 52.8206 KOps/s 47.0875 KOps/s $\textbf{\color{#35bf28}+12.18\%}$
test_tdmodule_dispatch 0.1723ms 33.0443μs 30.2624 KOps/s 26.3316 KOps/s $\textbf{\color{#35bf28}+14.93\%}$
test_tdseq 41.1010μs 19.5371μs 51.1847 KOps/s 45.6837 KOps/s $\textbf{\color{#35bf28}+12.04\%}$
test_tdseq_dispatch 63.9510μs 35.9214μs 27.8385 KOps/s 24.3729 KOps/s $\textbf{\color{#35bf28}+14.22\%}$
test_instantiation_functorch 1.6498ms 1.5463ms 646.6924 Ops/s 646.5730 Ops/s $\color{#35bf28}+0.02\%$
test_exec_functorch 0.1846ms 0.1435ms 6.9692 KOps/s 6.9945 KOps/s $\color{#d91a1a}-0.36\%$
test_exec_functional_call 0.1751ms 0.1334ms 7.4965 KOps/s 7.5182 KOps/s $\color{#d91a1a}-0.29\%$
test_exec_td_decorator 0.3751ms 0.1831ms 5.4613 KOps/s 5.3969 KOps/s $\color{#35bf28}+1.19\%$
test_vmap_mlp_speed_decorator[True-True] 0.7485ms 0.6751ms 1.4812 KOps/s 1.4468 KOps/s $\color{#35bf28}+2.38\%$
test_vmap_mlp_speed_decorator[True-False] 0.7952ms 0.6788ms 1.4732 KOps/s 1.4439 KOps/s $\color{#35bf28}+2.03\%$
test_vmap_mlp_speed_decorator[False-True] 0.7084ms 0.5972ms 1.6744 KOps/s 1.6646 KOps/s $\color{#35bf28}+0.59\%$
test_vmap_mlp_speed_decorator[False-False] 0.7090ms 0.5956ms 1.6791 KOps/s 1.6667 KOps/s $\color{#35bf28}+0.75\%$
test_vmap_transformer_speed_decorator[True-True] 19.2919ms 19.1858ms 52.1218 Ops/s 51.7255 Ops/s $\color{#35bf28}+0.77\%$
test_vmap_transformer_speed_decorator[True-False] 19.2285ms 19.1594ms 52.1937 Ops/s 51.8416 Ops/s $\color{#35bf28}+0.68\%$
test_vmap_transformer_speed_decorator[False-True] 19.2333ms 19.0298ms 52.5491 Ops/s 52.1104 Ops/s $\color{#35bf28}+0.84\%$
test_vmap_transformer_speed_decorator[False-False] 19.2681ms 18.9839ms 52.6761 Ops/s 52.2277 Ops/s $\color{#35bf28}+0.86\%$
test_to_module_speed[True] 1.4445ms 0.9676ms 1.0335 KOps/s 1.0192 KOps/s $\color{#35bf28}+1.40\%$
test_to_module_speed[False] 1.0135ms 0.9457ms 1.0574 KOps/s 1.0382 KOps/s $\color{#35bf28}+1.85\%$
test_tc_init 63.1300μs 33.6806μs 29.6907 KOps/s 26.0085 KOps/s $\textbf{\color{#35bf28}+14.16\%}$
test_tc_init_nested 97.9710μs 68.6676μs 14.5629 KOps/s 12.6657 KOps/s $\textbf{\color{#35bf28}+14.98\%}$
test_tc_first_layer_tensor 31.6910μs 0.7951μs 1.2578 MOps/s 1.4612 MOps/s $\textbf{\color{#d91a1a}-13.92\%}$
test_tc_first_layer_nontensor 24.1000μs 2.2287μs 448.7002 KOps/s 452.2342 KOps/s $\color{#d91a1a}-0.78\%$
test_tc_second_layer_tensor 21.6800μs 1.3809μs 724.1467 KOps/s 713.9297 KOps/s $\color{#35bf28}+1.43\%$
test_tc_second_layer_nontensor 25.4200μs 2.9270μs 341.6432 KOps/s 338.8124 KOps/s $\color{#35bf28}+0.84\%$
test_unbind 0.2158s 12.0190ms 83.2016 Ops/s 143.6766 Ops/s $\textbf{\color{#d91a1a}-42.09\%}$
test_full_like 9.2554ms 9.0940ms 109.9628 Ops/s 107.6753 Ops/s $\color{#35bf28}+2.12\%$
test_zeros_like 9.2868ms 7.1551ms 139.7601 Ops/s 231.4381 Ops/s $\textbf{\color{#d91a1a}-39.61\%}$
test_ones_like 9.2139ms 7.2330ms 138.2550 Ops/s 231.0697 Ops/s $\textbf{\color{#d91a1a}-40.17\%}$
test_clone 6.7601ms 6.3410ms 157.7026 Ops/s 157.2195 Ops/s $\color{#35bf28}+0.31\%$
test_squeeze 79.0910μs 9.5283μs 104.9506 KOps/s 107.2107 KOps/s $\color{#d91a1a}-2.11\%$
test_unsqueeze 0.1225ms 71.7254μs 13.9421 KOps/s 13.6734 KOps/s $\color{#35bf28}+1.97\%$
test_split 0.3695ms 0.1583ms 6.3188 KOps/s 6.2742 KOps/s $\color{#35bf28}+0.71\%$
test_permute 0.2401ms 0.1869ms 5.3502 KOps/s 5.6490 KOps/s $\textbf{\color{#d91a1a}-5.29\%}$
test_stack 50.2264ms 50.0182ms 19.9927 Ops/s 19.9692 Ops/s $\color{#35bf28}+0.12\%$
test_cat 50.2454ms 49.8336ms 20.0668 Ops/s 19.9517 Ops/s $\color{#35bf28}+0.58\%$

@vmoens vmoens merged commit 6f84874 into gh/vmoens/47/base Feb 6, 2025
51 checks passed
vmoens added a commit that referenced this pull request Feb 6, 2025
ghstack-source-id: 1ab62b35bb34b0a0ddaa8b1afc06a48702f41fdf
Pull Request resolved: #1211
@vmoens vmoens deleted the gh/vmoens/47/head branch February 6, 2025 13:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. suitable for minor
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants