Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BugFix] Better repr of lazy stacks #1076

Merged
merged 1 commit into from
Nov 6, 2024
Merged

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Nov 6, 2024

[ghstack-poisoned]
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 6, 2024
Copy link

github-actions bot commented Nov 6, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 217. Improved: $\large\color{#35bf28}25$. Worsened: $\large\color{#d91a1a}13$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 53.6810μs 17.3672μs 57.5799 KOps/s 54.9837 KOps/s $\color{#35bf28}+4.72\%$
test_plain_set_stack_nested 41.1870μs 17.5980μs 56.8248 KOps/s 54.7021 KOps/s $\color{#35bf28}+3.88\%$
test_plain_set_nested_inplace 59.9120μs 19.0114μs 52.6000 KOps/s 50.2596 KOps/s $\color{#35bf28}+4.66\%$
test_plain_set_stack_nested_inplace 44.7440μs 18.8513μs 53.0467 KOps/s 50.0457 KOps/s $\textbf{\color{#35bf28}+6.00\%}$
test_items 22.4320μs 4.0742μs 245.4488 KOps/s 241.3768 KOps/s $\color{#35bf28}+1.69\%$
test_items_nested 0.5947ms 0.3411ms 2.9321 KOps/s 2.9789 KOps/s $\color{#d91a1a}-1.57\%$
test_items_nested_locked 0.4588ms 0.3401ms 2.9403 KOps/s 2.9728 KOps/s $\color{#d91a1a}-1.09\%$
test_items_nested_leaf 0.1340ms 71.1671μs 14.0514 KOps/s 14.2940 KOps/s $\color{#d91a1a}-1.70\%$
test_items_stack_nested 0.5241ms 0.3431ms 2.9150 KOps/s 2.9531 KOps/s $\color{#d91a1a}-1.29\%$
test_items_stack_nested_leaf 0.1355ms 73.2988μs 13.6428 KOps/s 13.9255 KOps/s $\color{#d91a1a}-2.03\%$
test_items_stack_nested_locked 0.4950ms 0.3420ms 2.9237 KOps/s 2.9553 KOps/s $\color{#d91a1a}-1.07\%$
test_keys 21.0290μs 3.4740μs 287.8495 KOps/s 284.2116 KOps/s $\color{#35bf28}+1.28\%$
test_keys_nested 0.2304ms 0.1374ms 7.2757 KOps/s 7.3662 KOps/s $\color{#d91a1a}-1.23\%$
test_keys_nested_locked 1.9482ms 0.1409ms 7.0957 KOps/s 6.9683 KOps/s $\color{#35bf28}+1.83\%$
test_keys_nested_leaf 0.2213ms 0.1148ms 8.7073 KOps/s 8.4672 KOps/s $\color{#35bf28}+2.84\%$
test_keys_stack_nested 0.2329ms 0.1335ms 7.4907 KOps/s 7.3516 KOps/s $\color{#35bf28}+1.89\%$
test_keys_stack_nested_leaf 0.2039ms 0.1131ms 8.8382 KOps/s 8.5105 KOps/s $\color{#35bf28}+3.85\%$
test_keys_stack_nested_locked 0.2030ms 0.1390ms 7.1932 KOps/s 6.9487 KOps/s $\color{#35bf28}+3.52\%$
test_values 4.5004μs 1.0347μs 966.4913 KOps/s 974.1011 KOps/s $\color{#d91a1a}-0.78\%$
test_values_nested 0.1091ms 55.6029μs 17.9847 KOps/s 18.1615 KOps/s $\color{#d91a1a}-0.97\%$
test_values_nested_locked 0.1258ms 55.4226μs 18.0432 KOps/s 18.1436 KOps/s $\color{#d91a1a}-0.55\%$
test_values_nested_leaf 0.1128ms 59.5822μs 16.7835 KOps/s 16.7788 KOps/s $\color{#35bf28}+0.03\%$
test_values_stack_nested 0.1071ms 57.0728μs 17.5215 KOps/s 17.3921 KOps/s $\color{#35bf28}+0.74\%$
test_values_stack_nested_leaf 0.1133ms 59.7491μs 16.7366 KOps/s 16.8043 KOps/s $\color{#d91a1a}-0.40\%$
test_values_stack_nested_locked 0.1049ms 57.4968μs 17.3923 KOps/s 17.8985 KOps/s $\color{#d91a1a}-2.83\%$
test_membership 4.0490μs 0.7469μs 1.3388 MOps/s 1.4400 MOps/s $\textbf{\color{#d91a1a}-7.03\%}$
test_membership_nested 36.5780μs 2.7506μs 363.5538 KOps/s 370.2335 KOps/s $\color{#d91a1a}-1.80\%$
test_membership_nested_leaf 21.3700μs 2.7202μs 367.6247 KOps/s 364.2112 KOps/s $\color{#35bf28}+0.94\%$
test_membership_stacked_nested 28.4530μs 2.7398μs 364.9898 KOps/s 372.5485 KOps/s $\color{#d91a1a}-2.03\%$
test_membership_stacked_nested_leaf 29.2950μs 2.6992μs 370.4795 KOps/s 364.8796 KOps/s $\color{#35bf28}+1.53\%$
test_membership_nested_last 23.1640μs 4.0713μs 245.6230 KOps/s 246.7485 KOps/s $\color{#d91a1a}-0.46\%$
test_membership_nested_leaf_last 29.7560μs 4.1055μs 243.5753 KOps/s 246.1091 KOps/s $\color{#d91a1a}-1.03\%$
test_membership_stacked_nested_last 28.9640μs 5.6645μs 176.5390 KOps/s 247.4631 KOps/s $\textbf{\color{#d91a1a}-28.66\%}$
test_membership_stacked_nested_leaf_last 36.1280μs 5.6434μs 177.1977 KOps/s 247.6163 KOps/s $\textbf{\color{#d91a1a}-28.44\%}$
test_nested_getleaf 25.7780μs 10.4823μs 95.3986 KOps/s 95.2165 KOps/s $\color{#35bf28}+0.19\%$
test_nested_get 28.7130μs 9.9091μs 100.9168 KOps/s 100.3478 KOps/s $\color{#35bf28}+0.57\%$
test_stacked_getleaf 37.8420μs 10.6578μs 93.8283 KOps/s 95.5410 KOps/s $\color{#d91a1a}-1.79\%$
test_stacked_get 34.5250μs 9.9531μs 100.4707 KOps/s 100.8608 KOps/s $\color{#d91a1a}-0.39\%$
test_nested_getitemleaf 35.0560μs 11.0832μs 90.2269 KOps/s 92.2387 KOps/s $\color{#d91a1a}-2.18\%$
test_nested_getitem 38.8730μs 10.2293μs 97.7583 KOps/s 98.5538 KOps/s $\color{#d91a1a}-0.81\%$
test_stacked_getitemleaf 32.0800μs 11.1268μs 89.8727 KOps/s 91.7299 KOps/s $\color{#d91a1a}-2.02\%$
test_stacked_getitem 31.4090μs 10.1262μs 98.7539 KOps/s 97.5147 KOps/s $\color{#35bf28}+1.27\%$
test_lock_nested 5.2066ms 0.4418ms 2.2633 KOps/s 2.2521 KOps/s $\color{#35bf28}+0.50\%$
test_lock_stack_nested 0.5021ms 0.4068ms 2.4582 KOps/s 2.4097 KOps/s $\color{#35bf28}+2.01\%$
test_unlock_nested 0.8956ms 0.3598ms 2.7793 KOps/s 2.7377 KOps/s $\color{#35bf28}+1.52\%$
test_unlock_stack_nested 0.6949ms 0.3255ms 3.0724 KOps/s 2.9741 KOps/s $\color{#35bf28}+3.31\%$
test_flatten_speed 0.1560ms 91.4200μs 10.9385 KOps/s 11.0594 KOps/s $\color{#d91a1a}-1.09\%$
test_unflatten_speed 0.6367ms 0.4638ms 2.1562 KOps/s 2.1259 KOps/s $\color{#35bf28}+1.42\%$
test_common_ops 4.3356ms 0.7571ms 1.3208 KOps/s 1.2343 KOps/s $\textbf{\color{#35bf28}+7.01\%}$
test_creation 21.1190μs 2.0658μs 484.0628 KOps/s 484.5022 KOps/s $\color{#d91a1a}-0.09\%$
test_creation_empty 38.4630μs 10.1870μs 98.1642 KOps/s 83.0823 KOps/s $\textbf{\color{#35bf28}+18.15\%}$
test_creation_nested_1 42.1790μs 12.9270μs 77.3574 KOps/s 66.1088 KOps/s $\textbf{\color{#35bf28}+17.02\%}$
test_creation_nested_2 62.1860μs 16.7476μs 59.7101 KOps/s 52.4644 KOps/s $\textbf{\color{#35bf28}+13.81\%}$
test_clone 60.0730μs 13.2553μs 75.4414 KOps/s 74.4847 KOps/s $\color{#35bf28}+1.28\%$
test_getitem[int] 1.4174ms 12.4440μs 80.3603 KOps/s 81.0942 KOps/s $\color{#d91a1a}-0.90\%$
test_getitem[slice_int] 0.1404ms 23.0669μs 43.3522 KOps/s 41.7612 KOps/s $\color{#35bf28}+3.81\%$
test_getitem[range] 0.2421ms 47.4819μs 21.0607 KOps/s 20.3268 KOps/s $\color{#35bf28}+3.61\%$
test_getitem[tuple] 0.1394ms 19.4104μs 51.5188 KOps/s 51.5902 KOps/s $\color{#d91a1a}-0.14\%$
test_getitem[list] 0.1616ms 42.9891μs 23.2617 KOps/s 22.9004 KOps/s $\color{#35bf28}+1.58\%$
test_setitem_dim[int] 70.6220μs 25.1133μs 39.8195 KOps/s 38.9654 KOps/s $\color{#35bf28}+2.19\%$
test_setitem_dim[slice_int] 0.2076ms 50.9687μs 19.6199 KOps/s 19.8236 KOps/s $\color{#d91a1a}-1.03\%$
test_setitem_dim[range] 0.1372ms 74.2355μs 13.4706 KOps/s 13.0615 KOps/s $\color{#35bf28}+3.13\%$
test_setitem_dim[tuple] 77.5660μs 40.0243μs 24.9848 KOps/s 24.7311 KOps/s $\color{#35bf28}+1.03\%$
test_setitem 71.1240μs 19.3557μs 51.6643 KOps/s 46.5998 KOps/s $\textbf{\color{#35bf28}+10.87\%}$
test_set 62.1670μs 18.6339μs 53.6656 KOps/s 47.4178 KOps/s $\textbf{\color{#35bf28}+13.18\%}$
test_set_shared 2.1318ms 0.1681ms 5.9505 KOps/s 5.9288 KOps/s $\color{#35bf28}+0.37\%$
test_update 0.1149ms 21.1724μs 47.2313 KOps/s 41.4448 KOps/s $\textbf{\color{#35bf28}+13.96\%}$
test_update_nested 86.6830μs 30.9064μs 32.3557 KOps/s 29.2080 KOps/s $\textbf{\color{#35bf28}+10.78\%}$
test_update__nested 1.0200ms 32.1639μs 31.0908 KOps/s 30.1694 KOps/s $\color{#35bf28}+3.05\%$
test_set_nested 71.3640μs 20.9079μs 47.8289 KOps/s 43.4973 KOps/s $\textbf{\color{#35bf28}+9.96\%}$
test_set_nested_new 81.9840μs 25.7332μs 38.8604 KOps/s 36.0729 KOps/s $\textbf{\color{#35bf28}+7.73\%}$
test_select 93.5850μs 41.5270μs 24.0807 KOps/s 22.5931 KOps/s $\textbf{\color{#35bf28}+6.58\%}$
test_select_nested 0.1405ms 59.3845μs 16.8394 KOps/s 17.0250 KOps/s $\color{#d91a1a}-1.09\%$
test_exclude_nested 0.3482ms 74.5153μs 13.4201 KOps/s 13.5408 KOps/s $\color{#d91a1a}-0.89\%$
test_empty[True] 0.6310ms 0.3452ms 2.8965 KOps/s 2.9054 KOps/s $\color{#d91a1a}-0.31\%$
test_empty[False] 8.9395μs 1.2660μs 789.8834 KOps/s 810.9634 KOps/s $\color{#d91a1a}-2.60\%$
test_unbind_speed 0.3568ms 0.2601ms 3.8450 KOps/s 3.8368 KOps/s $\color{#35bf28}+0.21\%$
test_unbind_speed_stack0 0.4710ms 0.2531ms 3.9514 KOps/s 3.8890 KOps/s $\color{#35bf28}+1.61\%$
test_unbind_speed_stack1 0.1076s 0.7524ms 1.3291 KOps/s 1.5890 KOps/s $\textbf{\color{#d91a1a}-16.35\%}$
test_split 1.7902ms 1.5602ms 640.9603 Ops/s 587.1769 Ops/s $\textbf{\color{#35bf28}+9.16\%}$
test_chunk 92.1879ms 1.8445ms 542.1530 Ops/s 592.5612 Ops/s $\textbf{\color{#d91a1a}-8.51\%}$
test_consolidate_njt[False-None] 8.7679ms 8.0550ms 124.1467 Ops/s 121.7984 Ops/s $\color{#35bf28}+1.93\%$
test_creation[device0] 0.1767ms 90.7818μs 11.0154 KOps/s 10.6997 KOps/s $\color{#35bf28}+2.95\%$
test_creation_from_tensor 3.8072ms 93.6932μs 10.6731 KOps/s 10.5949 KOps/s $\color{#35bf28}+0.74\%$
test_add_one[memmap_tensor0] 0.1070ms 4.9944μs 200.2240 KOps/s 192.5943 KOps/s $\color{#35bf28}+3.96\%$
test_contiguous[memmap_tensor0] 18.9560μs 0.5067μs 1.9734 MOps/s 1.9586 MOps/s $\color{#35bf28}+0.76\%$
test_stack[memmap_tensor0] 33.7630μs 3.5167μs 284.3579 KOps/s 281.8057 KOps/s $\color{#35bf28}+0.91\%$
test_memmaptd_index 1.1275ms 0.2353ms 4.2491 KOps/s 4.1674 KOps/s $\color{#35bf28}+1.96\%$
test_memmaptd_index_astensor 0.6735ms 0.3138ms 3.1869 KOps/s 3.2051 KOps/s $\color{#d91a1a}-0.57\%$
test_memmaptd_index_op 1.0831ms 0.5812ms 1.7207 KOps/s 1.6193 KOps/s $\textbf{\color{#35bf28}+6.26\%}$
test_serialize_model 0.1216s 0.1150s 8.6969 Ops/s 7.7138 Ops/s $\textbf{\color{#35bf28}+12.74\%}$
test_serialize_model_pickle 0.4548s 0.3851s 2.5966 Ops/s 2.4769 Ops/s $\color{#35bf28}+4.83\%$
test_serialize_weights 0.2055s 0.1236s 8.0895 Ops/s 8.9597 Ops/s $\textbf{\color{#d91a1a}-9.71\%}$
test_serialize_weights_returnearly 0.1873s 0.1594s 6.2752 Ops/s 6.3521 Ops/s $\color{#d91a1a}-1.21\%$
test_serialize_weights_pickle 0.5559s 0.4410s 2.2677 Ops/s 2.5107 Ops/s $\textbf{\color{#d91a1a}-9.68\%}$
test_serialize_weights_filesystem 0.1499s 0.1407s 7.1094 Ops/s 6.5418 Ops/s $\textbf{\color{#35bf28}+8.68\%}$
test_serialize_model_filesystem 0.2332s 0.1603s 6.2392 Ops/s 6.7176 Ops/s $\textbf{\color{#d91a1a}-7.12\%}$
test_reshape_pytree 85.8720μs 26.8985μs 37.1768 KOps/s 35.7071 KOps/s $\color{#35bf28}+4.12\%$
test_reshape_td 71.4840μs 32.5468μs 30.7250 KOps/s 32.1192 KOps/s $\color{#d91a1a}-4.34\%$
test_view_pytree 0.1123ms 27.1066μs 36.8914 KOps/s 36.8051 KOps/s $\color{#35bf28}+0.23\%$
test_view_td 84.7990μs 36.4604μs 27.4270 KOps/s 27.5490 KOps/s $\color{#d91a1a}-0.44\%$
test_unbind_pytree 67.3170μs 29.9419μs 33.3980 KOps/s 33.7211 KOps/s $\color{#d91a1a}-0.96\%$
test_unbind_td 0.3282ms 38.5956μs 25.9097 KOps/s 25.7003 KOps/s $\color{#35bf28}+0.81\%$
test_split_pytree 75.1410μs 29.4062μs 34.0064 KOps/s 33.7189 KOps/s $\color{#35bf28}+0.85\%$
test_split_td 0.5271ms 43.0583μs 23.2243 KOps/s 22.9252 KOps/s $\color{#35bf28}+1.30\%$
test_add_pytree 93.6360μs 36.2789μs 27.5642 KOps/s 26.9395 KOps/s $\color{#35bf28}+2.32\%$
test_add_td 0.1542ms 54.1474μs 18.4681 KOps/s 16.6301 KOps/s $\textbf{\color{#35bf28}+11.05\%}$
test_compile_add_one_nested[tensordict-compile] 0.1117ms 61.9473μs 16.1427 KOps/s 15.9877 KOps/s $\color{#35bf28}+0.97\%$
test_compile_add_one_nested[tensordict-eager] 0.3403ms 0.1604ms 6.2348 KOps/s 6.2356 KOps/s $\color{#d91a1a}-0.01\%$
test_compile_add_one_nested[pytree-compile] 0.1078ms 45.5828μs 21.9381 KOps/s 21.5461 KOps/s $\color{#35bf28}+1.82\%$
test_compile_add_one_nested[pytree-eager] 0.2612ms 0.1208ms 8.2753 KOps/s 8.1832 KOps/s $\color{#35bf28}+1.13\%$
test_compile_copy_nested[tensordict-compile] 57.3470μs 25.4410μs 39.3066 KOps/s 38.2567 KOps/s $\color{#35bf28}+2.74\%$
test_compile_copy_nested[tensordict-eager] 0.1428ms 52.8366μs 18.9263 KOps/s 18.7316 KOps/s $\color{#35bf28}+1.04\%$
test_compile_copy_nested[pytree-compile] 0.1670ms 78.2755μs 12.7754 KOps/s 12.5046 KOps/s $\color{#35bf28}+2.17\%$
test_compile_copy_nested[pytree-eager] 0.1304ms 67.6613μs 14.7795 KOps/s 14.6502 KOps/s $\color{#35bf28}+0.88\%$
test_compile_add_one_flat[tensordict-compile] 0.2206ms 0.1047ms 9.5489 KOps/s 9.4646 KOps/s $\color{#35bf28}+0.89\%$
test_compile_add_one_flat[tensordict-eager] 0.3390ms 0.2000ms 4.9998 KOps/s 4.9424 KOps/s $\color{#35bf28}+1.16\%$
test_compile_add_one_flat[tensorclass-compile] 0.1151ms 44.6318μs 22.4055 KOps/s 21.7748 KOps/s $\color{#35bf28}+2.90\%$
test_compile_add_one_flat[tensorclass-eager] 0.4666ms 61.6442μs 16.2221 KOps/s 15.8419 KOps/s $\color{#35bf28}+2.40\%$
test_compile_add_one_flat[pytree-compile] 0.1854ms 0.1023ms 9.7716 KOps/s 9.5790 KOps/s $\color{#35bf28}+2.01\%$
test_compile_add_one_flat[pytree-eager] 0.3883ms 0.2057ms 4.8625 KOps/s 4.8244 KOps/s $\color{#35bf28}+0.79\%$
test_compile_add_self_flat[tensordict-eager] 0.3377ms 0.2125ms 4.7062 KOps/s 4.6870 KOps/s $\color{#35bf28}+0.41\%$
test_compile_add_self_flat[tensordict-compile] 0.2387ms 0.1093ms 9.1509 KOps/s 9.5041 KOps/s $\color{#d91a1a}-3.72\%$
test_compile_add_self_flat[tensorclass-eager] 0.2012ms 54.4157μs 18.3770 KOps/s 18.2391 KOps/s $\color{#35bf28}+0.76\%$
test_compile_add_self_flat[tensorclass-compile] 0.2192ms 46.8783μs 21.3318 KOps/s 21.4608 KOps/s $\color{#d91a1a}-0.60\%$
test_compile_add_self_flat[pytree-eager] 0.2023ms 0.1644ms 6.0812 KOps/s 6.1947 KOps/s $\color{#d91a1a}-1.83\%$
test_compile_add_self_flat[pytree-compile] 0.1857ms 0.1044ms 9.5741 KOps/s 9.4624 KOps/s $\color{#35bf28}+1.18\%$
test_compile_copy_flat[tensordict-compile] 77.9360μs 21.4325μs 46.6581 KOps/s 48.2293 KOps/s $\color{#d91a1a}-3.26\%$
test_compile_copy_flat[tensordict-eager] 0.1279ms 57.6577μs 17.3437 KOps/s 16.9433 KOps/s $\color{#35bf28}+2.36\%$
test_compile_copy_flat[pytree-compile] 0.1447ms 82.5293μs 12.1169 KOps/s 12.1978 KOps/s $\color{#d91a1a}-0.66\%$
test_compile_copy_flat[pytree-eager] 0.1301ms 69.8363μs 14.3192 KOps/s 14.5506 KOps/s $\color{#d91a1a}-1.59\%$
test_compile_assign_and_add[tensordict-compile] 0.3026ms 0.2096ms 4.7716 KOps/s 4.7223 KOps/s $\color{#35bf28}+1.04\%$
test_compile_assign_and_add[tensordict-eager] 1.4128ms 1.2896ms 775.4208 Ops/s 766.6530 Ops/s $\color{#35bf28}+1.14\%$
test_compile_assign_and_add[pytree-compile] 0.2803ms 0.2033ms 4.9196 KOps/s 4.9012 KOps/s $\color{#35bf28}+0.37\%$
test_compile_assign_and_add[pytree-eager] 1.6469ms 0.7938ms 1.2597 KOps/s 1.2422 KOps/s $\color{#35bf28}+1.41\%$
test_compile_assign_and_add_stack[compile] 0.7599ms 0.4557ms 2.1945 KOps/s 2.1827 KOps/s $\color{#35bf28}+0.54\%$
test_compile_assign_and_add_stack[eager] 4.5998ms 2.6544ms 376.7362 Ops/s 367.0909 Ops/s $\color{#35bf28}+2.63\%$
test_compile_indexing[tensor-tensordict-compile] 0.1029ms 37.3042μs 26.8066 KOps/s 27.9354 KOps/s $\color{#d91a1a}-4.04\%$
test_compile_indexing[tensor-tensordict-eager] 0.4334ms 33.4739μs 29.8740 KOps/s 30.4202 KOps/s $\color{#d91a1a}-1.80\%$
test_compile_indexing[tensor-tensorclass-compile] 86.4920μs 29.9267μs 33.4149 KOps/s 34.1300 KOps/s $\color{#d91a1a}-2.10\%$
test_compile_indexing[tensor-tensorclass-eager] 84.2180μs 23.6075μs 42.3594 KOps/s 41.8739 KOps/s $\color{#35bf28}+1.16\%$
test_compile_indexing[tensor-pytree-compile] 72.9170μs 30.6449μs 32.6319 KOps/s 33.3198 KOps/s $\color{#d91a1a}-2.06\%$
test_compile_indexing[tensor-pytree-eager] 73.1070μs 23.8572μs 41.9160 KOps/s 42.5251 KOps/s $\color{#d91a1a}-1.43\%$
test_compile_indexing[slice-tensordict-compile] 0.1496ms 52.5662μs 19.0236 KOps/s 19.7320 KOps/s $\color{#d91a1a}-3.59\%$
test_compile_indexing[slice-tensordict-eager] 0.4422ms 18.5012μs 54.0505 KOps/s 50.1013 KOps/s $\textbf{\color{#35bf28}+7.88\%}$
test_compile_indexing[slice-tensorclass-compile] 0.1114ms 44.1594μs 22.6452 KOps/s 22.8040 KOps/s $\color{#d91a1a}-0.70\%$
test_compile_indexing[slice-tensorclass-eager] 69.3610μs 18.7885μs 53.2240 KOps/s 52.1680 KOps/s $\color{#35bf28}+2.02\%$
test_compile_indexing[slice-pytree-compile] 0.1287ms 45.2598μs 22.0946 KOps/s 22.6154 KOps/s $\color{#d91a1a}-2.30\%$
test_compile_indexing[slice-pytree-eager] 59.6410μs 18.8256μs 53.1190 KOps/s 52.6519 KOps/s $\color{#35bf28}+0.89\%$
test_compile_indexing[int-tensordict-compile] 0.1324ms 53.6252μs 18.6479 KOps/s 19.4865 KOps/s $\color{#d91a1a}-4.30\%$
test_compile_indexing[int-tensordict-eager] 0.8563ms 18.6833μs 53.5238 KOps/s 50.3021 KOps/s $\textbf{\color{#35bf28}+6.40\%}$
test_compile_indexing[int-tensorclass-compile] 0.1421ms 45.5081μs 21.9741 KOps/s 22.5851 KOps/s $\color{#d91a1a}-2.71\%$
test_compile_indexing[int-tensorclass-eager] 58.8400μs 18.8832μs 52.9572 KOps/s 53.1122 KOps/s $\color{#d91a1a}-0.29\%$
test_compile_indexing[int-pytree-compile] 0.1116ms 45.4677μs 21.9936 KOps/s 22.5833 KOps/s $\color{#d91a1a}-2.61\%$
test_compile_indexing[int-pytree-eager] 52.9290μs 18.6766μs 53.5428 KOps/s 53.3860 KOps/s $\color{#35bf28}+0.29\%$
test_mod_add[eager] 85.1590μs 26.5656μs 37.6427 KOps/s 36.4452 KOps/s $\color{#35bf28}+3.29\%$
test_mod_add[compile] 87.7040μs 44.9307μs 22.2565 KOps/s 22.6155 KOps/s $\color{#d91a1a}-1.59\%$
test_mod_add[compile-overhead] 0.1090ms 44.6655μs 22.3886 KOps/s 22.2786 KOps/s $\color{#35bf28}+0.49\%$
test_mod_wrap[eager] 0.4061ms 0.2103ms 4.7548 KOps/s 4.7084 KOps/s $\color{#35bf28}+0.98\%$
test_mod_wrap[compile] 1.3104ms 0.2052ms 4.8721 KOps/s 4.9012 KOps/s $\color{#d91a1a}-0.59\%$
test_mod_wrap[compile-overhead] 1.3946ms 0.1993ms 5.0171 KOps/s 4.9667 KOps/s $\color{#35bf28}+1.01\%$
test_mod_wrap_and_backward[eager] 16.1648ms 11.7337ms 85.2249 Ops/s 84.4811 Ops/s $\color{#35bf28}+0.88\%$
test_mod_wrap_and_backward[compile] 15.7440ms 13.1389ms 76.1101 Ops/s 73.8649 Ops/s $\color{#35bf28}+3.04\%$
test_mod_wrap_and_backward[compile-overhead] 19.5961ms 12.9941ms 76.9582 Ops/s 77.1895 Ops/s $\color{#d91a1a}-0.30\%$
test_seq_add[eager] 0.2049ms 93.8364μs 10.6568 KOps/s 10.5469 KOps/s $\color{#35bf28}+1.04\%$
test_seq_add[compile] 0.1514ms 60.2857μs 16.5877 KOps/s 16.1993 KOps/s $\color{#35bf28}+2.40\%$
test_seq_add[compile-overhead] 0.1171ms 58.2528μs 17.1666 KOps/s 16.4974 KOps/s $\color{#35bf28}+4.06\%$
test_seq_wrap[eager] 0.7235ms 0.3869ms 2.5846 KOps/s 2.5015 KOps/s $\color{#35bf28}+3.32\%$
test_seq_wrap[compile] 0.4066ms 0.2259ms 4.4261 KOps/s 4.3583 KOps/s $\color{#35bf28}+1.55\%$
test_seq_wrap[compile-overhead] 0.3126ms 0.2216ms 4.5133 KOps/s 4.3252 KOps/s $\color{#35bf28}+4.35\%$
test_func_call_runtime[False-eager] 0.9583ms 0.5403ms 1.8508 KOps/s 1.8136 KOps/s $\color{#35bf28}+2.05\%$
test_func_call_runtime[False-compile] 0.7971ms 0.4278ms 2.3374 KOps/s 2.2955 KOps/s $\color{#35bf28}+1.83\%$
test_func_call_runtime[False-compile-overhead] 0.5384ms 0.4224ms 2.3675 KOps/s 2.2901 KOps/s $\color{#35bf28}+3.38\%$
test_func_call_runtime[True-eager] 0.9169ms 0.7474ms 1.3380 KOps/s 1.3125 KOps/s $\color{#35bf28}+1.94\%$
test_func_call_runtime[True-compile] 0.8573ms 0.4714ms 2.1212 KOps/s 2.1104 KOps/s $\color{#35bf28}+0.51\%$
test_func_call_runtime[True-compile-overhead] 0.5785ms 0.4698ms 2.1288 KOps/s 2.0770 KOps/s $\color{#35bf28}+2.49\%$
test_func_call_cm_runtime[False-eager] 1.0600ms 0.5440ms 1.8382 KOps/s 1.8427 KOps/s $\color{#d91a1a}-0.24\%$
test_func_call_cm_runtime[False-compile] 0.5424ms 0.4279ms 2.3372 KOps/s 2.3054 KOps/s $\color{#35bf28}+1.38\%$
test_func_call_cm_runtime[False-compile-overhead] 0.5278ms 0.4285ms 2.3337 KOps/s 2.3092 KOps/s $\color{#35bf28}+1.06\%$
test_func_call_cm_runtime[True-eager] 1.4430ms 0.8948ms 1.1175 KOps/s 1.1149 KOps/s $\color{#35bf28}+0.24\%$
test_func_call_cm_runtime[True-compile] 0.7507ms 0.4935ms 2.0263 KOps/s 1.9907 KOps/s $\color{#35bf28}+1.79\%$
test_func_call_cm_runtime[True-compile-overhead] 0.8861ms 0.4935ms 2.0262 KOps/s 1.9926 KOps/s $\color{#35bf28}+1.69\%$
test_vmap_func_call_cm_runtime[eager] 2.7281ms 1.8595ms 537.7791 Ops/s 521.5600 Ops/s $\color{#35bf28}+3.11\%$
test_vmap_func_call_cm_runtime[compile] 0.9260ms 0.5147ms 1.9429 KOps/s 1.8686 KOps/s $\color{#35bf28}+3.98\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.6265ms 0.5126ms 1.9510 KOps/s 1.8625 KOps/s $\color{#35bf28}+4.75\%$
test_distributed 0.3312ms 0.1253ms 7.9835 KOps/s 7.6101 KOps/s $\color{#35bf28}+4.91\%$
test_tdmodule 71.9840μs 18.3626μs 54.4585 KOps/s 46.7693 KOps/s $\textbf{\color{#35bf28}+16.44\%}$
test_tdmodule_dispatch 55.9450μs 36.4539μs 27.4319 KOps/s 25.3525 KOps/s $\textbf{\color{#35bf28}+8.20\%}$
test_tdseq 44.6130μs 21.3200μs 46.9042 KOps/s 43.5719 KOps/s $\textbf{\color{#35bf28}+7.65\%}$
test_tdseq_dispatch 63.7290μs 41.9255μs 23.8519 KOps/s 22.3642 KOps/s $\textbf{\color{#35bf28}+6.65\%}$
test_instantiation_functorch 2.0149ms 1.5228ms 656.6669 Ops/s 655.2430 Ops/s $\color{#35bf28}+0.22\%$
test_exec_functorch 0.2898ms 0.1790ms 5.5866 KOps/s 5.5526 KOps/s $\color{#35bf28}+0.61\%$
test_exec_functional_call 0.4236ms 0.1715ms 5.8300 KOps/s 5.8456 KOps/s $\color{#d91a1a}-0.27\%$
test_exec_td_decorator 0.4430ms 0.2256ms 4.4332 KOps/s 4.4517 KOps/s $\color{#d91a1a}-0.42\%$
test_vmap_mlp_speed_decorator[True-True] 0.9670ms 0.6262ms 1.5970 KOps/s 1.5327 KOps/s $\color{#35bf28}+4.20\%$
test_vmap_mlp_speed_decorator[True-False] 1.1542ms 0.6348ms 1.5752 KOps/s 1.4984 KOps/s $\textbf{\color{#35bf28}+5.12\%}$
test_vmap_mlp_speed_decorator[False-True] 0.6928ms 0.5108ms 1.9579 KOps/s 1.8724 KOps/s $\color{#35bf28}+4.56\%$
test_vmap_mlp_speed_decorator[False-False] 0.6299ms 0.5102ms 1.9601 KOps/s 1.8768 KOps/s $\color{#35bf28}+4.44\%$
test_to_module_speed[True] 2.0509ms 1.3051ms 766.2158 Ops/s 769.4951 Ops/s $\color{#d91a1a}-0.43\%$
test_to_module_speed[False] 1.6795ms 1.2638ms 791.2914 Ops/s 799.8932 Ops/s $\color{#d91a1a}-1.08\%$
test_tc_init 0.1331ms 44.7368μs 22.3530 KOps/s 21.8502 KOps/s $\color{#35bf28}+2.30\%$
test_tc_init_nested 0.1581ms 84.7442μs 11.8002 KOps/s 10.8135 KOps/s $\textbf{\color{#35bf28}+9.12\%}$
test_tc_first_layer_tensor 37.4300μs 1.5050μs 664.4648 KOps/s 659.1101 KOps/s $\color{#35bf28}+0.81\%$
test_tc_first_layer_nontensor 26.1490μs 4.6314μs 215.9176 KOps/s 213.6458 KOps/s $\color{#35bf28}+1.06\%$
test_tc_second_layer_tensor 38.3110μs 2.8013μs 356.9792 KOps/s 360.9306 KOps/s $\color{#d91a1a}-1.09\%$
test_tc_second_layer_nontensor 35.1650μs 5.9543μs 167.9464 KOps/s 168.7486 KOps/s $\color{#d91a1a}-0.48\%$
test_unbind 0.2154s 13.3603ms 74.8488 Ops/s 83.4037 Ops/s $\textbf{\color{#d91a1a}-10.26\%}$
test_full_like 14.7735ms 12.9095ms 77.4623 Ops/s 143.0219 Ops/s $\textbf{\color{#d91a1a}-45.84\%}$
test_zeros_like 11.5579ms 7.2392ms 138.1371 Ops/s 371.8699 Ops/s $\textbf{\color{#d91a1a}-62.85\%}$
test_ones_like 14.7841ms 7.4488ms 134.2490 Ops/s 319.7593 Ops/s $\textbf{\color{#d91a1a}-58.02\%}$
test_clone 14.3271ms 8.7731ms 113.9852 Ops/s 204.6920 Ops/s $\textbf{\color{#d91a1a}-44.31\%}$
test_squeeze 57.6580μs 11.7628μs 85.0141 KOps/s 85.2087 KOps/s $\color{#d91a1a}-0.23\%$
test_unsqueeze 0.1601ms 90.1386μs 11.0940 KOps/s 11.3528 KOps/s $\color{#d91a1a}-2.28\%$
test_split 0.4636ms 0.1870ms 5.3475 KOps/s 5.3382 KOps/s $\color{#35bf28}+0.17\%$
test_permute 0.4036ms 0.2182ms 4.5826 KOps/s 4.6172 KOps/s $\color{#d91a1a}-0.75\%$
test_stack 26.9269ms 24.0736ms 41.5393 Ops/s 41.2286 Ops/s $\color{#35bf28}+0.75\%$
test_cat 24.3831ms 23.9502ms 41.7534 Ops/s 42.0149 Ops/s $\color{#d91a1a}-0.62\%$

Copy link

github-actions bot commented Nov 6, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 229. Improved: $\large\color{#35bf28}10$. Worsened: $\large\color{#d91a1a}14$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 34.6020μs 10.1823μs 98.2093 KOps/s 96.0977 KOps/s $\color{#35bf28}+2.20\%$
test_plain_set_stack_nested 43.5830μs 10.2484μs 97.5760 KOps/s 95.5481 KOps/s $\color{#35bf28}+2.12\%$
test_plain_set_nested_inplace 72.4540μs 11.0967μs 90.1167 KOps/s 88.5409 KOps/s $\color{#35bf28}+1.78\%$
test_plain_set_stack_nested_inplace 38.1620μs 11.1174μs 89.9490 KOps/s 88.9636 KOps/s $\color{#35bf28}+1.11\%$
test_items 0.1199ms 3.0279μs 330.2618 KOps/s 341.5931 KOps/s $\color{#d91a1a}-3.32\%$
test_items_nested 0.3460ms 0.3212ms 3.1131 KOps/s 3.1514 KOps/s $\color{#d91a1a}-1.22\%$
test_items_nested_locked 0.3646ms 0.3205ms 3.1197 KOps/s 3.1312 KOps/s $\color{#d91a1a}-0.37\%$
test_items_nested_leaf 0.1249ms 58.4615μs 17.1053 KOps/s 17.2946 KOps/s $\color{#d91a1a}-1.09\%$
test_items_stack_nested 0.3677ms 0.3231ms 3.0949 KOps/s 3.1258 KOps/s $\color{#d91a1a}-0.99\%$
test_items_stack_nested_leaf 0.1721ms 59.1416μs 16.9086 KOps/s 16.8794 KOps/s $\color{#35bf28}+0.17\%$
test_items_stack_nested_locked 0.4062ms 0.3220ms 3.1052 KOps/s 3.1143 KOps/s $\color{#d91a1a}-0.29\%$
test_keys 30.8210μs 3.4441μs 290.3512 KOps/s 290.2077 KOps/s $\color{#35bf28}+0.05\%$
test_keys_nested 0.1417ms 70.3943μs 14.2057 KOps/s 14.2523 KOps/s $\color{#d91a1a}-0.33\%$
test_keys_nested_locked 2.8584ms 75.7368μs 13.2036 KOps/s 13.1367 KOps/s $\color{#35bf28}+0.51\%$
test_keys_nested_leaf 91.6750μs 61.2345μs 16.3307 KOps/s 16.2369 KOps/s $\color{#35bf28}+0.58\%$
test_keys_stack_nested 0.1120ms 70.7067μs 14.1429 KOps/s 13.9740 KOps/s $\color{#35bf28}+1.21\%$
test_keys_stack_nested_leaf 0.1000ms 61.7243μs 16.2011 KOps/s 15.8775 KOps/s $\color{#35bf28}+2.04\%$
test_keys_stack_nested_locked 0.1318ms 76.1715μs 13.1283 KOps/s 13.1489 KOps/s $\color{#d91a1a}-0.16\%$
test_values 6.6837μs 0.8438μs 1.1851 MOps/s 1.1869 MOps/s $\color{#d91a1a}-0.15\%$
test_values_nested 54.5230μs 31.2133μs 32.0377 KOps/s 32.1917 KOps/s $\color{#d91a1a}-0.48\%$
test_values_nested_locked 55.8930μs 32.7872μs 30.4997 KOps/s 30.7028 KOps/s $\color{#d91a1a}-0.66\%$
test_values_nested_leaf 0.1004ms 33.8820μs 29.5142 KOps/s 29.8666 KOps/s $\color{#d91a1a}-1.18\%$
test_values_stack_nested 69.0540μs 31.6986μs 31.5471 KOps/s 31.4536 KOps/s $\color{#35bf28}+0.30\%$
test_values_stack_nested_leaf 57.8930μs 34.2329μs 29.2117 KOps/s 29.1563 KOps/s $\color{#35bf28}+0.19\%$
test_values_stack_nested_locked 64.3630μs 33.0975μs 30.2138 KOps/s 30.1436 KOps/s $\color{#35bf28}+0.23\%$
test_membership 1.9426μs 0.5048μs 1.9810 MOps/s 1.9814 MOps/s $\color{#d91a1a}-0.02\%$
test_membership_nested 14.5755μs 1.9122μs 522.9489 KOps/s 520.4250 KOps/s $\color{#35bf28}+0.48\%$
test_membership_nested_leaf 93.8245μs 1.9286μs 518.5127 KOps/s 509.5950 KOps/s $\color{#35bf28}+1.75\%$
test_membership_stacked_nested 26.3020μs 2.0122μs 496.9590 KOps/s 503.2057 KOps/s $\color{#d91a1a}-1.24\%$
test_membership_stacked_nested_leaf 18.0610μs 2.0180μs 495.5288 KOps/s 498.3972 KOps/s $\color{#d91a1a}-0.58\%$
test_membership_nested_last 28.6720μs 2.8333μs 352.9478 KOps/s 356.4412 KOps/s $\color{#d91a1a}-0.98\%$
test_membership_nested_leaf_last 25.7510μs 2.8615μs 349.4680 KOps/s 349.9063 KOps/s $\color{#d91a1a}-0.13\%$
test_membership_stacked_nested_last 27.6110μs 3.2908μs 303.8770 KOps/s 305.7269 KOps/s $\color{#d91a1a}-0.61\%$
test_membership_stacked_nested_leaf_last 31.9920μs 3.2971μs 303.2939 KOps/s 307.6735 KOps/s $\color{#d91a1a}-1.42\%$
test_nested_getleaf 27.2920μs 5.9865μs 167.0418 KOps/s 165.3164 KOps/s $\color{#35bf28}+1.04\%$
test_nested_get 30.8820μs 5.7015μs 175.3922 KOps/s 174.9582 KOps/s $\color{#35bf28}+0.25\%$
test_stacked_getleaf 88.8750μs 6.0001μs 166.6630 KOps/s 167.2222 KOps/s $\color{#d91a1a}-0.33\%$
test_stacked_get 38.8620μs 5.6623μs 176.6064 KOps/s 174.3474 KOps/s $\color{#35bf28}+1.30\%$
test_nested_getitemleaf 30.7310μs 6.0775μs 164.5410 KOps/s 163.6658 KOps/s $\color{#35bf28}+0.53\%$
test_nested_getitem 34.7420μs 5.7650μs 173.4608 KOps/s 171.2445 KOps/s $\color{#35bf28}+1.29\%$
test_stacked_getitemleaf 31.6310μs 6.0850μs 164.3392 KOps/s 163.9898 KOps/s $\color{#35bf28}+0.21\%$
test_stacked_getitem 0.1676ms 5.7515μs 173.8664 KOps/s 173.8906 KOps/s $\color{#d91a1a}-0.01\%$
test_lock_nested 7.3868ms 0.3735ms 2.6775 KOps/s 2.7504 KOps/s $\color{#d91a1a}-2.65\%$
test_lock_stack_nested 0.4638ms 0.3367ms 2.9699 KOps/s 3.0369 KOps/s $\color{#d91a1a}-2.21\%$
test_unlock_nested 0.7822ms 0.3076ms 3.2509 KOps/s 3.3455 KOps/s $\color{#d91a1a}-2.83\%$
test_unlock_stack_nested 0.3093ms 0.2769ms 3.6117 KOps/s 3.7370 KOps/s $\color{#d91a1a}-3.35\%$
test_flatten_speed 94.5150μs 72.1598μs 13.8581 KOps/s 13.9019 KOps/s $\color{#d91a1a}-0.31\%$
test_unflatten_speed 0.3245ms 0.2914ms 3.4314 KOps/s 3.4526 KOps/s $\color{#d91a1a}-0.61\%$
test_common_ops 1.7633ms 0.5807ms 1.7221 KOps/s 1.7360 KOps/s $\color{#d91a1a}-0.80\%$
test_creation 0.1605ms 1.5040μs 664.9141 KOps/s 666.3993 KOps/s $\color{#d91a1a}-0.22\%$
test_creation_empty 39.2020μs 6.7715μs 147.6772 KOps/s 141.0319 KOps/s $\color{#35bf28}+4.71\%$
test_creation_nested_1 31.9920μs 8.2945μs 120.5611 KOps/s 117.2644 KOps/s $\color{#35bf28}+2.81\%$
test_creation_nested_2 44.5730μs 10.8119μs 92.4910 KOps/s 89.8967 KOps/s $\color{#35bf28}+2.89\%$
test_clone 45.8920μs 10.8592μs 92.0881 KOps/s 97.4668 KOps/s $\textbf{\color{#d91a1a}-5.52\%}$
test_getitem[int] 1.4610ms 11.1550μs 89.6456 KOps/s 95.5430 KOps/s $\textbf{\color{#d91a1a}-6.17\%}$
test_getitem[slice_int] 0.1919ms 21.5484μs 46.4072 KOps/s 46.8747 KOps/s $\color{#d91a1a}-1.00\%$
test_getitem[range] 0.1455ms 39.5678μs 25.2730 KOps/s 26.4961 KOps/s $\color{#d91a1a}-4.62\%$
test_getitem[tuple] 0.2542ms 18.8717μs 52.9893 KOps/s 54.1124 KOps/s $\color{#d91a1a}-2.08\%$
test_getitem[list] 0.2540ms 33.8003μs 29.5855 KOps/s 30.4636 KOps/s $\color{#d91a1a}-2.88\%$
test_setitem_dim[int] 91.4140μs 19.7121μs 50.7303 KOps/s 51.5893 KOps/s $\color{#d91a1a}-1.67\%$
test_setitem_dim[slice_int] 60.4930μs 38.6624μs 25.8649 KOps/s 24.4998 KOps/s $\textbf{\color{#35bf28}+5.57\%}$
test_setitem_dim[range] 0.1878ms 55.1011μs 18.1485 KOps/s 17.9889 KOps/s $\color{#35bf28}+0.89\%$
test_setitem_dim[tuple] 71.0030μs 33.6152μs 29.7484 KOps/s 31.6501 KOps/s $\textbf{\color{#d91a1a}-6.01\%}$
test_setitem 92.8640μs 14.6095μs 68.4487 KOps/s 70.7637 KOps/s $\color{#d91a1a}-3.27\%$
test_set 88.2950μs 14.2060μs 70.3927 KOps/s 72.6478 KOps/s $\color{#d91a1a}-3.10\%$
test_set_shared 1.6216ms 0.1469ms 6.8083 KOps/s 6.8398 KOps/s $\color{#d91a1a}-0.46\%$
test_update 0.4915ms 17.2070μs 58.1159 KOps/s 64.1102 KOps/s $\textbf{\color{#d91a1a}-9.35\%}$
test_update_nested 0.2869ms 21.0562μs 47.4919 KOps/s 49.8554 KOps/s $\color{#d91a1a}-4.74\%$
test_update__nested 0.1381ms 24.3879μs 41.0040 KOps/s 42.5880 KOps/s $\color{#d91a1a}-3.72\%$
test_set_nested 89.4440μs 15.1303μs 66.0927 KOps/s 68.8897 KOps/s $\color{#d91a1a}-4.06\%$
test_set_nested_new 85.3740μs 17.4302μs 57.3718 KOps/s 59.1747 KOps/s $\color{#d91a1a}-3.05\%$
test_select 0.1669ms 28.8220μs 34.6957 KOps/s 35.4121 KOps/s $\color{#d91a1a}-2.02\%$
test_select_nested 74.6340μs 41.9757μs 23.8233 KOps/s 23.9014 KOps/s $\color{#d91a1a}-0.33\%$
test_exclude_nested 89.1850μs 59.3023μs 16.8627 KOps/s 16.8871 KOps/s $\color{#d91a1a}-0.14\%$
test_empty[True] 0.2988ms 0.2562ms 3.9036 KOps/s 3.8635 KOps/s $\color{#35bf28}+1.04\%$
test_empty[False] 3.5592μs 0.7370μs 1.3568 MOps/s 1.3372 MOps/s $\color{#35bf28}+1.46\%$
test_to 86.7140μs 54.6000μs 18.3150 KOps/s 18.2256 KOps/s $\color{#35bf28}+0.49\%$
test_to_nonblocking 0.1991ms 49.3272μs 20.2728 KOps/s 20.5791 KOps/s $\color{#d91a1a}-1.49\%$
test_unbind_speed 0.2765ms 0.2361ms 4.2355 KOps/s 4.3327 KOps/s $\color{#d91a1a}-2.24\%$
test_unbind_speed_stack0 0.3613ms 0.2321ms 4.3091 KOps/s 4.3425 KOps/s $\color{#d91a1a}-0.77\%$
test_unbind_speed_stack1 94.1598ms 0.6549ms 1.5269 KOps/s 1.5511 KOps/s $\color{#d91a1a}-1.56\%$
test_split 96.2714ms 1.5898ms 629.0144 Ops/s 624.6448 Ops/s $\color{#35bf28}+0.70\%$
test_chunk 96.3091ms 1.6105ms 620.9378 Ops/s 623.5865 Ops/s $\color{#d91a1a}-0.42\%$
test_consolidate[False-None] 97.7003ms 2.9383ms 340.3313 Ops/s 337.4637 Ops/s $\color{#35bf28}+0.85\%$
test_consolidate[default-None] 2.0288ms 1.6364ms 611.0972 Ops/s 605.2933 Ops/s $\color{#35bf28}+0.96\%$
test_consolidate[reduce-overhead-None] 1.8039ms 1.6567ms 603.6228 Ops/s 589.9326 Ops/s $\color{#35bf28}+2.32\%$
test_consolidate_njt[False-None] 7.1847ms 6.6851ms 149.5854 Ops/s 151.3051 Ops/s $\color{#d91a1a}-1.14\%$
test_to[False-False-None] 2.1584ms 1.7508ms 571.1705 Ops/s 574.0497 Ops/s $\color{#d91a1a}-0.50\%$
test_to[True-False-None] 1.7964ms 1.3909ms 718.9706 Ops/s 743.3893 Ops/s $\color{#d91a1a}-3.28\%$
test_to[within-False-None] 0.3449s 5.5174ms 181.2453 Ops/s 243.9720 Ops/s $\textbf{\color{#d91a1a}-25.71\%}$
test_to[True-default-None] 5.7705ms 5.2938ms 188.9005 Ops/s 198.8959 Ops/s $\textbf{\color{#d91a1a}-5.03\%}$
test_to_njt[False-False-None] 7.5253ms 7.0966ms 140.9134 Ops/s 143.9289 Ops/s $\color{#d91a1a}-2.10\%$
test_to_njt[True-False-None] 6.2410ms 5.6222ms 177.8663 Ops/s 173.3391 Ops/s $\color{#35bf28}+2.61\%$
test_to_njt[within-False-None] 13.1502ms 12.5742ms 79.5280 Ops/s 77.2664 Ops/s $\color{#35bf28}+2.93\%$
test_creation[device0] 0.4768ms 83.8392μs 11.9276 KOps/s 12.0450 KOps/s $\color{#d91a1a}-0.97\%$
test_creation_from_tensor 0.5124ms 87.5003μs 11.4285 KOps/s 11.3559 KOps/s $\color{#35bf28}+0.64\%$
test_add_one[memmap_tensor0] 0.3972ms 7.3130μs 136.7423 KOps/s 144.7304 KOps/s $\textbf{\color{#d91a1a}-5.52\%}$
test_contiguous[memmap_tensor0] 19.5835μs 0.4237μs 2.3601 MOps/s 2.3249 MOps/s $\color{#35bf28}+1.52\%$
test_stack[memmap_tensor0] 0.4015ms 4.6134μs 216.7609 KOps/s 228.9296 KOps/s $\textbf{\color{#d91a1a}-5.32\%}$
test_memmaptd_index 2.0482ms 0.2595ms 3.8534 KOps/s 4.0430 KOps/s $\color{#d91a1a}-4.69\%$
test_memmaptd_index_astensor 0.5922ms 0.3209ms 3.1158 KOps/s 3.2634 KOps/s $\color{#d91a1a}-4.52\%$
test_memmaptd_index_op 0.9944ms 0.5790ms 1.7271 KOps/s 1.7834 KOps/s $\color{#d91a1a}-3.16\%$
test_serialize_model 0.1305s 0.1298s 7.7029 Ops/s 5.3945 Ops/s $\textbf{\color{#35bf28}+42.79\%}$
test_serialize_model_pickle 1.3625s 1.2181s 0.8210 Ops/s 0.8230 Ops/s $\color{#d91a1a}-0.25\%$
test_serialize_weights 0.1303s 0.1295s 7.7201 Ops/s 7.6822 Ops/s $\color{#35bf28}+0.49\%$
test_serialize_weights_returnearly 0.3742s 68.5731ms 14.5830 Ops/s 23.4360 Ops/s $\textbf{\color{#d91a1a}-37.78\%}$
test_serialize_weights_pickle 1.3485s 1.2114s 0.8255 Ops/s 0.8152 Ops/s $\color{#35bf28}+1.26\%$
test_reshape_pytree 0.1569ms 22.3953μs 44.6522 KOps/s 44.5657 KOps/s $\color{#35bf28}+0.19\%$
test_reshape_td 0.1596ms 27.0213μs 37.0079 KOps/s 37.6589 KOps/s $\color{#d91a1a}-1.73\%$
test_view_pytree 0.1348ms 22.1948μs 45.0555 KOps/s 45.5099 KOps/s $\color{#d91a1a}-1.00\%$
test_view_td 0.1678ms 29.7875μs 33.5712 KOps/s 35.0475 KOps/s $\color{#d91a1a}-4.21\%$
test_unbind_pytree 80.1440μs 27.8077μs 35.9613 KOps/s 36.2657 KOps/s $\color{#d91a1a}-0.84\%$
test_unbind_td 0.6383ms 35.4146μs 28.2369 KOps/s 28.1515 KOps/s $\color{#35bf28}+0.30\%$
test_split_pytree 0.1484ms 29.8239μs 33.5301 KOps/s 34.0598 KOps/s $\color{#d91a1a}-1.56\%$
test_split_td 0.7539ms 38.6296μs 25.8869 KOps/s 25.9459 KOps/s $\color{#d91a1a}-0.23\%$
test_add_pytree 0.1490ms 34.6833μs 28.8323 KOps/s 28.3500 KOps/s $\color{#35bf28}+1.70\%$
test_add_td 0.1514ms 45.9897μs 21.7440 KOps/s 20.6874 KOps/s $\textbf{\color{#35bf28}+5.11\%}$
test_compile_add_one_nested[tensordict-compile] 0.1703ms 0.1210ms 8.2634 KOps/s 7.9847 KOps/s $\color{#35bf28}+3.49\%$
test_compile_add_one_nested[tensordict-eager] 0.2697ms 0.1249ms 8.0091 KOps/s 7.7121 KOps/s $\color{#35bf28}+3.85\%$
test_compile_add_one_nested[pytree-compile] 0.2426ms 0.1010ms 9.8961 KOps/s 9.8052 KOps/s $\color{#35bf28}+0.93\%$
test_compile_add_one_nested[pytree-eager] 1.1111ms 0.1540ms 6.4951 KOps/s 6.3616 KOps/s $\color{#35bf28}+2.10\%$
test_compile_copy_nested[tensordict-compile] 0.1040ms 23.1066μs 43.2777 KOps/s 42.1182 KOps/s $\color{#35bf28}+2.75\%$
test_compile_copy_nested[tensordict-eager] 0.1146ms 27.0778μs 36.9307 KOps/s 35.3962 KOps/s $\color{#35bf28}+4.34\%$
test_compile_copy_nested[pytree-compile] 0.2246ms 64.7517μs 15.4436 KOps/s 15.2846 KOps/s $\color{#35bf28}+1.04\%$
test_compile_copy_nested[pytree-eager] 0.1656ms 49.5018μs 20.2013 KOps/s 20.2421 KOps/s $\color{#d91a1a}-0.20\%$
test_compile_add_one_flat[tensordict-compile] 0.2933ms 0.1458ms 6.8600 KOps/s 6.7018 KOps/s $\color{#35bf28}+2.36\%$
test_compile_add_one_flat[tensordict-eager] 0.3531ms 0.2100ms 4.7609 KOps/s 4.7469 KOps/s $\color{#35bf28}+0.29\%$
test_compile_add_one_flat[tensorclass-compile] 0.2471ms 0.1005ms 9.9519 KOps/s 10.0067 KOps/s $\color{#d91a1a}-0.55\%$
test_compile_add_one_flat[tensorclass-eager] 0.1980ms 53.6061μs 18.6546 KOps/s 17.8991 KOps/s $\color{#35bf28}+4.22\%$
test_compile_add_one_flat[pytree-compile] 0.3273ms 0.1443ms 6.9318 KOps/s 6.8194 KOps/s $\color{#35bf28}+1.65\%$
test_compile_add_one_flat[pytree-eager] 0.9038ms 0.5054ms 1.9788 KOps/s 1.9415 KOps/s $\color{#35bf28}+1.92\%$
test_compile_add_self_flat[tensordict-eager] 0.6205ms 0.2495ms 4.0086 KOps/s 3.9558 KOps/s $\color{#35bf28}+1.33\%$
test_compile_add_self_flat[tensordict-compile] 0.2340ms 0.1444ms 6.9260 KOps/s 6.9294 KOps/s $\color{#d91a1a}-0.05\%$
test_compile_add_self_flat[tensorclass-eager] 0.4552ms 62.8659μs 15.9069 KOps/s 15.5624 KOps/s $\color{#35bf28}+2.21\%$
test_compile_add_self_flat[tensorclass-compile] 0.5121ms 99.3646μs 10.0639 KOps/s 10.1912 KOps/s $\color{#d91a1a}-1.25\%$
test_compile_add_self_flat[pytree-eager] 0.6062ms 0.4256ms 2.3495 KOps/s 2.3089 KOps/s $\color{#35bf28}+1.76\%$
test_compile_add_self_flat[pytree-compile] 0.5355ms 0.1402ms 7.1328 KOps/s 7.0582 KOps/s $\color{#35bf28}+1.06\%$
test_compile_copy_flat[tensordict-compile] 0.4056ms 19.5685μs 51.1025 KOps/s 54.2296 KOps/s $\textbf{\color{#d91a1a}-5.77\%}$
test_compile_copy_flat[tensordict-eager] 0.4084ms 27.5388μs 36.3124 KOps/s 35.0560 KOps/s $\color{#35bf28}+3.58\%$
test_compile_copy_flat[pytree-compile] 0.4429ms 70.4073μs 14.2031 KOps/s 14.4243 KOps/s $\color{#d91a1a}-1.53\%$
test_compile_copy_flat[pytree-eager] 0.4184ms 51.7424μs 19.3265 KOps/s 19.3247 KOps/s $+0.01\%$
test_compile_assign_and_add[tensordict-compile] 1.7011ms 0.4591ms 2.1783 KOps/s 2.1778 KOps/s $\color{#35bf28}+0.02\%$
test_compile_assign_and_add[tensordict-eager] 3.0322ms 2.7899ms 358.4299 Ops/s 374.9893 Ops/s $\color{#d91a1a}-4.42\%$
test_compile_assign_and_add[pytree-compile] 1.6423ms 0.4476ms 2.2339 KOps/s 2.2163 KOps/s $\color{#35bf28}+0.79\%$
test_compile_assign_and_add[pytree-eager] 3.0144ms 2.8023ms 356.8500 Ops/s 360.9072 Ops/s $\color{#d91a1a}-1.12\%$
test_compile_indexing[tensor-tensordict-compile] 0.2734ms 0.1208ms 8.2792 KOps/s 8.1871 KOps/s $\color{#35bf28}+1.12\%$
test_compile_indexing[tensor-tensordict-eager] 0.5702ms 83.8492μs 11.9262 KOps/s 11.3951 KOps/s $\color{#35bf28}+4.66\%$
test_compile_indexing[tensor-tensorclass-compile] 0.2627ms 0.1122ms 8.9088 KOps/s 9.0075 KOps/s $\color{#d91a1a}-1.10\%$
test_compile_indexing[tensor-tensorclass-eager] 0.2549ms 71.2261μs 14.0398 KOps/s 13.5603 KOps/s $\color{#35bf28}+3.54\%$
test_compile_indexing[tensor-pytree-compile] 0.2660ms 0.1175ms 8.5115 KOps/s 8.8215 KOps/s $\color{#d91a1a}-3.51\%$
test_compile_indexing[tensor-pytree-eager] 0.2637ms 74.0152μs 13.5107 KOps/s 13.5663 KOps/s $\color{#d91a1a}-0.41\%$
test_compile_indexing[slice-tensordict-compile] 0.2555ms 0.1035ms 9.6611 KOps/s 9.6162 KOps/s $\color{#35bf28}+0.47\%$
test_compile_indexing[slice-tensordict-eager] 0.4057ms 17.7829μs 56.2337 KOps/s 56.9402 KOps/s $\color{#d91a1a}-1.24\%$
test_compile_indexing[slice-tensorclass-compile] 0.5017ms 0.1038ms 9.6355 KOps/s 10.1254 KOps/s $\color{#d91a1a}-4.84\%$
test_compile_indexing[slice-tensorclass-eager] 0.1175ms 16.1418μs 61.9509 KOps/s 62.9286 KOps/s $\color{#d91a1a}-1.55\%$
test_compile_indexing[slice-pytree-compile] 0.5156ms 0.1048ms 9.5451 KOps/s 10.0390 KOps/s $\color{#d91a1a}-4.92\%$
test_compile_indexing[slice-pytree-eager] 0.1346ms 15.8791μs 62.9757 KOps/s 63.7282 KOps/s $\color{#d91a1a}-1.18\%$
test_compile_indexing[int-tensordict-compile] 0.5059ms 0.1089ms 9.1790 KOps/s 9.1823 KOps/s $\color{#d91a1a}-0.04\%$
test_compile_indexing[int-tensordict-eager] 0.5981ms 17.2729μs 57.8942 KOps/s 57.7925 KOps/s $\color{#35bf28}+0.18\%$
test_compile_indexing[int-tensorclass-compile] 0.4964ms 99.2797μs 10.0726 KOps/s 10.0095 KOps/s $\color{#35bf28}+0.63\%$
test_compile_indexing[int-tensorclass-eager] 0.1247ms 15.9224μs 62.8045 KOps/s 62.4770 KOps/s $\color{#35bf28}+0.52\%$
test_compile_indexing[int-pytree-compile] 0.2378ms 98.5960μs 10.1424 KOps/s 10.0179 KOps/s $\color{#35bf28}+1.24\%$
test_compile_indexing[int-pytree-eager] 0.1608ms 15.9523μs 62.6871 KOps/s 63.5667 KOps/s $\color{#d91a1a}-1.38\%$
test_mod_add[eager] 0.2003ms 31.2572μs 31.9927 KOps/s 30.6392 KOps/s $\color{#35bf28}+4.42\%$
test_mod_add[compile] 0.4629ms 77.7420μs 12.8631 KOps/s 12.6423 KOps/s $\color{#35bf28}+1.75\%$
test_mod_add[compile-overhead] 0.3125ms 0.1629ms 6.1374 KOps/s 5.7304 KOps/s $\textbf{\color{#35bf28}+7.10\%}$
test_mod_wrap[eager] 0.3791ms 0.2477ms 4.0366 KOps/s 3.7391 KOps/s $\textbf{\color{#35bf28}+7.96\%}$
test_mod_wrap[compile] 1.5775ms 0.2880ms 3.4716 KOps/s 3.4564 KOps/s $\color{#35bf28}+0.44\%$
test_mod_wrap[compile-overhead] 7.4462ms 3.9852ms 250.9308 Ops/s 321.0841 Ops/s $\textbf{\color{#d91a1a}-21.85\%}$
test_mod_wrap_and_backward[eager] 1.5617ms 1.3740ms 727.7842 Ops/s 695.9980 Ops/s $\color{#35bf28}+4.57\%$
test_mod_wrap_and_backward[compile] 1.4270ms 1.2773ms 782.8928 Ops/s 777.2548 Ops/s $\color{#35bf28}+0.73\%$
test_mod_wrap_and_backward[compile-overhead] 1.3693ms 0.9261ms 1.0798 KOps/s 1.0662 KOps/s $\color{#35bf28}+1.27\%$
test_seq_add[eager] 0.2825ms 94.6481μs 10.5655 KOps/s 9.8468 KOps/s $\textbf{\color{#35bf28}+7.30\%}$
test_seq_add[compile] 0.2367ms 88.5423μs 11.2940 KOps/s 11.3599 KOps/s $\color{#d91a1a}-0.58\%$
test_seq_add[compile-overhead] 0.2808ms 0.1298ms 7.7067 KOps/s 7.7647 KOps/s $\color{#d91a1a}-0.75\%$
test_seq_wrap[eager] 0.5327ms 0.3843ms 2.6023 KOps/s 2.4393 KOps/s $\textbf{\color{#35bf28}+6.68\%}$
test_seq_wrap[compile] 0.4590ms 0.3031ms 3.2993 KOps/s 3.2621 KOps/s $\color{#35bf28}+1.14\%$
test_seq_wrap[compile-overhead] 0.3588ms 0.2257ms 4.4305 KOps/s 4.4286 KOps/s $\color{#35bf28}+0.04\%$
test_func_call_runtime[False-eager] 0.9579ms 0.7640ms 1.3089 KOps/s 1.2604 KOps/s $\color{#35bf28}+3.85\%$
test_func_call_runtime[False-compile] 1.1019ms 0.7583ms 1.3188 KOps/s 1.3159 KOps/s $\color{#35bf28}+0.22\%$
test_func_call_runtime[False-compile-overhead] 0.4602ms 0.3662ms 2.7304 KOps/s 2.7248 KOps/s $\color{#35bf28}+0.21\%$
test_func_call_runtime[True-eager] 1.0699ms 0.9175ms 1.0899 KOps/s 1.0682 KOps/s $\color{#35bf28}+2.04\%$
test_func_call_runtime[True-compile] 0.9641ms 0.7872ms 1.2703 KOps/s 1.2770 KOps/s $\color{#d91a1a}-0.52\%$
test_func_call_runtime[True-compile-overhead] 0.5331ms 0.3858ms 2.5923 KOps/s 2.5912 KOps/s $\color{#35bf28}+0.04\%$
test_func_call_cm_runtime[False-eager] 0.9178ms 0.7572ms 1.3206 KOps/s 1.3188 KOps/s $\color{#35bf28}+0.14\%$
test_func_call_cm_runtime[False-compile] 1.0349ms 0.7694ms 1.2997 KOps/s 1.3166 KOps/s $\color{#d91a1a}-1.29\%$
test_func_call_cm_runtime[False-compile-overhead] 0.4432ms 0.3660ms 2.7325 KOps/s 2.7192 KOps/s $\color{#35bf28}+0.49\%$
test_func_call_cm_runtime[True-eager] 1.1885ms 1.0254ms 975.2053 Ops/s 968.4188 Ops/s $\color{#35bf28}+0.70\%$
test_func_call_cm_runtime[True-compile] 0.9796ms 0.8178ms 1.2228 KOps/s 1.2288 KOps/s $\color{#d91a1a}-0.49\%$
test_func_call_cm_runtime[True-compile-overhead] 0.5762ms 0.4122ms 2.4259 KOps/s 2.4147 KOps/s $\color{#35bf28}+0.46\%$
test_vmap_func_call_cm_runtime[eager] 2.6079ms 2.1031ms 475.4776 Ops/s 471.5288 Ops/s $\color{#35bf28}+0.84\%$
test_vmap_func_call_cm_runtime[compile] 0.9912ms 0.8245ms 1.2129 KOps/s 1.2201 KOps/s $\color{#d91a1a}-0.59\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.5309ms 0.4153ms 2.4077 KOps/s 2.4049 KOps/s $\color{#35bf28}+0.11\%$
test_distributed 5.4884ms 0.1749ms 5.7172 KOps/s 8.7089 KOps/s $\textbf{\color{#d91a1a}-34.35\%}$
test_tdmodule 26.1620μs 12.9235μs 77.3781 KOps/s 70.3322 KOps/s $\textbf{\color{#35bf28}+10.02\%}$
test_tdmodule_dispatch 0.3216ms 25.7678μs 38.8081 KOps/s 37.4646 KOps/s $\color{#35bf28}+3.59\%$
test_tdseq 32.8510μs 14.4655μs 69.1298 KOps/s 65.5719 KOps/s $\textbf{\color{#35bf28}+5.43\%}$
test_tdseq_dispatch 49.8230μs 29.0729μs 34.3963 KOps/s 33.1810 KOps/s $\color{#35bf28}+3.66\%$
test_instantiation_functorch 1.6529ms 1.5717ms 636.2679 Ops/s 643.0600 Ops/s $\color{#d91a1a}-1.06\%$
test_exec_functorch 0.1993ms 0.1477ms 6.7702 KOps/s 6.8733 KOps/s $\color{#d91a1a}-1.50\%$
test_exec_functional_call 0.2785ms 0.1422ms 7.0333 KOps/s 7.1324 KOps/s $\color{#d91a1a}-1.39\%$
test_exec_td_decorator 0.3802ms 0.1858ms 5.3815 KOps/s 5.4126 KOps/s $\color{#d91a1a}-0.58\%$
test_vmap_mlp_speed_decorator[True-True] 0.8284ms 0.6785ms 1.4739 KOps/s 1.4753 KOps/s $\color{#d91a1a}-0.10\%$
test_vmap_mlp_speed_decorator[True-False] 0.8379ms 0.6767ms 1.4778 KOps/s 1.4731 KOps/s $\color{#35bf28}+0.32\%$
test_vmap_mlp_speed_decorator[False-True] 0.8382ms 0.5968ms 1.6755 KOps/s 1.6796 KOps/s $\color{#d91a1a}-0.24\%$
test_vmap_mlp_speed_decorator[False-False] 0.7723ms 0.5992ms 1.6688 KOps/s 1.6787 KOps/s $\color{#d91a1a}-0.59\%$
test_vmap_transformer_speed_decorator[True-True] 19.7322ms 19.5683ms 51.1032 Ops/s 50.5350 Ops/s $\color{#35bf28}+1.12\%$
test_vmap_transformer_speed_decorator[True-False] 20.2447ms 19.6038ms 51.0105 Ops/s 50.6313 Ops/s $\color{#35bf28}+0.75\%$
test_vmap_transformer_speed_decorator[False-True] 19.6611ms 19.4615ms 51.3835 Ops/s 49.7990 Ops/s $\color{#35bf28}+3.18\%$
test_vmap_transformer_speed_decorator[False-False] 19.6521ms 19.4745ms 51.3491 Ops/s 50.7458 Ops/s $\color{#35bf28}+1.19\%$
test_to_module_speed[True] 1.0477ms 0.9309ms 1.0742 KOps/s 1.0749 KOps/s $\color{#d91a1a}-0.06\%$
test_to_module_speed[False] 1.2732ms 0.9150ms 1.0928 KOps/s 1.0919 KOps/s $\color{#35bf28}+0.09\%$
test_tc_init 67.8130μs 33.9797μs 29.4294 KOps/s 29.4953 KOps/s $\color{#d91a1a}-0.22\%$
test_tc_init_nested 0.1188ms 70.9259μs 14.0992 KOps/s 14.5826 KOps/s $\color{#d91a1a}-3.32\%$
test_tc_first_layer_tensor 4.8017μs 0.7196μs 1.3896 MOps/s 1.3061 MOps/s $\textbf{\color{#35bf28}+6.39\%}$
test_tc_first_layer_nontensor 30.8820μs 2.3950μs 417.5402 KOps/s 423.4711 KOps/s $\color{#d91a1a}-1.40\%$
test_tc_second_layer_tensor 10.8340μs 1.4900μs 671.1400 KOps/s 655.3326 KOps/s $\color{#35bf28}+2.41\%$
test_tc_second_layer_nontensor 24.8610μs 3.2227μs 310.2968 KOps/s 321.7122 KOps/s $\color{#d91a1a}-3.55\%$
test_unbind 0.2371s 10.1079ms 98.9329 Ops/s 148.2861 Ops/s $\textbf{\color{#d91a1a}-33.28\%}$
test_full_like 10.3272ms 9.5750ms 104.4384 Ops/s 103.6498 Ops/s $\color{#35bf28}+0.76\%$
test_zeros_like 5.2907ms 4.3677ms 228.9526 Ops/s 232.5578 Ops/s $\color{#d91a1a}-1.55\%$
test_ones_like 9.7273ms 7.3665ms 135.7505 Ops/s 234.2589 Ops/s $\textbf{\color{#d91a1a}-42.05\%}$
test_clone 7.3848ms 6.7799ms 147.4949 Ops/s 146.9658 Ops/s $\color{#35bf28}+0.36\%$
test_squeeze 0.1135ms 9.7661μs 102.3953 KOps/s 105.5985 KOps/s $\color{#d91a1a}-3.03\%$
test_unsqueeze 0.1813ms 72.0271μs 13.8837 KOps/s 13.7451 KOps/s $\color{#35bf28}+1.01\%$
test_split 0.4007ms 0.1699ms 5.8867 KOps/s 6.1737 KOps/s $\color{#d91a1a}-4.65\%$
test_permute 0.3147ms 0.1776ms 5.6307 KOps/s 5.5457 KOps/s $\color{#35bf28}+1.53\%$
test_stack 54.7683ms 54.3620ms 18.3952 Ops/s 18.4875 Ops/s $\color{#d91a1a}-0.50\%$
test_cat 52.6036ms 51.8209ms 19.2972 Ops/s 19.1144 Ops/s $\color{#35bf28}+0.96\%$

@vmoens vmoens merged commit 51e8018 into gh/vmoens/35/base Nov 6, 2024
46 of 55 checks passed
vmoens added a commit that referenced this pull request Nov 6, 2024
ghstack-source-id: 7256b4c95b239bf9e6467c0ea687abe2c9179922
Pull Request resolved: #1076
@vmoens vmoens deleted the gh/vmoens/35/head branch November 6, 2024 17:36
vmoens added a commit that referenced this pull request Nov 14, 2024
ghstack-source-id: 7256b4c95b239bf9e6467c0ea687abe2c9179922
Pull Request resolved: #1076

(cherry picked from commit eaba711)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants