Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BugFix] select_out_keys for Prob sequential #1103

Merged
merged 7 commits into from
Nov 23, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Nov 22, 2024

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Nov 22, 2024
ghstack-source-id: fc4868eab4153c5a2df9cbda2d7ec1c08a7ed377
Pull Request resolved: #1103
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 22, 2024
Copy link

github-actions bot commented Nov 22, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 217. Improved: $\large\color{#35bf28}5$. Worsened: $\large\color{#d91a1a}34$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 32.3110μs 17.3479μs 57.6440 KOps/s 57.1023 KOps/s $\color{#35bf28}+0.95\%$
test_plain_set_stack_nested 45.8160μs 18.0353μs 55.4468 KOps/s 56.3722 KOps/s $\color{#d91a1a}-1.64\%$
test_plain_set_nested_inplace 60.8740μs 19.5831μs 51.0645 KOps/s 51.8488 KOps/s $\color{#d91a1a}-1.51\%$
test_plain_set_stack_nested_inplace 69.0990μs 19.4454μs 51.4262 KOps/s 51.7727 KOps/s $\color{#d91a1a}-0.67\%$
test_items 22.7330μs 4.1281μs 242.2427 KOps/s 238.0223 KOps/s $\color{#35bf28}+1.77\%$
test_items_nested 0.4292ms 0.3456ms 2.8934 KOps/s 2.9296 KOps/s $\color{#d91a1a}-1.24\%$
test_items_nested_locked 0.6053ms 0.3470ms 2.8820 KOps/s 2.9196 KOps/s $\color{#d91a1a}-1.29\%$
test_items_nested_leaf 0.1217ms 71.5122μs 13.9836 KOps/s 14.1453 KOps/s $\color{#d91a1a}-1.14\%$
test_items_stack_nested 0.4038ms 0.3495ms 2.8614 KOps/s 2.8975 KOps/s $\color{#d91a1a}-1.24\%$
test_items_stack_nested_leaf 0.1448ms 74.1514μs 13.4859 KOps/s 13.5657 KOps/s $\color{#d91a1a}-0.59\%$
test_items_stack_nested_locked 0.5582ms 0.3474ms 2.8785 KOps/s 2.8690 KOps/s $\color{#35bf28}+0.33\%$
test_keys 33.2220μs 3.4886μs 286.6491 KOps/s 285.2909 KOps/s $\color{#35bf28}+0.48\%$
test_keys_nested 0.2603ms 0.1376ms 7.2678 KOps/s 7.4014 KOps/s $\color{#d91a1a}-1.81\%$
test_keys_nested_locked 1.9458ms 0.1415ms 7.0688 KOps/s 7.0585 KOps/s $\color{#35bf28}+0.15\%$
test_keys_nested_leaf 0.2392ms 0.1167ms 8.5667 KOps/s 8.5216 KOps/s $\color{#35bf28}+0.53\%$
test_keys_stack_nested 0.2240ms 0.1369ms 7.3035 KOps/s 7.3084 KOps/s $\color{#d91a1a}-0.07\%$
test_keys_stack_nested_leaf 0.1714ms 0.1171ms 8.5408 KOps/s 8.5010 KOps/s $\color{#35bf28}+0.47\%$
test_keys_stack_nested_locked 0.2761ms 0.1418ms 7.0524 KOps/s 7.0321 KOps/s $\color{#35bf28}+0.29\%$
test_values 6.9110μs 1.0307μs 970.2143 KOps/s 953.6190 KOps/s $\color{#35bf28}+1.74\%$
test_values_nested 0.1079ms 54.7850μs 18.2532 KOps/s 18.3544 KOps/s $\color{#d91a1a}-0.55\%$
test_values_nested_locked 0.1177ms 55.3582μs 18.0642 KOps/s 18.2408 KOps/s $\color{#d91a1a}-0.97\%$
test_values_nested_leaf 0.1534ms 59.3564μs 16.8474 KOps/s 16.6743 KOps/s $\color{#35bf28}+1.04\%$
test_values_stack_nested 0.1037ms 56.0683μs 17.8354 KOps/s 18.0357 KOps/s $\color{#d91a1a}-1.11\%$
test_values_stack_nested_leaf 0.1212ms 60.7833μs 16.4519 KOps/s 16.5090 KOps/s $\color{#d91a1a}-0.35\%$
test_values_stack_nested_locked 0.1151ms 56.3557μs 17.7444 KOps/s 17.9323 KOps/s $\color{#d91a1a}-1.05\%$
test_membership 4.8863μs 0.7664μs 1.3048 MOps/s 1.4025 MOps/s $\textbf{\color{#d91a1a}-6.97\%}$
test_membership_nested 29.1650μs 2.7567μs 362.7554 KOps/s 367.9181 KOps/s $\color{#d91a1a}-1.40\%$
test_membership_nested_leaf 29.8760μs 2.8225μs 354.2998 KOps/s 366.1705 KOps/s $\color{#d91a1a}-3.24\%$
test_membership_stacked_nested 30.5570μs 2.7352μs 365.6096 KOps/s 373.6079 KOps/s $\color{#d91a1a}-2.14\%$
test_membership_stacked_nested_leaf 49.6430μs 2.7703μs 360.9754 KOps/s 370.2828 KOps/s $\color{#d91a1a}-2.51\%$
test_membership_nested_last 31.3590μs 4.0374μs 247.6821 KOps/s 248.2188 KOps/s $\color{#d91a1a}-0.22\%$
test_membership_nested_leaf_last 32.8410μs 4.0932μs 244.3100 KOps/s 245.2776 KOps/s $\color{#d91a1a}-0.39\%$
test_membership_stacked_nested_last 33.7730μs 4.0750μs 245.3988 KOps/s 252.2685 KOps/s $\color{#d91a1a}-2.72\%$
test_membership_stacked_nested_leaf_last 46.3570μs 4.1075μs 243.4576 KOps/s 244.6066 KOps/s $\color{#d91a1a}-0.47\%$
test_nested_getleaf 39.7750μs 10.9783μs 91.0887 KOps/s 92.8976 KOps/s $\color{#d91a1a}-1.95\%$
test_nested_get 40.3760μs 10.2269μs 97.7818 KOps/s 98.6833 KOps/s $\color{#d91a1a}-0.91\%$
test_stacked_getleaf 41.7390μs 10.8252μs 92.3770 KOps/s 94.5397 KOps/s $\color{#d91a1a}-2.29\%$
test_stacked_get 49.1220μs 10.2685μs 97.3855 KOps/s 98.6733 KOps/s $\color{#d91a1a}-1.31\%$
test_nested_getitemleaf 38.0710μs 11.2391μs 88.9748 KOps/s 90.6488 KOps/s $\color{#d91a1a}-1.85\%$
test_nested_getitem 57.3280μs 10.4744μs 95.4708 KOps/s 98.5424 KOps/s $\color{#d91a1a}-3.12\%$
test_stacked_getitemleaf 71.5940μs 10.8999μs 91.7436 KOps/s 90.2474 KOps/s $\color{#35bf28}+1.66\%$
test_stacked_getitem 37.8410μs 10.3358μs 96.7509 KOps/s 97.0833 KOps/s $\color{#d91a1a}-0.34\%$
test_lock_nested 2.8479ms 0.4558ms 2.1938 KOps/s 2.2651 KOps/s $\color{#d91a1a}-3.15\%$
test_lock_stack_nested 0.6901ms 0.4173ms 2.3964 KOps/s 2.4372 KOps/s $\color{#d91a1a}-1.67\%$
test_unlock_nested 0.8101ms 0.3676ms 2.7201 KOps/s 2.6819 KOps/s $\color{#35bf28}+1.42\%$
test_unlock_stack_nested 0.5199ms 0.3364ms 2.9723 KOps/s 3.0088 KOps/s $\color{#d91a1a}-1.21\%$
test_flatten_speed 0.1580ms 91.6496μs 10.9111 KOps/s 11.1302 KOps/s $\color{#d91a1a}-1.97\%$
test_unflatten_speed 0.8746ms 0.4752ms 2.1042 KOps/s 2.0677 KOps/s $\color{#35bf28}+1.77\%$
test_common_ops 4.1297ms 0.7743ms 1.2915 KOps/s 1.3403 KOps/s $\color{#d91a1a}-3.64\%$
test_creation 23.3840μs 2.1026μs 475.5939 KOps/s 477.4025 KOps/s $\color{#d91a1a}-0.38\%$
test_creation_empty 33.3530μs 10.3727μs 96.4067 KOps/s 105.2997 KOps/s $\textbf{\color{#d91a1a}-8.45\%}$
test_creation_nested_1 1.2526ms 13.2475μs 75.4860 KOps/s 81.1270 KOps/s $\textbf{\color{#d91a1a}-6.95\%}$
test_creation_nested_2 56.0050μs 17.2080μs 58.1125 KOps/s 60.4695 KOps/s $\color{#d91a1a}-3.90\%$
test_clone 0.1397ms 13.0528μs 76.6117 KOps/s 79.1631 KOps/s $\color{#d91a1a}-3.22\%$
test_getitem[int] 0.7819ms 12.8968μs 77.5386 KOps/s 81.6404 KOps/s $\textbf{\color{#d91a1a}-5.02\%}$
test_getitem[slice_int] 0.1462ms 24.6929μs 40.4974 KOps/s 44.1075 KOps/s $\textbf{\color{#d91a1a}-8.18\%}$
test_getitem[range] 0.2480ms 50.5019μs 19.8012 KOps/s 20.7851 KOps/s $\color{#d91a1a}-4.73\%$
test_getitem[tuple] 0.1269ms 20.5027μs 48.7742 KOps/s 51.0078 KOps/s $\color{#d91a1a}-4.38\%$
test_getitem[list] 0.2248ms 45.0789μs 22.1833 KOps/s 23.0812 KOps/s $\color{#d91a1a}-3.89\%$
test_setitem_dim[int] 52.9100μs 27.2346μs 36.7181 KOps/s 40.9042 KOps/s $\textbf{\color{#d91a1a}-10.23\%}$
test_setitem_dim[slice_int] 0.1164ms 55.7554μs 17.9355 KOps/s 19.8156 KOps/s $\textbf{\color{#d91a1a}-9.49\%}$
test_setitem_dim[range] 0.1212ms 74.5165μs 13.4198 KOps/s 13.4798 KOps/s $\color{#d91a1a}-0.44\%$
test_setitem_dim[tuple] 0.1025ms 42.4246μs 23.5712 KOps/s 25.2379 KOps/s $\textbf{\color{#d91a1a}-6.60\%}$
test_setitem 76.5330μs 19.6116μs 50.9904 KOps/s 53.0372 KOps/s $\color{#d91a1a}-3.86\%$
test_set 0.1371ms 19.4080μs 51.5253 KOps/s 54.3913 KOps/s $\textbf{\color{#d91a1a}-5.27\%}$
test_set_shared 1.1729ms 0.1688ms 5.9257 KOps/s 6.0086 KOps/s $\color{#d91a1a}-1.38\%$
test_update 0.2212ms 21.7689μs 45.9370 KOps/s 49.0469 KOps/s $\textbf{\color{#d91a1a}-6.34\%}$
test_update_nested 0.1068ms 31.9084μs 31.3397 KOps/s 33.4903 KOps/s $\textbf{\color{#d91a1a}-6.42\%}$
test_update__nested 0.5186ms 32.6552μs 30.6230 KOps/s 31.4231 KOps/s $\color{#d91a1a}-2.55\%$
test_set_nested 97.2920μs 21.4150μs 46.6962 KOps/s 48.7241 KOps/s $\color{#d91a1a}-4.16\%$
test_set_nested_new 93.7650μs 26.0730μs 38.3538 KOps/s 39.9530 KOps/s $\color{#d91a1a}-4.00\%$
test_select 0.1474ms 42.0244μs 23.7957 KOps/s 24.8398 KOps/s $\color{#d91a1a}-4.20\%$
test_select_nested 0.1146ms 59.6663μs 16.7599 KOps/s 17.2092 KOps/s $\color{#d91a1a}-2.61\%$
test_exclude_nested 0.1138ms 74.8725μs 13.3560 KOps/s 13.3650 KOps/s $\color{#d91a1a}-0.07\%$
test_empty[True] 0.6548ms 0.3516ms 2.8441 KOps/s 2.8719 KOps/s $\color{#d91a1a}-0.97\%$
test_empty[False] 11.1910μs 1.2510μs 799.3329 KOps/s 820.1727 KOps/s $\color{#d91a1a}-2.54\%$
test_unbind_speed 0.5906ms 0.2640ms 3.7882 KOps/s 3.8413 KOps/s $\color{#d91a1a}-1.38\%$
test_unbind_speed_stack0 0.5000ms 0.2620ms 3.8174 KOps/s 3.8940 KOps/s $\color{#d91a1a}-1.97\%$
test_unbind_speed_stack1 0.1069s 0.7846ms 1.2746 KOps/s 1.4095 KOps/s $\textbf{\color{#d91a1a}-9.57\%}$
test_split 0.1060s 1.7464ms 572.6140 Ops/s 584.1621 Ops/s $\color{#d91a1a}-1.98\%$
test_chunk 0.1101s 1.7628ms 567.2686 Ops/s 585.4421 Ops/s $\color{#d91a1a}-3.10\%$
test_consolidate_njt[False-None] 8.9875ms 8.3498ms 119.7639 Ops/s 123.9488 Ops/s $\color{#d91a1a}-3.38\%$
test_creation[device0] 3.4660ms 95.6332μs 10.4566 KOps/s 10.7706 KOps/s $\color{#d91a1a}-2.92\%$
test_creation_from_tensor 0.3113ms 96.5410μs 10.3583 KOps/s 10.5986 KOps/s $\color{#d91a1a}-2.27\%$
test_add_one[memmap_tensor0] 0.2721ms 5.1090μs 195.7328 KOps/s 202.6682 KOps/s $\color{#d91a1a}-3.42\%$
test_contiguous[memmap_tensor0] 16.6910μs 0.5085μs 1.9667 MOps/s 1.9323 MOps/s $\color{#35bf28}+1.78\%$
test_stack[memmap_tensor0] 45.5450μs 3.4863μs 286.8374 KOps/s 297.2361 KOps/s $\color{#d91a1a}-3.50\%$
test_memmaptd_index 1.0473ms 0.2436ms 4.1056 KOps/s 4.2906 KOps/s $\color{#d91a1a}-4.31\%$
test_memmaptd_index_astensor 0.6648ms 0.3235ms 3.0907 KOps/s 3.1652 KOps/s $\color{#d91a1a}-2.35\%$
test_memmaptd_index_op 1.0070ms 0.6000ms 1.6667 KOps/s 1.7762 KOps/s $\textbf{\color{#d91a1a}-6.16\%}$
test_serialize_model 0.1271s 0.1194s 8.3784 Ops/s 7.3560 Ops/s $\textbf{\color{#35bf28}+13.90\%}$
test_serialize_model_pickle 0.4434s 0.3904s 2.5613 Ops/s 2.4870 Ops/s $\color{#35bf28}+2.99\%$
test_serialize_weights 0.1214s 0.1143s 8.7498 Ops/s 8.6854 Ops/s $\color{#35bf28}+0.74\%$
test_serialize_weights_returnearly 0.1682s 0.1592s 6.2818 Ops/s 6.2167 Ops/s $\color{#35bf28}+1.05\%$
test_serialize_weights_pickle 0.6402s 0.4902s 2.0399 Ops/s 2.5706 Ops/s $\textbf{\color{#d91a1a}-20.65\%}$
test_serialize_weights_filesystem 0.1560s 0.1434s 6.9740 Ops/s 6.8921 Ops/s $\color{#35bf28}+1.19\%$
test_serialize_model_filesystem 0.1690s 0.1469s 6.8073 Ops/s 6.6175 Ops/s $\color{#35bf28}+2.87\%$
test_reshape_pytree 72.6160μs 27.0898μs 36.9142 KOps/s 37.8091 KOps/s $\color{#d91a1a}-2.37\%$
test_reshape_td 89.2880μs 33.8179μs 29.5701 KOps/s 32.1012 KOps/s $\textbf{\color{#d91a1a}-7.88\%}$
test_view_pytree 72.8860μs 26.8187μs 37.2873 KOps/s 37.8218 KOps/s $\color{#d91a1a}-1.41\%$
test_view_td 87.0240μs 38.0932μs 26.2514 KOps/s 26.8866 KOps/s $\color{#d91a1a}-2.36\%$
test_unbind_pytree 66.2440μs 30.2798μs 33.0253 KOps/s 34.4440 KOps/s $\color{#d91a1a}-4.12\%$
test_unbind_td 0.3398ms 39.2204μs 25.4969 KOps/s 26.2799 KOps/s $\color{#d91a1a}-2.98\%$
test_split_pytree 80.0000μs 29.6696μs 33.7045 KOps/s 34.0655 KOps/s $\color{#d91a1a}-1.06\%$
test_split_td 0.1999ms 44.7705μs 22.3361 KOps/s 23.5823 KOps/s $\textbf{\color{#d91a1a}-5.28\%}$
test_add_pytree 85.6410μs 36.4245μs 27.4541 KOps/s 27.9030 KOps/s $\color{#d91a1a}-1.61\%$
test_add_td 0.1749ms 54.4902μs 18.3519 KOps/s 18.7403 KOps/s $\color{#d91a1a}-2.07\%$
test_compile_add_one_nested[tensordict-compile] 0.1477ms 62.7537μs 15.9353 KOps/s 16.2917 KOps/s $\color{#d91a1a}-2.19\%$
test_compile_add_one_nested[tensordict-eager] 0.4061ms 0.1643ms 6.0869 KOps/s 6.1697 KOps/s $\color{#d91a1a}-1.34\%$
test_compile_add_one_nested[pytree-compile] 0.1100ms 46.1818μs 21.6535 KOps/s 22.0789 KOps/s $\color{#d91a1a}-1.93\%$
test_compile_add_one_nested[pytree-eager] 0.2624ms 0.1206ms 8.2913 KOps/s 7.9767 KOps/s $\color{#35bf28}+3.94\%$
test_compile_copy_nested[tensordict-compile] 65.2920μs 25.7893μs 38.7758 KOps/s 39.0551 KOps/s $\color{#d91a1a}-0.72\%$
test_compile_copy_nested[tensordict-eager] 0.1277ms 53.2593μs 18.7761 KOps/s 18.6206 KOps/s $\color{#35bf28}+0.83\%$
test_compile_copy_nested[pytree-compile] 0.1708ms 80.9688μs 12.3504 KOps/s 12.6003 KOps/s $\color{#d91a1a}-1.98\%$
test_compile_copy_nested[pytree-eager] 0.1332ms 68.7428μs 14.5470 KOps/s 14.8719 KOps/s $\color{#d91a1a}-2.19\%$
test_compile_add_one_flat[tensordict-compile] 0.1979ms 0.1056ms 9.4694 KOps/s 9.5712 KOps/s $\color{#d91a1a}-1.06\%$
test_compile_add_one_flat[tensordict-eager] 0.3470ms 0.2034ms 4.9171 KOps/s 4.9523 KOps/s $\color{#d91a1a}-0.71\%$
test_compile_add_one_flat[tensorclass-compile] 0.1106ms 45.8949μs 21.7889 KOps/s 22.2308 KOps/s $\color{#d91a1a}-1.99\%$
test_compile_add_one_flat[tensorclass-eager] 0.4848ms 63.2218μs 15.8173 KOps/s 16.2907 KOps/s $\color{#d91a1a}-2.91\%$
test_compile_add_one_flat[pytree-compile] 0.1842ms 0.1041ms 9.6075 KOps/s 9.7473 KOps/s $\color{#d91a1a}-1.43\%$
test_compile_add_one_flat[pytree-eager] 0.3968ms 0.2038ms 4.9072 KOps/s 5.0046 KOps/s $\color{#d91a1a}-1.95\%$
test_compile_add_self_flat[tensordict-eager] 0.3674ms 0.2147ms 4.6584 KOps/s 4.6894 KOps/s $\color{#d91a1a}-0.66\%$
test_compile_add_self_flat[tensordict-compile] 0.2389ms 0.1112ms 8.9901 KOps/s 9.6015 KOps/s $\textbf{\color{#d91a1a}-6.37\%}$
test_compile_add_self_flat[tensorclass-eager] 0.3053ms 55.7501μs 17.9372 KOps/s 18.9056 KOps/s $\textbf{\color{#d91a1a}-5.12\%}$
test_compile_add_self_flat[tensorclass-compile] 0.1562ms 46.9720μs 21.2893 KOps/s 21.8918 KOps/s $\color{#d91a1a}-2.75\%$
test_compile_add_self_flat[pytree-eager] 1.3103ms 0.1617ms 6.1856 KOps/s 6.3188 KOps/s $\color{#d91a1a}-2.11\%$
test_compile_add_self_flat[pytree-compile] 0.1968ms 0.1037ms 9.6417 KOps/s 9.8434 KOps/s $\color{#d91a1a}-2.05\%$
test_compile_copy_flat[tensordict-compile] 60.2530μs 22.2509μs 44.9420 KOps/s 46.9586 KOps/s $\color{#d91a1a}-4.29\%$
test_compile_copy_flat[tensordict-eager] 0.1369ms 59.4219μs 16.8288 KOps/s 17.0519 KOps/s $\color{#d91a1a}-1.31\%$
test_compile_copy_flat[pytree-compile] 0.2130ms 81.9176μs 12.2074 KOps/s 12.4947 KOps/s $\color{#d91a1a}-2.30\%$
test_compile_copy_flat[pytree-eager] 0.1053ms 69.3795μs 14.4135 KOps/s 14.8563 KOps/s $\color{#d91a1a}-2.98\%$
test_compile_assign_and_add[tensordict-compile] 0.4556ms 0.2088ms 4.7884 KOps/s 4.8225 KOps/s $\color{#d91a1a}-0.71\%$
test_compile_assign_and_add[tensordict-eager] 2.3028ms 1.3057ms 765.8573 Ops/s 788.1997 Ops/s $\color{#d91a1a}-2.83\%$
test_compile_assign_and_add[pytree-compile] 0.4359ms 0.2074ms 4.8224 KOps/s 4.9781 KOps/s $\color{#d91a1a}-3.13\%$
test_compile_assign_and_add[pytree-eager] 1.3222ms 0.7891ms 1.2673 KOps/s 1.2996 KOps/s $\color{#d91a1a}-2.48\%$
test_compile_assign_and_add_stack[compile] 0.8182ms 0.4644ms 2.1532 KOps/s 2.2328 KOps/s $\color{#d91a1a}-3.57\%$
test_compile_assign_and_add_stack[eager] 4.0396ms 2.6825ms 372.7835 Ops/s 395.7572 Ops/s $\textbf{\color{#d91a1a}-5.81\%}$
test_compile_indexing[tensor-tensordict-compile] 95.6390μs 36.2389μs 27.5946 KOps/s 27.5010 KOps/s $\color{#35bf28}+0.34\%$
test_compile_indexing[tensor-tensordict-eager] 0.4674ms 33.9015μs 29.4972 KOps/s 30.0841 KOps/s $\color{#d91a1a}-1.95\%$
test_compile_indexing[tensor-tensorclass-compile] 85.8970μs 29.8345μs 33.5182 KOps/s 34.9232 KOps/s $\color{#d91a1a}-4.02\%$
test_compile_indexing[tensor-tensorclass-eager] 70.6570μs 23.7773μs 42.0570 KOps/s 42.2616 KOps/s $\color{#d91a1a}-0.48\%$
test_compile_indexing[tensor-pytree-compile] 89.5890μs 30.8022μs 32.4653 KOps/s 33.5459 KOps/s $\color{#d91a1a}-3.22\%$
test_compile_indexing[tensor-pytree-eager] 83.8480μs 24.0165μs 41.6380 KOps/s 43.4372 KOps/s $\color{#d91a1a}-4.14\%$
test_compile_indexing[slice-tensordict-compile] 0.1051ms 52.5357μs 19.0347 KOps/s 19.9559 KOps/s $\color{#d91a1a}-4.62\%$
test_compile_indexing[slice-tensordict-eager] 0.5777ms 19.8656μs 50.3382 KOps/s 52.4258 KOps/s $\color{#d91a1a}-3.98\%$
test_compile_indexing[slice-tensorclass-compile] 0.1107ms 45.1225μs 22.1619 KOps/s 23.2562 KOps/s $\color{#d91a1a}-4.71\%$
test_compile_indexing[slice-tensorclass-eager] 67.2560μs 19.2437μs 51.9650 KOps/s 54.2391 KOps/s $\color{#d91a1a}-4.19\%$
test_compile_indexing[slice-pytree-compile] 0.1123ms 45.8314μs 21.8191 KOps/s 22.6755 KOps/s $\color{#d91a1a}-3.78\%$
test_compile_indexing[slice-pytree-eager] 82.4590μs 18.8207μs 53.1330 KOps/s 54.2135 KOps/s $\color{#d91a1a}-1.99\%$
test_compile_indexing[int-tensordict-compile] 0.1306ms 53.7903μs 18.5907 KOps/s 19.1207 KOps/s $\color{#d91a1a}-2.77\%$
test_compile_indexing[int-tensordict-eager] 0.9403ms 19.7245μs 50.6984 KOps/s 52.1660 KOps/s $\color{#d91a1a}-2.81\%$
test_compile_indexing[int-tensorclass-compile] 0.1203ms 46.2151μs 21.6380 KOps/s 22.8509 KOps/s $\textbf{\color{#d91a1a}-5.31\%}$
test_compile_indexing[int-tensorclass-eager] 64.0010μs 18.9126μs 52.8749 KOps/s 53.6867 KOps/s $\color{#d91a1a}-1.51\%$
test_compile_indexing[int-pytree-compile] 0.1315ms 46.1258μs 21.6798 KOps/s 22.7404 KOps/s $\color{#d91a1a}-4.66\%$
test_compile_indexing[int-pytree-eager] 70.1320μs 19.0443μs 52.5093 KOps/s 54.5637 KOps/s $\color{#d91a1a}-3.77\%$
test_mod_add[eager] 73.4880μs 26.5046μs 37.7293 KOps/s 38.6442 KOps/s $\color{#d91a1a}-2.37\%$
test_mod_add[compile] 0.1003ms 47.1180μs 21.2233 KOps/s 22.5871 KOps/s $\textbf{\color{#d91a1a}-6.04\%}$
test_mod_add[compile-overhead] 0.1460ms 46.9661μs 21.2920 KOps/s 22.6658 KOps/s $\textbf{\color{#d91a1a}-6.06\%}$
test_mod_wrap[eager] 0.4212ms 0.2223ms 4.4982 KOps/s 4.8116 KOps/s $\textbf{\color{#d91a1a}-6.51\%}$
test_mod_wrap[compile] 1.8326ms 0.2110ms 4.7389 KOps/s 4.8109 KOps/s $\color{#d91a1a}-1.50\%$
test_mod_wrap[compile-overhead] 1.8385ms 0.2075ms 4.8202 KOps/s 4.9490 KOps/s $\color{#d91a1a}-2.60\%$
test_mod_wrap_and_backward[eager] 13.8486ms 11.8630ms 84.2957 Ops/s 93.4573 Ops/s $\textbf{\color{#d91a1a}-9.80\%}$
test_mod_wrap_and_backward[compile] 19.2123ms 12.7674ms 78.3244 Ops/s 86.4500 Ops/s $\textbf{\color{#d91a1a}-9.40\%}$
test_mod_wrap_and_backward[compile-overhead] 16.1457ms 13.5113ms 74.0121 Ops/s 83.7396 Ops/s $\textbf{\color{#d91a1a}-11.62\%}$
test_seq_add[eager] 0.1667ms 94.1373μs 10.6228 KOps/s 11.2343 KOps/s $\textbf{\color{#d91a1a}-5.44\%}$
test_seq_add[compile] 0.1683ms 63.4870μs 15.7513 KOps/s 16.7482 KOps/s $\textbf{\color{#d91a1a}-5.95\%}$
test_seq_add[compile-overhead] 0.1633ms 60.5125μs 16.5255 KOps/s 17.1960 KOps/s $\color{#d91a1a}-3.90\%$
test_seq_wrap[eager] 0.6039ms 0.4004ms 2.4973 KOps/s 2.6055 KOps/s $\color{#d91a1a}-4.15\%$
test_seq_wrap[compile] 0.4194ms 0.2338ms 4.2771 KOps/s 4.3910 KOps/s $\color{#d91a1a}-2.59\%$
test_seq_wrap[compile-overhead] 0.4071ms 0.2330ms 4.2913 KOps/s 4.4039 KOps/s $\color{#d91a1a}-2.56\%$
test_func_call_runtime[False-eager] 0.9416ms 0.5736ms 1.7432 KOps/s 1.8588 KOps/s $\textbf{\color{#d91a1a}-6.22\%}$
test_func_call_runtime[False-compile] 0.6063ms 0.4376ms 2.2849 KOps/s 2.3516 KOps/s $\color{#d91a1a}-2.84\%$
test_func_call_runtime[False-compile-overhead] 0.7163ms 0.4330ms 2.3097 KOps/s 2.3444 KOps/s $\color{#d91a1a}-1.48\%$
test_func_call_runtime[True-eager] 1.3061ms 0.7886ms 1.2680 KOps/s 1.3507 KOps/s $\textbf{\color{#d91a1a}-6.12\%}$
test_func_call_runtime[True-compile] 0.5947ms 0.4711ms 2.1227 KOps/s 2.0936 KOps/s $\color{#35bf28}+1.39\%$
test_func_call_runtime[True-compile-overhead] 0.6321ms 0.4681ms 2.1363 KOps/s 2.1016 KOps/s $\color{#35bf28}+1.65\%$
test_func_call_cm_runtime[False-eager] 0.7288ms 0.5617ms 1.7802 KOps/s 1.8422 KOps/s $\color{#d91a1a}-3.37\%$
test_func_call_cm_runtime[False-compile] 0.5849ms 0.4349ms 2.2992 KOps/s 2.3107 KOps/s $\color{#d91a1a}-0.50\%$
test_func_call_cm_runtime[False-compile-overhead] 0.5593ms 0.4322ms 2.3135 KOps/s 2.3200 KOps/s $\color{#d91a1a}-0.28\%$
test_func_call_cm_runtime[True-eager] 1.1136ms 0.9139ms 1.0942 KOps/s 1.1134 KOps/s $\color{#d91a1a}-1.73\%$
test_func_call_cm_runtime[True-compile] 0.6064ms 0.4976ms 2.0095 KOps/s 1.9992 KOps/s $\color{#35bf28}+0.52\%$
test_func_call_cm_runtime[True-compile-overhead] 0.6023ms 0.4992ms 2.0032 KOps/s 2.0013 KOps/s $\color{#35bf28}+0.09\%$
test_vmap_func_call_cm_runtime[eager] 2.4801ms 1.9317ms 517.6917 Ops/s 525.9307 Ops/s $\color{#d91a1a}-1.57\%$
test_vmap_func_call_cm_runtime[compile] 0.9709ms 0.5326ms 1.8774 KOps/s 1.9029 KOps/s $\color{#d91a1a}-1.34\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.8413ms 0.5238ms 1.9090 KOps/s 1.9055 KOps/s $\color{#35bf28}+0.18\%$
test_distributed 0.2840ms 0.1274ms 7.8520 KOps/s 7.7980 KOps/s $\color{#35bf28}+0.69\%$
test_tdmodule 36.6590μs 18.6679μs 53.5679 KOps/s 55.2453 KOps/s $\color{#d91a1a}-3.04\%$
test_tdmodule_dispatch 68.1080μs 38.4518μs 26.0066 KOps/s 26.6889 KOps/s $\color{#d91a1a}-2.56\%$
test_tdseq 48.9620μs 21.4415μs 46.6386 KOps/s 49.2967 KOps/s $\textbf{\color{#d91a1a}-5.39\%}$
test_tdseq_dispatch 73.2780μs 44.2060μs 22.6214 KOps/s 23.3866 KOps/s $\color{#d91a1a}-3.27\%$
test_instantiation_functorch 2.0094ms 1.6101ms 621.0951 Ops/s 647.4610 Ops/s $\color{#d91a1a}-4.07\%$
test_exec_functorch 0.3780ms 0.1848ms 5.4107 KOps/s 5.5104 KOps/s $\color{#d91a1a}-1.81\%$
test_exec_functional_call 0.2813ms 0.1761ms 5.6773 KOps/s 5.7332 KOps/s $\color{#d91a1a}-0.98\%$
test_exec_td_decorator 0.5201ms 0.2318ms 4.3140 KOps/s 4.3846 KOps/s $\color{#d91a1a}-1.61\%$
test_vmap_mlp_speed_decorator[True-True] 0.8412ms 0.6517ms 1.5345 KOps/s 1.5963 KOps/s $\color{#d91a1a}-3.87\%$
test_vmap_mlp_speed_decorator[True-False] 0.9370ms 0.6526ms 1.5323 KOps/s 1.5915 KOps/s $\color{#d91a1a}-3.71\%$
test_vmap_mlp_speed_decorator[False-True] 0.7494ms 0.5322ms 1.8791 KOps/s 1.9389 KOps/s $\color{#d91a1a}-3.08\%$
test_vmap_mlp_speed_decorator[False-False] 0.8495ms 0.5342ms 1.8721 KOps/s 1.9251 KOps/s $\color{#d91a1a}-2.75\%$
test_to_module_speed[True] 1.7420ms 1.3255ms 754.4572 Ops/s 785.5193 Ops/s $\color{#d91a1a}-3.95\%$
test_to_module_speed[False] 1.5354ms 1.2760ms 783.7237 Ops/s 796.9698 Ops/s $\color{#d91a1a}-1.66\%$
test_tc_init 79.5000μs 43.5695μs 22.9519 KOps/s 22.4661 KOps/s $\color{#35bf28}+2.16\%$
test_tc_init_nested 0.1623ms 88.2609μs 11.3300 KOps/s 10.9646 KOps/s $\color{#35bf28}+3.33\%$
test_tc_first_layer_tensor 20.0770μs 1.5065μs 663.7948 KOps/s 672.3221 KOps/s $\color{#d91a1a}-1.27\%$
test_tc_first_layer_nontensor 51.4800μs 4.8307μs 207.0084 KOps/s 210.1796 KOps/s $\color{#d91a1a}-1.51\%$
test_tc_second_layer_tensor 28.4840μs 2.7911μs 358.2876 KOps/s 364.4651 KOps/s $\color{#d91a1a}-1.69\%$
test_tc_second_layer_nontensor 43.9830μs 6.1958μs 161.3991 KOps/s 164.8824 KOps/s $\color{#d91a1a}-2.11\%$
test_unbind 0.2281s 13.5853ms 73.6091 Ops/s 83.9181 Ops/s $\textbf{\color{#d91a1a}-12.28\%}$
test_full_like 8.6645ms 7.5581ms 132.3076 Ops/s 80.5155 Ops/s $\textbf{\color{#35bf28}+64.33\%}$
test_zeros_like 3.7804ms 2.9350ms 340.7098 Ops/s 136.7721 Ops/s $\textbf{\color{#35bf28}+149.11\%}$
test_ones_like 4.3630ms 3.3143ms 301.7216 Ops/s 132.6607 Ops/s $\textbf{\color{#35bf28}+127.44\%}$
test_clone 6.7077ms 5.1552ms 193.9782 Ops/s 108.1048 Ops/s $\textbf{\color{#35bf28}+79.44\%}$
test_squeeze 77.4750μs 12.0818μs 82.7688 KOps/s 85.9653 KOps/s $\color{#d91a1a}-3.72\%$
test_unsqueeze 0.1668ms 92.1394μs 10.8531 KOps/s 11.6731 KOps/s $\textbf{\color{#d91a1a}-7.02\%}$
test_split 0.3390ms 0.1942ms 5.1492 KOps/s 5.4477 KOps/s $\textbf{\color{#d91a1a}-5.48\%}$
test_permute 0.3492ms 0.2203ms 4.5396 KOps/s 4.6289 KOps/s $\color{#d91a1a}-1.93\%$
test_stack 29.5379ms 25.1070ms 39.8296 Ops/s 40.1225 Ops/s $\color{#d91a1a}-0.73\%$
test_cat 25.8227ms 24.9296ms 40.1130 Ops/s 39.1932 Ops/s $\color{#35bf28}+2.35\%$

Copy link

github-actions bot commented Nov 22, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 229. Improved: $\large\color{#35bf28}10$. Worsened: $\large\color{#d91a1a}13$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 25.2600μs 10.4431μs 95.7566 KOps/s 95.4027 KOps/s $\color{#35bf28}+0.37\%$
test_plain_set_stack_nested 31.9910μs 10.4606μs 95.5964 KOps/s 95.1474 KOps/s $\color{#35bf28}+0.47\%$
test_plain_set_nested_inplace 92.8220μs 11.3136μs 88.3890 KOps/s 88.5546 KOps/s $\color{#d91a1a}-0.19\%$
test_plain_set_stack_nested_inplace 39.4400μs 11.2414μs 88.9567 KOps/s 88.8756 KOps/s $\color{#35bf28}+0.09\%$
test_items 48.6310μs 2.8915μs 345.8467 KOps/s 344.3042 KOps/s $\color{#35bf28}+0.45\%$
test_items_nested 0.3743ms 0.3186ms 3.1390 KOps/s 3.1827 KOps/s $\color{#d91a1a}-1.37\%$
test_items_nested_locked 0.3711ms 0.3202ms 3.1231 KOps/s 3.1286 KOps/s $\color{#d91a1a}-0.18\%$
test_items_nested_leaf 81.3210μs 58.4374μs 17.1123 KOps/s 17.1276 KOps/s $\color{#d91a1a}-0.09\%$
test_items_stack_nested 0.3753ms 0.3192ms 3.1329 KOps/s 3.1397 KOps/s $\color{#d91a1a}-0.22\%$
test_items_stack_nested_leaf 0.1035ms 59.7207μs 16.7446 KOps/s 16.9064 KOps/s $\color{#d91a1a}-0.96\%$
test_items_stack_nested_locked 0.3722ms 0.3226ms 3.1000 KOps/s 3.1331 KOps/s $\color{#d91a1a}-1.05\%$
test_keys 29.3900μs 3.4665μs 288.4750 KOps/s 289.5451 KOps/s $\color{#d91a1a}-0.37\%$
test_keys_nested 0.1032ms 69.5825μs 14.3714 KOps/s 14.2364 KOps/s $\color{#35bf28}+0.95\%$
test_keys_nested_locked 0.6693ms 75.2019μs 13.2975 KOps/s 13.1545 KOps/s $\color{#35bf28}+1.09\%$
test_keys_nested_leaf 0.2561ms 60.8786μs 16.4261 KOps/s 16.3428 KOps/s $\color{#35bf28}+0.51\%$
test_keys_stack_nested 0.2766ms 70.3166μs 14.2214 KOps/s 14.1315 KOps/s $\color{#35bf28}+0.64\%$
test_keys_stack_nested_leaf 0.2546ms 61.2323μs 16.3312 KOps/s 16.0695 KOps/s $\color{#35bf28}+1.63\%$
test_keys_stack_nested_locked 0.2626ms 75.2262μs 13.2932 KOps/s 13.1865 KOps/s $\color{#35bf28}+0.81\%$
test_values 32.3187μs 0.8474μs 1.1801 MOps/s 1.1872 MOps/s $\color{#d91a1a}-0.60\%$
test_values_nested 60.0910μs 31.2241μs 32.0266 KOps/s 32.1851 KOps/s $\color{#d91a1a}-0.49\%$
test_values_nested_locked 0.2124ms 32.7604μs 30.5247 KOps/s 30.7875 KOps/s $\color{#d91a1a}-0.85\%$
test_values_nested_leaf 63.4210μs 33.5935μs 29.7676 KOps/s 29.9320 KOps/s $\color{#d91a1a}-0.55\%$
test_values_stack_nested 62.7510μs 31.4987μs 31.7473 KOps/s 31.7761 KOps/s $\color{#d91a1a}-0.09\%$
test_values_stack_nested_leaf 63.1610μs 33.9651μs 29.4420 KOps/s 29.3530 KOps/s $\color{#35bf28}+0.30\%$
test_values_stack_nested_locked 0.1015ms 33.1762μs 30.1421 KOps/s 30.3358 KOps/s $\color{#d91a1a}-0.64\%$
test_membership 2.5095μs 0.5095μs 1.9627 MOps/s 1.9660 MOps/s $\color{#d91a1a}-0.17\%$
test_membership_nested 13.9705μs 1.8410μs 543.1832 KOps/s 531.9150 KOps/s $\color{#35bf28}+2.12\%$
test_membership_nested_leaf 12.9267μs 1.8078μs 553.1564 KOps/s 529.8595 KOps/s $\color{#35bf28}+4.40\%$
test_membership_stacked_nested 44.3100μs 1.9273μs 518.8661 KOps/s 521.5446 KOps/s $\color{#d91a1a}-0.51\%$
test_membership_stacked_nested_leaf 31.3510μs 1.8960μs 527.4312 KOps/s 521.1797 KOps/s $\color{#35bf28}+1.20\%$
test_membership_nested_last 26.5400μs 2.8595μs 349.7147 KOps/s 358.3806 KOps/s $\color{#d91a1a}-2.42\%$
test_membership_nested_leaf_last 32.5610μs 2.8780μs 347.4673 KOps/s 355.9653 KOps/s $\color{#d91a1a}-2.39\%$
test_membership_stacked_nested_last 47.4610μs 4.5114μs 221.6600 KOps/s 265.6981 KOps/s $\textbf{\color{#d91a1a}-16.57\%}$
test_membership_stacked_nested_leaf_last 19.1600μs 4.5099μs 221.7347 KOps/s 268.6838 KOps/s $\textbf{\color{#d91a1a}-17.47\%}$
test_nested_getleaf 34.3600μs 5.9583μs 167.8321 KOps/s 167.7987 KOps/s $\color{#35bf28}+0.02\%$
test_nested_get 35.9700μs 5.7004μs 175.4250 KOps/s 175.2508 KOps/s $\color{#35bf28}+0.10\%$
test_stacked_getleaf 42.2410μs 5.9448μs 168.2147 KOps/s 167.0028 KOps/s $\color{#35bf28}+0.73\%$
test_stacked_get 32.5800μs 5.6998μs 175.4459 KOps/s 176.4689 KOps/s $\color{#d91a1a}-0.58\%$
test_nested_getitemleaf 27.3700μs 6.0732μs 164.6587 KOps/s 165.5792 KOps/s $\color{#d91a1a}-0.56\%$
test_nested_getitem 32.5310μs 5.7465μs 174.0184 KOps/s 174.0991 KOps/s $\color{#d91a1a}-0.05\%$
test_stacked_getitemleaf 62.7110μs 6.0301μs 165.8359 KOps/s 164.7669 KOps/s $\color{#35bf28}+0.65\%$
test_stacked_getitem 40.5900μs 5.7261μs 174.6384 KOps/s 174.9400 KOps/s $\color{#d91a1a}-0.17\%$
test_lock_nested 9.0931ms 0.3612ms 2.7684 KOps/s 2.7570 KOps/s $\color{#35bf28}+0.41\%$
test_lock_stack_nested 0.4522ms 0.3233ms 3.0932 KOps/s 3.0259 KOps/s $\color{#35bf28}+2.22\%$
test_unlock_nested 0.7128ms 0.2939ms 3.4025 KOps/s 3.3063 KOps/s $\color{#35bf28}+2.91\%$
test_unlock_stack_nested 0.3831ms 0.2617ms 3.8217 KOps/s 3.7034 KOps/s $\color{#35bf28}+3.19\%$
test_flatten_speed 0.1127ms 71.9303μs 13.9023 KOps/s 13.6666 KOps/s $\color{#35bf28}+1.72\%$
test_unflatten_speed 0.4150ms 0.2845ms 3.5153 KOps/s 3.4313 KOps/s $\color{#35bf28}+2.45\%$
test_common_ops 1.6621ms 0.5523ms 1.8106 KOps/s 1.7716 KOps/s $\color{#35bf28}+2.20\%$
test_creation 89.1010μs 1.4179μs 705.2907 KOps/s 705.7674 KOps/s $\color{#d91a1a}-0.07\%$
test_creation_empty 32.8500μs 7.0614μs 141.6156 KOps/s 141.0323 KOps/s $\color{#35bf28}+0.41\%$
test_creation_nested_1 79.6110μs 8.5943μs 116.3559 KOps/s 115.1995 KOps/s $\color{#35bf28}+1.00\%$
test_creation_nested_2 40.3510μs 10.9645μs 91.2037 KOps/s 89.4064 KOps/s $\color{#35bf28}+2.01\%$
test_clone 80.6010μs 9.5258μs 104.9776 KOps/s 101.2994 KOps/s $\color{#35bf28}+3.63\%$
test_getitem[int] 1.3696ms 10.2091μs 97.9519 KOps/s 94.3976 KOps/s $\color{#35bf28}+3.77\%$
test_getitem[slice_int] 0.1073ms 19.1965μs 52.0929 KOps/s 50.0475 KOps/s $\color{#35bf28}+4.09\%$
test_getitem[range] 0.1301ms 34.9024μs 28.6514 KOps/s 28.2398 KOps/s $\color{#35bf28}+1.46\%$
test_getitem[tuple] 0.1089ms 17.3158μs 57.7508 KOps/s 56.1865 KOps/s $\color{#35bf28}+2.78\%$
test_getitem[list] 0.2100ms 31.1754μs 32.0765 KOps/s 31.7344 KOps/s $\color{#35bf28}+1.08\%$
test_setitem_dim[int] 37.3410μs 17.0829μs 58.5380 KOps/s 58.4215 KOps/s $\color{#35bf28}+0.20\%$
test_setitem_dim[slice_int] 55.6710μs 34.4654μs 29.0146 KOps/s 28.7845 KOps/s $\color{#35bf28}+0.80\%$
test_setitem_dim[range] 80.5510μs 49.7460μs 20.1021 KOps/s 19.9912 KOps/s $\color{#35bf28}+0.55\%$
test_setitem_dim[tuple] 51.3810μs 29.7304μs 33.6356 KOps/s 33.4548 KOps/s $\color{#35bf28}+0.54\%$
test_setitem 91.0610μs 13.1734μs 75.9108 KOps/s 72.8154 KOps/s $\color{#35bf28}+4.25\%$
test_set 0.1502ms 12.7596μs 78.3722 KOps/s 75.0484 KOps/s $\color{#35bf28}+4.43\%$
test_set_shared 1.6236ms 0.1422ms 7.0331 KOps/s 6.9458 KOps/s $\color{#35bf28}+1.26\%$
test_update 0.2936ms 15.0968μs 66.2391 KOps/s 64.3253 KOps/s $\color{#35bf28}+2.98\%$
test_update_nested 0.1076ms 19.4317μs 51.4622 KOps/s 49.3626 KOps/s $\color{#35bf28}+4.25\%$
test_update__nested 1.2064ms 22.5622μs 44.3219 KOps/s 43.3114 KOps/s $\color{#35bf28}+2.33\%$
test_set_nested 79.8110μs 13.5521μs 73.7891 KOps/s 69.8318 KOps/s $\textbf{\color{#35bf28}+5.67\%}$
test_set_nested_new 75.4810μs 15.6733μs 63.8028 KOps/s 60.8578 KOps/s $\color{#35bf28}+4.84\%$
test_select 0.1498ms 27.7301μs 36.0619 KOps/s 35.0304 KOps/s $\color{#35bf28}+2.94\%$
test_select_nested 70.2000μs 41.2167μs 24.2620 KOps/s 23.7863 KOps/s $\color{#35bf28}+2.00\%$
test_exclude_nested 0.1053ms 57.9133μs 17.2672 KOps/s 17.1402 KOps/s $\color{#35bf28}+0.74\%$
test_empty[True] 0.3332ms 0.2558ms 3.9098 KOps/s 3.9341 KOps/s $\color{#d91a1a}-0.62\%$
test_empty[False] 3.5360μs 0.7486μs 1.3359 MOps/s 1.3392 MOps/s $\color{#d91a1a}-0.25\%$
test_to 81.3610μs 53.9249μs 18.5443 KOps/s 19.1165 KOps/s $\color{#d91a1a}-2.99\%$
test_to_nonblocking 0.2396ms 44.6055μs 22.4188 KOps/s 22.5474 KOps/s $\color{#d91a1a}-0.57\%$
test_unbind_speed 1.9662ms 0.2237ms 4.4710 KOps/s 4.3636 KOps/s $\color{#35bf28}+2.46\%$
test_unbind_speed_stack0 0.4118ms 0.2255ms 4.4346 KOps/s 4.3002 KOps/s $\color{#35bf28}+3.13\%$
test_unbind_speed_stack1 93.1453ms 0.6266ms 1.5960 KOps/s 1.5671 KOps/s $\color{#35bf28}+1.85\%$
test_split 94.7678ms 1.6524ms 605.1943 Ops/s 581.8528 Ops/s $\color{#35bf28}+4.01\%$
test_chunk 1.5910ms 1.4018ms 713.3447 Ops/s 695.1956 Ops/s $\color{#35bf28}+2.61\%$
test_consolidate[False-None] 97.3763ms 2.7642ms 361.7625 Ops/s 353.8992 Ops/s $\color{#35bf28}+2.22\%$
test_consolidate[default-None] 1.7495ms 1.6093ms 621.3731 Ops/s 600.3787 Ops/s $\color{#35bf28}+3.50\%$
test_consolidate[reduce-overhead-None] 1.8546ms 1.6576ms 603.2891 Ops/s 584.6911 Ops/s $\color{#35bf28}+3.18\%$
test_consolidate_njt[False-None] 6.5325ms 6.3600ms 157.2323 Ops/s 156.9122 Ops/s $\color{#35bf28}+0.20\%$
test_to[False-False-None] 1.8098ms 1.6416ms 609.1770 Ops/s 607.3954 Ops/s $\color{#35bf28}+0.29\%$
test_to[True-False-None] 1.4839ms 1.2354ms 809.4637 Ops/s 771.7661 Ops/s $\color{#35bf28}+4.88\%$
test_to[within-False-None] 4.0901ms 3.9046ms 256.1056 Ops/s 255.7307 Ops/s $\color{#35bf28}+0.15\%$
test_to[True-default-None] 5.4416ms 5.1652ms 193.6039 Ops/s 194.6325 Ops/s $\color{#d91a1a}-0.53\%$
test_to_njt[False-False-None] 7.0803ms 6.8321ms 146.3688 Ops/s 147.1729 Ops/s $\color{#d91a1a}-0.55\%$
test_to_njt[True-False-None] 5.7593ms 5.3809ms 185.8426 Ops/s 185.1195 Ops/s $\color{#35bf28}+0.39\%$
test_to_njt[within-False-None] 12.0254ms 11.8419ms 84.4456 Ops/s 84.3405 Ops/s $\color{#35bf28}+0.12\%$
test_creation[device0] 0.4609ms 77.1473μs 12.9622 KOps/s 13.0128 KOps/s $\color{#d91a1a}-0.39\%$
test_creation_from_tensor 0.5182ms 81.3523μs 12.2922 KOps/s 12.5585 KOps/s $\color{#d91a1a}-2.12\%$
test_add_one[memmap_tensor0] 0.4354ms 5.9874μs 167.0188 KOps/s 159.5478 KOps/s $\color{#35bf28}+4.68\%$
test_contiguous[memmap_tensor0] 3.7206μs 0.3952μs 2.5303 MOps/s 2.5945 MOps/s $\color{#d91a1a}-2.47\%$
test_stack[memmap_tensor0] 40.6210μs 4.1886μs 238.7414 KOps/s 228.2482 KOps/s $\color{#35bf28}+4.60\%$
test_memmaptd_index 1.6425ms 0.2404ms 4.1604 KOps/s 4.1177 KOps/s $\color{#35bf28}+1.04\%$
test_memmaptd_index_astensor 0.5775ms 0.2971ms 3.3658 KOps/s 3.3731 KOps/s $\color{#d91a1a}-0.22\%$
test_memmaptd_index_op 1.1080ms 0.5376ms 1.8603 KOps/s 1.8118 KOps/s $\color{#35bf28}+2.68\%$
test_serialize_model 0.4269s 0.1723s 5.8042 Ops/s 7.6771 Ops/s $\textbf{\color{#d91a1a}-24.40\%}$
test_serialize_model_pickle 1.3713s 1.2208s 0.8192 Ops/s 0.8246 Ops/s $\color{#d91a1a}-0.67\%$
test_serialize_weights 0.1311s 0.1299s 7.6978 Ops/s 7.6969 Ops/s $\color{#35bf28}+0.01\%$
test_serialize_weights_returnearly 0.3177s 52.8056ms 18.9374 Ops/s 13.0855 Ops/s $\textbf{\color{#35bf28}+44.72\%}$
test_serialize_weights_pickle 1.3585s 1.2173s 0.8215 Ops/s 0.8123 Ops/s $\color{#35bf28}+1.14\%$
test_reshape_pytree 61.8410μs 21.6897μs 46.1047 KOps/s 45.4510 KOps/s $\color{#35bf28}+1.44\%$
test_reshape_td 62.5010μs 25.5561μs 39.1296 KOps/s 37.9597 KOps/s $\color{#35bf28}+3.08\%$
test_view_pytree 0.1815ms 21.6581μs 46.1721 KOps/s 46.7514 KOps/s $\color{#d91a1a}-1.24\%$
test_view_td 0.1624ms 27.9226μs 35.8133 KOps/s 33.1264 KOps/s $\textbf{\color{#35bf28}+8.11\%}$
test_unbind_pytree 0.1199ms 26.9105μs 37.1603 KOps/s 36.1457 KOps/s $\color{#35bf28}+2.81\%$
test_unbind_td 0.7678ms 34.3593μs 29.1042 KOps/s 28.3659 KOps/s $\color{#35bf28}+2.60\%$
test_split_pytree 0.1606ms 28.8629μs 34.6465 KOps/s 33.4576 KOps/s $\color{#35bf28}+3.55\%$
test_split_td 1.0248ms 37.1137μs 26.9443 KOps/s 26.0545 KOps/s $\color{#35bf28}+3.41\%$
test_add_pytree 0.1749ms 31.5999μs 31.6457 KOps/s 30.2634 KOps/s $\color{#35bf28}+4.57\%$
test_add_td 0.1485ms 41.8262μs 23.9085 KOps/s 23.2105 KOps/s $\color{#35bf28}+3.01\%$
test_compile_add_one_nested[tensordict-compile] 0.2856ms 0.1207ms 8.2828 KOps/s 8.1732 KOps/s $\color{#35bf28}+1.34\%$
test_compile_add_one_nested[tensordict-eager] 0.3046ms 0.1237ms 8.0845 KOps/s 8.0123 KOps/s $\color{#35bf28}+0.90\%$
test_compile_add_one_nested[pytree-compile] 0.2780ms 95.1959μs 10.5046 KOps/s 10.3356 KOps/s $\color{#35bf28}+1.64\%$
test_compile_add_one_nested[pytree-eager] 1.5619ms 0.1470ms 6.8024 KOps/s 6.7119 KOps/s $\color{#35bf28}+1.35\%$
test_compile_copy_nested[tensordict-compile] 0.2041ms 22.4930μs 44.4582 KOps/s 43.8068 KOps/s $\color{#35bf28}+1.49\%$
test_compile_copy_nested[tensordict-eager] 0.1509ms 26.4346μs 37.8292 KOps/s 37.3404 KOps/s $\color{#35bf28}+1.31\%$
test_compile_copy_nested[pytree-compile] 0.1563ms 63.8323μs 15.6661 KOps/s 15.4685 KOps/s $\color{#35bf28}+1.28\%$
test_compile_copy_nested[pytree-eager] 0.1813ms 48.7852μs 20.4980 KOps/s 20.0453 KOps/s $\color{#35bf28}+2.26\%$
test_compile_add_one_flat[tensordict-compile] 0.2930ms 0.1413ms 7.0790 KOps/s 6.9156 KOps/s $\color{#35bf28}+2.36\%$
test_compile_add_one_flat[tensordict-eager] 0.3566ms 0.2081ms 4.8053 KOps/s 4.8524 KOps/s $\color{#d91a1a}-0.97\%$
test_compile_add_one_flat[tensorclass-compile] 0.2951ms 98.6340μs 10.1385 KOps/s 10.2806 KOps/s $\color{#d91a1a}-1.38\%$
test_compile_add_one_flat[tensorclass-eager] 0.2373ms 53.1053μs 18.8305 KOps/s 19.1002 KOps/s $\color{#d91a1a}-1.41\%$
test_compile_add_one_flat[pytree-compile] 0.2970ms 0.1375ms 7.2715 KOps/s 7.2526 KOps/s $\color{#35bf28}+0.26\%$
test_compile_add_one_flat[pytree-eager] 0.6319ms 0.4660ms 2.1460 KOps/s 2.0694 KOps/s $\color{#35bf28}+3.70\%$
test_compile_add_self_flat[tensordict-eager] 0.3889ms 0.2466ms 4.0549 KOps/s 4.0314 KOps/s $\color{#35bf28}+0.58\%$
test_compile_add_self_flat[tensordict-compile] 0.2894ms 0.1431ms 6.9882 KOps/s 6.8936 KOps/s $\color{#35bf28}+1.37\%$
test_compile_add_self_flat[tensorclass-eager] 0.2231ms 60.7875μs 16.4507 KOps/s 16.1596 KOps/s $\color{#35bf28}+1.80\%$
test_compile_add_self_flat[tensorclass-compile] 0.2493ms 99.0240μs 10.0986 KOps/s 10.1790 KOps/s $\color{#d91a1a}-0.79\%$
test_compile_add_self_flat[pytree-eager] 0.5566ms 0.4032ms 2.4804 KOps/s 2.4603 KOps/s $\color{#35bf28}+0.82\%$
test_compile_add_self_flat[pytree-compile] 0.2799ms 0.1370ms 7.3008 KOps/s 7.3875 KOps/s $\color{#d91a1a}-1.17\%$
test_compile_copy_flat[tensordict-compile] 0.1800ms 19.6943μs 50.7761 KOps/s 53.7948 KOps/s $\textbf{\color{#d91a1a}-5.61\%}$
test_compile_copy_flat[tensordict-eager] 53.2410μs 27.1542μs 36.8267 KOps/s 36.2621 KOps/s $\color{#35bf28}+1.56\%$
test_compile_copy_flat[pytree-compile] 0.1765ms 69.0184μs 14.4889 KOps/s 14.3638 KOps/s $\color{#35bf28}+0.87\%$
test_compile_copy_flat[pytree-eager] 91.3610μs 51.1190μs 19.5622 KOps/s 19.5059 KOps/s $\color{#35bf28}+0.29\%$
test_compile_assign_and_add[tensordict-compile] 1.6013ms 0.3866ms 2.5864 KOps/s 2.1900 KOps/s $\textbf{\color{#35bf28}+18.10\%}$
test_compile_assign_and_add[tensordict-eager] 2.8103ms 2.5596ms 390.6877 Ops/s 395.8013 Ops/s $\color{#d91a1a}-1.29\%$
test_compile_assign_and_add[pytree-compile] 1.5773ms 0.3780ms 2.6452 KOps/s 2.2899 KOps/s $\textbf{\color{#35bf28}+15.52\%}$
test_compile_assign_and_add[pytree-eager] 2.7785ms 2.5849ms 386.8670 Ops/s 382.3897 Ops/s $\color{#35bf28}+1.17\%$
test_compile_indexing[tensor-tensordict-compile] 0.3271ms 0.1166ms 8.5791 KOps/s 8.6410 KOps/s $\color{#d91a1a}-0.72\%$
test_compile_indexing[tensor-tensordict-eager] 0.5734ms 81.6573μs 12.2463 KOps/s 12.9931 KOps/s $\textbf{\color{#d91a1a}-5.75\%}$
test_compile_indexing[tensor-tensorclass-compile] 0.3010ms 0.1097ms 9.1191 KOps/s 9.7166 KOps/s $\textbf{\color{#d91a1a}-6.15\%}$
test_compile_indexing[tensor-tensorclass-eager] 0.2477ms 69.5748μs 14.3730 KOps/s 15.0044 KOps/s $\color{#d91a1a}-4.21\%$
test_compile_indexing[tensor-pytree-compile] 0.3014ms 0.1108ms 9.0288 KOps/s 9.6655 KOps/s $\textbf{\color{#d91a1a}-6.59\%}$
test_compile_indexing[tensor-pytree-eager] 0.2702ms 70.1304μs 14.2592 KOps/s 14.9735 KOps/s $\color{#d91a1a}-4.77\%$
test_compile_indexing[slice-tensordict-compile] 0.2432ms 99.6477μs 10.0354 KOps/s 10.0732 KOps/s $\color{#d91a1a}-0.38\%$
test_compile_indexing[slice-tensordict-eager] 0.2336ms 17.0131μs 58.7782 KOps/s 59.2528 KOps/s $\color{#d91a1a}-0.80\%$
test_compile_indexing[slice-tensorclass-compile] 0.2620ms 95.9526μs 10.4218 KOps/s 10.4888 KOps/s $\color{#d91a1a}-0.64\%$
test_compile_indexing[slice-tensorclass-eager] 0.1517ms 15.1828μs 65.8638 KOps/s 63.7376 KOps/s $\color{#35bf28}+3.34\%$
test_compile_indexing[slice-pytree-compile] 0.2823ms 95.2578μs 10.4978 KOps/s 9.9765 KOps/s $\textbf{\color{#35bf28}+5.23\%}$
test_compile_indexing[slice-pytree-eager] 0.2205ms 15.3649μs 65.0833 KOps/s 64.6706 KOps/s $\color{#35bf28}+0.64\%$
test_compile_indexing[int-tensordict-compile] 0.2969ms 0.1027ms 9.7358 KOps/s 9.7082 KOps/s $\color{#35bf28}+0.28\%$
test_compile_indexing[int-tensordict-eager] 0.6072ms 16.0110μs 62.4572 KOps/s 55.1714 KOps/s $\textbf{\color{#35bf28}+13.21\%}$
test_compile_indexing[int-tensorclass-compile] 0.2393ms 95.0587μs 10.5198 KOps/s 10.3117 KOps/s $\color{#35bf28}+2.02\%$
test_compile_indexing[int-tensorclass-eager] 0.1505ms 15.2315μs 65.6533 KOps/s 63.1555 KOps/s $\color{#35bf28}+3.96\%$
test_compile_indexing[int-pytree-compile] 0.2456ms 95.0686μs 10.5187 KOps/s 10.3516 KOps/s $\color{#35bf28}+1.61\%$
test_compile_indexing[int-pytree-eager] 0.1241ms 15.3142μs 65.2988 KOps/s 64.6312 KOps/s $\color{#35bf28}+1.03\%$
test_mod_add[eager] 0.1796ms 29.7847μs 33.5742 KOps/s 33.5075 KOps/s $\color{#35bf28}+0.20\%$
test_mod_add[compile] 0.2489ms 77.0884μs 12.9721 KOps/s 12.9482 KOps/s $\color{#35bf28}+0.18\%$
test_mod_add[compile-overhead] 0.3137ms 0.1623ms 6.1622 KOps/s 5.9494 KOps/s $\color{#35bf28}+3.58\%$
test_mod_wrap[eager] 0.3798ms 0.2331ms 4.2905 KOps/s 4.2644 KOps/s $\color{#35bf28}+0.61\%$
test_mod_wrap[compile] 1.5996ms 0.2885ms 3.4659 KOps/s 3.5815 KOps/s $\color{#d91a1a}-3.23\%$
test_mod_wrap[compile-overhead] 7.5129ms 3.8147ms 262.1442 Ops/s 259.4481 Ops/s $\color{#35bf28}+1.04\%$
test_mod_wrap_and_backward[eager] 1.5571ms 1.3069ms 765.1928 Ops/s 700.0868 Ops/s $\textbf{\color{#35bf28}+9.30\%}$
test_mod_wrap_and_backward[compile] 1.3996ms 1.2490ms 800.6573 Ops/s 737.8164 Ops/s $\textbf{\color{#35bf28}+8.52\%}$
test_mod_wrap_and_backward[compile-overhead] 1.3420ms 0.8955ms 1.1166 KOps/s 928.2526 Ops/s $\textbf{\color{#35bf28}+20.29\%}$
test_seq_add[eager] 0.2681ms 94.5156μs 10.5803 KOps/s 10.6626 KOps/s $\color{#d91a1a}-0.77\%$
test_seq_add[compile] 0.2579ms 89.6176μs 11.1585 KOps/s 11.7538 KOps/s $\textbf{\color{#d91a1a}-5.06\%}$
test_seq_add[compile-overhead] 0.3009ms 0.1338ms 7.4749 KOps/s 7.9454 KOps/s $\textbf{\color{#d91a1a}-5.92\%}$
test_seq_wrap[eager] 0.5832ms 0.3801ms 2.6312 KOps/s 2.7177 KOps/s $\color{#d91a1a}-3.18\%$
test_seq_wrap[compile] 1.6381ms 0.2977ms 3.3588 KOps/s 3.3801 KOps/s $\color{#d91a1a}-0.63\%$
test_seq_wrap[compile-overhead] 0.4050ms 0.2228ms 4.4884 KOps/s 4.5711 KOps/s $\color{#d91a1a}-1.81\%$
test_func_call_runtime[False-eager] 0.9598ms 0.7497ms 1.3338 KOps/s 1.3913 KOps/s $\color{#d91a1a}-4.13\%$
test_func_call_runtime[False-compile] 0.9762ms 0.7644ms 1.3081 KOps/s 1.3597 KOps/s $\color{#d91a1a}-3.79\%$
test_func_call_runtime[False-compile-overhead] 0.5071ms 0.3525ms 2.8370 KOps/s 2.8115 KOps/s $\color{#35bf28}+0.91\%$
test_func_call_runtime[True-eager] 1.1907ms 0.8813ms 1.1347 KOps/s 1.1353 KOps/s $\color{#d91a1a}-0.05\%$
test_func_call_runtime[True-compile] 0.9331ms 0.7553ms 1.3239 KOps/s 1.3251 KOps/s $\color{#d91a1a}-0.09\%$
test_func_call_runtime[True-compile-overhead] 0.5262ms 0.3745ms 2.6703 KOps/s 2.6642 KOps/s $\color{#35bf28}+0.23\%$
test_func_call_cm_runtime[False-eager] 0.8966ms 0.7025ms 1.4235 KOps/s 1.4038 KOps/s $\color{#35bf28}+1.40\%$
test_func_call_cm_runtime[False-compile] 0.9602ms 0.7378ms 1.3553 KOps/s 1.3571 KOps/s $\color{#d91a1a}-0.13\%$
test_func_call_cm_runtime[False-compile-overhead] 0.5717ms 0.3641ms 2.7468 KOps/s 2.7997 KOps/s $\color{#d91a1a}-1.89\%$
test_func_call_cm_runtime[True-eager] 1.2126ms 1.0104ms 989.7386 Ops/s 1.0226 KOps/s $\color{#d91a1a}-3.21\%$
test_func_call_cm_runtime[True-compile] 1.1072ms 0.8264ms 1.2101 KOps/s 1.2745 KOps/s $\textbf{\color{#d91a1a}-5.06\%}$
test_func_call_cm_runtime[True-compile-overhead] 0.5918ms 0.4077ms 2.4527 KOps/s 2.4816 KOps/s $\color{#d91a1a}-1.17\%$
test_vmap_func_call_cm_runtime[eager] 2.4692ms 1.9768ms 505.8798 Ops/s 499.1610 Ops/s $\color{#35bf28}+1.35\%$
test_vmap_func_call_cm_runtime[compile] 0.9650ms 0.7972ms 1.2544 KOps/s 1.2467 KOps/s $\color{#35bf28}+0.61\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.5306ms 0.4017ms 2.4894 KOps/s 2.4721 KOps/s $\color{#35bf28}+0.70\%$
test_distributed 2.8786ms 0.2918ms 3.4268 KOps/s 8.8836 KOps/s $\textbf{\color{#d91a1a}-61.43\%}$
test_tdmodule 0.1127ms 13.6452μs 73.2859 KOps/s 77.0764 KOps/s $\color{#d91a1a}-4.92\%$
test_tdmodule_dispatch 56.2710μs 28.5149μs 35.0694 KOps/s 36.8755 KOps/s $\color{#d91a1a}-4.90\%$
test_tdseq 35.4310μs 15.0260μs 66.5513 KOps/s 68.3434 KOps/s $\color{#d91a1a}-2.62\%$
test_tdseq_dispatch 0.1271ms 31.5139μs 31.7320 KOps/s 33.0181 KOps/s $\color{#d91a1a}-3.90\%$
test_instantiation_functorch 1.6813ms 1.5015ms 666.0142 Ops/s 661.3632 Ops/s $\color{#35bf28}+0.70\%$
test_exec_functorch 0.2563ms 0.1388ms 7.2035 KOps/s 7.1712 KOps/s $\color{#35bf28}+0.45\%$
test_exec_functional_call 0.2605ms 0.1300ms 7.6928 KOps/s 7.6504 KOps/s $\color{#35bf28}+0.55\%$
test_exec_td_decorator 0.4065ms 0.1718ms 5.8216 KOps/s 5.7377 KOps/s $\color{#35bf28}+1.46\%$
test_vmap_mlp_speed_decorator[True-True] 0.8416ms 0.6590ms 1.5174 KOps/s 1.5325 KOps/s $\color{#d91a1a}-0.98\%$
test_vmap_mlp_speed_decorator[True-False] 0.8569ms 0.6544ms 1.5282 KOps/s 1.5304 KOps/s $\color{#d91a1a}-0.15\%$
test_vmap_mlp_speed_decorator[False-True] 0.7982ms 0.5872ms 1.7031 KOps/s 1.6823 KOps/s $\color{#35bf28}+1.23\%$
test_vmap_mlp_speed_decorator[False-False] 0.7537ms 0.5697ms 1.7555 KOps/s 1.7426 KOps/s $\color{#35bf28}+0.74\%$
test_vmap_transformer_speed_decorator[True-True] 19.1063ms 18.4692ms 54.1441 Ops/s 54.0438 Ops/s $\color{#35bf28}+0.19\%$
test_vmap_transformer_speed_decorator[True-False] 19.2378ms 18.5196ms 53.9968 Ops/s 53.9754 Ops/s $\color{#35bf28}+0.04\%$
test_vmap_transformer_speed_decorator[False-True] 19.1170ms 18.3971ms 54.3563 Ops/s 54.4659 Ops/s $\color{#d91a1a}-0.20\%$
test_vmap_transformer_speed_decorator[False-False] 19.0936ms 18.3173ms 54.5933 Ops/s 54.2469 Ops/s $\color{#35bf28}+0.64\%$
test_to_module_speed[True] 1.1569ms 0.9168ms 1.0907 KOps/s 1.0846 KOps/s $\color{#35bf28}+0.57\%$
test_to_module_speed[False] 1.4096ms 0.9008ms 1.1101 KOps/s 1.1059 KOps/s $\color{#35bf28}+0.37\%$
test_tc_init 71.9600μs 34.6601μs 28.8516 KOps/s 29.1429 KOps/s $\color{#d91a1a}-1.00\%$
test_tc_init_nested 0.2059ms 70.0073μs 14.2842 KOps/s 13.9422 KOps/s $\color{#35bf28}+2.45\%$
test_tc_first_layer_tensor 5.9600μs 0.6827μs 1.4648 MOps/s 1.4500 MOps/s $\color{#35bf28}+1.02\%$
test_tc_first_layer_nontensor 27.9900μs 2.2695μs 440.6278 KOps/s 438.0590 KOps/s $\color{#35bf28}+0.59\%$
test_tc_second_layer_tensor 23.8328μs 1.4060μs 711.2436 KOps/s 710.0264 KOps/s $\color{#35bf28}+0.17\%$
test_tc_second_layer_nontensor 28.3000μs 3.0302μs 330.0136 KOps/s 326.0231 KOps/s $\color{#35bf28}+1.22\%$
test_unbind 0.2387s 9.7897ms 102.1478 Ops/s 154.1525 Ops/s $\textbf{\color{#d91a1a}-33.74\%}$
test_full_like 9.6928ms 9.2747ms 107.8206 Ops/s 106.4979 Ops/s $\color{#35bf28}+1.24\%$
test_zeros_like 5.2714ms 4.3368ms 230.5822 Ops/s 230.1625 Ops/s $\color{#35bf28}+0.18\%$
test_ones_like 4.9970ms 4.3492ms 229.9285 Ops/s 230.0691 Ops/s $\color{#d91a1a}-0.06\%$
test_clone 11.7366ms 9.2937ms 107.5998 Ops/s 151.8730 Ops/s $\textbf{\color{#d91a1a}-29.15\%}$
test_squeeze 55.3510μs 9.2672μs 107.9074 KOps/s 109.5122 KOps/s $\color{#d91a1a}-1.47\%$
test_unsqueeze 0.2094ms 71.3555μs 14.0143 KOps/s 14.1322 KOps/s $\color{#d91a1a}-0.83\%$
test_split 0.3846ms 0.1568ms 6.3793 KOps/s 6.4795 KOps/s $\color{#d91a1a}-1.55\%$
test_permute 0.3904ms 0.1806ms 5.5385 KOps/s 5.7813 KOps/s $\color{#d91a1a}-4.20\%$
test_stack 51.7984ms 51.2395ms 19.5162 Ops/s 19.6076 Ops/s $\color{#d91a1a}-0.47\%$
test_cat 51.4460ms 50.8419ms 19.6688 Ops/s 19.7915 Ops/s $\color{#d91a1a}-0.62\%$

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Nov 22, 2024
ghstack-source-id: 6d6a10c52d5558017497e4e3cf1995820350a952
Pull Request resolved: #1103
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Nov 22, 2024
ghstack-source-id: e7267e4d756b8ac3ea0727c72fed4ffac9a74e4e
Pull Request resolved: #1103
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Nov 23, 2024
ghstack-source-id: 9d15fa52028fb9b055e4b562bf740e4640b61cfd
Pull Request resolved: #1103
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Nov 23, 2024
ghstack-source-id: 336a1ba18ff51b2a2fcb041cd18cc4c198ec211e
Pull Request resolved: #1103
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Nov 23, 2024
ghstack-source-id: d4a264a77251a71964c05834daeaa029ed142417
Pull Request resolved: #1103
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Nov 23, 2024
ghstack-source-id: a566ae225c54f07a680b4bf380b16d8e797f62ea
Pull Request resolved: #1103
@vmoens vmoens merged commit f223205 into gh/vmoens/38/base Nov 23, 2024
42 of 44 checks passed
vmoens added a commit that referenced this pull request Nov 23, 2024
ghstack-source-id: a566ae225c54f07a680b4bf380b16d8e797f62ea
Pull Request resolved: #1103
@vmoens vmoens deleted the gh/vmoens/38/head branch November 23, 2024 19:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants