Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Test] Skip compile tests that require 2.5 for stable #996

Merged
merged 1 commit into from
Sep 17, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Sep 17, 2024

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Sep 17, 2024
ghstack-source-id: 531f17478756b54eacc70b1f4c9be319a6335a37
Pull Request resolved: #996
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 17, 2024
@vmoens vmoens merged commit a1d41dc into gh/vmoens/22/base Sep 17, 2024
12 of 28 checks passed
vmoens added a commit that referenced this pull request Sep 17, 2024
ghstack-source-id: 531f17478756b54eacc70b1f4c9be319a6335a37
Pull Request resolved: #996
@vmoens vmoens deleted the gh/vmoens/22/head branch September 17, 2024 02:51
Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 222. Improved: $\large\color{#35bf28}7$. Worsened: $\large\color{#d91a1a}36$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 45.4360μs 21.1071μs 47.3775 KOps/s 52.7448 KOps/s $\textbf{\color{#d91a1a}-10.18\%}$
test_plain_set_stack_nested 49.6930μs 21.4057μs 46.7165 KOps/s 52.5680 KOps/s $\textbf{\color{#d91a1a}-11.13\%}$
test_plain_set_nested_inplace 74.8100μs 22.7879μs 43.8830 KOps/s 48.0164 KOps/s $\textbf{\color{#d91a1a}-8.61\%}$
test_plain_set_stack_nested_inplace 57.0270μs 22.8166μs 43.8278 KOps/s 47.9525 KOps/s $\textbf{\color{#d91a1a}-8.60\%}$
test_items 20.0570μs 4.3374μs 230.5536 KOps/s 240.5228 KOps/s $\color{#d91a1a}-4.14\%$
test_items_nested 0.6000ms 0.3638ms 2.7486 KOps/s 2.7319 KOps/s $\color{#35bf28}+0.61\%$
test_items_nested_locked 0.6260ms 0.3627ms 2.7570 KOps/s 2.7471 KOps/s $\color{#35bf28}+0.36\%$
test_items_nested_leaf 0.1256ms 67.8537μs 14.7376 KOps/s 14.6690 KOps/s $\color{#35bf28}+0.47\%$
test_items_stack_nested 0.4904ms 0.3697ms 2.7048 KOps/s 2.6873 KOps/s $\color{#35bf28}+0.65\%$
test_items_stack_nested_leaf 0.1280ms 70.3729μs 14.2100 KOps/s 14.1051 KOps/s $\color{#35bf28}+0.74\%$
test_items_stack_nested_locked 0.5754ms 0.3707ms 2.6977 KOps/s 2.6759 KOps/s $\color{#35bf28}+0.82\%$
test_keys 20.2070μs 3.7788μs 264.6354 KOps/s 269.9298 KOps/s $\color{#d91a1a}-1.96\%$
test_keys_nested 0.1683ms 0.1005ms 9.9530 KOps/s 9.7856 KOps/s $\color{#35bf28}+1.71\%$
test_keys_nested_locked 1.6825ms 0.1071ms 9.3342 KOps/s 9.4524 KOps/s $\color{#d91a1a}-1.25\%$
test_keys_nested_leaf 0.1514ms 86.2090μs 11.5997 KOps/s 11.9323 KOps/s $\color{#d91a1a}-2.79\%$
test_keys_stack_nested 0.1718ms 0.1005ms 9.9458 KOps/s 9.8978 KOps/s $\color{#35bf28}+0.49\%$
test_keys_stack_nested_leaf 0.1445ms 83.3967μs 11.9909 KOps/s 11.9299 KOps/s $\color{#35bf28}+0.51\%$
test_keys_stack_nested_locked 0.1752ms 0.1061ms 9.4256 KOps/s 9.3091 KOps/s $\color{#35bf28}+1.25\%$
test_values 5.9592μs 1.0691μs 935.3570 KOps/s 949.1950 KOps/s $\color{#d91a1a}-1.46\%$
test_values_nested 0.3411ms 76.7045μs 13.0370 KOps/s 13.6391 KOps/s $\color{#d91a1a}-4.41\%$
test_values_nested_locked 0.1304ms 73.3731μs 13.6290 KOps/s 13.6341 KOps/s $\color{#d91a1a}-0.04\%$
test_values_nested_leaf 0.3848ms 62.6979μs 15.9495 KOps/s 15.9367 KOps/s $\color{#35bf28}+0.08\%$
test_values_stack_nested 0.1330ms 74.5782μs 13.4087 KOps/s 13.3730 KOps/s $\color{#35bf28}+0.27\%$
test_values_stack_nested_leaf 0.2338ms 60.0617μs 16.6496 KOps/s 15.9171 KOps/s $\color{#35bf28}+4.60\%$
test_values_stack_nested_locked 0.1635ms 74.5886μs 13.4069 KOps/s 13.4691 KOps/s $\color{#d91a1a}-0.46\%$
test_membership 5.2108μs 0.6978μs 1.4330 MOps/s 1.0897 MOps/s $\textbf{\color{#35bf28}+31.51\%}$
test_membership_nested 0.1093ms 2.8610μs 349.5339 KOps/s 366.8371 KOps/s $\color{#d91a1a}-4.72\%$
test_membership_nested_leaf 54.8430μs 2.8557μs 350.1778 KOps/s 367.7709 KOps/s $\color{#d91a1a}-4.78\%$
test_membership_stacked_nested 23.5850μs 2.7811μs 359.5739 KOps/s 372.9649 KOps/s $\color{#d91a1a}-3.59\%$
test_membership_stacked_nested_leaf 23.4740μs 2.8320μs 353.1062 KOps/s 366.3796 KOps/s $\color{#d91a1a}-3.62\%$
test_membership_nested_last 24.0160μs 4.0424μs 247.3780 KOps/s 252.1458 KOps/s $\color{#d91a1a}-1.89\%$
test_membership_nested_leaf_last 25.3180μs 4.0664μs 245.9205 KOps/s 253.9772 KOps/s $\color{#d91a1a}-3.17\%$
test_membership_stacked_nested_last 33.7140μs 5.5474μs 180.2646 KOps/s 179.0521 KOps/s $\color{#35bf28}+0.68\%$
test_membership_stacked_nested_leaf_last 30.1570μs 5.5266μs 180.9421 KOps/s 180.2578 KOps/s $\color{#35bf28}+0.38\%$
test_nested_getleaf 43.9030μs 11.0340μs 90.6291 KOps/s 94.4306 KOps/s $\color{#d91a1a}-4.03\%$
test_nested_get 45.3650μs 10.5838μs 94.4840 KOps/s 98.8217 KOps/s $\color{#d91a1a}-4.39\%$
test_stacked_getleaf 34.1340μs 11.0523μs 90.4786 KOps/s 94.5207 KOps/s $\color{#d91a1a}-4.28\%$
test_stacked_get 34.7360μs 10.5709μs 94.5995 KOps/s 99.7441 KOps/s $\textbf{\color{#d91a1a}-5.16\%}$
test_nested_getitemleaf 44.4740μs 11.2535μs 88.8616 KOps/s 89.9628 KOps/s $\color{#d91a1a}-1.22\%$
test_nested_getitem 47.4190μs 10.6346μs 94.0327 KOps/s 95.9945 KOps/s $\color{#d91a1a}-2.04\%$
test_stacked_getitemleaf 34.7960μs 11.2830μs 88.6292 KOps/s 92.4000 KOps/s $\color{#d91a1a}-4.08\%$
test_stacked_getitem 38.0320μs 10.6701μs 93.7200 KOps/s 98.5988 KOps/s $\color{#d91a1a}-4.95\%$
test_lock_nested 92.4819ms 0.6023ms 1.6603 KOps/s 2.0237 KOps/s $\textbf{\color{#d91a1a}-17.96\%}$
test_lock_stack_nested 0.8037ms 0.4457ms 2.2439 KOps/s 2.2198 KOps/s $\color{#35bf28}+1.08\%$
test_unlock_nested 94.1709ms 0.5184ms 1.9289 KOps/s 2.4466 KOps/s $\textbf{\color{#d91a1a}-21.16\%}$
test_unlock_stack_nested 0.5851ms 0.3644ms 2.7442 KOps/s 2.6954 KOps/s $\color{#35bf28}+1.81\%$
test_flatten_speed 0.1712ms 88.3837μs 11.3143 KOps/s 11.2108 KOps/s $\color{#35bf28}+0.92\%$
test_unflatten_speed 0.9848ms 0.4761ms 2.1002 KOps/s 2.1641 KOps/s $\color{#d91a1a}-2.95\%$
test_common_ops 4.6756ms 1.1801ms 847.4082 Ops/s 922.1649 Ops/s $\textbf{\color{#d91a1a}-8.11\%}$
test_creation 19.9270μs 2.0738μs 482.1981 KOps/s 484.7066 KOps/s $\color{#d91a1a}-0.52\%$
test_creation_empty 70.2610μs 19.1198μs 52.3017 KOps/s 64.3983 KOps/s $\textbf{\color{#d91a1a}-18.78\%}$
test_creation_nested_1 59.0710μs 22.1323μs 45.1828 KOps/s 54.6995 KOps/s $\textbf{\color{#d91a1a}-17.40\%}$
test_creation_nested_2 63.8190μs 26.3863μs 37.8984 KOps/s 43.4417 KOps/s $\textbf{\color{#d91a1a}-12.76\%}$
test_clone 1.4067ms 17.8627μs 55.9825 KOps/s 59.8058 KOps/s $\textbf{\color{#d91a1a}-6.39\%}$
test_getitem[int] 0.8102ms 16.8907μs 59.2041 KOps/s 60.8786 KOps/s $\color{#d91a1a}-2.75\%$
test_getitem[slice_int] 0.1505ms 31.1441μs 32.1088 KOps/s 32.8248 KOps/s $\color{#d91a1a}-2.18\%$
test_getitem[range] 0.3809ms 60.8297μs 16.4393 KOps/s 16.9929 KOps/s $\color{#d91a1a}-3.26\%$
test_getitem[tuple] 0.1534ms 25.5371μs 39.1588 KOps/s 40.5420 KOps/s $\color{#d91a1a}-3.41\%$
test_getitem[list] 0.3402ms 56.0888μs 17.8289 KOps/s 18.8536 KOps/s $\textbf{\color{#d91a1a}-5.44\%}$
test_setitem_dim[int] 73.3070μs 34.3543μs 29.1085 KOps/s 30.7244 KOps/s $\textbf{\color{#d91a1a}-5.26\%}$
test_setitem_dim[slice_int] 0.1067ms 63.0031μs 15.8722 KOps/s 16.4398 KOps/s $\color{#d91a1a}-3.45\%$
test_setitem_dim[range] 0.1411ms 86.2763μs 11.5907 KOps/s 11.9035 KOps/s $\color{#d91a1a}-2.63\%$
test_setitem_dim[tuple] 0.1073ms 51.0158μs 19.6018 KOps/s 20.6057 KOps/s $\color{#d91a1a}-4.87\%$
test_setitem 0.1923ms 31.6315μs 31.6141 KOps/s 34.0336 KOps/s $\textbf{\color{#d91a1a}-7.11\%}$
test_set 0.1600ms 30.3496μs 32.9493 KOps/s 34.8670 KOps/s $\textbf{\color{#d91a1a}-5.50\%}$
test_set_shared 4.0104ms 0.2178ms 4.5922 KOps/s 4.6439 KOps/s $\color{#d91a1a}-1.11\%$
test_update 0.1857ms 38.0723μs 26.2658 KOps/s 29.2061 KOps/s $\textbf{\color{#d91a1a}-10.07\%}$
test_update_nested 0.2228ms 49.8093μs 20.0766 KOps/s 22.0959 KOps/s $\textbf{\color{#d91a1a}-9.14\%}$
test_update__nested 0.1590ms 36.2348μs 27.5978 KOps/s 28.5923 KOps/s $\color{#d91a1a}-3.48\%$
test_set_nested 0.1682ms 33.7310μs 29.6463 KOps/s 32.4474 KOps/s $\textbf{\color{#d91a1a}-8.63\%}$
test_set_nested_new 0.2037ms 38.9592μs 25.6679 KOps/s 27.9705 KOps/s $\textbf{\color{#d91a1a}-8.23\%}$
test_select 0.2186ms 56.3738μs 17.7387 KOps/s 19.0938 KOps/s $\textbf{\color{#d91a1a}-7.10\%}$
test_select_nested 0.1340ms 60.1118μs 16.6357 KOps/s 17.0705 KOps/s $\color{#d91a1a}-2.55\%$
test_exclude_nested 0.1422ms 75.2593μs 13.2874 KOps/s 13.3502 KOps/s $\color{#d91a1a}-0.47\%$
test_empty[True] 0.5222ms 0.3201ms 3.1237 KOps/s 3.1165 KOps/s $\color{#35bf28}+0.23\%$
test_empty[False] 10.5012μs 1.1707μs 854.2146 KOps/s 829.2823 KOps/s $\color{#35bf28}+3.01\%$
test_unbind_speed 0.5138ms 0.3040ms 3.2893 KOps/s 3.3123 KOps/s $\color{#d91a1a}-0.70\%$
test_unbind_speed_stack0 0.4285ms 0.2866ms 3.4888 KOps/s 3.4142 KOps/s $\color{#35bf28}+2.18\%$
test_unbind_speed_stack1 96.5653ms 0.7891ms 1.2672 KOps/s 1.3615 KOps/s $\textbf{\color{#d91a1a}-6.93\%}$
test_split 2.1060ms 1.9909ms 502.2866 Ops/s 457.2237 Ops/s $\textbf{\color{#35bf28}+9.86\%}$
test_chunk 94.7928ms 2.1823ms 458.2302 Ops/s 458.1919 Ops/s $+0.01\%$
test_creation[device0] 0.2327ms 0.1159ms 8.6247 KOps/s 8.2204 KOps/s $\color{#35bf28}+4.92\%$
test_creation_from_tensor 3.5644ms 0.1172ms 8.5340 KOps/s 8.5719 KOps/s $\color{#d91a1a}-0.44\%$
test_add_one[memmap_tensor0] 0.2419ms 7.5582μs 132.3064 KOps/s 134.8884 KOps/s $\color{#d91a1a}-1.91\%$
test_contiguous[memmap_tensor0] 21.3600μs 1.9214μs 520.4497 KOps/s 514.1760 KOps/s $\color{#35bf28}+1.22\%$
test_stack[memmap_tensor0] 43.6820μs 5.6785μs 176.1033 KOps/s 174.5776 KOps/s $\color{#35bf28}+0.87\%$
test_memmaptd_index 1.1602ms 0.4086ms 2.4474 KOps/s 2.5148 KOps/s $\color{#d91a1a}-2.68\%$
test_memmaptd_index_astensor 1.2060ms 0.4864ms 2.0560 KOps/s 2.1128 KOps/s $\color{#d91a1a}-2.69\%$
test_memmaptd_index_op 1.7791ms 1.0462ms 955.8608 Ops/s 1.0574 KOps/s $\textbf{\color{#d91a1a}-9.60\%}$
test_serialize_model 0.2021s 0.1317s 7.5933 Ops/s 8.4686 Ops/s $\textbf{\color{#d91a1a}-10.34\%}$
test_serialize_model_pickle 0.4563s 0.3897s 2.5659 Ops/s 2.5641 Ops/s $\color{#35bf28}+0.07\%$
test_serialize_weights 0.1242s 0.1163s 8.5985 Ops/s 7.4539 Ops/s $\textbf{\color{#35bf28}+15.36\%}$
test_serialize_weights_returnearly 0.1580s 0.1532s 6.5284 Ops/s 6.4311 Ops/s $\color{#35bf28}+1.51\%$
test_serialize_weights_pickle 0.4493s 0.3984s 2.5103 Ops/s 2.4333 Ops/s $\color{#35bf28}+3.17\%$
test_serialize_weights_filesystem 0.1461s 0.1427s 7.0056 Ops/s 6.9600 Ops/s $\color{#35bf28}+0.66\%$
test_serialize_model_filesystem 0.1589s 0.1519s 6.5818 Ops/s 6.5472 Ops/s $\color{#35bf28}+0.53\%$
test_reshape_pytree 87.0540μs 40.3338μs 24.7931 KOps/s 25.3988 KOps/s $\color{#d91a1a}-2.38\%$
test_reshape_td 0.1555ms 45.4208μs 22.0164 KOps/s 21.9225 KOps/s $\color{#35bf28}+0.43\%$
test_view_pytree 97.0420μs 39.2288μs 25.4915 KOps/s 25.8846 KOps/s $\color{#d91a1a}-1.52\%$
test_view_td 0.1361ms 52.5163μs 19.0417 KOps/s 19.1283 KOps/s $\color{#d91a1a}-0.45\%$
test_unbind_pytree 76.2130μs 36.6866μs 27.2579 KOps/s 27.7037 KOps/s $\color{#d91a1a}-1.61\%$
test_unbind_td 0.3132ms 45.6760μs 21.8933 KOps/s 20.8676 KOps/s $\color{#35bf28}+4.92\%$
test_split_pytree 83.8170μs 38.7662μs 25.7956 KOps/s 26.1442 KOps/s $\color{#d91a1a}-1.33\%$
test_split_td 0.1993ms 58.6200μs 17.0590 KOps/s 17.4603 KOps/s $\color{#d91a1a}-2.30\%$
test_add_pytree 0.1187ms 46.0517μs 21.7147 KOps/s 22.1533 KOps/s $\color{#d91a1a}-1.98\%$
test_add_td 0.1674ms 84.0575μs 11.8966 KOps/s 13.1326 KOps/s $\textbf{\color{#d91a1a}-9.41\%}$
test_compile_add_one_nested[tensordict-compile] 0.1481ms 58.3809μs 17.1289 KOps/s 17.2941 KOps/s $\color{#d91a1a}-0.96\%$
test_compile_add_one_nested[tensordict-eager] 0.3905ms 0.1789ms 5.5911 KOps/s 5.7267 KOps/s $\color{#d91a1a}-2.37\%$
test_compile_add_one_nested[pytree-compile] 0.1228ms 58.5316μs 17.0848 KOps/s 17.5362 KOps/s $\color{#d91a1a}-2.57\%$
test_compile_add_one_nested[pytree-eager] 0.3218ms 0.1443ms 6.9318 KOps/s 7.2444 KOps/s $\color{#d91a1a}-4.32\%$
test_compile_copy_nested[tensordict-compile] 77.2850μs 21.0865μs 47.4238 KOps/s 46.4339 KOps/s $\color{#35bf28}+2.13\%$
test_compile_copy_nested[tensordict-eager] 0.1486ms 67.7824μs 14.7531 KOps/s 15.0297 KOps/s $\color{#d91a1a}-1.84\%$
test_compile_copy_nested[pytree-compile] 0.1590ms 77.1224μs 12.9664 KOps/s 13.1056 KOps/s $\color{#d91a1a}-1.06\%$
test_compile_copy_nested[pytree-eager] 0.1281ms 69.2424μs 14.4420 KOps/s 14.4277 KOps/s $\color{#35bf28}+0.10\%$
test_compile_add_one_flat[tensordict-compile] 0.3781ms 0.1747ms 5.7251 KOps/s 5.8380 KOps/s $\color{#d91a1a}-1.93\%$
test_compile_add_one_flat[tensordict-eager] 0.3562ms 0.1898ms 5.2698 KOps/s 5.3808 KOps/s $\color{#d91a1a}-2.06\%$
test_compile_add_one_flat[tensorclass-compile] 0.1054ms 45.6623μs 21.8999 KOps/s 20.9072 KOps/s $\color{#35bf28}+4.75\%$
test_compile_add_one_flat[tensorclass-eager] 0.1515ms 68.7241μs 14.5509 KOps/s 14.6228 KOps/s $\color{#d91a1a}-0.49\%$
test_compile_add_one_flat[pytree-compile] 0.2826ms 0.1783ms 5.6099 KOps/s 5.7548 KOps/s $\color{#d91a1a}-2.52\%$
test_compile_add_one_flat[pytree-eager] 0.5379ms 0.2895ms 3.4538 KOps/s 3.4354 KOps/s $\color{#35bf28}+0.54\%$
test_compile_add_self_flat[tensordict-eager] 0.4609ms 0.2011ms 4.9730 KOps/s 4.9412 KOps/s $\color{#35bf28}+0.64\%$
test_compile_add_self_flat[tensordict-compile] 0.4427ms 0.1819ms 5.4966 KOps/s 5.7712 KOps/s $\color{#d91a1a}-4.76\%$
test_compile_add_self_flat[tensorclass-eager] 0.1567ms 63.5334μs 15.7398 KOps/s 15.9350 KOps/s $\color{#d91a1a}-1.23\%$
test_compile_add_self_flat[tensorclass-compile] 0.1229ms 47.1818μs 21.1946 KOps/s 21.0051 KOps/s $\color{#35bf28}+0.90\%$
test_compile_add_self_flat[pytree-eager] 0.3080ms 0.2344ms 4.2669 KOps/s 4.3748 KOps/s $\color{#d91a1a}-2.47\%$
test_compile_add_self_flat[pytree-compile] 0.3350ms 0.1764ms 5.6694 KOps/s 5.6712 KOps/s $\color{#d91a1a}-0.03\%$
test_compile_copy_flat[tensordict-compile] 0.5520ms 0.1078ms 9.2786 KOps/s 9.7006 KOps/s $\color{#d91a1a}-4.35\%$
test_compile_copy_flat[tensordict-eager] 0.2116ms 62.3004μs 16.0513 KOps/s 17.5354 KOps/s $\textbf{\color{#d91a1a}-8.46\%}$
test_compile_copy_flat[pytree-compile] 0.1569ms 77.4595μs 12.9100 KOps/s 12.6742 KOps/s $\color{#35bf28}+1.86\%$
test_compile_copy_flat[pytree-eager] 0.1459ms 68.8458μs 14.5252 KOps/s 14.0073 KOps/s $\color{#35bf28}+3.70\%$
test_compile_assign_and_add[tensordict-compile] 0.3776ms 0.1911ms 5.2323 KOps/s 5.1242 KOps/s $\color{#35bf28}+2.11\%$
test_compile_assign_and_add[tensordict-eager] 1.8864ms 1.6324ms 612.6060 Ops/s 619.1677 Ops/s $\color{#d91a1a}-1.06\%$
test_compile_assign_and_add[pytree-compile] 0.2857ms 0.1933ms 5.1725 KOps/s 5.2033 KOps/s $\color{#d91a1a}-0.59\%$
test_compile_assign_and_add[pytree-eager] 1.5107ms 1.1104ms 900.5431 Ops/s 935.4301 Ops/s $\color{#d91a1a}-3.73\%$
test_compile_assign_and_add_stack[compile] 0.8110ms 0.4173ms 2.3964 KOps/s 2.3280 KOps/s $\color{#35bf28}+2.94\%$
test_compile_assign_and_add_stack[eager] 5.6426ms 3.8853ms 257.3773 Ops/s 279.3335 Ops/s $\textbf{\color{#d91a1a}-7.86\%}$
test_compile_indexing[tensor-tensordict-compile] 84.7390μs 33.9404μs 29.4634 KOps/s 27.9032 KOps/s $\textbf{\color{#35bf28}+5.59\%}$
test_compile_indexing[tensor-tensordict-eager] 1.0723ms 50.2563μs 19.8980 KOps/s 20.6674 KOps/s $\color{#d91a1a}-3.72\%$
test_compile_indexing[tensor-tensorclass-compile] 0.1001ms 30.7766μs 32.4922 KOps/s 33.8871 KOps/s $\color{#d91a1a}-4.12\%$
test_compile_indexing[tensor-tensorclass-eager] 95.5990μs 29.5506μs 33.8403 KOps/s 33.5357 KOps/s $\color{#35bf28}+0.91\%$
test_compile_indexing[tensor-pytree-compile] 79.1480μs 30.2465μs 33.0617 KOps/s 34.0793 KOps/s $\color{#d91a1a}-2.99\%$
test_compile_indexing[tensor-pytree-eager] 93.2550μs 29.9999μs 33.3334 KOps/s 34.1460 KOps/s $\color{#d91a1a}-2.38\%$
test_compile_indexing[slice-tensordict-compile] 0.1581ms 73.9279μs 13.5267 KOps/s 13.6682 KOps/s $\color{#d91a1a}-1.04\%$
test_compile_indexing[slice-tensordict-eager] 0.5170ms 27.6770μs 36.1311 KOps/s 37.1319 KOps/s $\color{#d91a1a}-2.70\%$
test_compile_indexing[slice-tensorclass-compile] 0.1389ms 67.0144μs 14.9222 KOps/s 14.6581 KOps/s $\color{#35bf28}+1.80\%$
test_compile_indexing[slice-tensorclass-eager] 77.5160μs 23.4359μs 42.6696 KOps/s 42.0803 KOps/s $\color{#35bf28}+1.40\%$
test_compile_indexing[slice-pytree-compile] 0.1628ms 68.3970μs 14.6205 KOps/s 14.6782 KOps/s $\color{#d91a1a}-0.39\%$
test_compile_indexing[slice-pytree-eager] 82.8450μs 23.4714μs 42.6051 KOps/s 42.2930 KOps/s $\color{#35bf28}+0.74\%$
test_compile_indexing[int-tensordict-compile] 0.1598ms 74.2371μs 13.4704 KOps/s 13.8266 KOps/s $\color{#d91a1a}-2.58\%$
test_compile_indexing[int-tensordict-eager] 0.8901ms 27.5914μs 36.2432 KOps/s 37.8094 KOps/s $\color{#d91a1a}-4.14\%$
test_compile_indexing[int-tensorclass-compile] 0.1415ms 68.2478μs 14.6525 KOps/s 14.7464 KOps/s $\color{#d91a1a}-0.64\%$
test_compile_indexing[int-tensorclass-eager] 82.7650μs 23.2224μs 43.0619 KOps/s 42.6491 KOps/s $\color{#35bf28}+0.97\%$
test_compile_indexing[int-pytree-compile] 0.1515ms 68.2072μs 14.6612 KOps/s 14.7708 KOps/s $\color{#d91a1a}-0.74\%$
test_compile_indexing[int-pytree-eager] 58.9500μs 23.3418μs 42.8416 KOps/s 42.5321 KOps/s $\color{#35bf28}+0.73\%$
test_mod_add[eager] 87.1830μs 26.1035μs 38.3091 KOps/s 41.4728 KOps/s $\textbf{\color{#d91a1a}-7.63\%}$
test_mod_add[compile] 82.1850μs 39.3576μs 25.4080 KOps/s 26.5832 KOps/s $\color{#d91a1a}-4.42\%$
test_mod_add[compile-overhead] 80.6110μs 39.7582μs 25.1520 KOps/s 26.3888 KOps/s $\color{#d91a1a}-4.69\%$
test_mod_wrap[eager] 0.4199ms 0.2127ms 4.7009 KOps/s 4.8562 KOps/s $\color{#d91a1a}-3.20\%$
test_mod_wrap[compile] 0.4100ms 0.2337ms 4.2781 KOps/s 4.3430 KOps/s $\color{#d91a1a}-1.49\%$
test_mod_wrap[compile-overhead] 0.3840ms 0.2320ms 4.3101 KOps/s 4.3914 KOps/s $\color{#d91a1a}-1.85\%$
test_mod_wrap_and_backward[eager] 17.9058ms 12.3975ms 80.6614 Ops/s 87.4740 Ops/s $\textbf{\color{#d91a1a}-7.79\%}$
test_mod_wrap_and_backward[compile] 14.1027ms 11.7904ms 84.8149 Ops/s 79.2917 Ops/s $\textbf{\color{#35bf28}+6.97\%}$
test_mod_wrap_and_backward[compile-overhead] 17.6497ms 12.0027ms 83.3149 Ops/s 85.2111 Ops/s $\color{#d91a1a}-2.23\%$
test_seq_add[eager] 0.2077ms 92.3625μs 10.8269 KOps/s 11.3372 KOps/s $\color{#d91a1a}-4.50\%$
test_seq_add[compile] 0.4562ms 72.0855μs 13.8724 KOps/s 15.5752 KOps/s $\textbf{\color{#d91a1a}-10.93\%}$
test_seq_add[compile-overhead] 0.1182ms 62.7258μs 15.9424 KOps/s 16.0528 KOps/s $\color{#d91a1a}-0.69\%$
test_seq_wrap[eager] 0.6513ms 0.3965ms 2.5223 KOps/s 2.6395 KOps/s $\color{#d91a1a}-4.44\%$
test_seq_wrap[compile] 0.4845ms 0.2642ms 3.7847 KOps/s 3.6986 KOps/s $\color{#35bf28}+2.33\%$
test_seq_wrap[compile-overhead] 0.5238ms 0.2680ms 3.7316 KOps/s 3.7563 KOps/s $\color{#d91a1a}-0.66\%$
test_func_call_runtime[False-eager] 0.8932ms 0.5197ms 1.9243 KOps/s 1.8911 KOps/s $\color{#35bf28}+1.76\%$
test_func_call_runtime[False-compile] 0.9371ms 0.5024ms 1.9905 KOps/s 2.0125 KOps/s $\color{#d91a1a}-1.09\%$
test_func_call_runtime[False-compile-overhead] 0.6260ms 0.5013ms 1.9950 KOps/s 2.0028 KOps/s $\color{#d91a1a}-0.39\%$
test_func_call_runtime[True-eager] 0.8672ms 0.7399ms 1.3515 KOps/s 1.3261 KOps/s $\color{#35bf28}+1.91\%$
test_func_call_runtime[True-compile] 0.9196ms 0.5139ms 1.9460 KOps/s 1.9543 KOps/s $\color{#d91a1a}-0.43\%$
test_func_call_runtime[True-compile-overhead] 0.9642ms 0.5152ms 1.9410 KOps/s 1.9486 KOps/s $\color{#d91a1a}-0.39\%$
test_func_call_cm_runtime[False-eager] 0.9300ms 0.5236ms 1.9098 KOps/s 1.8671 KOps/s $\color{#35bf28}+2.29\%$
test_func_call_cm_runtime[False-compile] 0.8042ms 0.5069ms 1.9729 KOps/s 1.9818 KOps/s $\color{#d91a1a}-0.45\%$
test_func_call_cm_runtime[False-compile-overhead] 0.8915ms 0.5067ms 1.9736 KOps/s 1.9895 KOps/s $\color{#d91a1a}-0.80\%$
test_func_call_cm_runtime[True-eager] 1.0911ms 0.8648ms 1.1564 KOps/s 1.1298 KOps/s $\color{#35bf28}+2.35\%$
test_func_call_cm_runtime[True-compile] 0.9628ms 0.7389ms 1.3533 KOps/s 1.3292 KOps/s $\color{#35bf28}+1.81\%$
test_func_call_cm_runtime[True-compile-overhead] 1.1818ms 0.7347ms 1.3611 KOps/s 1.3193 KOps/s $\color{#35bf28}+3.16\%$
test_vmap_func_call_cm_runtime[eager] 3.2056ms 1.8866ms 530.0577 Ops/s 539.3361 Ops/s $\color{#d91a1a}-1.72\%$
test_vmap_func_call_cm_runtime[compile] 2.9016ms 1.9577ms 510.7912 Ops/s 515.8425 Ops/s $\color{#d91a1a}-0.98\%$
test_vmap_func_call_cm_runtime[compile-overhead] 2.6631ms 1.9461ms 513.8569 Ops/s 521.9402 Ops/s $\color{#d91a1a}-1.55\%$
test_distributed 0.3192ms 0.1230ms 8.1330 KOps/s 7.7987 KOps/s $\color{#35bf28}+4.29\%$
test_tdmodule 46.0870μs 19.1543μs 52.2076 KOps/s 59.4281 KOps/s $\textbf{\color{#d91a1a}-12.15\%}$
test_tdmodule_dispatch 60.9740μs 38.3454μs 26.0787 KOps/s 29.7709 KOps/s $\textbf{\color{#d91a1a}-12.40\%}$
test_tdseq 50.2740μs 21.4116μs 46.7036 KOps/s 52.4572 KOps/s $\textbf{\color{#d91a1a}-10.97\%}$
test_tdseq_dispatch 73.1070μs 42.9825μs 23.2653 KOps/s 26.1806 KOps/s $\textbf{\color{#d91a1a}-11.14\%}$
test_instantiation_functorch 1.7443ms 1.5816ms 632.2848 Ops/s 631.6823 Ops/s $\color{#35bf28}+0.10\%$
test_instantiation_td 2.0012ms 1.1526ms 867.6160 Ops/s 848.2312 Ops/s $\color{#35bf28}+2.29\%$
test_exec_functorch 0.4010ms 0.1869ms 5.3504 KOps/s 5.4071 KOps/s $\color{#d91a1a}-1.05\%$
test_exec_functional_call 0.2744ms 0.1728ms 5.7854 KOps/s 5.8092 KOps/s $\color{#d91a1a}-0.41\%$
test_exec_td 0.2638ms 0.1705ms 5.8668 KOps/s 5.9366 KOps/s $\color{#d91a1a}-1.18\%$
test_exec_td_decorator 1.2151ms 0.2254ms 4.4367 KOps/s 4.4003 KOps/s $\color{#35bf28}+0.83\%$
test_vmap_mlp_speed[True-True] 1.0456ms 0.6583ms 1.5191 KOps/s 1.5691 KOps/s $\color{#d91a1a}-3.18\%$
test_vmap_mlp_speed[True-False] 0.8866ms 0.6536ms 1.5300 KOps/s 1.5787 KOps/s $\color{#d91a1a}-3.08\%$
test_vmap_mlp_speed[False-True] 1.4553ms 0.5144ms 1.9439 KOps/s 2.0070 KOps/s $\color{#d91a1a}-3.14\%$
test_vmap_mlp_speed[False-False] 0.7663ms 0.5063ms 1.9749 KOps/s 2.0027 KOps/s $\color{#d91a1a}-1.39\%$
test_vmap_mlp_speed_decorator[True-True] 1.3146ms 0.6401ms 1.5624 KOps/s 1.6231 KOps/s $\color{#d91a1a}-3.74\%$
test_vmap_mlp_speed_decorator[True-False] 0.9058ms 0.6372ms 1.5695 KOps/s 1.6084 KOps/s $\color{#d91a1a}-2.42\%$
test_vmap_mlp_speed_decorator[False-True] 0.6859ms 0.5224ms 1.9141 KOps/s 1.9059 KOps/s $\color{#35bf28}+0.43\%$
test_vmap_mlp_speed_decorator[False-False] 0.8001ms 0.5236ms 1.9099 KOps/s 1.9478 KOps/s $\color{#d91a1a}-1.94\%$
test_to_module_speed[True] 2.1009ms 1.2808ms 780.7726 Ops/s 776.3551 Ops/s $\color{#35bf28}+0.57\%$
test_to_module_speed[False] 2.0342ms 1.2517ms 798.9142 Ops/s 793.6217 Ops/s $\color{#35bf28}+0.67\%$
test_tc_init 0.2995ms 52.7950μs 18.9412 KOps/s 24.2522 KOps/s $\textbf{\color{#d91a1a}-21.90\%}$
test_tc_init_nested 0.3221ms 94.6046μs 10.5703 KOps/s 12.0958 KOps/s $\textbf{\color{#d91a1a}-12.61\%}$
test_tc_first_layer_tensor 19.6060μs 1.5539μs 643.5299 KOps/s 653.2868 KOps/s $\color{#d91a1a}-1.49\%$
test_tc_first_layer_nontensor 0.1016ms 4.7720μs 209.5539 KOps/s 208.3185 KOps/s $\color{#35bf28}+0.59\%$
test_tc_second_layer_tensor 0.1659ms 2.8883μs 346.2190 KOps/s 358.1946 KOps/s $\color{#d91a1a}-3.34\%$
test_tc_second_layer_nontensor 50.1740μs 5.9420μs 168.2932 KOps/s 166.5975 KOps/s $\color{#35bf28}+1.02\%$
test_unbind 0.4807s 13.1526ms 76.0304 Ops/s 75.0380 Ops/s $\color{#35bf28}+1.32\%$
test_full_like 8.4463ms 7.2449ms 138.0279 Ops/s 138.4911 Ops/s $\color{#d91a1a}-0.33\%$
test_zeros_like 3.6726ms 2.9802ms 335.5440 Ops/s 321.6460 Ops/s $\color{#35bf28}+4.32\%$
test_ones_like 3.8754ms 3.3055ms 302.5264 Ops/s 159.2632 Ops/s $\textbf{\color{#35bf28}+89.95\%}$
test_clone 6.9830ms 5.5755ms 179.3566 Ops/s 120.1433 Ops/s $\textbf{\color{#35bf28}+49.29\%}$
test_squeeze 71.7750μs 12.6212μs 79.2315 KOps/s 80.3479 KOps/s $\color{#d91a1a}-1.39\%$
test_unsqueeze 0.3482ms 93.4207μs 10.7043 KOps/s 10.7441 KOps/s $\color{#d91a1a}-0.37\%$
test_split 0.3890ms 0.2017ms 4.9573 KOps/s 5.0136 KOps/s $\color{#d91a1a}-1.12\%$
test_permute 0.3026ms 0.2220ms 4.5036 KOps/s 4.4740 KOps/s $\color{#35bf28}+0.66\%$
test_stack 29.1201ms 26.1285ms 38.2724 Ops/s 37.2066 Ops/s $\color{#35bf28}+2.86\%$
test_cat 32.1400ms 26.5840ms 37.6167 Ops/s 38.0723 Ops/s $\color{#d91a1a}-1.20\%$

Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 228. Improved: $\large\color{#35bf28}20$. Worsened: $\large\color{#d91a1a}15$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 0.6011ms 13.8308μs 72.3026 KOps/s 66.8500 KOps/s $\textbf{\color{#35bf28}+8.16\%}$
test_plain_set_stack_nested 47.9710μs 14.0341μs 71.2551 KOps/s 66.1518 KOps/s $\textbf{\color{#35bf28}+7.71\%}$
test_plain_set_nested_inplace 52.2710μs 14.9067μs 67.0840 KOps/s 62.0839 KOps/s $\textbf{\color{#35bf28}+8.05\%}$
test_plain_set_stack_nested_inplace 45.3010μs 14.8726μs 67.2379 KOps/s 63.1544 KOps/s $\textbf{\color{#35bf28}+6.47\%}$
test_items 30.3200μs 2.9063μs 344.0746 KOps/s 342.0789 KOps/s $\color{#35bf28}+0.58\%$
test_items_nested 0.3784ms 0.3298ms 3.0319 KOps/s 3.0578 KOps/s $\color{#d91a1a}-0.85\%$
test_items_nested_locked 0.5101ms 0.3311ms 3.0205 KOps/s 3.0481 KOps/s $\color{#d91a1a}-0.90\%$
test_items_nested_leaf 84.4910μs 56.0848μs 17.8302 KOps/s 17.7977 KOps/s $\color{#35bf28}+0.18\%$
test_items_stack_nested 0.3877ms 0.3352ms 2.9837 KOps/s 3.0785 KOps/s $\color{#d91a1a}-3.08\%$
test_items_stack_nested_leaf 82.7720μs 58.2817μs 17.1580 KOps/s 17.4717 KOps/s $\color{#d91a1a}-1.80\%$
test_items_stack_nested_locked 0.4066ms 0.3342ms 2.9922 KOps/s 3.0433 KOps/s $\color{#d91a1a}-1.68\%$
test_keys 25.9200μs 3.4644μs 288.6526 KOps/s 288.9242 KOps/s $\color{#d91a1a}-0.09\%$
test_keys_nested 91.5520μs 58.0816μs 17.2171 KOps/s 17.5110 KOps/s $\color{#d91a1a}-1.68\%$
test_keys_nested_locked 2.2807ms 63.9575μs 15.6354 KOps/s 16.0261 KOps/s $\color{#d91a1a}-2.44\%$
test_keys_nested_leaf 80.6520μs 49.2428μs 20.3075 KOps/s 20.7773 KOps/s $\color{#d91a1a}-2.26\%$
test_keys_stack_nested 90.2620μs 57.7822μs 17.3064 KOps/s 17.5621 KOps/s $\color{#d91a1a}-1.46\%$
test_keys_stack_nested_leaf 84.3320μs 49.4731μs 20.2130 KOps/s 20.4370 KOps/s $\color{#d91a1a}-1.10\%$
test_keys_stack_nested_locked 91.7410μs 62.7451μs 15.9375 KOps/s 16.1824 KOps/s $\color{#d91a1a}-1.51\%$
test_values 5.3468μs 0.8832μs 1.1322 MOps/s 1.1534 MOps/s $\color{#d91a1a}-1.84\%$
test_values_nested 96.2020μs 41.3024μs 24.2117 KOps/s 24.6496 KOps/s $\color{#d91a1a}-1.78\%$
test_values_nested_locked 74.8010μs 42.8217μs 23.3526 KOps/s 23.4504 KOps/s $\color{#d91a1a}-0.42\%$
test_values_nested_leaf 79.6610μs 35.4964μs 28.1719 KOps/s 28.4414 KOps/s $\color{#d91a1a}-0.95\%$
test_values_stack_nested 76.7120μs 42.0405μs 23.7866 KOps/s 24.1615 KOps/s $\color{#d91a1a}-1.55\%$
test_values_stack_nested_leaf 61.9210μs 36.0656μs 27.7272 KOps/s 28.0525 KOps/s $\color{#d91a1a}-1.16\%$
test_values_stack_nested_locked 75.1320μs 43.8582μs 22.8008 KOps/s 23.0919 KOps/s $\color{#d91a1a}-1.26\%$
test_membership 4.5673μs 0.5107μs 1.9581 MOps/s 1.9814 MOps/s $\color{#d91a1a}-1.18\%$
test_membership_nested 25.0605μs 1.9162μs 521.8665 KOps/s 523.7167 KOps/s $\color{#d91a1a}-0.35\%$
test_membership_nested_leaf 18.8855μs 1.9149μs 522.2222 KOps/s 526.7061 KOps/s $\color{#d91a1a}-0.85\%$
test_membership_stacked_nested 40.6410μs 1.9700μs 507.6233 KOps/s 511.9163 KOps/s $\color{#d91a1a}-0.84\%$
test_membership_stacked_nested_leaf 28.7400μs 1.9967μs 500.8264 KOps/s 503.9144 KOps/s $\color{#d91a1a}-0.61\%$
test_membership_nested_last 41.1600μs 2.8068μs 356.2787 KOps/s 352.5149 KOps/s $\color{#35bf28}+1.07\%$
test_membership_nested_leaf_last 35.8000μs 2.8371μs 352.4692 KOps/s 357.7844 KOps/s $\color{#d91a1a}-1.49\%$
test_membership_stacked_nested_last 45.2810μs 5.3067μs 188.4424 KOps/s 269.6665 KOps/s $\textbf{\color{#d91a1a}-30.12\%}$
test_membership_stacked_nested_leaf_last 30.8500μs 5.3076μs 188.4078 KOps/s 269.3840 KOps/s $\textbf{\color{#d91a1a}-30.06\%}$
test_nested_getleaf 33.8910μs 6.1948μs 161.4244 KOps/s 161.8956 KOps/s $\color{#d91a1a}-0.29\%$
test_nested_get 34.8610μs 5.7642μs 173.4832 KOps/s 175.3526 KOps/s $\color{#d91a1a}-1.07\%$
test_stacked_getleaf 34.7910μs 6.0525μs 165.2221 KOps/s 163.9870 KOps/s $\color{#35bf28}+0.75\%$
test_stacked_get 43.8010μs 5.6673μs 176.4507 KOps/s 178.2584 KOps/s $\color{#d91a1a}-1.01\%$
test_nested_getitemleaf 45.5700μs 6.1565μs 162.4309 KOps/s 160.7351 KOps/s $\color{#35bf28}+1.05\%$
test_nested_getitem 33.3310μs 5.8621μs 170.5876 KOps/s 171.6982 KOps/s $\color{#d91a1a}-0.65\%$
test_stacked_getitemleaf 36.7310μs 6.0947μs 164.0767 KOps/s 161.7987 KOps/s $\color{#35bf28}+1.41\%$
test_stacked_getitem 33.1010μs 5.7663μs 173.4208 KOps/s 174.4298 KOps/s $\color{#d91a1a}-0.58\%$
test_lock_nested 3.0154ms 0.4239ms 2.3588 KOps/s 2.3779 KOps/s $\color{#d91a1a}-0.80\%$
test_lock_stack_nested 0.4288ms 0.3812ms 2.6236 KOps/s 2.6266 KOps/s $\color{#d91a1a}-0.12\%$
test_unlock_nested 0.7474ms 0.3590ms 2.7855 KOps/s 2.8116 KOps/s $\color{#d91a1a}-0.93\%$
test_unlock_stack_nested 0.3496ms 0.3181ms 3.1436 KOps/s 3.1411 KOps/s $\color{#35bf28}+0.08\%$
test_flatten_speed 0.1079ms 69.4292μs 14.4032 KOps/s 14.4688 KOps/s $\color{#d91a1a}-0.45\%$
test_unflatten_speed 0.3605ms 0.2841ms 3.5201 KOps/s 3.5490 KOps/s $\color{#d91a1a}-0.82\%$
test_common_ops 1.5203ms 1.2582ms 794.7620 Ops/s 761.1505 Ops/s $\color{#35bf28}+4.42\%$
test_creation 39.2900μs 1.5414μs 648.7661 KOps/s 658.2968 KOps/s $\color{#d91a1a}-1.45\%$
test_creation_empty 71.9020μs 15.4242μs 64.8330 KOps/s 57.6934 KOps/s $\textbf{\color{#35bf28}+12.38\%}$
test_creation_nested_1 48.4410μs 17.1587μs 58.2793 KOps/s 53.6452 KOps/s $\textbf{\color{#35bf28}+8.64\%}$
test_creation_nested_2 1.3810ms 20.3870μs 49.0509 KOps/s 46.5746 KOps/s $\textbf{\color{#35bf28}+5.32\%}$
test_clone 66.4220μs 29.6121μs 33.7700 KOps/s 34.9372 KOps/s $\color{#d91a1a}-3.34\%$
test_getitem[int] 93.3077ms 23.9943μs 41.6766 KOps/s 60.3779 KOps/s $\textbf{\color{#d91a1a}-30.97\%}$
test_getitem[slice_int] 0.1817ms 28.1843μs 35.4807 KOps/s 36.1451 KOps/s $\color{#d91a1a}-1.84\%$
test_getitem[range] 0.2409ms 0.1113ms 8.9819 KOps/s 9.0074 KOps/s $\color{#d91a1a}-0.28\%$
test_getitem[tuple] 0.1164ms 24.0271μs 41.6196 KOps/s 42.1826 KOps/s $\color{#d91a1a}-1.33\%$
test_getitem[list] 0.2481ms 99.2910μs 10.0714 KOps/s 10.0387 KOps/s $\color{#35bf28}+0.33\%$
test_setitem_dim[int] 71.8520μs 45.6828μs 21.8901 KOps/s 21.3095 KOps/s $\color{#35bf28}+2.72\%$
test_setitem_dim[slice_int] 0.1031ms 68.9617μs 14.5008 KOps/s 14.5864 KOps/s $\color{#d91a1a}-0.59\%$
test_setitem_dim[range] 0.1620ms 0.1300ms 7.6904 KOps/s 7.7604 KOps/s $\color{#d91a1a}-0.90\%$
test_setitem_dim[tuple] 88.3010μs 62.2328μs 16.0687 KOps/s 16.4240 KOps/s $\color{#d91a1a}-2.16\%$
test_setitem 85.5110μs 44.7539μs 22.3444 KOps/s 23.4218 KOps/s $\color{#d91a1a}-4.60\%$
test_set 76.3520μs 43.4821μs 22.9979 KOps/s 23.3526 KOps/s $\color{#d91a1a}-1.52\%$
test_set_shared 0.3397ms 51.7519μs 19.3230 KOps/s 19.8273 KOps/s $\color{#d91a1a}-2.54\%$
test_update 0.1095ms 50.7977μs 19.6859 KOps/s 18.6930 KOps/s $\textbf{\color{#35bf28}+5.31\%}$
test_update_nested 0.2026ms 58.6591μs 17.0477 KOps/s 16.9198 KOps/s $\color{#35bf28}+0.76\%$
test_update__nested 0.2111ms 60.5596μs 16.5127 KOps/s 17.1204 KOps/s $\color{#d91a1a}-3.55\%$
test_set_nested 0.1739ms 44.4377μs 22.5034 KOps/s 22.3827 KOps/s $\color{#35bf28}+0.54\%$
test_set_nested_new 84.6710μs 47.8801μs 20.8855 KOps/s 21.0489 KOps/s $\color{#d91a1a}-0.78\%$
test_select 0.1021ms 60.8819μs 16.4252 KOps/s 16.4293 KOps/s $\color{#d91a1a}-0.02\%$
test_select_nested 0.4501ms 43.4305μs 23.0253 KOps/s 23.6350 KOps/s $\color{#d91a1a}-2.58\%$
test_exclude_nested 0.1248ms 59.2919μs 16.8657 KOps/s 16.6970 KOps/s $\color{#35bf28}+1.01\%$
test_empty[True] 0.3673ms 0.2445ms 4.0898 KOps/s 4.1187 KOps/s $\color{#d91a1a}-0.70\%$
test_empty[False] 3.5930μs 0.7423μs 1.3471 MOps/s 1.3647 MOps/s $\color{#d91a1a}-1.29\%$
test_to 62.0210μs 26.0213μs 38.4300 KOps/s 40.2871 KOps/s $\color{#d91a1a}-4.61\%$
test_to_nonblocking 53.0710μs 26.3802μs 37.9071 KOps/s 43.0663 KOps/s $\textbf{\color{#d91a1a}-11.98\%}$
test_unbind_speed 1.7698ms 0.2974ms 3.3630 KOps/s 3.5503 KOps/s $\textbf{\color{#d91a1a}-5.27\%}$
test_unbind_speed_stack0 0.3252ms 0.2763ms 3.6197 KOps/s 3.6389 KOps/s $\color{#d91a1a}-0.53\%$
test_unbind_speed_stack1 94.1723ms 0.7041ms 1.4203 KOps/s 1.4039 KOps/s $\color{#35bf28}+1.17\%$
test_split 96.7237ms 2.2542ms 443.6112 Ops/s 465.8515 Ops/s $\color{#d91a1a}-4.77\%$
test_chunk 97.4661ms 2.2430ms 445.8298 Ops/s 462.0457 Ops/s $\color{#d91a1a}-3.51\%$
test_creation[device0] 0.3409ms 0.1273ms 7.8574 KOps/s 7.9188 KOps/s $\color{#d91a1a}-0.77\%$
test_creation_from_tensor 0.3826ms 0.1337ms 7.4779 KOps/s 7.5516 KOps/s $\color{#d91a1a}-0.98\%$
test_add_one[memmap_tensor0] 0.2327ms 9.8677μs 101.3407 KOps/s 111.2682 KOps/s $\textbf{\color{#d91a1a}-8.92\%}$
test_contiguous[memmap_tensor0] 23.2100μs 2.2040μs 453.7240 KOps/s 455.4151 KOps/s $\color{#d91a1a}-0.37\%$
test_stack[memmap_tensor0] 34.7610μs 7.0582μs 141.6783 KOps/s 144.7248 KOps/s $\color{#d91a1a}-2.10\%$
test_memmaptd_index 1.3276ms 0.4384ms 2.2809 KOps/s 2.3274 KOps/s $\color{#d91a1a}-2.00\%$
test_memmaptd_index_astensor 0.7465ms 0.4927ms 2.0296 KOps/s 2.0587 KOps/s $\color{#d91a1a}-1.41\%$
test_memmaptd_index_op 1.4481ms 1.0527ms 949.9037 Ops/s 950.0023 Ops/s $\color{#d91a1a}-0.01\%$
test_serialize_model 0.1311s 0.1294s 7.7299 Ops/s 7.7315 Ops/s $\color{#d91a1a}-0.02\%$
test_serialize_model_pickle 1.3481s 1.2118s 0.8252 Ops/s 0.8231 Ops/s $\color{#35bf28}+0.25\%$
test_serialize_weights 0.1297s 0.1283s 7.7959 Ops/s 7.7626 Ops/s $\color{#35bf28}+0.43\%$
test_serialize_weights_returnearly 0.2206s 64.3938ms 15.5294 Ops/s 16.0887 Ops/s $\color{#d91a1a}-3.48\%$
test_serialize_weights_pickle 1.3737s 1.2170s 0.8217 Ops/s 0.8213 Ops/s $\color{#35bf28}+0.05\%$
test_reshape_pytree 69.7510μs 37.1764μs 26.8988 KOps/s 27.2029 KOps/s $\color{#d91a1a}-1.12\%$
test_reshape_td 0.1616ms 43.0762μs 23.2147 KOps/s 23.6890 KOps/s $\color{#d91a1a}-2.00\%$
test_view_pytree 82.2220μs 36.4968μs 27.3996 KOps/s 27.8826 KOps/s $\color{#d91a1a}-1.73\%$
test_view_td 87.4820μs 47.2080μs 21.1828 KOps/s 21.1943 KOps/s $\color{#d91a1a}-0.05\%$
test_unbind_pytree 61.3310μs 35.3626μs 28.2785 KOps/s 28.7127 KOps/s $\color{#d91a1a}-1.51\%$
test_unbind_td 0.5218ms 43.1619μs 23.1686 KOps/s 23.1658 KOps/s $\color{#35bf28}+0.01\%$
test_split_pytree 76.1210μs 48.2659μs 20.7186 KOps/s 21.0860 KOps/s $\color{#d91a1a}-1.74\%$
test_split_td 0.6473ms 59.1935μs 16.8937 KOps/s 17.6866 KOps/s $\color{#d91a1a}-4.48\%$
test_add_pytree 0.2049ms 60.6455μs 16.4893 KOps/s 17.7031 KOps/s $\textbf{\color{#d91a1a}-6.86\%}$
test_add_td 0.2411ms 97.2584μs 10.2819 KOps/s 10.7858 KOps/s $\color{#d91a1a}-4.67\%$
test_compile_add_one_nested[tensordict-compile] 0.4148ms 0.2138ms 4.6762 KOps/s 4.6556 KOps/s $\color{#35bf28}+0.44\%$
test_compile_add_one_nested[tensordict-eager] 0.2986ms 0.1507ms 6.6339 KOps/s 6.5397 KOps/s $\color{#35bf28}+1.44\%$
test_compile_add_one_nested[pytree-compile] 0.1977ms 0.1546ms 6.4673 KOps/s 6.8193 KOps/s $\textbf{\color{#d91a1a}-5.16\%}$
test_compile_add_one_nested[pytree-eager] 0.2846ms 0.2043ms 4.8953 KOps/s 5.5116 KOps/s $\textbf{\color{#d91a1a}-11.18\%}$
test_compile_copy_nested[tensordict-compile] 68.0110μs 20.9033μs 47.8393 KOps/s 45.0902 KOps/s $\textbf{\color{#35bf28}+6.10\%}$
test_compile_copy_nested[tensordict-eager] 84.6220μs 44.4552μs 22.4945 KOps/s 22.6616 KOps/s $\color{#d91a1a}-0.74\%$
test_compile_copy_nested[pytree-compile] 0.1175ms 65.2214μs 15.3324 KOps/s 15.7068 KOps/s $\color{#d91a1a}-2.38\%$
test_compile_copy_nested[pytree-eager] 90.7720μs 49.6231μs 20.1519 KOps/s 20.0575 KOps/s $\color{#35bf28}+0.47\%$
test_compile_add_one_flat[tensordict-compile] 0.4328ms 0.3223ms 3.1032 KOps/s 3.0978 KOps/s $\color{#35bf28}+0.17\%$
test_compile_add_one_flat[tensordict-eager] 0.5906ms 0.2094ms 4.7748 KOps/s 4.8104 KOps/s $\color{#d91a1a}-0.74\%$
test_compile_add_one_flat[tensorclass-compile] 0.2785ms 0.1298ms 7.7027 KOps/s 7.7021 KOps/s $+0.01\%$
test_compile_add_one_flat[tensorclass-eager] 0.2440ms 61.7355μs 16.1981 KOps/s 16.5967 KOps/s $\color{#d91a1a}-2.40\%$
test_compile_add_one_flat[pytree-compile] 0.4492ms 0.3211ms 3.1140 KOps/s 3.0978 KOps/s $\color{#35bf28}+0.52\%$
test_compile_add_one_flat[pytree-eager] 0.9017ms 0.6765ms 1.4781 KOps/s 1.6232 KOps/s $\textbf{\color{#d91a1a}-8.94\%}$
test_compile_add_self_flat[tensordict-eager] 0.3903ms 0.2494ms 4.0104 KOps/s 4.0123 KOps/s $\color{#d91a1a}-0.05\%$
test_compile_add_self_flat[tensordict-compile] 0.4692ms 0.3230ms 3.0961 KOps/s 3.0762 KOps/s $\color{#35bf28}+0.65\%$
test_compile_add_self_flat[tensorclass-eager] 0.4765ms 72.9181μs 13.7140 KOps/s 13.5597 KOps/s $\color{#35bf28}+1.14\%$
test_compile_add_self_flat[tensorclass-compile] 0.5581ms 0.1365ms 7.3241 KOps/s 7.6072 KOps/s $\color{#d91a1a}-3.72\%$
test_compile_add_self_flat[pytree-eager] 0.9360ms 0.5473ms 1.8271 KOps/s 1.8809 KOps/s $\color{#d91a1a}-2.86\%$
test_compile_add_self_flat[pytree-compile] 0.4697ms 0.3235ms 3.0915 KOps/s 3.0892 KOps/s $\color{#35bf28}+0.08\%$
test_compile_copy_flat[tensordict-compile] 57.7910μs 18.7646μs 53.2918 KOps/s 56.4568 KOps/s $\textbf{\color{#d91a1a}-5.61\%}$
test_compile_copy_flat[tensordict-eager] 69.1820μs 26.9468μs 37.1102 KOps/s 36.4072 KOps/s $\color{#35bf28}+1.93\%$
test_compile_copy_flat[pytree-compile] 0.1063ms 69.2121μs 14.4483 KOps/s 14.3336 KOps/s $\color{#35bf28}+0.80\%$
test_compile_copy_flat[pytree-eager] 81.6320μs 51.0908μs 19.5730 KOps/s 19.4631 KOps/s $\color{#35bf28}+0.56\%$
test_compile_assign_and_add[tensordict-compile] 2.3416ms 0.8204ms 1.2189 KOps/s 1.1295 KOps/s $\textbf{\color{#35bf28}+7.92\%}$
test_compile_assign_and_add[tensordict-eager] 3.3832ms 3.2032ms 312.1835 Ops/s 314.9462 Ops/s $\color{#d91a1a}-0.88\%$
test_compile_assign_and_add[pytree-compile] 2.3014ms 0.8060ms 1.2407 KOps/s 1.1420 KOps/s $\textbf{\color{#35bf28}+8.65\%}$
test_compile_assign_and_add[pytree-eager] 3.4686ms 3.2579ms 306.9438 Ops/s 311.7310 Ops/s $\color{#d91a1a}-1.54\%$
test_compile_indexing[tensor-tensordict-compile] 0.1612ms 0.1127ms 8.8737 KOps/s 9.0106 KOps/s $\color{#d91a1a}-1.52\%$
test_compile_indexing[tensor-tensordict-eager] 0.1889ms 61.6349μs 16.2246 KOps/s 15.5886 KOps/s $\color{#35bf28}+4.08\%$
test_compile_indexing[tensor-tensorclass-compile] 0.2488ms 0.1054ms 9.4883 KOps/s 9.1436 KOps/s $\color{#35bf28}+3.77\%$
test_compile_indexing[tensor-tensorclass-eager] 93.9620μs 44.3301μs 22.5580 KOps/s 21.7480 KOps/s $\color{#35bf28}+3.72\%$
test_compile_indexing[tensor-pytree-compile] 0.2571ms 0.1096ms 9.1261 KOps/s 9.0749 KOps/s $\color{#35bf28}+0.56\%$
test_compile_indexing[tensor-pytree-eager] 78.9620μs 43.8334μs 22.8136 KOps/s 21.4394 KOps/s $\textbf{\color{#35bf28}+6.41\%}$
test_compile_indexing[slice-tensordict-compile] 0.2912ms 0.1403ms 7.1285 KOps/s 7.2080 KOps/s $\color{#d91a1a}-1.10\%$
test_compile_indexing[slice-tensordict-eager] 0.1581ms 26.2214μs 38.1368 KOps/s 38.5504 KOps/s $\color{#d91a1a}-1.07\%$
test_compile_indexing[slice-tensorclass-compile] 0.1778ms 0.1331ms 7.5107 KOps/s 7.5282 KOps/s $\color{#d91a1a}-0.23\%$
test_compile_indexing[slice-tensorclass-eager] 59.6210μs 21.4153μs 46.6957 KOps/s 48.4160 KOps/s $\color{#d91a1a}-3.55\%$
test_compile_indexing[slice-pytree-compile] 0.2633ms 0.1335ms 7.4921 KOps/s 7.3643 KOps/s $\color{#35bf28}+1.74\%$
test_compile_indexing[slice-pytree-eager] 55.2810μs 21.5304μs 46.4459 KOps/s 47.0998 KOps/s $\color{#d91a1a}-1.39\%$
test_compile_indexing[int-tensordict-compile] 0.2799ms 0.1409ms 7.0984 KOps/s 6.8085 KOps/s $\color{#35bf28}+4.26\%$
test_compile_indexing[int-tensordict-eager] 0.5081ms 25.6873μs 38.9298 KOps/s 37.9182 KOps/s $\color{#35bf28}+2.67\%$
test_compile_indexing[int-tensorclass-compile] 0.2848ms 0.1346ms 7.4305 KOps/s 7.2342 KOps/s $\color{#35bf28}+2.71\%$
test_compile_indexing[int-tensorclass-eager] 0.1941ms 22.2432μs 44.9576 KOps/s 48.8155 KOps/s $\textbf{\color{#d91a1a}-7.90\%}$
test_compile_indexing[int-pytree-compile] 0.1796ms 0.1351ms 7.4042 KOps/s 7.4600 KOps/s $\color{#d91a1a}-0.75\%$
test_compile_indexing[int-pytree-eager] 59.0810μs 22.4547μs 44.5342 KOps/s 48.9108 KOps/s $\textbf{\color{#d91a1a}-8.95\%}$
test_mod_add[eager] 70.5210μs 32.9043μs 30.3911 KOps/s 30.5484 KOps/s $\color{#d91a1a}-0.51\%$
test_mod_add[compile] 0.3747ms 72.2202μs 13.8465 KOps/s 14.1149 KOps/s $\color{#d91a1a}-1.90\%$
test_mod_add[compile-overhead] 0.2628ms 0.1375ms 7.2702 KOps/s 6.5803 KOps/s $\textbf{\color{#35bf28}+10.49\%}$
test_mod_wrap[eager] 0.4070ms 0.2456ms 4.0723 KOps/s 4.0561 KOps/s $\color{#35bf28}+0.40\%$
test_mod_wrap[compile] 0.8695ms 0.2970ms 3.3672 KOps/s 3.3427 KOps/s $\color{#35bf28}+0.73\%$
test_mod_wrap[compile-overhead] 7.4717ms 4.0630ms 246.1245 Ops/s 248.0468 Ops/s $\color{#d91a1a}-0.77\%$
test_mod_wrap_and_backward[eager] 1.6141ms 1.4520ms 688.6942 Ops/s 689.0922 Ops/s $\color{#d91a1a}-0.06\%$
test_mod_wrap_and_backward[compile] 1.7540ms 1.4411ms 693.9028 Ops/s 699.1749 Ops/s $\color{#d91a1a}-0.75\%$
test_mod_wrap_and_backward[compile-overhead] 1.6645ms 1.0456ms 956.3751 Ops/s 981.1245 Ops/s $\color{#d91a1a}-2.52\%$
test_seq_add[eager] 0.2476ms 98.2997μs 10.1730 KOps/s 10.0829 KOps/s $\color{#35bf28}+0.89\%$
test_seq_add[compile] 0.2221ms 82.7330μs 12.0871 KOps/s 12.2765 KOps/s $\color{#d91a1a}-1.54\%$
test_seq_add[compile-overhead] 0.1594ms 0.1159ms 8.6296 KOps/s 8.5799 KOps/s $\color{#35bf28}+0.58\%$
test_seq_wrap[eager] 0.5509ms 0.3955ms 2.5287 KOps/s 2.5665 KOps/s $\color{#d91a1a}-1.47\%$
test_seq_wrap[compile] 0.4181ms 0.3239ms 3.0875 KOps/s 3.1566 KOps/s $\color{#d91a1a}-2.19\%$
test_seq_wrap[compile-overhead] 0.3693ms 0.2238ms 4.4689 KOps/s 4.5033 KOps/s $\color{#d91a1a}-0.76\%$
test_func_call_runtime[False-eager] 0.8636ms 0.7473ms 1.3382 KOps/s 1.3276 KOps/s $\color{#35bf28}+0.80\%$
test_func_call_runtime[False-compile] 0.8905ms 0.8028ms 1.2456 KOps/s 1.2636 KOps/s $\color{#d91a1a}-1.42\%$
test_func_call_runtime[False-compile-overhead] 0.4295ms 0.3650ms 2.7396 KOps/s 2.7526 KOps/s $\color{#d91a1a}-0.47\%$
test_func_call_runtime[True-eager] 1.0574ms 0.9200ms 1.0870 KOps/s 1.1054 KOps/s $\color{#d91a1a}-1.67\%$
test_func_call_runtime[True-compile] 1.0381ms 0.8449ms 1.1836 KOps/s 1.2053 KOps/s $\color{#d91a1a}-1.80\%$
test_func_call_runtime[True-compile-overhead] 0.4772ms 0.4027ms 2.4835 KOps/s 2.5127 KOps/s $\color{#d91a1a}-1.16\%$
test_func_call_cm_runtime[False-eager] 0.8229ms 0.7418ms 1.3481 KOps/s 1.2712 KOps/s $\textbf{\color{#35bf28}+6.05\%}$
test_func_call_cm_runtime[False-compile] 1.1151ms 0.8046ms 1.2428 KOps/s 1.2577 KOps/s $\color{#d91a1a}-1.19\%$
test_func_call_cm_runtime[False-compile-overhead] 0.4818ms 0.3679ms 2.7185 KOps/s 2.7549 KOps/s $\color{#d91a1a}-1.32\%$
test_func_call_cm_runtime[True-eager] 1.1552ms 1.0082ms 991.8465 Ops/s 987.9953 Ops/s $\color{#35bf28}+0.39\%$
test_func_call_cm_runtime[True-compile] 1.0247ms 0.8687ms 1.1511 KOps/s 1.1705 KOps/s $\color{#d91a1a}-1.66\%$
test_func_call_cm_runtime[True-compile-overhead] 0.5545ms 0.4254ms 2.3506 KOps/s 2.3663 KOps/s $\color{#d91a1a}-0.67\%$
test_vmap_func_call_cm_runtime[eager] 2.5585ms 2.1027ms 475.5772 Ops/s 478.8541 Ops/s $\color{#d91a1a}-0.68\%$
test_vmap_func_call_cm_runtime[compile] 1.0714ms 0.8879ms 1.1262 KOps/s 1.1293 KOps/s $\color{#d91a1a}-0.27\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.4738ms 0.4300ms 2.3256 KOps/s 2.3304 KOps/s $\color{#d91a1a}-0.21\%$
test_distributed 3.5515ms 0.2154ms 4.6433 KOps/s 8.2768 KOps/s $\textbf{\color{#d91a1a}-43.90\%}$
test_tdmodule 52.1210μs 15.2162μs 65.7196 KOps/s 59.5100 KOps/s $\textbf{\color{#35bf28}+10.43\%}$
test_tdmodule_dispatch 51.4510μs 29.2003μs 34.2462 KOps/s 31.3541 KOps/s $\textbf{\color{#35bf28}+9.22\%}$
test_tdseq 24.4200μs 15.7007μs 63.6916 KOps/s 59.1204 KOps/s $\textbf{\color{#35bf28}+7.73\%}$
test_tdseq_dispatch 54.8810μs 31.4402μs 31.8064 KOps/s 28.9895 KOps/s $\textbf{\color{#35bf28}+9.72\%}$
test_instantiation_functorch 1.9888ms 1.8889ms 529.4212 Ops/s 527.4011 Ops/s $\color{#35bf28}+0.38\%$
test_instantiation_td 1.8058ms 1.2044ms 830.2943 Ops/s 824.7579 Ops/s $\color{#35bf28}+0.67\%$
test_exec_functorch 0.2410ms 0.2132ms 4.6898 KOps/s 4.6567 KOps/s $\color{#35bf28}+0.71\%$
test_exec_functional_call 0.2830ms 0.2128ms 4.6984 KOps/s 4.7788 KOps/s $\color{#d91a1a}-1.68\%$
test_exec_td 0.2718ms 0.2202ms 4.5419 KOps/s 4.6411 KOps/s $\color{#d91a1a}-2.14\%$
test_exec_td_decorator 1.0285ms 0.2598ms 3.8498 KOps/s 3.8711 KOps/s $\color{#d91a1a}-0.55\%$
test_vmap_mlp_speed[True-True] 0.8197ms 0.6866ms 1.4565 KOps/s 1.4387 KOps/s $\color{#35bf28}+1.24\%$
test_vmap_mlp_speed[True-False] 0.8615ms 0.6958ms 1.4371 KOps/s 1.4534 KOps/s $\color{#d91a1a}-1.12\%$
test_vmap_mlp_speed[False-True] 0.7456ms 0.6058ms 1.6507 KOps/s 1.7342 KOps/s $\color{#d91a1a}-4.82\%$
test_vmap_mlp_speed[False-False] 0.7137ms 0.5933ms 1.6854 KOps/s 1.7308 KOps/s $\color{#d91a1a}-2.63\%$
test_vmap_mlp_speed_decorator[True-True] 0.7988ms 0.6754ms 1.4806 KOps/s 1.4752 KOps/s $\color{#35bf28}+0.37\%$
test_vmap_mlp_speed_decorator[True-False] 1.1112ms 0.6726ms 1.4869 KOps/s 1.4715 KOps/s $\color{#35bf28}+1.04\%$
test_vmap_mlp_speed_decorator[False-True] 0.7674ms 0.5979ms 1.6724 KOps/s 1.6899 KOps/s $\color{#d91a1a}-1.04\%$
test_vmap_mlp_speed_decorator[False-False] 0.7207ms 0.6040ms 1.6556 KOps/s 1.6943 KOps/s $\color{#d91a1a}-2.28\%$
test_vmap_transformer_speed[True-True] 8.6012ms 8.4465ms 118.3928 Ops/s 117.9423 Ops/s $\color{#35bf28}+0.38\%$
test_vmap_transformer_speed[True-False] 8.7240ms 8.4323ms 118.5911 Ops/s 118.0640 Ops/s $\color{#35bf28}+0.45\%$
test_vmap_transformer_speed[False-True] 8.3257ms 8.2074ms 121.8419 Ops/s 120.3886 Ops/s $\color{#35bf28}+1.21\%$
test_vmap_transformer_speed[False-False] 8.5697ms 8.2636ms 121.0130 Ops/s 121.4458 Ops/s $\color{#d91a1a}-0.36\%$
test_vmap_transformer_speed_decorator[True-True] 20.4734ms 19.8176ms 50.4603 Ops/s 50.5923 Ops/s $\color{#d91a1a}-0.26\%$
test_vmap_transformer_speed_decorator[True-False] 19.7923ms 19.7156ms 50.7212 Ops/s 50.7216 Ops/s $-0.00\%$
test_vmap_transformer_speed_decorator[False-True] 19.7281ms 19.5747ms 51.0863 Ops/s 51.2233 Ops/s $\color{#d91a1a}-0.27\%$
test_vmap_transformer_speed_decorator[False-False] 20.3832ms 19.6111ms 50.9914 Ops/s 51.0252 Ops/s $\color{#d91a1a}-0.07\%$
test_to_module_speed[True] 2.0264ms 0.9530ms 1.0493 KOps/s 1.0417 KOps/s $\color{#35bf28}+0.72\%$
test_to_module_speed[False] 1.0449ms 0.9362ms 1.0681 KOps/s 1.0753 KOps/s $\color{#d91a1a}-0.66\%$
test_tc_init 63.4510μs 34.4191μs 29.0536 KOps/s 26.8758 KOps/s $\textbf{\color{#35bf28}+8.10\%}$
test_tc_init_nested 0.1112ms 68.6681μs 14.5628 KOps/s 13.4346 KOps/s $\textbf{\color{#35bf28}+8.40\%}$
test_tc_first_layer_tensor 3.7043μs 0.6898μs 1.4498 MOps/s 1.4466 MOps/s $\color{#35bf28}+0.22\%$
test_tc_first_layer_nontensor 20.3300μs 2.2984μs 435.0936 KOps/s 438.2595 KOps/s $\color{#d91a1a}-0.72\%$
test_tc_second_layer_tensor 30.2180μs 1.4068μs 710.8160 KOps/s 710.1403 KOps/s $\color{#35bf28}+0.10\%$
test_tc_second_layer_nontensor 31.9610μs 2.9607μs 337.7533 KOps/s 332.1934 KOps/s $\color{#35bf28}+1.67\%$
test_unbind 0.1973s 12.4247ms 80.4849 Ops/s 90.2867 Ops/s $\textbf{\color{#d91a1a}-10.86\%}$
test_full_like 0.7253ms 0.5725ms 1.7468 KOps/s 1.7382 KOps/s $\color{#35bf28}+0.49\%$
test_zeros_like 0.2860ms 0.1978ms 5.0563 KOps/s 5.0479 KOps/s $\color{#35bf28}+0.17\%$
test_ones_like 0.2337ms 0.1976ms 5.0604 KOps/s 5.0567 KOps/s $\color{#35bf28}+0.07\%$
test_clone 0.5645ms 0.4142ms 2.4140 KOps/s 2.4095 KOps/s $\color{#35bf28}+0.19\%$
test_squeeze 0.1394ms 9.9153μs 100.8546 KOps/s 101.4435 KOps/s $\color{#d91a1a}-0.58\%$
test_unsqueeze 0.2243ms 74.8127μs 13.3667 KOps/s 13.5164 KOps/s $\color{#d91a1a}-1.11\%$
test_split 0.4250ms 0.1591ms 6.2845 KOps/s 6.4024 KOps/s $\color{#d91a1a}-1.84\%$
test_permute 0.2688ms 0.1846ms 5.4157 KOps/s 5.6204 KOps/s $\color{#d91a1a}-3.64\%$
test_stack 1.2759ms 0.8576ms 1.1660 KOps/s 1.1773 KOps/s $\color{#d91a1a}-0.96\%$
test_cat 1.2700ms 1.2317ms 811.9148 Ops/s 811.7354 Ops/s $\color{#35bf28}+0.02\%$

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants