Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BugFix] Softly revert get changes #950

Merged
merged 1 commit into from
Aug 5, 2024
Merged

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Aug 5, 2024

Description

Describe your changes in detail.

Motivation and Context

Why is this change required? What problem does it solve?
If it fixes an open issue, please link to the issue here.
You can use the syntax close #15213 if this solves the issue #15213

  • I have raised an issue to propose this change (required for new features and bug fixes)

Types of changes

What types of changes does your code introduce? Remove all that do not apply:

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds core functionality)
  • Breaking change (fix or feature that would cause existing functionality to change)
  • Documentation (update in the documentation)
  • Example (update in the folder of examples)

Checklist

Go over all the following points, and put an x in all the boxes that apply.
If you are unsure about any of these, don't hesitate to ask. We are here to help!

  • I have read the CONTRIBUTION guide (required)
  • My change requires a change to the documentation.
  • I have updated the tests accordingly (required for a bug fix or a new feature).
  • I have updated the documentation accordingly.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Aug 5, 2024
Copy link

github-actions bot commented Aug 5, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 219. Improved: $\large\color{#35bf28}24$. Worsened: $\large\color{#d91a1a}24$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 54.2620μs 22.2430μs 44.9580 KOps/s 49.6402 KOps/s $\textbf{\color{#d91a1a}-9.43\%}$
test_plain_set_stack_nested 0.1458ms 22.9661μs 43.5424 KOps/s 49.1414 KOps/s $\textbf{\color{#d91a1a}-11.39\%}$
test_plain_set_nested_inplace 0.1156ms 24.2887μs 41.1714 KOps/s 44.5578 KOps/s $\textbf{\color{#d91a1a}-7.60\%}$
test_plain_set_stack_nested_inplace 59.8220μs 24.3796μs 41.0180 KOps/s 45.0048 KOps/s $\textbf{\color{#d91a1a}-8.86\%}$
test_items 29.6060μs 2.6544μs 376.7313 KOps/s 377.6866 KOps/s $\color{#d91a1a}-0.25\%$
test_items_nested 0.6188ms 0.3384ms 2.9555 KOps/s 2.9813 KOps/s $\color{#d91a1a}-0.87\%$
test_items_nested_locked 2.8456ms 0.3384ms 2.9548 KOps/s 2.9734 KOps/s $\color{#d91a1a}-0.63\%$
test_items_nested_leaf 0.1704ms 86.6794μs 11.5368 KOps/s 11.9114 KOps/s $\color{#d91a1a}-3.14\%$
test_items_stack_nested 0.6599ms 0.3367ms 2.9702 KOps/s 2.9832 KOps/s $\color{#d91a1a}-0.44\%$
test_items_stack_nested_leaf 0.1691ms 87.7462μs 11.3965 KOps/s 12.2899 KOps/s $\textbf{\color{#d91a1a}-7.27\%}$
test_items_stack_nested_locked 0.4274ms 0.3369ms 2.9679 KOps/s 2.9743 KOps/s $\color{#d91a1a}-0.22\%$
test_keys 34.1740μs 3.9819μs 251.1374 KOps/s 257.5556 KOps/s $\color{#d91a1a}-2.49\%$
test_keys_nested 0.3843ms 0.1485ms 6.7347 KOps/s 7.0358 KOps/s $\color{#d91a1a}-4.28\%$
test_keys_nested_locked 0.7614ms 0.1496ms 6.6846 KOps/s 6.7509 KOps/s $\color{#d91a1a}-0.98\%$
test_keys_nested_leaf 0.3823ms 0.1265ms 7.9036 KOps/s 8.2417 KOps/s $\color{#d91a1a}-4.10\%$
test_keys_stack_nested 0.2421ms 0.1434ms 6.9718 KOps/s 7.1047 KOps/s $\color{#d91a1a}-1.87\%$
test_keys_stack_nested_leaf 0.2421ms 0.1221ms 8.1931 KOps/s 8.2569 KOps/s $\color{#d91a1a}-0.77\%$
test_keys_stack_nested_locked 0.3057ms 0.1472ms 6.7956 KOps/s 6.8347 KOps/s $\color{#d91a1a}-0.57\%$
test_values 28.8915μs 1.1939μs 837.6168 KOps/s 818.7262 KOps/s $\color{#35bf28}+2.31\%$
test_values_nested 0.2236ms 50.0388μs 19.9845 KOps/s 19.8245 KOps/s $\color{#35bf28}+0.81\%$
test_values_nested_locked 0.1021ms 49.7042μs 20.1190 KOps/s 19.6680 KOps/s $\color{#35bf28}+2.29\%$
test_values_nested_leaf 83.9070μs 44.8037μs 22.3196 KOps/s 22.0196 KOps/s $\color{#35bf28}+1.36\%$
test_values_stack_nested 0.1227ms 50.5051μs 19.8000 KOps/s 18.7741 KOps/s $\textbf{\color{#35bf28}+5.46\%}$
test_values_stack_nested_leaf 0.1269ms 44.5803μs 22.4314 KOps/s 22.0439 KOps/s $\color{#35bf28}+1.76\%$
test_values_stack_nested_locked 0.1276ms 50.6037μs 19.7614 KOps/s 19.3764 KOps/s $\color{#35bf28}+1.99\%$
test_membership 16.5810μs 0.8969μs 1.1150 MOps/s 1.1112 MOps/s $\color{#35bf28}+0.34\%$
test_membership_nested 24.5060μs 2.6141μs 382.5442 KOps/s 381.5341 KOps/s $\color{#35bf28}+0.26\%$
test_membership_nested_leaf 37.1200μs 2.6211μs 381.5189 KOps/s 366.1926 KOps/s $\color{#35bf28}+4.19\%$
test_membership_stacked_nested 24.3450μs 2.6200μs 381.6792 KOps/s 374.6617 KOps/s $\color{#35bf28}+1.87\%$
test_membership_stacked_nested_leaf 52.4760μs 2.6337μs 379.6868 KOps/s 376.6879 KOps/s $\color{#35bf28}+0.80\%$
test_membership_nested_last 31.6290μs 3.8735μs 258.1664 KOps/s 253.7737 KOps/s $\color{#35bf28}+1.73\%$
test_membership_nested_leaf_last 61.7780μs 3.8427μs 260.2349 KOps/s 252.5206 KOps/s $\color{#35bf28}+3.05\%$
test_membership_stacked_nested_last 38.6020μs 3.8622μs 258.9211 KOps/s 77.7265 KOps/s $\textbf{\color{#35bf28}+233.12\%}$
test_membership_stacked_nested_leaf_last 28.0720μs 3.8392μs 260.4681 KOps/s 77.5213 KOps/s $\textbf{\color{#35bf28}+236.00\%}$
test_nested_getleaf 78.9970μs 10.5688μs 94.6185 KOps/s 94.0290 KOps/s $\color{#35bf28}+0.63\%$
test_nested_get 37.2900μs 10.0842μs 99.1650 KOps/s 102.1309 KOps/s $\color{#d91a1a}-2.90\%$
test_stacked_getleaf 34.7750μs 10.5103μs 95.1445 KOps/s 96.7563 KOps/s $\color{#d91a1a}-1.67\%$
test_stacked_get 0.2593ms 10.5553μs 94.7392 KOps/s 103.1889 KOps/s $\textbf{\color{#d91a1a}-8.19\%}$
test_nested_getitemleaf 54.3810μs 11.0780μs 90.2687 KOps/s 91.5389 KOps/s $\color{#d91a1a}-1.39\%$
test_nested_getitem 38.9630μs 10.1397μs 98.6224 KOps/s 100.6004 KOps/s $\color{#d91a1a}-1.97\%$
test_stacked_getitemleaf 61.2550μs 10.8357μs 92.2874 KOps/s 92.9886 KOps/s $\color{#d91a1a}-0.75\%$
test_stacked_getitem 34.2640μs 10.0694μs 99.3109 KOps/s 101.7217 KOps/s $\color{#d91a1a}-2.37\%$
test_lock_nested 92.1199ms 0.6063ms 1.6494 KOps/s 1.9728 KOps/s $\textbf{\color{#d91a1a}-16.39\%}$
test_lock_stack_nested 0.8608ms 0.4671ms 2.1409 KOps/s 2.2211 KOps/s $\color{#d91a1a}-3.61\%$
test_unlock_nested 89.8652ms 0.5125ms 1.9512 KOps/s 2.3590 KOps/s $\textbf{\color{#d91a1a}-17.29\%}$
test_unlock_stack_nested 0.5672ms 0.3817ms 2.6196 KOps/s 2.7211 KOps/s $\color{#d91a1a}-3.73\%$
test_flatten_speed 0.5263ms 0.1061ms 9.4252 KOps/s 9.7582 KOps/s $\color{#d91a1a}-3.41\%$
test_unflatten_speed 0.6739ms 0.4600ms 2.1738 KOps/s 2.1900 KOps/s $\color{#d91a1a}-0.74\%$
test_common_ops 1.7532ms 1.1125ms 898.9014 Ops/s 929.9893 Ops/s $\color{#d91a1a}-3.34\%$
test_creation 21.1290μs 2.0788μs 481.0551 KOps/s 485.2517 KOps/s $\color{#d91a1a}-0.86\%$
test_creation_empty 85.6780μs 18.9661μs 52.7256 KOps/s 61.2964 KOps/s $\textbf{\color{#d91a1a}-13.98\%}$
test_creation_nested_1 55.2930μs 22.2072μs 45.0304 KOps/s 49.0192 KOps/s $\textbf{\color{#d91a1a}-8.14\%}$
test_creation_nested_2 67.4460μs 26.2509μs 38.0940 KOps/s 39.6791 KOps/s $\color{#d91a1a}-3.99\%$
test_clone 62.5770μs 16.4717μs 60.7104 KOps/s 59.6678 KOps/s $\color{#35bf28}+1.75\%$
test_getitem[int] 1.4798ms 16.7292μs 59.7756 KOps/s 57.5865 KOps/s $\color{#35bf28}+3.80\%$
test_getitem[slice_int] 0.1259ms 31.2639μs 31.9857 KOps/s 31.3306 KOps/s $\color{#35bf28}+2.09\%$
test_getitem[range] 0.1651ms 56.3077μs 17.7595 KOps/s 17.3350 KOps/s $\color{#35bf28}+2.45\%$
test_getitem[tuple] 0.1426ms 25.4553μs 39.2845 KOps/s 38.3090 KOps/s $\color{#35bf28}+2.55\%$
test_getitem[list] 0.2221ms 51.4906μs 19.4210 KOps/s 18.5535 KOps/s $\color{#35bf28}+4.68\%$
test_setitem_dim[int] 65.2220μs 42.6378μs 23.4534 KOps/s 23.4878 KOps/s $\color{#d91a1a}-0.15\%$
test_setitem_dim[slice_int] 0.1142ms 73.4676μs 13.6114 KOps/s 13.8513 KOps/s $\color{#d91a1a}-1.73\%$
test_setitem_dim[range] 0.1712ms 95.6585μs 10.4539 KOps/s 10.8255 KOps/s $\color{#d91a1a}-3.43\%$
test_setitem_dim[tuple] 0.1200ms 60.1630μs 16.6215 KOps/s 17.3979 KOps/s $\color{#d91a1a}-4.46\%$
test_setitem 0.1386ms 30.2686μs 33.0375 KOps/s 36.0688 KOps/s $\textbf{\color{#d91a1a}-8.40\%}$
test_set 0.1140ms 29.2293μs 34.2122 KOps/s 36.6911 KOps/s $\textbf{\color{#d91a1a}-6.76\%}$
test_set_shared 4.1298ms 0.2165ms 4.6190 KOps/s 4.4940 KOps/s $\color{#35bf28}+2.78\%$
test_update 0.1410ms 36.5816μs 27.3361 KOps/s 29.6886 KOps/s $\textbf{\color{#d91a1a}-7.92\%}$
test_update_nested 0.1320ms 45.9571μs 21.7594 KOps/s 22.7864 KOps/s $\color{#d91a1a}-4.51\%$
test_update__nested 0.1911ms 34.4792μs 29.0030 KOps/s 29.3842 KOps/s $\color{#d91a1a}-1.30\%$
test_set_nested 0.1674ms 31.1794μs 32.0725 KOps/s 33.4946 KOps/s $\color{#d91a1a}-4.25\%$
test_set_nested_new 0.1446ms 35.9070μs 27.8497 KOps/s 28.8953 KOps/s $\color{#d91a1a}-3.62\%$
test_select 0.1267ms 51.9101μs 19.2641 KOps/s 19.4145 KOps/s $\color{#d91a1a}-0.78\%$
test_select_nested 0.1241ms 58.8276μs 16.9988 KOps/s 17.1133 KOps/s $\color{#d91a1a}-0.67\%$
test_exclude_nested 0.1415ms 77.4747μs 12.9074 KOps/s 12.9384 KOps/s $\color{#d91a1a}-0.24\%$
test_empty[True] 0.4389ms 0.3242ms 3.0848 KOps/s 3.1403 KOps/s $\color{#d91a1a}-1.77\%$
test_empty[False] 11.5040μs 1.1483μs 870.8409 KOps/s 863.5897 KOps/s $\color{#35bf28}+0.84\%$
test_unbind_speed 0.6359ms 0.3098ms 3.2276 KOps/s 3.1497 KOps/s $\color{#35bf28}+2.47\%$
test_unbind_speed_stack0 0.5264ms 0.3025ms 3.3059 KOps/s 3.4018 KOps/s $\color{#d91a1a}-2.82\%$
test_unbind_speed_stack1 94.0431ms 0.8074ms 1.2385 KOps/s 1.4358 KOps/s $\textbf{\color{#d91a1a}-13.74\%}$
test_split 84.4260ms 2.1653ms 461.8256 Ops/s 460.3768 Ops/s $\color{#35bf28}+0.31\%$
test_chunk 90.3007ms 2.2337ms 447.6976 Ops/s 459.3073 Ops/s $\color{#d91a1a}-2.53\%$
test_creation[device0] 0.4338ms 0.1207ms 8.2868 KOps/s 8.2253 KOps/s $\color{#35bf28}+0.75\%$
test_creation_from_tensor 4.1233ms 0.1217ms 8.2173 KOps/s 8.2586 KOps/s $\color{#d91a1a}-0.50\%$
test_add_one[memmap_tensor0] 0.4937ms 7.7870μs 128.4187 KOps/s 122.8018 KOps/s $\color{#35bf28}+4.57\%$
test_contiguous[memmap_tensor0] 19.0860μs 2.0107μs 497.3389 KOps/s 488.6262 KOps/s $\color{#35bf28}+1.78\%$
test_stack[memmap_tensor0] 43.7810μs 5.8490μs 170.9685 KOps/s 168.5574 KOps/s $\color{#35bf28}+1.43\%$
test_memmaptd_index 0.9700ms 0.4088ms 2.4460 KOps/s 2.3756 KOps/s $\color{#35bf28}+2.97\%$
test_memmaptd_index_astensor 0.9378ms 0.4915ms 2.0344 KOps/s 1.9886 KOps/s $\color{#35bf28}+2.30\%$
test_memmaptd_index_op 1.6720ms 1.0526ms 949.9862 Ops/s 972.0745 Ops/s $\color{#d91a1a}-2.27\%$
test_serialize_model 0.1278s 0.1181s 8.4684 Ops/s 7.6924 Ops/s $\textbf{\color{#35bf28}+10.09\%}$
test_serialize_model_pickle 0.4468s 0.3984s 2.5101 Ops/s 2.4496 Ops/s $\color{#35bf28}+2.47\%$
test_serialize_weights 0.2079s 0.1305s 7.6655 Ops/s 8.5431 Ops/s $\textbf{\color{#d91a1a}-10.27\%}$
test_serialize_weights_returnearly 0.1833s 0.1633s 6.1225 Ops/s 6.1667 Ops/s $\color{#d91a1a}-0.72\%$
test_serialize_weights_pickle 0.5024s 0.4527s 2.2091 Ops/s 1.1869 Ops/s $\textbf{\color{#35bf28}+86.12\%}$
test_serialize_weights_filesystem 0.1479s 0.1437s 6.9573 Ops/s 6.5753 Ops/s $\textbf{\color{#35bf28}+5.81\%}$
test_serialize_model_filesystem 0.2358s 0.1607s 6.2238 Ops/s 6.9578 Ops/s $\textbf{\color{#d91a1a}-10.55\%}$
test_reshape_pytree 94.4070μs 40.0640μs 24.9600 KOps/s 24.9945 KOps/s $\color{#d91a1a}-0.14\%$
test_reshape_td 98.6850μs 47.3145μs 21.1352 KOps/s 21.4608 KOps/s $\color{#d91a1a}-1.52\%$
test_view_pytree 86.4020μs 40.0969μs 24.9396 KOps/s 24.9181 KOps/s $\color{#35bf28}+0.09\%$
test_view_td 0.1145ms 54.4001μs 18.3823 KOps/s 19.1476 KOps/s $\color{#d91a1a}-4.00\%$
test_unbind_pytree 91.8820μs 37.6546μs 26.5572 KOps/s 26.5440 KOps/s $\color{#35bf28}+0.05\%$
test_unbind_td 0.4287ms 46.0020μs 21.7382 KOps/s 21.4104 KOps/s $\color{#35bf28}+1.53\%$
test_split_pytree 80.6300μs 40.5102μs 24.6851 KOps/s 24.6575 KOps/s $\color{#35bf28}+0.11\%$
test_split_td 0.4672ms 58.1202μs 17.2057 KOps/s 16.5500 KOps/s $\color{#35bf28}+3.96\%$
test_add_pytree 91.6620μs 46.3288μs 21.5848 KOps/s 21.1204 KOps/s $\color{#35bf28}+2.20\%$
test_add_td 0.2437ms 85.9922μs 11.6290 KOps/s 12.3158 KOps/s $\textbf{\color{#d91a1a}-5.58\%}$
test_compile_add_one_nested[tensordict-compile] 0.1199ms 54.7079μs 18.2789 KOps/s 18.3417 KOps/s $\color{#d91a1a}-0.34\%$
test_compile_add_one_nested[tensordict-eager] 0.3871ms 0.1902ms 5.2571 KOps/s 4.9929 KOps/s $\textbf{\color{#35bf28}+5.29\%}$
test_compile_add_one_nested[pytree-compile] 0.2273ms 54.7952μs 18.2498 KOps/s 18.2300 KOps/s $\color{#35bf28}+0.11\%$
test_compile_add_one_nested[pytree-eager] 0.2912ms 0.1445ms 6.9218 KOps/s 6.8240 KOps/s $\color{#35bf28}+1.43\%$
test_compile_copy_nested[tensordict-compile] 54.5210μs 20.1628μs 49.5963 KOps/s 47.9031 KOps/s $\color{#35bf28}+3.53\%$
test_compile_copy_nested[tensordict-eager] 0.1344ms 63.6929μs 15.7003 KOps/s 15.3231 KOps/s $\color{#35bf28}+2.46\%$
test_compile_copy_nested[pytree-compile] 4.7918ms 79.5637μs 12.5685 KOps/s 12.5963 KOps/s $\color{#d91a1a}-0.22\%$
test_compile_copy_nested[pytree-eager] 0.1659ms 72.3604μs 13.8197 KOps/s 13.7490 KOps/s $\color{#35bf28}+0.51\%$
test_compile_add_one_flat[tensordict-compile] 0.2989ms 0.1743ms 5.7357 KOps/s 5.6872 KOps/s $\color{#35bf28}+0.85\%$
test_compile_add_one_flat[tensordict-eager] 0.2993ms 0.1934ms 5.1705 KOps/s 5.2014 KOps/s $\color{#d91a1a}-0.60\%$
test_compile_add_one_flat[tensorclass-compile] 83.6760μs 38.1400μs 26.2192 KOps/s 25.7356 KOps/s $\color{#35bf28}+1.88\%$
test_compile_add_one_flat[tensorclass-eager] 0.4770ms 70.5264μs 14.1791 KOps/s 13.8351 KOps/s $\color{#35bf28}+2.49\%$
test_compile_add_one_flat[pytree-compile] 0.2595ms 0.1710ms 5.8485 KOps/s 5.6957 KOps/s $\color{#35bf28}+2.68\%$
test_compile_add_one_flat[pytree-eager] 0.4348ms 0.2944ms 3.3969 KOps/s 3.4036 KOps/s $\color{#d91a1a}-0.20\%$
test_compile_add_self_flat[tensordict-eager] 0.3045ms 0.2048ms 4.8838 KOps/s 4.7867 KOps/s $\color{#35bf28}+2.03\%$
test_compile_add_self_flat[tensordict-compile] 0.5409ms 0.1789ms 5.5903 KOps/s 5.6156 KOps/s $\color{#d91a1a}-0.45\%$
test_compile_add_self_flat[tensorclass-eager] 0.4384ms 62.2848μs 16.0553 KOps/s 15.4692 KOps/s $\color{#35bf28}+3.79\%$
test_compile_add_self_flat[tensorclass-compile] 0.1023ms 39.8779μs 25.0766 KOps/s 25.4008 KOps/s $\color{#d91a1a}-1.28\%$
test_compile_add_self_flat[pytree-eager] 0.6095ms 0.2436ms 4.1047 KOps/s 4.1975 KOps/s $\color{#d91a1a}-2.21\%$
test_compile_add_self_flat[pytree-compile] 0.2888ms 0.1725ms 5.7982 KOps/s 5.7259 KOps/s $\color{#35bf28}+1.26\%$
test_compile_copy_flat[tensordict-compile] 0.2078ms 0.1074ms 9.3153 KOps/s 9.0736 KOps/s $\color{#35bf28}+2.66\%$
test_compile_copy_flat[tensordict-eager] 0.1186ms 55.9972μs 17.8580 KOps/s 17.2769 KOps/s $\color{#35bf28}+3.36\%$
test_compile_copy_flat[pytree-compile] 0.1871ms 81.1089μs 12.3291 KOps/s 12.3584 KOps/s $\color{#d91a1a}-0.24\%$
test_compile_copy_flat[pytree-eager] 0.1712ms 71.9178μs 13.9048 KOps/s 13.7935 KOps/s $\color{#35bf28}+0.81\%$
test_compile_assign_and_add[tensordict-compile] 0.2838ms 0.1884ms 5.3089 KOps/s 5.2750 KOps/s $\color{#35bf28}+0.64\%$
test_compile_assign_and_add[tensordict-eager] 3.2730ms 1.6461ms 607.5031 Ops/s 601.8823 Ops/s $\color{#35bf28}+0.93\%$
test_compile_assign_and_add[pytree-compile] 0.6026ms 0.1907ms 5.2436 KOps/s 5.2720 KOps/s $\color{#d91a1a}-0.54\%$
test_compile_assign_and_add[pytree-eager] 1.3878ms 1.0955ms 912.8399 Ops/s 914.9189 Ops/s $\color{#d91a1a}-0.23\%$
test_compile_assign_and_add_stack[compile] 0.7034ms 0.4146ms 2.4117 KOps/s 2.3938 KOps/s $\color{#35bf28}+0.75\%$
test_compile_assign_and_add_stack[eager] 6.9768ms 3.9288ms 254.5314 Ops/s 267.2659 Ops/s $\color{#d91a1a}-4.76\%$
test_compile_indexing[tensor-tensordict-compile] 0.2336ms 33.5990μs 29.7628 KOps/s 30.2766 KOps/s $\color{#d91a1a}-1.70\%$
test_compile_indexing[tensor-tensordict-eager] 1.0547ms 46.4911μs 21.5095 KOps/s 19.9726 KOps/s $\textbf{\color{#35bf28}+7.70\%}$
test_compile_indexing[tensor-tensorclass-compile] 81.8030μs 27.8988μs 35.8438 KOps/s 35.0547 KOps/s $\color{#35bf28}+2.25\%$
test_compile_indexing[tensor-tensorclass-eager] 94.9880μs 29.4477μs 33.9585 KOps/s 32.1031 KOps/s $\textbf{\color{#35bf28}+5.78\%}$
test_compile_indexing[tensor-pytree-compile] 0.1160ms 27.9172μs 35.8202 KOps/s 34.7225 KOps/s $\color{#35bf28}+3.16\%$
test_compile_indexing[tensor-pytree-eager] 0.1095ms 29.7575μs 33.6049 KOps/s 31.3476 KOps/s $\textbf{\color{#35bf28}+7.20\%}$
test_compile_indexing[slice-tensordict-compile] 0.3476ms 71.6317μs 13.9603 KOps/s 13.2134 KOps/s $\textbf{\color{#35bf28}+5.65\%}$
test_compile_indexing[slice-tensordict-eager] 3.4526ms 27.8312μs 35.9310 KOps/s 34.7684 KOps/s $\color{#35bf28}+3.34\%$
test_compile_indexing[slice-tensorclass-compile] 0.1386ms 66.8645μs 14.9556 KOps/s 14.4161 KOps/s $\color{#35bf28}+3.74\%$
test_compile_indexing[slice-tensorclass-eager] 84.6380μs 24.1127μs 41.4719 KOps/s 40.0408 KOps/s $\color{#35bf28}+3.57\%$
test_compile_indexing[slice-pytree-compile] 0.1350ms 67.8474μs 14.7390 KOps/s 14.4883 KOps/s $\color{#35bf28}+1.73\%$
test_compile_indexing[slice-pytree-eager] 64.5810μs 24.4231μs 40.9448 KOps/s 40.2830 KOps/s $\color{#35bf28}+1.64\%$
test_compile_indexing[int-tensordict-compile] 1.9929ms 72.3103μs 13.8293 KOps/s 13.3506 KOps/s $\color{#35bf28}+3.59\%$
test_compile_indexing[int-tensordict-eager] 0.7627ms 27.3875μs 36.5130 KOps/s 35.1414 KOps/s $\color{#35bf28}+3.90\%$
test_compile_indexing[int-tensorclass-compile] 0.1259ms 66.4828μs 15.0415 KOps/s 14.8175 KOps/s $\color{#35bf28}+1.51\%$
test_compile_indexing[int-tensorclass-eager] 65.8230μs 24.2341μs 41.2642 KOps/s 40.9161 KOps/s $\color{#35bf28}+0.85\%$
test_compile_indexing[int-pytree-compile] 0.3355ms 68.9884μs 14.4952 KOps/s 14.6295 KOps/s $\color{#d91a1a}-0.92\%$
test_compile_indexing[int-pytree-eager] 63.4790μs 24.0126μs 41.6448 KOps/s 40.7984 KOps/s $\color{#35bf28}+2.07\%$
test_mod_add[eager] 0.1563ms 24.9050μs 40.1525 KOps/s 41.9347 KOps/s $\color{#d91a1a}-4.25\%$
test_mod_add[compile] 94.1760μs 36.1708μs 27.6466 KOps/s 27.4501 KOps/s $\color{#35bf28}+0.72\%$
test_mod_add[compile-overhead] 0.1273ms 37.8390μs 26.4277 KOps/s 27.3483 KOps/s $\color{#d91a1a}-3.37\%$
test_mod_wrap[eager] 0.4413ms 0.2130ms 4.6950 KOps/s 4.6680 KOps/s $\color{#35bf28}+0.58\%$
test_mod_wrap[compile] 1.4565ms 0.2330ms 4.2925 KOps/s 4.2095 KOps/s $\color{#35bf28}+1.97\%$
test_mod_wrap[compile-overhead] 0.4904ms 0.2257ms 4.4303 KOps/s 4.3402 KOps/s $\color{#35bf28}+2.07\%$
test_mod_wrap_and_backward[eager] 11.8448ms 10.8749ms 91.9549 Ops/s 87.5404 Ops/s $\textbf{\color{#35bf28}+5.04\%}$
test_mod_wrap_and_backward[compile] 13.2671ms 11.0813ms 90.2418 Ops/s 85.9913 Ops/s $\color{#35bf28}+4.94\%$
test_mod_wrap_and_backward[compile-overhead] 15.0551ms 11.5224ms 86.7875 Ops/s 90.2115 Ops/s $\color{#d91a1a}-3.80\%$
test_seq_add[eager] 0.2633ms 89.6255μs 11.1575 KOps/s 11.7325 KOps/s $\color{#d91a1a}-4.90\%$
test_seq_add[compile] 0.1866ms 60.5190μs 16.5237 KOps/s 16.1344 KOps/s $\color{#35bf28}+2.41\%$
test_seq_add[compile-overhead] 0.1665ms 60.1710μs 16.6193 KOps/s 16.9235 KOps/s $\color{#d91a1a}-1.80\%$
test_seq_wrap[eager] 0.6475ms 0.3871ms 2.5830 KOps/s 2.7113 KOps/s $\color{#d91a1a}-4.73\%$
test_seq_wrap[compile] 0.7321ms 0.2672ms 3.7428 KOps/s 3.7714 KOps/s $\color{#d91a1a}-0.76\%$
test_seq_wrap[compile-overhead] 0.4725ms 0.2640ms 3.7873 KOps/s 3.7702 KOps/s $\color{#35bf28}+0.45\%$
test_func_call_runtime[False-eager] 0.9266ms 0.5329ms 1.8764 KOps/s 1.8412 KOps/s $\color{#35bf28}+1.91\%$
test_func_call_runtime[False-compile] 0.6785ms 0.4935ms 2.0263 KOps/s 1.9895 KOps/s $\color{#35bf28}+1.85\%$
test_func_call_runtime[False-compile-overhead] 1.0359ms 0.4976ms 2.0097 KOps/s 1.9886 KOps/s $\color{#35bf28}+1.06\%$
test_func_call_runtime[True-eager] 1.0171ms 0.7564ms 1.3221 KOps/s 1.2889 KOps/s $\color{#35bf28}+2.58\%$
test_func_call_runtime[True-compile] 0.7114ms 0.5072ms 1.9717 KOps/s 1.8793 KOps/s $\color{#35bf28}+4.92\%$
test_func_call_runtime[True-compile-overhead] 0.8825ms 0.5079ms 1.9687 KOps/s 1.8711 KOps/s $\textbf{\color{#35bf28}+5.22\%}$
test_func_call_cm_runtime[False-eager] 0.9018ms 0.5346ms 1.8704 KOps/s 1.8165 KOps/s $\color{#35bf28}+2.97\%$
test_func_call_cm_runtime[False-compile] 0.6501ms 0.4940ms 2.0245 KOps/s 1.9337 KOps/s $\color{#35bf28}+4.69\%$
test_func_call_cm_runtime[False-compile-overhead] 0.6153ms 0.4957ms 2.0173 KOps/s 1.9414 KOps/s $\color{#35bf28}+3.91\%$
test_func_call_cm_runtime[True-eager] 1.2608ms 0.8760ms 1.1416 KOps/s 1.0783 KOps/s $\textbf{\color{#35bf28}+5.87\%}$
test_func_call_cm_runtime[True-compile] 1.0910ms 0.8373ms 1.1943 KOps/s 1.1241 KOps/s $\textbf{\color{#35bf28}+6.24\%}$
test_func_call_cm_runtime[True-compile-overhead] 0.9634ms 0.8420ms 1.1877 KOps/s 1.1528 KOps/s $\color{#35bf28}+3.03\%$
test_distributed 0.3563ms 0.1317ms 7.5943 KOps/s 7.5024 KOps/s $\color{#35bf28}+1.22\%$
test_tdmodule 0.1174ms 17.9491μs 55.7131 KOps/s 64.1415 KOps/s $\textbf{\color{#d91a1a}-13.14\%}$
test_tdmodule_dispatch 64.0290μs 37.5747μs 26.6137 KOps/s 29.6721 KOps/s $\textbf{\color{#d91a1a}-10.31\%}$
test_tdseq 51.0750μs 19.6425μs 50.9100 KOps/s 58.7712 KOps/s $\textbf{\color{#d91a1a}-13.38\%}$
test_tdseq_dispatch 71.7340μs 41.1089μs 24.3257 KOps/s 27.1711 KOps/s $\textbf{\color{#d91a1a}-10.47\%}$
test_instantiation_functorch 1.8407ms 1.6278ms 614.3311 Ops/s 592.5961 Ops/s $\color{#35bf28}+3.67\%$
test_instantiation_td 1.7732ms 1.1714ms 853.6841 Ops/s 853.3606 Ops/s $\color{#35bf28}+0.04\%$
test_exec_functorch 0.3251ms 0.1809ms 5.5267 KOps/s 5.5247 KOps/s $\color{#35bf28}+0.04\%$
test_exec_functional_call 0.3328ms 0.1713ms 5.8383 KOps/s 5.7752 KOps/s $\color{#35bf28}+1.09\%$
test_exec_td 0.2719ms 0.1757ms 5.6913 KOps/s 5.7117 KOps/s $\color{#d91a1a}-0.36\%$
test_exec_td_decorator 1.0723ms 0.2401ms 4.1655 KOps/s 4.3940 KOps/s $\textbf{\color{#d91a1a}-5.20\%}$
test_vmap_mlp_speed[True-True] 1.0519ms 0.5981ms 1.6720 KOps/s 1.7134 KOps/s $\color{#d91a1a}-2.42\%$
test_vmap_mlp_speed[True-False] 0.7843ms 0.5763ms 1.7351 KOps/s 1.7236 KOps/s $\color{#35bf28}+0.67\%$
test_vmap_mlp_speed[False-True] 0.7459ms 0.4757ms 2.1021 KOps/s 2.0542 KOps/s $\color{#35bf28}+2.33\%$
test_vmap_mlp_speed[False-False] 0.8482ms 0.4767ms 2.0977 KOps/s 2.0616 KOps/s $\color{#35bf28}+1.75\%$
test_vmap_mlp_speed_decorator[True-True] 1.2988ms 0.6335ms 1.5786 KOps/s 1.5403 KOps/s $\color{#35bf28}+2.49\%$
test_vmap_mlp_speed_decorator[True-False] 0.9399ms 0.6353ms 1.5741 KOps/s 1.5325 KOps/s $\color{#35bf28}+2.71\%$
test_vmap_mlp_speed_decorator[False-True] 0.7336ms 0.5225ms 1.9137 KOps/s 1.8363 KOps/s $\color{#35bf28}+4.22\%$
test_vmap_mlp_speed_decorator[False-False] 1.0050ms 0.5235ms 1.9101 KOps/s 1.8285 KOps/s $\color{#35bf28}+4.46\%$
test_to_module_speed[True] 2.1485ms 1.3380ms 747.3790 Ops/s 733.1241 Ops/s $\color{#35bf28}+1.94\%$
test_to_module_speed[False] 1.4173ms 1.3003ms 769.0478 Ops/s 776.5428 Ops/s $\color{#d91a1a}-0.97\%$
test_tc_init 83.9670μs 45.5164μs 21.9701 KOps/s 23.3398 KOps/s $\textbf{\color{#d91a1a}-5.87\%}$
test_tc_init_nested 0.2110ms 94.8433μs 10.5437 KOps/s 11.6640 KOps/s $\textbf{\color{#d91a1a}-9.61\%}$
test_tc_first_layer_tensor 25.7180μs 1.4303μs 699.1619 KOps/s 671.3518 KOps/s $\color{#35bf28}+4.14\%$
test_tc_first_layer_nontensor 30.7870μs 4.1903μs 238.6442 KOps/s 225.9259 KOps/s $\textbf{\color{#35bf28}+5.63\%}$
test_tc_second_layer_tensor 25.4780μs 2.6706μs 374.4407 KOps/s 372.0306 KOps/s $\color{#35bf28}+0.65\%$
test_tc_second_layer_nontensor 32.5810μs 5.4208μs 184.4747 KOps/s 183.5538 KOps/s $\color{#35bf28}+0.50\%$
test_unbind 0.4571s 13.8122ms 72.3996 Ops/s 63.8742 Ops/s $\textbf{\color{#35bf28}+13.35\%}$
test_full_like 9.2468ms 7.2404ms 138.1140 Ops/s 102.3873 Ops/s $\textbf{\color{#35bf28}+34.89\%}$
test_zeros_like 12.3081ms 6.6363ms 150.6873 Ops/s 128.2236 Ops/s $\textbf{\color{#35bf28}+17.52\%}$
test_ones_like 12.8032ms 7.3603ms 135.8632 Ops/s 118.1992 Ops/s $\textbf{\color{#35bf28}+14.94\%}$
test_clone 14.6502ms 8.9758ms 111.4109 Ops/s 93.6129 Ops/s $\textbf{\color{#35bf28}+19.01\%}$
test_squeeze 63.5990μs 12.5608μs 79.6127 KOps/s 77.1291 KOps/s $\color{#35bf28}+3.22\%$
test_unsqueeze 0.1732ms 92.0450μs 10.8642 KOps/s 10.2745 KOps/s $\textbf{\color{#35bf28}+5.74\%}$
test_split 0.4909ms 0.1983ms 5.0439 KOps/s 4.8053 KOps/s $\color{#35bf28}+4.97\%$
test_permute 0.3807ms 0.2185ms 4.5760 KOps/s 4.4670 KOps/s $\color{#35bf28}+2.44\%$
test_stack 32.7312ms 25.5840ms 39.0869 Ops/s 34.8550 Ops/s $\textbf{\color{#35bf28}+12.14\%}$
test_cat 35.0363ms 25.3951ms 39.3777 Ops/s 35.3363 Ops/s $\textbf{\color{#35bf28}+11.44\%}$

Copy link

github-actions bot commented Aug 5, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 225. Improved: $\large\color{#35bf28}15$. Worsened: $\large\color{#d91a1a}24$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 0.1520ms 16.7182μs 59.8151 KOps/s 63.2210 KOps/s $\textbf{\color{#d91a1a}-5.39\%}$
test_plain_set_stack_nested 35.3310μs 16.8412μs 59.3783 KOps/s 63.0315 KOps/s $\textbf{\color{#d91a1a}-5.80\%}$
test_plain_set_nested_inplace 53.7110μs 17.8906μs 55.8954 KOps/s 59.3114 KOps/s $\textbf{\color{#d91a1a}-5.76\%}$
test_plain_set_stack_nested_inplace 41.6310μs 17.8598μs 55.9918 KOps/s 59.1493 KOps/s $\textbf{\color{#d91a1a}-5.34\%}$
test_items 18.6710μs 4.7019μs 212.6781 KOps/s 210.7114 KOps/s $\color{#35bf28}+0.93\%$
test_items_nested 0.4006ms 0.3645ms 2.7434 KOps/s 2.6952 KOps/s $\color{#35bf28}+1.79\%$
test_items_nested_locked 0.4005ms 0.3666ms 2.7277 KOps/s 2.6380 KOps/s $\color{#35bf28}+3.40\%$
test_items_nested_leaf 0.1126ms 83.4886μs 11.9777 KOps/s 11.9713 KOps/s $\color{#35bf28}+0.05\%$
test_items_stack_nested 0.4374ms 0.3663ms 2.7301 KOps/s 2.6724 KOps/s $\color{#35bf28}+2.16\%$
test_items_stack_nested_leaf 0.2677ms 84.5307μs 11.8300 KOps/s 11.7402 KOps/s $\color{#35bf28}+0.77\%$
test_items_stack_nested_locked 0.5797ms 0.3722ms 2.6871 KOps/s 2.7061 KOps/s $\color{#d91a1a}-0.70\%$
test_keys 18.4200μs 4.3723μs 228.7141 KOps/s 227.1713 KOps/s $\color{#35bf28}+0.68\%$
test_keys_nested 0.1070ms 67.4549μs 14.8247 KOps/s 14.9675 KOps/s $\color{#d91a1a}-0.95\%$
test_keys_nested_locked 0.6482ms 73.0549μs 13.6883 KOps/s 13.6780 KOps/s $\color{#35bf28}+0.08\%$
test_keys_nested_leaf 90.8420μs 57.9577μs 17.2540 KOps/s 17.4631 KOps/s $\color{#d91a1a}-1.20\%$
test_keys_stack_nested 95.0220μs 67.9018μs 14.7272 KOps/s 14.6920 KOps/s $\color{#35bf28}+0.24\%$
test_keys_stack_nested_leaf 94.5120μs 58.1485μs 17.1974 KOps/s 17.0572 KOps/s $\color{#35bf28}+0.82\%$
test_keys_stack_nested_locked 0.1168ms 73.2853μs 13.6453 KOps/s 13.5667 KOps/s $\color{#35bf28}+0.58\%$
test_values 9.4500μs 1.7703μs 564.8636 KOps/s 568.7677 KOps/s $\color{#d91a1a}-0.69\%$
test_values_nested 0.1077ms 33.5186μs 29.8342 KOps/s 29.8287 KOps/s $\color{#35bf28}+0.02\%$
test_values_nested_locked 91.2620μs 35.6280μs 28.0678 KOps/s 28.1341 KOps/s $\color{#d91a1a}-0.24\%$
test_values_nested_leaf 53.8910μs 29.7205μs 33.6468 KOps/s 33.6459 KOps/s $+0.00\%$
test_values_stack_nested 73.4120μs 34.4495μs 29.0280 KOps/s 29.0576 KOps/s $\color{#d91a1a}-0.10\%$
test_values_stack_nested_leaf 54.7720μs 30.4142μs 32.8793 KOps/s 32.7249 KOps/s $\color{#35bf28}+0.47\%$
test_values_stack_nested_locked 90.2020μs 36.2351μs 27.5976 KOps/s 27.5467 KOps/s $\color{#35bf28}+0.18\%$
test_membership 1.3330μs 0.5353μs 1.8682 MOps/s 1.8275 MOps/s $\color{#35bf28}+2.23\%$
test_membership_nested 9.2050μs 1.9806μs 504.9014 KOps/s 485.3293 KOps/s $\color{#35bf28}+4.03\%$
test_membership_nested_leaf 14.2750μs 1.9998μs 500.0440 KOps/s 512.0470 KOps/s $\color{#d91a1a}-2.34\%$
test_membership_stacked_nested 20.9600μs 2.0204μs 494.9551 KOps/s 497.7068 KOps/s $\color{#d91a1a}-0.55\%$
test_membership_stacked_nested_leaf 19.9610μs 2.0201μs 495.0172 KOps/s 498.2423 KOps/s $\color{#d91a1a}-0.65\%$
test_membership_nested_last 21.9410μs 2.9468μs 339.3543 KOps/s 339.9525 KOps/s $\color{#d91a1a}-0.18\%$
test_membership_nested_leaf_last 15.8400μs 2.9623μs 337.5739 KOps/s 337.7384 KOps/s $\color{#d91a1a}-0.05\%$
test_membership_stacked_nested_last 24.4410μs 2.9653μs 337.2375 KOps/s 340.4713 KOps/s $\color{#d91a1a}-0.95\%$
test_membership_stacked_nested_leaf_last 22.6810μs 2.9293μs 341.3741 KOps/s 339.8723 KOps/s $\color{#35bf28}+0.44\%$
test_nested_getleaf 27.7300μs 7.8766μs 126.9590 KOps/s 128.8636 KOps/s $\color{#d91a1a}-1.48\%$
test_nested_get 45.0620μs 7.3785μs 135.5281 KOps/s 136.6831 KOps/s $\color{#d91a1a}-0.84\%$
test_stacked_getleaf 27.4810μs 7.9717μs 125.4436 KOps/s 127.9245 KOps/s $\color{#d91a1a}-1.94\%$
test_stacked_get 66.3620μs 7.4105μs 134.9443 KOps/s 137.4962 KOps/s $\color{#d91a1a}-1.86\%$
test_nested_getitemleaf 21.6000μs 8.1373μs 122.8915 KOps/s 123.0272 KOps/s $\color{#d91a1a}-0.11\%$
test_nested_getitem 26.8710μs 7.6614μs 130.5238 KOps/s 130.4013 KOps/s $\color{#35bf28}+0.09\%$
test_stacked_getitemleaf 26.3710μs 8.1663μs 122.4551 KOps/s 122.9005 KOps/s $\color{#d91a1a}-0.36\%$
test_stacked_getitem 21.2700μs 7.7029μs 129.8213 KOps/s 130.5585 KOps/s $\color{#d91a1a}-0.56\%$
test_lock_nested 0.9220ms 0.4682ms 2.1359 KOps/s 2.1064 KOps/s $\color{#35bf28}+1.40\%$
test_lock_stack_nested 0.5665ms 0.4358ms 2.2945 KOps/s 2.2752 KOps/s $\color{#35bf28}+0.85\%$
test_unlock_nested 0.8221ms 0.3887ms 2.5726 KOps/s 2.5195 KOps/s $\color{#35bf28}+2.11\%$
test_unlock_stack_nested 0.4916ms 0.3560ms 2.8091 KOps/s 2.7832 KOps/s $\color{#35bf28}+0.93\%$
test_flatten_speed 94.7136ms 0.1166ms 8.5748 KOps/s 9.6622 KOps/s $\textbf{\color{#d91a1a}-11.25\%}$
test_unflatten_speed 0.3730ms 0.3143ms 3.1817 KOps/s 3.1985 KOps/s $\color{#d91a1a}-0.53\%$
test_common_ops 1.6142ms 1.3858ms 721.5999 Ops/s 754.2480 Ops/s $\color{#d91a1a}-4.33\%$
test_creation 15.9400μs 1.6823μs 594.4358 KOps/s 606.7345 KOps/s $\color{#d91a1a}-2.03\%$
test_creation_empty 0.1522ms 16.7539μs 59.6878 KOps/s 66.6796 KOps/s $\textbf{\color{#d91a1a}-10.49\%}$
test_creation_nested_1 0.1227ms 18.6656μs 53.5746 KOps/s 59.1377 KOps/s $\textbf{\color{#d91a1a}-9.41\%}$
test_creation_nested_2 43.7710μs 21.4322μs 46.6587 KOps/s 50.6193 KOps/s $\textbf{\color{#d91a1a}-7.82\%}$
test_clone 0.1843ms 30.5742μs 32.7074 KOps/s 31.1291 KOps/s $\textbf{\color{#35bf28}+5.07\%}$
test_getitem[int] 1.1452ms 18.2971μs 54.6535 KOps/s 57.2756 KOps/s $\color{#d91a1a}-4.58\%$
test_getitem[slice_int] 0.1657ms 29.0626μs 34.4085 KOps/s 32.0008 KOps/s $\textbf{\color{#35bf28}+7.52\%}$
test_getitem[range] 0.2483ms 0.1173ms 8.5239 KOps/s 8.3945 KOps/s $\color{#35bf28}+1.54\%$
test_getitem[tuple] 0.1403ms 25.1228μs 39.8045 KOps/s 38.9600 KOps/s $\color{#35bf28}+2.17\%$
test_getitem[list] 0.3313ms 0.1074ms 9.3142 KOps/s 9.3414 KOps/s $\color{#d91a1a}-0.29\%$
test_setitem_dim[int] 0.1886ms 56.4243μs 17.7229 KOps/s 18.3092 KOps/s $\color{#d91a1a}-3.20\%$
test_setitem_dim[slice_int] 0.1222ms 83.6051μs 11.9610 KOps/s 12.9284 KOps/s $\textbf{\color{#d91a1a}-7.48\%}$
test_setitem_dim[range] 0.2882ms 0.1538ms 6.5036 KOps/s 7.1129 KOps/s $\textbf{\color{#d91a1a}-8.57\%}$
test_setitem_dim[tuple] 0.2219ms 78.7870μs 12.6924 KOps/s 14.1073 KOps/s $\textbf{\color{#d91a1a}-10.03\%}$
test_setitem 0.2225ms 47.5003μs 21.0525 KOps/s 21.3771 KOps/s $\color{#d91a1a}-1.52\%$
test_set 0.2212ms 46.8459μs 21.3466 KOps/s 23.0017 KOps/s $\textbf{\color{#d91a1a}-7.20\%}$
test_set_shared 0.3809ms 55.2698μs 18.0931 KOps/s 17.6869 KOps/s $\color{#35bf28}+2.30\%$
test_update 0.1997ms 52.9035μs 18.9023 KOps/s 19.2854 KOps/s $\color{#d91a1a}-1.99\%$
test_update_nested 0.2164ms 64.5432μs 15.4935 KOps/s 16.4802 KOps/s $\textbf{\color{#d91a1a}-5.99\%}$
test_update__nested 0.2498ms 68.5947μs 14.5784 KOps/s 14.8238 KOps/s $\color{#d91a1a}-1.66\%$
test_set_nested 0.2018ms 48.6185μs 20.5683 KOps/s 21.5903 KOps/s $\color{#d91a1a}-4.73\%$
test_set_nested_new 0.2033ms 53.1300μs 18.8218 KOps/s 19.8630 KOps/s $\textbf{\color{#d91a1a}-5.24\%}$
test_select 0.2264ms 69.0563μs 14.4809 KOps/s 15.1323 KOps/s $\color{#d91a1a}-4.30\%$
test_select_nested 76.2320μs 51.1973μs 19.5323 KOps/s 19.5541 KOps/s $\color{#d91a1a}-0.11\%$
test_exclude_nested 95.4830μs 69.7362μs 14.3397 KOps/s 14.3816 KOps/s $\color{#d91a1a}-0.29\%$
test_empty[True] 0.3489ms 0.2831ms 3.5322 KOps/s 3.4527 KOps/s $\color{#35bf28}+2.30\%$
test_empty[False] 1.8520μs 0.8589μs 1.1643 MOps/s 1.1606 MOps/s $\color{#35bf28}+0.32\%$
test_to 64.0210μs 25.6985μs 38.9128 KOps/s 36.5339 KOps/s $\textbf{\color{#35bf28}+6.51\%}$
test_to_nonblocking 47.5610μs 25.1218μs 39.8060 KOps/s 36.3296 KOps/s $\textbf{\color{#35bf28}+9.57\%}$
test_unbind_speed 0.4483ms 0.3027ms 3.3032 KOps/s 3.3020 KOps/s $\color{#35bf28}+0.04\%$
test_unbind_speed_stack0 0.3504ms 0.3023ms 3.3084 KOps/s 3.2492 KOps/s $\color{#35bf28}+1.82\%$
test_unbind_speed_stack1 91.5653ms 0.7703ms 1.2982 KOps/s 1.2655 KOps/s $\color{#35bf28}+2.58\%$
test_split 93.7822ms 2.4013ms 416.4395 Ops/s 418.0717 Ops/s $\color{#d91a1a}-0.39\%$
test_chunk 2.3164ms 2.1866ms 457.3348 Ops/s 419.9192 Ops/s $\textbf{\color{#35bf28}+8.91\%}$
test_creation[device0] 0.2519ms 0.1064ms 9.4014 KOps/s 9.3549 KOps/s $\color{#35bf28}+0.50\%$
test_creation_from_tensor 0.2762ms 0.1092ms 9.1593 KOps/s 9.6149 KOps/s $\color{#d91a1a}-4.74\%$
test_add_one[memmap_tensor0] 0.1567ms 9.3393μs 107.0739 KOps/s 110.5579 KOps/s $\color{#d91a1a}-3.15\%$
test_contiguous[memmap_tensor0] 0.1706ms 2.2186μs 450.7359 KOps/s 444.7646 KOps/s $\color{#35bf28}+1.34\%$
test_stack[memmap_tensor0] 0.2021ms 7.0617μs 141.6099 KOps/s 145.1317 KOps/s $\color{#d91a1a}-2.43\%$
test_memmaptd_index 1.3299ms 0.4385ms 2.2807 KOps/s 2.3179 KOps/s $\color{#d91a1a}-1.61\%$
test_memmaptd_index_astensor 99.1266ms 0.5555ms 1.8002 KOps/s 2.0240 KOps/s $\textbf{\color{#d91a1a}-11.05\%}$
test_memmaptd_index_op 1.6459ms 1.0705ms 934.1620 Ops/s 960.9092 Ops/s $\color{#d91a1a}-2.78\%$
test_serialize_model 94.9624ms 90.5848ms 11.0394 Ops/s 10.8415 Ops/s $\color{#35bf28}+1.83\%$
test_serialize_model_pickle 1.3511s 1.2363s 0.8089 Ops/s 0.8084 Ops/s $\color{#35bf28}+0.06\%$
test_serialize_weights 87.8333ms 86.3346ms 11.5828 Ops/s 9.6327 Ops/s $\textbf{\color{#35bf28}+20.24\%}$
test_serialize_weights_returnearly 55.8990ms 51.8285ms 19.2944 Ops/s 14.7924 Ops/s $\textbf{\color{#35bf28}+30.43\%}$
test_serialize_weights_pickle 1.3525s 1.2371s 0.8083 Ops/s 0.8036 Ops/s $\color{#35bf28}+0.59\%$
test_reshape_pytree 0.2375ms 38.5326μs 25.9521 KOps/s 25.3404 KOps/s $\color{#35bf28}+2.41\%$
test_reshape_td 0.2139ms 44.3874μs 22.5289 KOps/s 21.3552 KOps/s $\textbf{\color{#35bf28}+5.50\%}$
test_view_pytree 0.1412ms 38.1601μs 26.2054 KOps/s 25.0454 KOps/s $\color{#35bf28}+4.63\%$
test_view_td 0.2166ms 49.8220μs 20.0715 KOps/s 19.0823 KOps/s $\textbf{\color{#35bf28}+5.18\%}$
test_unbind_pytree 0.2543ms 37.0495μs 26.9910 KOps/s 26.6588 KOps/s $\color{#35bf28}+1.25\%$
test_unbind_td 0.3940ms 45.1729μs 22.1372 KOps/s 21.7790 KOps/s $\color{#35bf28}+1.64\%$
test_split_pytree 0.3451ms 50.5006μs 19.8017 KOps/s 19.7961 KOps/s $\color{#35bf28}+0.03\%$
test_split_td 0.1986ms 62.1371μs 16.0935 KOps/s 15.4912 KOps/s $\color{#35bf28}+3.89\%$
test_add_pytree 0.2030ms 60.3920μs 16.5585 KOps/s 14.8743 KOps/s $\textbf{\color{#35bf28}+11.32\%}$
test_add_td 0.2464ms 97.2783μs 10.2798 KOps/s 9.9439 KOps/s $\color{#35bf28}+3.38\%$
test_compile_add_one_nested[tensordict-compile] 0.4137ms 0.2146ms 4.6608 KOps/s 4.5778 KOps/s $\color{#35bf28}+1.81\%$
test_compile_add_one_nested[tensordict-eager] 0.3216ms 0.1757ms 5.6907 KOps/s 5.6836 KOps/s $\color{#35bf28}+0.13\%$
test_compile_add_one_nested[pytree-compile] 0.2995ms 0.1495ms 6.6887 KOps/s 6.6468 KOps/s $\color{#35bf28}+0.63\%$
test_compile_add_one_nested[pytree-eager] 0.3503ms 0.1991ms 5.0227 KOps/s 4.9636 KOps/s $\color{#35bf28}+1.19\%$
test_compile_copy_nested[tensordict-compile] 0.1441ms 22.2131μs 45.0185 KOps/s 44.4145 KOps/s $\color{#35bf28}+1.36\%$
test_compile_copy_nested[tensordict-eager] 0.1160ms 48.1909μs 20.7508 KOps/s 20.7220 KOps/s $\color{#35bf28}+0.14\%$
test_compile_copy_nested[pytree-compile] 0.1534ms 74.4271μs 13.4360 KOps/s 13.5859 KOps/s $\color{#d91a1a}-1.10\%$
test_compile_copy_nested[pytree-eager] 86.3020μs 60.0953μs 16.6402 KOps/s 16.5738 KOps/s $\color{#35bf28}+0.40\%$
test_compile_add_one_flat[tensordict-compile] 0.5142ms 0.3357ms 2.9785 KOps/s 2.9739 KOps/s $\color{#35bf28}+0.15\%$
test_compile_add_one_flat[tensordict-eager] 0.3737ms 0.2223ms 4.4986 KOps/s 4.4006 KOps/s $\color{#35bf28}+2.23\%$
test_compile_add_one_flat[tensorclass-compile] 0.2789ms 0.1339ms 7.4708 KOps/s 7.4368 KOps/s $\color{#35bf28}+0.46\%$
test_compile_add_one_flat[tensorclass-eager] 0.2103ms 64.3834μs 15.5320 KOps/s 15.0760 KOps/s $\color{#35bf28}+3.02\%$
test_compile_add_one_flat[pytree-compile] 0.5322ms 0.3360ms 2.9762 KOps/s 2.9927 KOps/s $\color{#d91a1a}-0.55\%$
test_compile_add_one_flat[pytree-eager] 0.8650ms 0.6599ms 1.5153 KOps/s 1.5186 KOps/s $\color{#d91a1a}-0.22\%$
test_compile_add_self_flat[tensordict-eager] 0.3855ms 0.2692ms 3.7150 KOps/s 3.6404 KOps/s $\color{#35bf28}+2.05\%$
test_compile_add_self_flat[tensordict-compile] 0.4806ms 0.3369ms 2.9683 KOps/s 2.9596 KOps/s $\color{#35bf28}+0.29\%$
test_compile_add_self_flat[tensorclass-eager] 0.2266ms 76.2062μs 13.1223 KOps/s 12.8944 KOps/s $\color{#35bf28}+1.77\%$
test_compile_add_self_flat[tensorclass-compile] 0.3094ms 0.1351ms 7.4046 KOps/s 7.3650 KOps/s $\color{#35bf28}+0.54\%$
test_compile_add_self_flat[pytree-eager] 0.7225ms 0.5641ms 1.7727 KOps/s 1.7731 KOps/s $\color{#d91a1a}-0.02\%$
test_compile_add_self_flat[pytree-compile] 0.5186ms 0.3343ms 2.9913 KOps/s 2.9878 KOps/s $\color{#35bf28}+0.12\%$
test_compile_copy_flat[tensordict-compile] 0.2280ms 19.1853μs 52.1231 KOps/s 51.1114 KOps/s $\color{#35bf28}+1.98\%$
test_compile_copy_flat[tensordict-eager] 65.8110μs 31.4363μs 31.8104 KOps/s 30.3017 KOps/s $\color{#35bf28}+4.98\%$
test_compile_copy_flat[pytree-compile] 0.2738ms 76.7027μs 13.0374 KOps/s 12.9762 KOps/s $\color{#35bf28}+0.47\%$
test_compile_copy_flat[pytree-eager] 0.2456ms 60.8345μs 16.4380 KOps/s 16.4059 KOps/s $\color{#35bf28}+0.20\%$
test_compile_assign_and_add[tensordict-compile] 2.4685ms 0.8578ms 1.1658 KOps/s 1.0650 KOps/s $\textbf{\color{#35bf28}+9.47\%}$
test_compile_assign_and_add[tensordict-eager] 3.5343ms 3.3746ms 296.3301 Ops/s 288.6321 Ops/s $\color{#35bf28}+2.67\%$
test_compile_assign_and_add[pytree-compile] 2.4560ms 0.8498ms 1.1767 KOps/s 1.0780 KOps/s $\textbf{\color{#35bf28}+9.16\%}$
test_compile_assign_and_add[pytree-eager] 3.6320ms 3.4249ms 291.9808 Ops/s 293.0317 Ops/s $\color{#d91a1a}-0.36\%$
test_compile_indexing[tensor-tensordict-compile] 0.2544ms 0.1139ms 8.7814 KOps/s 8.7196 KOps/s $\color{#35bf28}+0.71\%$
test_compile_indexing[tensor-tensordict-eager] 0.2490ms 67.2042μs 14.8800 KOps/s 15.2632 KOps/s $\color{#d91a1a}-2.51\%$
test_compile_indexing[tensor-tensorclass-compile] 0.2588ms 0.1061ms 9.4253 KOps/s 9.2449 KOps/s $\color{#35bf28}+1.95\%$
test_compile_indexing[tensor-tensorclass-eager] 0.2240ms 50.8688μs 19.6584 KOps/s 21.2943 KOps/s $\textbf{\color{#d91a1a}-7.68\%}$
test_compile_indexing[tensor-pytree-compile] 0.2796ms 0.1102ms 9.0724 KOps/s 9.2951 KOps/s $\color{#d91a1a}-2.40\%$
test_compile_indexing[tensor-pytree-eager] 0.2309ms 50.9432μs 19.6297 KOps/s 21.1715 KOps/s $\textbf{\color{#d91a1a}-7.28\%}$
test_compile_indexing[slice-tensordict-compile] 0.2714ms 0.1429ms 6.9998 KOps/s 6.8128 KOps/s $\color{#35bf28}+2.75\%$
test_compile_indexing[slice-tensordict-eager] 0.1774ms 26.7036μs 37.4481 KOps/s 36.5091 KOps/s $\color{#35bf28}+2.57\%$
test_compile_indexing[slice-tensorclass-compile] 0.2838ms 0.1342ms 7.4494 KOps/s 7.3311 KOps/s $\color{#35bf28}+1.61\%$
test_compile_indexing[slice-tensorclass-eager] 0.1248ms 23.0287μs 43.4242 KOps/s 43.1354 KOps/s $\color{#35bf28}+0.67\%$
test_compile_indexing[slice-pytree-compile] 0.3075ms 0.1336ms 7.4857 KOps/s 7.2590 KOps/s $\color{#35bf28}+3.12\%$
test_compile_indexing[slice-pytree-eager] 61.7120μs 22.8365μs 43.7896 KOps/s 42.6197 KOps/s $\color{#35bf28}+2.74\%$
test_compile_indexing[int-tensordict-compile] 0.3053ms 0.1419ms 7.0458 KOps/s 6.9395 KOps/s $\color{#35bf28}+1.53\%$
test_compile_indexing[int-tensordict-eager] 0.4484ms 26.8689μs 37.2178 KOps/s 37.0497 KOps/s $\color{#35bf28}+0.45\%$
test_compile_indexing[int-tensorclass-compile] 0.3274ms 0.1343ms 7.4433 KOps/s 7.3541 KOps/s $\color{#35bf28}+1.21\%$
test_compile_indexing[int-tensorclass-eager] 80.7520μs 23.0704μs 43.3455 KOps/s 43.4090 KOps/s $\color{#d91a1a}-0.15\%$
test_compile_indexing[int-pytree-compile] 0.3459ms 0.1339ms 7.4689 KOps/s 7.3553 KOps/s $\color{#35bf28}+1.54\%$
test_compile_indexing[int-pytree-eager] 0.3255ms 23.0272μs 43.4270 KOps/s 43.2019 KOps/s $\color{#35bf28}+0.52\%$
test_mod_add[eager] 0.2512ms 33.6670μs 29.7027 KOps/s 30.0902 KOps/s $\color{#d91a1a}-1.29\%$
test_mod_add[compile] 0.2090ms 70.3575μs 14.2131 KOps/s 13.5692 KOps/s $\color{#35bf28}+4.75\%$
test_mod_add[compile-overhead] 0.2642ms 0.1378ms 7.2558 KOps/s 6.2185 KOps/s $\textbf{\color{#35bf28}+16.68\%}$
test_mod_wrap[eager] 0.4415ms 0.2720ms 3.6760 KOps/s 3.7922 KOps/s $\color{#d91a1a}-3.06\%$
test_mod_wrap[compile] 1.2215ms 0.3065ms 3.2623 KOps/s 3.2158 KOps/s $\color{#35bf28}+1.45\%$
test_mod_wrap[compile-overhead] 8.2738ms 4.3381ms 230.5172 Ops/s 222.6593 Ops/s $\color{#35bf28}+3.53\%$
test_mod_wrap_and_backward[eager] 1.5811ms 1.3812ms 724.0173 Ops/s 719.0035 Ops/s $\color{#35bf28}+0.70\%$
test_mod_wrap_and_backward[compile] 2.7295ms 1.3681ms 730.9550 Ops/s 718.4106 Ops/s $\color{#35bf28}+1.75\%$
test_mod_wrap_and_backward[compile-overhead] 1.3813ms 0.9240ms 1.0822 KOps/s 1.0733 KOps/s $\color{#35bf28}+0.83\%$
test_seq_add[eager] 0.2597ms 0.1031ms 9.6994 KOps/s 9.7898 KOps/s $\color{#d91a1a}-0.92\%$
test_seq_add[compile] 0.2280ms 84.1553μs 11.8828 KOps/s 11.7829 KOps/s $\color{#35bf28}+0.85\%$
test_seq_add[compile-overhead] 0.2712ms 0.1198ms 8.3458 KOps/s 8.3008 KOps/s $\color{#35bf28}+0.54\%$
test_seq_wrap[eager] 0.5794ms 0.4010ms 2.4935 KOps/s 2.4348 KOps/s $\color{#35bf28}+2.41\%$
test_seq_wrap[compile] 0.4907ms 0.3195ms 3.1295 KOps/s 3.0490 KOps/s $\color{#35bf28}+2.64\%$
test_seq_wrap[compile-overhead] 0.4041ms 0.2293ms 4.3607 KOps/s 4.3043 KOps/s $\color{#35bf28}+1.31\%$
test_func_call_runtime[False-eager] 1.0438ms 0.8093ms 1.2356 KOps/s 1.3050 KOps/s $\textbf{\color{#d91a1a}-5.31\%}$
test_func_call_runtime[False-compile] 1.0184ms 0.8095ms 1.2353 KOps/s 1.2231 KOps/s $\color{#35bf28}+1.00\%$
test_func_call_runtime[False-compile-overhead] 0.5797ms 0.3738ms 2.6751 KOps/s 2.6247 KOps/s $\color{#35bf28}+1.92\%$
test_func_call_runtime[True-eager] 1.1571ms 0.9533ms 1.0489 KOps/s 1.0381 KOps/s $\color{#35bf28}+1.05\%$
test_func_call_runtime[True-compile] 1.1090ms 0.8584ms 1.1649 KOps/s 1.1464 KOps/s $\color{#35bf28}+1.61\%$
test_func_call_runtime[True-compile-overhead] 0.5587ms 0.4180ms 2.3925 KOps/s 2.3489 KOps/s $\color{#35bf28}+1.86\%$
test_func_call_cm_runtime[False-eager] 0.9431ms 0.7545ms 1.3254 KOps/s 1.2347 KOps/s $\textbf{\color{#35bf28}+7.35\%}$
test_func_call_cm_runtime[False-compile] 0.9828ms 0.8185ms 1.2218 KOps/s 1.2091 KOps/s $\color{#35bf28}+1.05\%$
test_func_call_cm_runtime[False-compile-overhead] 0.5182ms 0.3738ms 2.6755 KOps/s 2.6232 KOps/s $\color{#35bf28}+2.00\%$
test_func_call_cm_runtime[True-eager] 1.2072ms 1.0538ms 948.9175 Ops/s 900.8351 Ops/s $\textbf{\color{#35bf28}+5.34\%}$
test_func_call_cm_runtime[True-compile] 1.2034ms 1.0398ms 961.7157 Ops/s 949.0728 Ops/s $\color{#35bf28}+1.33\%$
test_func_call_cm_runtime[True-compile-overhead] 1.2925ms 1.0554ms 947.4691 Ops/s 950.7106 Ops/s $\color{#d91a1a}-0.34\%$
test_distributed 1.1018ms 73.2294μs 13.6557 KOps/s 13.8871 KOps/s $\color{#d91a1a}-1.67\%$
test_tdmodule 54.9610μs 15.9612μs 62.6521 KOps/s 68.1826 KOps/s $\textbf{\color{#d91a1a}-8.11\%}$
test_tdmodule_dispatch 48.2110μs 32.6764μs 30.6031 KOps/s 33.0504 KOps/s $\textbf{\color{#d91a1a}-7.40\%}$
test_tdseq 31.7210μs 16.5848μs 60.2960 KOps/s 63.0168 KOps/s $\color{#d91a1a}-4.32\%$
test_tdseq_dispatch 50.9010μs 34.1535μs 29.2796 KOps/s 30.2842 KOps/s $\color{#d91a1a}-3.32\%$
test_instantiation_functorch 2.1906ms 2.0357ms 491.2337 Ops/s 476.0856 Ops/s $\color{#35bf28}+3.18\%$
test_instantiation_td 2.1257ms 1.3451ms 743.4496 Ops/s 737.9527 Ops/s $\color{#35bf28}+0.74\%$
test_exec_functorch 0.4097ms 0.2309ms 4.3305 KOps/s 4.3305 KOps/s $+0.00\%$
test_exec_functional_call 0.4045ms 0.2267ms 4.4120 KOps/s 4.4128 KOps/s $\color{#d91a1a}-0.02\%$
test_exec_td 0.4033ms 0.2264ms 4.4174 KOps/s 4.2906 KOps/s $\color{#35bf28}+2.96\%$
test_exec_td_decorator 0.5897ms 0.2788ms 3.5863 KOps/s 3.4989 KOps/s $\color{#35bf28}+2.50\%$
test_vmap_mlp_speed[True-True] 0.8321ms 0.6550ms 1.5267 KOps/s 1.5121 KOps/s $\color{#35bf28}+0.96\%$
test_vmap_mlp_speed[True-False] 0.7938ms 0.6533ms 1.5307 KOps/s 1.5300 KOps/s $\color{#35bf28}+0.05\%$
test_vmap_mlp_speed[False-True] 0.8137ms 0.6028ms 1.6590 KOps/s 1.7292 KOps/s $\color{#d91a1a}-4.06\%$
test_vmap_mlp_speed[False-False] 0.8017ms 0.5927ms 1.6871 KOps/s 1.7351 KOps/s $\color{#d91a1a}-2.77\%$
test_vmap_mlp_speed_decorator[True-True] 1.5948ms 0.7174ms 1.3940 KOps/s 1.4220 KOps/s $\color{#d91a1a}-1.97\%$
test_vmap_mlp_speed_decorator[True-False] 0.9241ms 0.7144ms 1.3997 KOps/s 1.4190 KOps/s $\color{#d91a1a}-1.36\%$
test_vmap_mlp_speed_decorator[False-True] 0.8577ms 0.6260ms 1.5975 KOps/s 1.6139 KOps/s $\color{#d91a1a}-1.01\%$
test_vmap_mlp_speed_decorator[False-False] 0.8485ms 0.6330ms 1.5798 KOps/s 1.6097 KOps/s $\color{#d91a1a}-1.86\%$
test_vmap_transformer_speed[True-True] 9.0158ms 8.8393ms 113.1305 Ops/s 113.2016 Ops/s $\color{#d91a1a}-0.06\%$
test_vmap_transformer_speed[True-False] 9.2702ms 8.7970ms 113.6756 Ops/s 113.4200 Ops/s $\color{#35bf28}+0.23\%$
test_vmap_transformer_speed[False-True] 8.9875ms 8.7366ms 114.4615 Ops/s 114.7378 Ops/s $\color{#d91a1a}-0.24\%$
test_vmap_transformer_speed[False-False] 8.8626ms 8.7075ms 114.8432 Ops/s 114.7891 Ops/s $\color{#35bf28}+0.05\%$
test_vmap_transformer_speed_decorator[True-True] 21.0927ms 20.8231ms 48.0235 Ops/s 48.1073 Ops/s $\color{#d91a1a}-0.17\%$
test_vmap_transformer_speed_decorator[True-False] 21.0641ms 20.8835ms 47.8848 Ops/s 47.9258 Ops/s $\color{#d91a1a}-0.09\%$
test_vmap_transformer_speed_decorator[False-True] 21.5130ms 20.7393ms 48.2177 Ops/s 48.4467 Ops/s $\color{#d91a1a}-0.47\%$
test_vmap_transformer_speed_decorator[False-False] 20.9555ms 20.7089ms 48.2885 Ops/s 48.3938 Ops/s $\color{#d91a1a}-0.22\%$
test_to_module_speed[True] 2.4273ms 1.1579ms 863.5981 Ops/s 866.8960 Ops/s $\color{#d91a1a}-0.38\%$
test_to_module_speed[False] 1.6255ms 1.1366ms 879.8518 Ops/s 881.2403 Ops/s $\color{#d91a1a}-0.16\%$
test_tc_init 82.1120μs 38.2978μs 26.1111 KOps/s 27.7478 KOps/s $\textbf{\color{#d91a1a}-5.90\%}$
test_tc_init_nested 0.2037ms 76.5708μs 13.0598 KOps/s 13.6811 KOps/s $\color{#d91a1a}-4.54\%$
test_tc_first_layer_tensor 16.8700μs 0.9272μs 1.0786 MOps/s 1.2783 MOps/s $\textbf{\color{#d91a1a}-15.62\%}$
test_tc_first_layer_nontensor 21.4510μs 2.5531μs 391.6814 KOps/s 392.0741 KOps/s $\color{#d91a1a}-0.10\%$
test_tc_second_layer_tensor 23.8500μs 1.7318μs 577.4196 KOps/s 619.6230 KOps/s $\textbf{\color{#d91a1a}-6.81\%}$
test_tc_second_layer_nontensor 18.7710μs 3.4240μs 292.0579 KOps/s 296.5386 KOps/s $\color{#d91a1a}-1.51\%$
test_unbind 0.1883s 13.1743ms 75.9052 Ops/s 81.1499 Ops/s $\textbf{\color{#d91a1a}-6.46\%}$
test_full_like 0.7615ms 0.5789ms 1.7273 KOps/s 1.7313 KOps/s $\color{#d91a1a}-0.23\%$
test_zeros_like 0.3482ms 0.1979ms 5.0534 KOps/s 5.0546 KOps/s $\color{#d91a1a}-0.02\%$
test_ones_like 0.3485ms 0.1978ms 5.0565 KOps/s 5.0564 KOps/s $+0.00\%$
test_clone 0.5906ms 0.4150ms 2.4095 KOps/s 2.4069 KOps/s $\color{#35bf28}+0.11\%$
test_squeeze 29.0110μs 10.7612μs 92.9266 KOps/s 91.8588 KOps/s $\color{#35bf28}+1.16\%$
test_unsqueeze 0.2439ms 79.4173μs 12.5917 KOps/s 12.2763 KOps/s $\color{#35bf28}+2.57\%$
test_split 0.4228ms 0.1723ms 5.8030 KOps/s 5.5362 KOps/s $\color{#35bf28}+4.82\%$
test_permute 0.2646ms 0.1882ms 5.3124 KOps/s 5.2233 KOps/s $\color{#35bf28}+1.71\%$
test_stack 1.3779ms 0.9133ms 1.0949 KOps/s 1.1380 KOps/s $\color{#d91a1a}-3.79\%$
test_cat 1.3827ms 1.2320ms 811.7191 Ops/s 811.4115 Ops/s $\color{#35bf28}+0.04\%$

@vmoens vmoens added the bug Something isn't working label Aug 5, 2024
@vmoens vmoens merged commit 6f09c43 into main Aug 5, 2024
41 of 47 checks passed
@vmoens vmoens deleted the partially-revert-get-changes branch October 21, 2024 14:01
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants