Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[NOMERG] test 0.7 builds #1210

Open
wants to merge 1 commit into
base: main
Choose a base branch
from
Open

[NOMERG] test 0.7 builds #1210

wants to merge 1 commit into from

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Feb 5, 2025

Description

Describe your changes in detail.

Motivation and Context

Why is this change required? What problem does it solve?
If it fixes an open issue, please link to the issue here.
You can use the syntax close #15213 if this solves the issue #15213

  • I have raised an issue to propose this change (required for new features and bug fixes)

Types of changes

What types of changes does your code introduce? Remove all that do not apply:

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds core functionality)
  • Breaking change (fix or feature that would cause existing functionality to change)
  • Documentation (update in the documentation)
  • Example (update in the folder of examples)

Checklist

Go over all the following points, and put an x in all the boxes that apply.
If you are unsure about any of these, don't hesitate to ask. We are here to help!

  • I have read the CONTRIBUTION guide (required)
  • My change requires a change to the documentation.
  • I have updated the tests accordingly (required for a bug fix or a new feature).
  • I have updated the documentation accordingly.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 5, 2025
@vmoens vmoens added the ciflow/binaries/all Build all wheels label Feb 5, 2025
@vmoens vmoens force-pushed the release/0.7.0 branch 2 times, most recently from 27fffbb to 2c2d48d Compare February 5, 2025 13:36
Copy link

github-actions bot commented Feb 5, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 217. Improved: $\large\color{#35bf28}14$. Worsened: $\large\color{#d91a1a}13$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 68.3180μs 20.7607μs 48.1678 KOps/s 48.8808 KOps/s $\color{#d91a1a}-1.46\%$
test_plain_set_stack_nested 58.8110μs 20.8511μs 47.9591 KOps/s 48.6217 KOps/s $\color{#d91a1a}-1.36\%$
test_plain_set_nested_inplace 73.5480μs 22.4026μs 44.6377 KOps/s 43.7923 KOps/s $\color{#35bf28}+1.93\%$
test_plain_set_stack_nested_inplace 73.9090μs 22.4815μs 44.4809 KOps/s 44.2334 KOps/s $\color{#35bf28}+0.56\%$
test_items 22.7420μs 4.1145μs 243.0409 KOps/s 237.3754 KOps/s $\color{#35bf28}+2.39\%$
test_items_nested 0.7283ms 0.4054ms 2.4665 KOps/s 2.4649 KOps/s $\color{#35bf28}+0.06\%$
test_items_nested_locked 0.7896ms 0.4057ms 2.4648 KOps/s 2.4671 KOps/s $\color{#d91a1a}-0.09\%$
test_items_nested_leaf 0.1301ms 77.4228μs 12.9161 KOps/s 12.9086 KOps/s $\color{#35bf28}+0.06\%$
test_items_stack_nested 0.6552ms 0.4102ms 2.4379 KOps/s 2.4388 KOps/s $\color{#d91a1a}-0.04\%$
test_items_stack_nested_leaf 0.1416ms 79.9614μs 12.5060 KOps/s 12.4258 KOps/s $\color{#35bf28}+0.65\%$
test_items_stack_nested_locked 0.4698ms 0.4071ms 2.4562 KOps/s 2.4481 KOps/s $\color{#35bf28}+0.33\%$
test_keys 20.2890μs 3.4434μs 290.4148 KOps/s 284.3201 KOps/s $\color{#35bf28}+2.14\%$
test_keys_nested 0.2157ms 0.1630ms 6.1356 KOps/s 6.0963 KOps/s $\color{#35bf28}+0.64\%$
test_keys_nested_locked 1.7870ms 0.1697ms 5.8915 KOps/s 5.8904 KOps/s $\color{#35bf28}+0.02\%$
test_keys_nested_leaf 0.2070ms 0.1421ms 7.0370 KOps/s 6.9291 KOps/s $\color{#35bf28}+1.56\%$
test_keys_stack_nested 0.2519ms 0.1613ms 6.1990 KOps/s 6.0925 KOps/s $\color{#35bf28}+1.75\%$
test_keys_stack_nested_leaf 0.2328ms 0.1401ms 7.1357 KOps/s 6.9849 KOps/s $\color{#35bf28}+2.16\%$
test_keys_stack_nested_locked 0.2662ms 0.1684ms 5.9394 KOps/s 5.8805 KOps/s $\color{#35bf28}+1.00\%$
test_values 9.5600μs 1.0489μs 953.3935 KOps/s 958.4062 KOps/s $\color{#d91a1a}-0.52\%$
test_values_nested 0.1183ms 61.8490μs 16.1684 KOps/s 15.8582 KOps/s $\color{#35bf28}+1.96\%$
test_values_nested_locked 0.1062ms 62.1381μs 16.0932 KOps/s 15.7027 KOps/s $\color{#35bf28}+2.49\%$
test_values_nested_leaf 0.1348ms 71.5242μs 13.9813 KOps/s 12.4870 KOps/s $\textbf{\color{#35bf28}+11.97\%}$
test_values_stack_nested 0.1182ms 63.1706μs 15.8302 KOps/s 15.4663 KOps/s $\color{#35bf28}+2.35\%$
test_values_stack_nested_leaf 0.1569ms 70.7869μs 14.1269 KOps/s 13.5215 KOps/s $\color{#35bf28}+4.48\%$
test_values_stack_nested_locked 0.1096ms 62.9413μs 15.8878 KOps/s 15.3283 KOps/s $\color{#35bf28}+3.65\%$
test_membership 18.6350μs 0.8662μs 1.1545 MOps/s 1.3757 MOps/s $\textbf{\color{#d91a1a}-16.08\%}$
test_membership_nested 36.9990μs 2.8995μs 344.8914 KOps/s 342.1402 KOps/s $\color{#35bf28}+0.80\%$
test_membership_nested_leaf 34.3340μs 2.9265μs 341.7023 KOps/s 336.9218 KOps/s $\color{#35bf28}+1.42\%$
test_membership_stacked_nested 18.1740μs 2.8794μs 347.2896 KOps/s 328.9782 KOps/s $\textbf{\color{#35bf28}+5.57\%}$
test_membership_stacked_nested_leaf 27.3210μs 2.9042μs 344.3256 KOps/s 344.7176 KOps/s $\color{#d91a1a}-0.11\%$
test_membership_nested_last 22.5220μs 4.3372μs 230.5635 KOps/s 225.0014 KOps/s $\color{#35bf28}+2.47\%$
test_membership_nested_leaf_last 32.2800μs 4.6690μs 214.1800 KOps/s 226.5983 KOps/s $\textbf{\color{#d91a1a}-5.48\%}$
test_membership_stacked_nested_last 33.5630μs 8.0239μs 124.6274 KOps/s 227.9682 KOps/s $\textbf{\color{#d91a1a}-45.33\%}$
test_membership_stacked_nested_leaf_last 37.0300μs 8.0417μs 124.3522 KOps/s 221.2522 KOps/s $\textbf{\color{#d91a1a}-43.80\%}$
test_nested_getleaf 34.7060μs 10.5091μs 95.1559 KOps/s 94.0586 KOps/s $\color{#35bf28}+1.17\%$
test_nested_get 44.3530μs 9.9109μs 100.8992 KOps/s 99.8992 KOps/s $\color{#35bf28}+1.00\%$
test_stacked_getleaf 38.5720μs 10.4317μs 95.8617 KOps/s 93.9210 KOps/s $\color{#35bf28}+2.07\%$
test_stacked_get 39.9840μs 10.0033μs 99.9665 KOps/s 99.3616 KOps/s $\color{#35bf28}+0.61\%$
test_nested_getitemleaf 46.9480μs 11.2408μs 88.9613 KOps/s 87.7825 KOps/s $\color{#35bf28}+1.34\%$
test_nested_getitem 51.4360μs 10.6894μs 93.5503 KOps/s 93.0789 KOps/s $\color{#35bf28}+0.51\%$
test_stacked_getitemleaf 37.5010μs 11.2932μs 88.5487 KOps/s 88.6464 KOps/s $\color{#d91a1a}-0.11\%$
test_stacked_getitem 47.7790μs 10.5314μs 94.9537 KOps/s 91.4801 KOps/s $\color{#35bf28}+3.80\%$
test_lock_nested 0.8510ms 0.4054ms 2.4666 KOps/s 2.4347 KOps/s $\color{#35bf28}+1.31\%$
test_lock_stack_nested 0.5032ms 0.4115ms 2.4303 KOps/s 2.3268 KOps/s $\color{#35bf28}+4.45\%$
test_unlock_nested 0.7981ms 0.3332ms 3.0009 KOps/s 2.9999 KOps/s $\color{#35bf28}+0.03\%$
test_unlock_stack_nested 0.4397ms 0.3316ms 3.0160 KOps/s 2.9331 KOps/s $\color{#35bf28}+2.83\%$
test_flatten_speed 0.1626ms 99.4511μs 10.0552 KOps/s 9.7054 KOps/s $\color{#35bf28}+3.60\%$
test_unflatten_speed 0.6183ms 0.5169ms 1.9345 KOps/s 1.9117 KOps/s $\color{#35bf28}+1.20\%$
test_common_ops 4.1663ms 0.8115ms 1.2323 KOps/s 1.2457 KOps/s $\color{#d91a1a}-1.07\%$
test_creation 28.9240μs 2.5018μs 399.7109 KOps/s 395.8424 KOps/s $\color{#35bf28}+0.98\%$
test_creation_empty 51.1460μs 11.7543μs 85.0750 KOps/s 89.2753 KOps/s $\color{#d91a1a}-4.70\%$
test_creation_nested_1 43.7920μs 14.6921μs 68.0638 KOps/s 71.2037 KOps/s $\color{#d91a1a}-4.41\%$
test_creation_nested_2 75.4510μs 19.2381μs 51.9801 KOps/s 53.3938 KOps/s $\color{#d91a1a}-2.65\%$
test_clone 0.1800ms 14.8378μs 67.3957 KOps/s 72.6787 KOps/s $\textbf{\color{#d91a1a}-7.27\%}$
test_getitem[int] 0.8672ms 12.8225μs 77.9879 KOps/s 77.7220 KOps/s $\color{#35bf28}+0.34\%$
test_getitem[slice_int] 0.1320ms 24.1068μs 41.4821 KOps/s 39.8627 KOps/s $\color{#35bf28}+4.06\%$
test_getitem[range] 0.1633ms 49.7698μs 20.0925 KOps/s 19.4528 KOps/s $\color{#35bf28}+3.29\%$
test_getitem[tuple] 0.1361ms 20.2115μs 49.4767 KOps/s 49.7958 KOps/s $\color{#d91a1a}-0.64\%$
test_getitem[list] 0.3208ms 45.3941μs 22.0293 KOps/s 21.2902 KOps/s $\color{#35bf28}+3.47\%$
test_setitem_dim[int] 62.5370μs 26.1379μs 38.2586 KOps/s 38.2549 KOps/s $+0.01\%$
test_setitem_dim[slice_int] 83.2560μs 52.1006μs 19.1936 KOps/s 19.0098 KOps/s $\color{#35bf28}+0.97\%$
test_setitem_dim[range] 0.1162ms 76.0597μs 13.1476 KOps/s 12.7749 KOps/s $\color{#35bf28}+2.92\%$
test_setitem_dim[tuple] 87.5140μs 40.8795μs 24.4621 KOps/s 24.2454 KOps/s $\color{#35bf28}+0.89\%$
test_setitem 0.1440ms 21.0239μs 47.5649 KOps/s 47.9775 KOps/s $\color{#d91a1a}-0.86\%$
test_set 75.0830μs 20.3125μs 49.2308 KOps/s 48.8532 KOps/s $\color{#35bf28}+0.77\%$
test_set_shared 4.4356ms 0.1857ms 5.3855 KOps/s 5.3735 KOps/s $\color{#35bf28}+0.22\%$
test_update 0.2809ms 23.1115μs 43.2686 KOps/s 43.2132 KOps/s $\color{#35bf28}+0.13\%$
test_update_nested 0.4849ms 33.0769μs 30.2325 KOps/s 30.5185 KOps/s $\color{#d91a1a}-0.94\%$
test_update__nested 0.1395ms 34.0196μs 29.3949 KOps/s 29.4522 KOps/s $\color{#d91a1a}-0.19\%$
test_set_nested 0.1311ms 22.6825μs 44.0868 KOps/s 44.7598 KOps/s $\color{#d91a1a}-1.50\%$
test_set_nested_new 0.1557ms 27.6009μs 36.2307 KOps/s 37.4192 KOps/s $\color{#d91a1a}-3.18\%$
test_select 98.4650μs 42.8520μs 23.3361 KOps/s 23.1553 KOps/s $\color{#35bf28}+0.78\%$
test_select_nested 0.1608ms 62.6662μs 15.9576 KOps/s 15.7317 KOps/s $\color{#35bf28}+1.44\%$
test_exclude_nested 0.1713ms 80.4747μs 12.4263 KOps/s 12.1749 KOps/s $\color{#35bf28}+2.06\%$
test_empty[True] 0.5375ms 0.4030ms 2.4814 KOps/s 2.4329 KOps/s $\color{#35bf28}+1.99\%$
test_empty[False] 7.9625μs 1.4019μs 713.3186 KOps/s 734.8090 KOps/s $\color{#d91a1a}-2.92\%$
test_unbind_speed 0.3367ms 0.2680ms 3.7310 KOps/s 3.6767 KOps/s $\color{#35bf28}+1.48\%$
test_unbind_speed_stack0 0.4337ms 0.2608ms 3.8338 KOps/s 3.7261 KOps/s $\color{#35bf28}+2.89\%$
test_unbind_speed_stack1 0.1121s 0.7216ms 1.3858 KOps/s 1.2069 KOps/s $\textbf{\color{#35bf28}+14.82\%}$
test_split 0.1127s 1.7425ms 573.8876 Ops/s 632.1916 Ops/s $\textbf{\color{#d91a1a}-9.22\%}$
test_chunk 0.1312s 1.7609ms 567.8959 Ops/s 566.3323 Ops/s $\color{#35bf28}+0.28\%$
test_consolidate_njt[False-None] 10.0245ms 8.4300ms 118.6234 Ops/s 121.2037 Ops/s $\color{#d91a1a}-2.13\%$
test_creation[device0] 0.3003ms 92.0444μs 10.8643 KOps/s 10.7455 KOps/s $\color{#35bf28}+1.11\%$
test_creation_from_tensor 4.1407ms 97.2553μs 10.2822 KOps/s 10.2052 KOps/s $\color{#35bf28}+0.75\%$
test_add_one[memmap_tensor0] 79.8500μs 4.6365μs 215.6777 KOps/s 190.0964 KOps/s $\textbf{\color{#35bf28}+13.46\%}$
test_contiguous[memmap_tensor0] 29.4160μs 0.5058μs 1.9772 MOps/s 1.9471 MOps/s $\color{#35bf28}+1.54\%$
test_stack[memmap_tensor0] 19.8670μs 3.2784μs 305.0267 KOps/s 286.8480 KOps/s $\textbf{\color{#35bf28}+6.34\%}$
test_memmaptd_index 1.3981ms 0.2298ms 4.3514 KOps/s 4.2793 KOps/s $\color{#35bf28}+1.69\%$
test_memmaptd_index_astensor 0.5428ms 0.3154ms 3.1703 KOps/s 3.0868 KOps/s $\color{#35bf28}+2.71\%$
test_memmaptd_index_op 0.8208ms 0.5828ms 1.7158 KOps/s 1.6526 KOps/s $\color{#35bf28}+3.82\%$
test_serialize_model 0.2383s 0.1352s 7.3967 Ops/s 8.4639 Ops/s $\textbf{\color{#d91a1a}-12.61\%}$
test_serialize_model_pickle 0.4476s 0.3869s 2.5845 Ops/s 2.5624 Ops/s $\color{#35bf28}+0.86\%$
test_serialize_weights 0.1219s 0.1176s 8.5027 Ops/s 8.6417 Ops/s $\color{#d91a1a}-1.61\%$
test_serialize_weights_returnearly 0.1751s 0.1618s 6.1821 Ops/s 6.0658 Ops/s $\color{#35bf28}+1.92\%$
test_serialize_weights_pickle 0.4895s 0.4181s 2.3920 Ops/s 2.2687 Ops/s $\textbf{\color{#35bf28}+5.44\%}$
test_serialize_weights_filesystem 0.1539s 0.1478s 6.7645 Ops/s 6.6609 Ops/s $\color{#35bf28}+1.56\%$
test_serialize_model_filesystem 0.2738s 0.1680s 5.9534 Ops/s 6.4296 Ops/s $\textbf{\color{#d91a1a}-7.41\%}$
test_reshape_pytree 71.7940μs 26.1139μs 38.2938 KOps/s 37.2055 KOps/s $\color{#35bf28}+2.92\%$
test_reshape_td 89.2070μs 33.8915μs 29.5059 KOps/s 30.3055 KOps/s $\color{#d91a1a}-2.64\%$
test_view_pytree 0.1180ms 25.9890μs 38.4778 KOps/s 36.9990 KOps/s $\color{#35bf28}+4.00\%$
test_view_td 90.9410μs 38.9879μs 25.6490 KOps/s 25.3368 KOps/s $\color{#35bf28}+1.23\%$
test_unbind_pytree 67.7070μs 29.0056μs 34.4761 KOps/s 33.8234 KOps/s $\color{#35bf28}+1.93\%$
test_unbind_td 0.3508ms 40.0216μs 24.9865 KOps/s 25.2591 KOps/s $\color{#d91a1a}-1.08\%$
test_split_pytree 74.2890μs 28.5856μs 34.9827 KOps/s 34.0862 KOps/s $\color{#35bf28}+2.63\%$
test_split_td 0.2503ms 44.6279μs 22.4075 KOps/s 21.6321 KOps/s $\color{#35bf28}+3.58\%$
test_add_pytree 93.3350μs 36.0953μs 27.7045 KOps/s 26.9423 KOps/s $\color{#35bf28}+2.83\%$
test_add_td 0.1750ms 62.2129μs 16.0738 KOps/s 17.4330 KOps/s $\textbf{\color{#d91a1a}-7.80\%}$
test_compile_add_one_nested[tensordict-compile] 0.1760ms 68.2550μs 14.6509 KOps/s 14.6718 KOps/s $\color{#d91a1a}-0.14\%$
test_compile_add_one_nested[tensordict-eager] 0.3457ms 0.1701ms 5.8794 KOps/s 5.7935 KOps/s $\color{#35bf28}+1.48\%$
test_compile_add_one_nested[pytree-compile] 0.1202ms 47.1841μs 21.1936 KOps/s 21.2941 KOps/s $\color{#d91a1a}-0.47\%$
test_compile_add_one_nested[pytree-eager] 0.2094ms 0.1177ms 8.4970 KOps/s 8.2035 KOps/s $\color{#35bf28}+3.58\%$
test_compile_copy_nested[tensordict-compile] 90.9700μs 29.2903μs 34.1410 KOps/s 35.1499 KOps/s $\color{#d91a1a}-2.87\%$
test_compile_copy_nested[tensordict-eager] 0.1108ms 58.7023μs 17.0351 KOps/s 17.0656 KOps/s $\color{#d91a1a}-0.18\%$
test_compile_copy_nested[pytree-compile] 0.1492ms 79.5672μs 12.5680 KOps/s 12.2452 KOps/s $\color{#35bf28}+2.64\%$
test_compile_copy_nested[pytree-eager] 0.1295ms 67.3275μs 14.8528 KOps/s 14.7270 KOps/s $\color{#35bf28}+0.85\%$
test_compile_add_one_flat[tensordict-compile] 0.2174ms 0.1090ms 9.1773 KOps/s 9.3024 KOps/s $\color{#d91a1a}-1.35\%$
test_compile_add_one_flat[tensordict-eager] 0.4297ms 0.2145ms 4.6612 KOps/s 4.6163 KOps/s $\color{#35bf28}+0.97\%$
test_compile_add_one_flat[tensorclass-compile] 0.1126ms 47.8129μs 20.9148 KOps/s 21.0515 KOps/s $\color{#d91a1a}-0.65\%$
test_compile_add_one_flat[tensorclass-eager] 0.1479ms 65.8012μs 15.1973 KOps/s 14.7386 KOps/s $\color{#35bf28}+3.11\%$
test_compile_add_one_flat[pytree-compile] 0.1943ms 0.1029ms 9.7218 KOps/s 9.8389 KOps/s $\color{#d91a1a}-1.19\%$
test_compile_add_one_flat[pytree-eager] 0.4336ms 0.2022ms 4.9456 KOps/s 4.8481 KOps/s $\color{#35bf28}+2.01\%$
test_compile_add_self_flat[tensordict-eager] 0.4908ms 0.2318ms 4.3149 KOps/s 4.2609 KOps/s $\color{#35bf28}+1.27\%$
test_compile_add_self_flat[tensordict-compile] 0.1974ms 0.1087ms 9.2004 KOps/s 9.1553 KOps/s $\color{#35bf28}+0.49\%$
test_compile_add_self_flat[tensorclass-eager] 0.1584ms 62.8566μs 15.9092 KOps/s 15.7538 KOps/s $\color{#35bf28}+0.99\%$
test_compile_add_self_flat[tensorclass-compile] 0.3623ms 50.6911μs 19.7273 KOps/s 20.5330 KOps/s $\color{#d91a1a}-3.92\%$
test_compile_add_self_flat[pytree-eager] 0.3049ms 0.1564ms 6.3923 KOps/s 6.1954 KOps/s $\color{#35bf28}+3.18\%$
test_compile_add_self_flat[pytree-compile] 0.1917ms 0.1025ms 9.7588 KOps/s 9.6607 KOps/s $\color{#35bf28}+1.01\%$
test_compile_copy_flat[tensordict-compile] 78.3160μs 22.6153μs 44.2178 KOps/s 45.9102 KOps/s $\color{#d91a1a}-3.69\%$
test_compile_copy_flat[tensordict-eager] 0.1512ms 66.6848μs 14.9959 KOps/s 14.9319 KOps/s $\color{#35bf28}+0.43\%$
test_compile_copy_flat[pytree-compile] 0.1579ms 84.9730μs 11.7684 KOps/s 12.2511 KOps/s $\color{#d91a1a}-3.94\%$
test_compile_copy_flat[pytree-eager] 0.1206ms 66.6910μs 14.9945 KOps/s 14.6050 KOps/s $\color{#35bf28}+2.67\%$
test_compile_assign_and_add[tensordict-compile] 0.3621ms 0.2145ms 4.6620 KOps/s 4.4595 KOps/s $\color{#35bf28}+4.54\%$
test_compile_assign_and_add[tensordict-eager] 1.6866ms 1.3607ms 734.9247 Ops/s 704.1142 Ops/s $\color{#35bf28}+4.38\%$
test_compile_assign_and_add[pytree-compile] 0.3242ms 0.2105ms 4.7504 KOps/s 4.6756 KOps/s $\color{#35bf28}+1.60\%$
test_compile_assign_and_add[pytree-eager] 1.0379ms 0.8317ms 1.2024 KOps/s 1.1853 KOps/s $\color{#35bf28}+1.44\%$
test_compile_assign_and_add_stack[compile] 0.6766ms 0.4573ms 2.1869 KOps/s 2.1149 KOps/s $\color{#35bf28}+3.41\%$
test_compile_assign_and_add_stack[eager] 2.9403ms 2.6841ms 372.5586 Ops/s 363.9606 Ops/s $\color{#35bf28}+2.36\%$
test_compile_indexing[tensor-tensordict-compile] 0.1022ms 40.6230μs 24.6166 KOps/s 25.0240 KOps/s $\color{#d91a1a}-1.63\%$
test_compile_indexing[tensor-tensordict-eager] 0.6601ms 33.2339μs 30.0898 KOps/s 28.9262 KOps/s $\color{#35bf28}+4.02\%$
test_compile_indexing[tensor-tensorclass-compile] 90.6700μs 31.7622μs 31.4839 KOps/s 30.9264 KOps/s $\color{#35bf28}+1.80\%$
test_compile_indexing[tensor-tensorclass-eager] 77.2750μs 22.9976μs 43.4828 KOps/s 42.5851 KOps/s $\color{#35bf28}+2.11\%$
test_compile_indexing[tensor-pytree-compile] 0.1277ms 32.7483μs 30.5359 KOps/s 30.0482 KOps/s $\color{#35bf28}+1.62\%$
test_compile_indexing[tensor-pytree-eager] 0.1275ms 23.2593μs 42.9935 KOps/s 42.2334 KOps/s $\color{#35bf28}+1.80\%$
test_compile_indexing[slice-tensordict-compile] 0.1412ms 54.4894μs 18.3522 KOps/s 18.2553 KOps/s $\color{#35bf28}+0.53\%$
test_compile_indexing[slice-tensordict-eager] 0.4046ms 19.7103μs 50.7349 KOps/s 47.0406 KOps/s $\textbf{\color{#35bf28}+7.85\%}$
test_compile_indexing[slice-tensorclass-compile] 0.1092ms 47.1315μs 21.2172 KOps/s 20.9316 KOps/s $\color{#35bf28}+1.36\%$
test_compile_indexing[slice-tensorclass-eager] 0.1032ms 18.2292μs 54.8571 KOps/s 51.4319 KOps/s $\textbf{\color{#35bf28}+6.66\%}$
test_compile_indexing[slice-pytree-compile] 0.1366ms 48.5409μs 20.6012 KOps/s 20.5130 KOps/s $\color{#35bf28}+0.43\%$
test_compile_indexing[slice-pytree-eager] 80.0300μs 18.2093μs 54.9170 KOps/s 52.1634 KOps/s $\textbf{\color{#35bf28}+5.28\%}$
test_compile_indexing[int-tensordict-compile] 0.1777ms 58.6123μs 17.0613 KOps/s 17.9276 KOps/s $\color{#d91a1a}-4.83\%$
test_compile_indexing[int-tensordict-eager] 0.8574ms 19.4959μs 51.2927 KOps/s 49.4108 KOps/s $\color{#35bf28}+3.81\%$
test_compile_indexing[int-tensorclass-compile] 0.1375ms 47.6155μs 21.0016 KOps/s 20.7572 KOps/s $\color{#35bf28}+1.18\%$
test_compile_indexing[int-tensorclass-eager] 69.4710μs 18.1432μs 55.1169 KOps/s 52.6073 KOps/s $\color{#35bf28}+4.77\%$
test_compile_indexing[int-pytree-compile] 0.1064ms 47.4432μs 21.0778 KOps/s 20.4715 KOps/s $\color{#35bf28}+2.96\%$
test_compile_indexing[int-pytree-eager] 60.3330μs 18.1399μs 55.1271 KOps/s 52.3424 KOps/s $\textbf{\color{#35bf28}+5.32\%}$
test_mod_add[eager] 0.1548ms 36.0131μs 27.7676 KOps/s 28.2603 KOps/s $\color{#d91a1a}-1.74\%$
test_mod_add[compile] 0.1244ms 67.5819μs 14.7969 KOps/s 14.7735 KOps/s $\color{#35bf28}+0.16\%$
test_mod_add[compile-overhead] 0.1776ms 65.3609μs 15.2997 KOps/s 14.7390 KOps/s $\color{#35bf28}+3.80\%$
test_mod_wrap[eager] 0.4310ms 0.2261ms 4.4234 KOps/s 4.2572 KOps/s $\color{#35bf28}+3.90\%$
test_mod_wrap[compile] 2.5423ms 0.2344ms 4.2655 KOps/s 4.1811 KOps/s $\color{#35bf28}+2.02\%$
test_mod_wrap[compile-overhead] 0.4385ms 0.2305ms 4.3386 KOps/s 4.2995 KOps/s $\color{#35bf28}+0.91\%$
test_mod_wrap_and_backward[eager] 15.5057ms 13.3388ms 74.9690 Ops/s 88.4577 Ops/s $\textbf{\color{#d91a1a}-15.25\%}$
test_mod_wrap_and_backward[compile] 15.5652ms 11.9431ms 83.7305 Ops/s 88.7047 Ops/s $\textbf{\color{#d91a1a}-5.61\%}$
test_mod_wrap_and_backward[compile-overhead] 14.2549ms 11.9716ms 83.5311 Ops/s 87.8987 Ops/s $\color{#d91a1a}-4.97\%$
test_seq_add[eager] 0.1950ms 0.1169ms 8.5577 KOps/s 8.4139 KOps/s $\color{#35bf28}+1.71\%$
test_seq_add[compile] 0.1801ms 81.3320μs 12.2953 KOps/s 12.4617 KOps/s $\color{#d91a1a}-1.34\%$
test_seq_add[compile-overhead] 0.1780ms 79.2797μs 12.6136 KOps/s 12.7998 KOps/s $\color{#d91a1a}-1.45\%$
test_seq_wrap[eager] 0.7524ms 0.4566ms 2.1900 KOps/s 2.1728 KOps/s $\color{#35bf28}+0.79\%$
test_seq_wrap[compile] 0.4753ms 0.2496ms 4.0060 KOps/s 3.9874 KOps/s $\color{#35bf28}+0.46\%$
test_seq_wrap[compile-overhead] 0.4436ms 0.2483ms 4.0277 KOps/s 4.0443 KOps/s $\color{#d91a1a}-0.41\%$
test_func_call_runtime[False-eager] 0.9844ms 0.5474ms 1.8267 KOps/s 1.7805 KOps/s $\color{#35bf28}+2.59\%$
test_func_call_runtime[False-compile] 0.6755ms 0.4443ms 2.2505 KOps/s 2.1880 KOps/s $\color{#35bf28}+2.86\%$
test_func_call_runtime[False-compile-overhead] 0.8045ms 0.4438ms 2.2532 KOps/s 2.2353 KOps/s $\color{#35bf28}+0.80\%$
test_func_call_runtime[True-eager] 1.1218ms 0.7639ms 1.3091 KOps/s 1.2788 KOps/s $\color{#35bf28}+2.37\%$
test_func_call_runtime[True-compile] 0.6009ms 0.4663ms 2.1447 KOps/s 2.1273 KOps/s $\color{#35bf28}+0.82\%$
test_func_call_runtime[True-compile-overhead] 0.9704ms 0.4678ms 2.1376 KOps/s 2.0746 KOps/s $\color{#35bf28}+3.04\%$
test_func_call_cm_runtime[False-eager] 0.9049ms 0.5514ms 1.8137 KOps/s 1.7843 KOps/s $\color{#35bf28}+1.64\%$
test_func_call_cm_runtime[False-compile] 0.9413ms 0.4540ms 2.2028 KOps/s 2.1899 KOps/s $\color{#35bf28}+0.59\%$
test_func_call_cm_runtime[False-compile-overhead] 0.6123ms 0.4420ms 2.2625 KOps/s 2.1933 KOps/s $\color{#35bf28}+3.15\%$
test_func_call_cm_runtime[True-eager] 1.7277ms 0.9153ms 1.0925 KOps/s 1.0820 KOps/s $\color{#35bf28}+0.97\%$
test_func_call_cm_runtime[True-compile] 1.2262ms 0.8093ms 1.2356 KOps/s 1.2054 KOps/s $\color{#35bf28}+2.50\%$
test_func_call_cm_runtime[True-compile-overhead] 1.0295ms 0.8147ms 1.2275 KOps/s 1.1933 KOps/s $\color{#35bf28}+2.87\%$
test_vmap_func_call_cm_runtime[eager] 4.7234ms 1.9425ms 514.8073 Ops/s 513.4718 Ops/s $\color{#35bf28}+0.26\%$
test_vmap_func_call_cm_runtime[compile] 0.8358ms 0.5410ms 1.8484 KOps/s 1.8055 KOps/s $\color{#35bf28}+2.38\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.9976ms 0.5379ms 1.8590 KOps/s 1.8401 KOps/s $\color{#35bf28}+1.03\%$
test_distributed 0.3154ms 0.1252ms 7.9899 KOps/s 7.6749 KOps/s $\color{#35bf28}+4.10\%$
test_tdmodule 47.9100μs 26.1095μs 38.3002 KOps/s 36.1711 KOps/s $\textbf{\color{#35bf28}+5.89\%}$
test_tdmodule_dispatch 0.6446ms 50.0802μs 19.9680 KOps/s 20.3767 KOps/s $\color{#d91a1a}-2.01\%$
test_tdseq 57.7180μs 30.1591μs 33.1575 KOps/s 33.6895 KOps/s $\color{#d91a1a}-1.58\%$
test_tdseq_dispatch 81.4430μs 54.0759μs 18.4925 KOps/s 18.3317 KOps/s $\color{#35bf28}+0.88\%$
test_instantiation_functorch 1.7574ms 1.5389ms 649.8153 Ops/s 633.2390 Ops/s $\color{#35bf28}+2.62\%$
test_exec_functorch 0.4791ms 0.1770ms 5.6501 KOps/s 5.5236 KOps/s $\color{#35bf28}+2.29\%$
test_exec_functional_call 0.3053ms 0.1697ms 5.8931 KOps/s 5.6932 KOps/s $\color{#35bf28}+3.51\%$
test_exec_td_decorator 0.5510ms 0.2313ms 4.3239 KOps/s 4.0291 KOps/s $\textbf{\color{#35bf28}+7.32\%}$
test_vmap_mlp_speed_decorator[True-True] 0.8623ms 0.6578ms 1.5203 KOps/s 1.4848 KOps/s $\color{#35bf28}+2.39\%$
test_vmap_mlp_speed_decorator[True-False] 0.8625ms 0.6537ms 1.5298 KOps/s 1.4914 KOps/s $\color{#35bf28}+2.57\%$
test_vmap_mlp_speed_decorator[False-True] 0.9026ms 0.5373ms 1.8612 KOps/s 1.8233 KOps/s $\color{#35bf28}+2.07\%$
test_vmap_mlp_speed_decorator[False-False] 0.7974ms 0.5331ms 1.8758 KOps/s 1.8344 KOps/s $\color{#35bf28}+2.26\%$
test_to_module_speed[True] 2.0051ms 1.3253ms 754.5405 Ops/s 751.2093 Ops/s $\color{#35bf28}+0.44\%$
test_to_module_speed[False] 1.8267ms 1.2917ms 774.1497 Ops/s 766.0816 Ops/s $\color{#35bf28}+1.05\%$
test_tc_init 0.1027ms 45.9788μs 21.7491 KOps/s 21.8747 KOps/s $\color{#d91a1a}-0.57\%$
test_tc_init_nested 0.1797ms 91.6949μs 10.9057 KOps/s 10.9467 KOps/s $\color{#d91a1a}-0.37\%$
test_tc_first_layer_tensor 20.0770μs 1.5450μs 647.2460 KOps/s 659.0469 KOps/s $\color{#d91a1a}-1.79\%$
test_tc_first_layer_nontensor 20.9200μs 4.7271μs 211.5470 KOps/s 215.6199 KOps/s $\color{#d91a1a}-1.89\%$
test_tc_second_layer_tensor 0.4516ms 2.9018μs 344.6192 KOps/s 356.0657 KOps/s $\color{#d91a1a}-3.21\%$
test_tc_second_layer_nontensor 35.4270μs 6.0585μs 165.0582 KOps/s 167.7901 KOps/s $\color{#d91a1a}-1.63\%$
test_unbind 0.2486s 14.5403ms 68.7745 Ops/s 73.4630 Ops/s $\textbf{\color{#d91a1a}-6.38\%}$
test_full_like 9.8351ms 8.2395ms 121.3660 Ops/s 115.2991 Ops/s $\textbf{\color{#35bf28}+5.26\%}$
test_zeros_like 5.4665ms 3.3320ms 300.1229 Ops/s 318.1976 Ops/s $\textbf{\color{#d91a1a}-5.68\%}$
test_ones_like 6.6831ms 3.4984ms 285.8443 Ops/s 276.7719 Ops/s $\color{#35bf28}+3.28\%$
test_clone 8.1410ms 5.8116ms 172.0690 Ops/s 142.2553 Ops/s $\textbf{\color{#35bf28}+20.96\%}$
test_squeeze 61.8160μs 12.2143μs 81.8713 KOps/s 78.9595 KOps/s $\color{#35bf28}+3.69\%$
test_unsqueeze 0.1626ms 91.9010μs 10.8813 KOps/s 10.6692 KOps/s $\color{#35bf28}+1.99\%$
test_split 0.4782ms 0.1945ms 5.1411 KOps/s 4.9907 KOps/s $\color{#35bf28}+3.01\%$
test_permute 0.3434ms 0.2009ms 4.9788 KOps/s 5.0210 KOps/s $\color{#d91a1a}-0.84\%$
test_stack 37.1513ms 27.4955ms 36.3696 Ops/s 35.8817 Ops/s $\color{#35bf28}+1.36\%$
test_cat 31.9557ms 26.4014ms 37.8769 Ops/s 37.2746 Ops/s $\color{#35bf28}+1.62\%$

Copy link

github-actions bot commented Feb 5, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 229. Improved: $\large\color{#35bf28}15$. Worsened: $\large\color{#d91a1a}13$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 56.8510μs 13.3695μs 74.7974 KOps/s 78.4634 KOps/s $\color{#d91a1a}-4.67\%$
test_plain_set_stack_nested 53.7200μs 13.5858μs 73.6065 KOps/s 78.1798 KOps/s $\textbf{\color{#d91a1a}-5.85\%}$
test_plain_set_nested_inplace 71.7700μs 14.4750μs 69.0847 KOps/s 72.3143 KOps/s $\color{#d91a1a}-4.47\%$
test_plain_set_stack_nested_inplace 66.9500μs 14.3425μs 69.7231 KOps/s 71.7474 KOps/s $\color{#d91a1a}-2.82\%$
test_items 27.8500μs 2.9068μs 344.0221 KOps/s 344.9230 KOps/s $\color{#d91a1a}-0.26\%$
test_items_nested 0.4231ms 0.3775ms 2.6489 KOps/s 2.6579 KOps/s $\color{#d91a1a}-0.34\%$
test_items_nested_locked 0.4407ms 0.3822ms 2.6161 KOps/s 2.6406 KOps/s $\color{#d91a1a}-0.93\%$
test_items_nested_leaf 0.1862ms 59.3379μs 16.8526 KOps/s 17.1354 KOps/s $\color{#d91a1a}-1.65\%$
test_items_stack_nested 0.4772ms 0.3778ms 2.6471 KOps/s 2.6588 KOps/s $\color{#d91a1a}-0.44\%$
test_items_stack_nested_leaf 0.1641ms 59.3510μs 16.8489 KOps/s 17.1872 KOps/s $\color{#d91a1a}-1.97\%$
test_items_stack_nested_locked 0.5239ms 0.3788ms 2.6399 KOps/s 2.6546 KOps/s $\color{#d91a1a}-0.55\%$
test_keys 35.0100μs 3.4295μs 291.5843 KOps/s 289.9347 KOps/s $\color{#35bf28}+0.57\%$
test_keys_nested 0.2062ms 88.9477μs 11.2426 KOps/s 11.5101 KOps/s $\color{#d91a1a}-2.32\%$
test_keys_nested_locked 0.7776ms 95.5637μs 10.4642 KOps/s 10.7304 KOps/s $\color{#d91a1a}-2.48\%$
test_keys_nested_leaf 0.1140ms 79.8506μs 12.5234 KOps/s 12.8162 KOps/s $\color{#d91a1a}-2.28\%$
test_keys_stack_nested 0.1287ms 90.3225μs 11.0714 KOps/s 11.2836 KOps/s $\color{#d91a1a}-1.88\%$
test_keys_stack_nested_leaf 0.1187ms 81.5261μs 12.2660 KOps/s 12.7285 KOps/s $\color{#d91a1a}-3.63\%$
test_keys_stack_nested_locked 0.1527ms 96.0726μs 10.4088 KOps/s 10.8035 KOps/s $\color{#d91a1a}-3.65\%$
test_values 7.6975μs 0.8604μs 1.1623 MOps/s 1.1734 MOps/s $\color{#d91a1a}-0.95\%$
test_values_nested 68.1210μs 37.9390μs 26.3581 KOps/s 26.7063 KOps/s $\color{#d91a1a}-1.30\%$
test_values_nested_locked 81.3700μs 39.3318μs 25.4247 KOps/s 25.6854 KOps/s $\color{#d91a1a}-1.01\%$
test_values_nested_leaf 79.1110μs 42.0339μs 23.7903 KOps/s 23.9497 KOps/s $\color{#d91a1a}-0.67\%$
test_values_stack_nested 0.2318ms 38.0477μs 26.2828 KOps/s 26.5307 KOps/s $\color{#d91a1a}-0.93\%$
test_values_stack_nested_leaf 82.5110μs 42.7469μs 23.3935 KOps/s 23.8183 KOps/s $\color{#d91a1a}-1.78\%$
test_values_stack_nested_locked 79.0000μs 39.6261μs 25.2359 KOps/s 25.6038 KOps/s $\color{#d91a1a}-1.44\%$
test_membership 2.2635μs 0.5131μs 1.9488 MOps/s 1.9007 MOps/s $\color{#35bf28}+2.53\%$
test_membership_nested 23.8755μs 1.9954μs 501.1476 KOps/s 464.6231 KOps/s $\textbf{\color{#35bf28}+7.86\%}$
test_membership_nested_leaf 15.5155μs 1.9697μs 507.6793 KOps/s 494.2725 KOps/s $\color{#35bf28}+2.71\%$
test_membership_stacked_nested 53.0610μs 2.0642μs 484.4507 KOps/s 469.7361 KOps/s $\color{#35bf28}+3.13\%$
test_membership_stacked_nested_leaf 20.8100μs 2.0416μs 489.8056 KOps/s 457.8555 KOps/s $\textbf{\color{#35bf28}+6.98\%}$
test_membership_nested_last 31.2700μs 3.0334μs 329.6615 KOps/s 316.3037 KOps/s $\color{#35bf28}+4.22\%$
test_membership_nested_leaf_last 40.4300μs 3.0191μs 331.2288 KOps/s 315.0670 KOps/s $\textbf{\color{#35bf28}+5.13\%}$
test_membership_stacked_nested_last 28.4500μs 3.5469μs 281.9358 KOps/s 320.0805 KOps/s $\textbf{\color{#d91a1a}-11.92\%}$
test_membership_stacked_nested_leaf_last 39.7100μs 3.5620μs 280.7435 KOps/s 319.8031 KOps/s $\textbf{\color{#d91a1a}-12.21\%}$
test_nested_getleaf 37.1100μs 6.1632μs 162.2531 KOps/s 160.9345 KOps/s $\color{#35bf28}+0.82\%$
test_nested_get 49.2610μs 5.8398μs 171.2387 KOps/s 171.5687 KOps/s $\color{#d91a1a}-0.19\%$
test_stacked_getleaf 46.6400μs 6.1688μs 162.1063 KOps/s 160.8645 KOps/s $\color{#35bf28}+0.77\%$
test_stacked_get 48.6600μs 5.8312μs 171.4915 KOps/s 168.9526 KOps/s $\color{#35bf28}+1.50\%$
test_nested_getitemleaf 37.3300μs 6.3859μs 156.5960 KOps/s 153.3916 KOps/s $\color{#35bf28}+2.09\%$
test_nested_getitem 51.5300μs 6.1248μs 163.2713 KOps/s 162.3338 KOps/s $\color{#35bf28}+0.58\%$
test_stacked_getitemleaf 47.7800μs 6.4268μs 155.5992 KOps/s 154.3952 KOps/s $\color{#35bf28}+0.78\%$
test_stacked_getitem 43.8710μs 6.1192μs 163.4188 KOps/s 162.9849 KOps/s $\color{#35bf28}+0.27\%$
test_lock_nested 8.9732ms 0.3485ms 2.8695 KOps/s 2.8484 KOps/s $\color{#35bf28}+0.74\%$
test_lock_stack_nested 0.4791ms 0.3430ms 2.9157 KOps/s 2.8725 KOps/s $\color{#35bf28}+1.51\%$
test_unlock_nested 0.3729ms 0.2832ms 3.5308 KOps/s 3.4746 KOps/s $\color{#35bf28}+1.62\%$
test_unlock_stack_nested 0.3160ms 0.2822ms 3.5431 KOps/s 3.4749 KOps/s $\color{#35bf28}+1.96\%$
test_flatten_speed 0.4784ms 75.9008μs 13.1751 KOps/s 13.1173 KOps/s $\color{#35bf28}+0.44\%$
test_unflatten_speed 0.7301ms 0.3243ms 3.0840 KOps/s 3.0637 KOps/s $\color{#35bf28}+0.66\%$
test_common_ops 0.8396ms 0.6562ms 1.5239 KOps/s 1.5695 KOps/s $\color{#d91a1a}-2.91\%$
test_creation 0.1150ms 1.7456μs 572.8747 KOps/s 562.6189 KOps/s $\color{#35bf28}+1.82\%$
test_creation_empty 0.1457ms 10.7140μs 93.3356 KOps/s 108.3412 KOps/s $\textbf{\color{#d91a1a}-13.85\%}$
test_creation_nested_1 0.4097ms 12.4242μs 80.4879 KOps/s 90.7067 KOps/s $\textbf{\color{#d91a1a}-11.27\%}$
test_creation_nested_2 47.8000μs 15.1245μs 66.1179 KOps/s 73.6250 KOps/s $\textbf{\color{#d91a1a}-10.20\%}$
test_clone 39.4300μs 9.8876μs 101.1370 KOps/s 96.5657 KOps/s $\color{#35bf28}+4.73\%$
test_getitem[int] 1.2157ms 10.8926μs 91.8054 KOps/s 91.8871 KOps/s $\color{#d91a1a}-0.09\%$
test_getitem[slice_int] 0.4304ms 20.8301μs 48.0074 KOps/s 47.1580 KOps/s $\color{#35bf28}+1.80\%$
test_getitem[range] 0.1781ms 36.8610μs 27.1289 KOps/s 26.8051 KOps/s $\color{#35bf28}+1.21\%$
test_getitem[tuple] 0.1182ms 18.4203μs 54.2879 KOps/s 53.4566 KOps/s $\color{#35bf28}+1.56\%$
test_getitem[list] 0.4331ms 32.3713μs 30.8915 KOps/s 30.0209 KOps/s $\color{#35bf28}+2.90\%$
test_setitem_dim[int] 47.7510μs 18.9490μs 52.7731 KOps/s 51.0883 KOps/s $\color{#35bf28}+3.30\%$
test_setitem_dim[slice_int] 0.1228ms 37.6049μs 26.5923 KOps/s 25.9149 KOps/s $\color{#35bf28}+2.61\%$
test_setitem_dim[range] 0.1007ms 52.4532μs 19.0646 KOps/s 18.5503 KOps/s $\color{#35bf28}+2.77\%$
test_setitem_dim[tuple] 52.9100μs 32.0361μs 31.2148 KOps/s 30.0145 KOps/s $\color{#35bf28}+4.00\%$
test_setitem 48.0510μs 15.2452μs 65.5942 KOps/s 63.6892 KOps/s $\color{#35bf28}+2.99\%$
test_set 52.6710μs 15.2938μs 65.3858 KOps/s 65.6776 KOps/s $\color{#d91a1a}-0.44\%$
test_set_shared 0.5995ms 0.1555ms 6.4321 KOps/s 6.3327 KOps/s $\color{#35bf28}+1.57\%$
test_update 0.5212ms 19.2294μs 52.0037 KOps/s 54.5357 KOps/s $\color{#d91a1a}-4.64\%$
test_update_nested 0.4615ms 24.3213μs 41.1162 KOps/s 40.5105 KOps/s $\color{#35bf28}+1.50\%$
test_update__nested 0.5031ms 24.2449μs 41.2458 KOps/s 39.7781 KOps/s $\color{#35bf28}+3.69\%$
test_set_nested 0.1161ms 16.6510μs 60.0565 KOps/s 61.1536 KOps/s $\color{#d91a1a}-1.79\%$
test_set_nested_new 0.4233ms 18.8901μs 52.9378 KOps/s 52.7131 KOps/s $\color{#35bf28}+0.43\%$
test_select 92.1410μs 31.2571μs 31.9927 KOps/s 31.3501 KOps/s $\color{#35bf28}+2.05\%$
test_select_nested 0.4354ms 44.6827μs 22.3801 KOps/s 22.6840 KOps/s $\color{#d91a1a}-1.34\%$
test_exclude_nested 0.4606ms 64.0828μs 15.6048 KOps/s 15.7751 KOps/s $\color{#d91a1a}-1.08\%$
test_empty[True] 0.6888ms 0.2983ms 3.3519 KOps/s 3.3776 KOps/s $\color{#d91a1a}-0.76\%$
test_empty[False] 40.5452μs 0.8235μs 1.2143 MOps/s 1.1947 MOps/s $\color{#35bf28}+1.64\%$
test_to 86.8500μs 55.2241μs 18.1080 KOps/s 17.6482 KOps/s $\color{#35bf28}+2.61\%$
test_to_nonblocking 0.2430ms 47.6631μs 20.9806 KOps/s 20.7530 KOps/s $\color{#35bf28}+1.10\%$
test_unbind_speed 0.2920ms 0.2418ms 4.1360 KOps/s 4.0795 KOps/s $\color{#35bf28}+1.38\%$
test_unbind_speed_stack0 0.6777ms 0.2396ms 4.1735 KOps/s 4.0850 KOps/s $\color{#35bf28}+2.17\%$
test_unbind_speed_stack1 92.5288ms 0.7337ms 1.3630 KOps/s 1.3389 KOps/s $\color{#35bf28}+1.80\%$
test_split 93.7672ms 1.5946ms 627.1187 Ops/s 610.0595 Ops/s $\color{#35bf28}+2.80\%$
test_chunk 95.4159ms 1.6229ms 616.1855 Ops/s 608.3204 Ops/s $\color{#35bf28}+1.29\%$
test_consolidate[False-None] 3.3448ms 2.6924ms 371.4219 Ops/s 363.9142 Ops/s $\color{#35bf28}+2.06\%$
test_consolidate[default-None] 1.8760ms 1.7361ms 575.9939 Ops/s 575.2678 Ops/s $\color{#35bf28}+0.13\%$
test_consolidate[reduce-overhead-None] 2.0703ms 1.7796ms 561.9094 Ops/s 559.9303 Ops/s $\color{#35bf28}+0.35\%$
test_consolidate_njt[False-None] 6.8626ms 6.6516ms 150.3406 Ops/s 148.9390 Ops/s $\color{#35bf28}+0.94\%$
test_to[False-False-None] 1.9377ms 1.7187ms 581.8404 Ops/s 577.2099 Ops/s $\color{#35bf28}+0.80\%$
test_to[True-False-None] 1.6017ms 1.3630ms 733.6945 Ops/s 714.6849 Ops/s $\color{#35bf28}+2.66\%$
test_to[within-False-None] 4.4227ms 4.1756ms 239.4847 Ops/s 235.0509 Ops/s $\color{#35bf28}+1.89\%$
test_to[True-default-None] 5.6513ms 5.3166ms 188.0893 Ops/s 190.3569 Ops/s $\color{#d91a1a}-1.19\%$
test_to_njt[False-False-None] 7.1204ms 6.9372ms 144.1509 Ops/s 143.8521 Ops/s $\color{#35bf28}+0.21\%$
test_to_njt[True-False-None] 6.0226ms 5.4884ms 182.2019 Ops/s 178.4613 Ops/s $\color{#35bf28}+2.10\%$
test_to_njt[within-False-None] 12.6194ms 12.3929ms 80.6913 Ops/s 80.6314 Ops/s $\color{#35bf28}+0.07\%$
test_creation[device0] 0.6357ms 79.8396μs 12.5251 KOps/s 12.5914 KOps/s $\color{#d91a1a}-0.53\%$
test_creation_from_tensor 0.5758ms 82.9353μs 12.0576 KOps/s 12.0837 KOps/s $\color{#d91a1a}-0.22\%$
test_add_one[memmap_tensor0] 0.2245ms 6.2523μs 159.9413 KOps/s 155.8023 KOps/s $\color{#35bf28}+2.66\%$
test_contiguous[memmap_tensor0] 2.1440μs 0.4221μs 2.3690 MOps/s 2.3805 MOps/s $\color{#d91a1a}-0.48\%$
test_stack[memmap_tensor0] 0.1402ms 4.5827μs 218.2103 KOps/s 213.4702 KOps/s $\color{#35bf28}+2.22\%$
test_memmaptd_index 1.6385ms 0.2425ms 4.1229 KOps/s 3.9871 KOps/s $\color{#35bf28}+3.40\%$
test_memmaptd_index_astensor 0.4482ms 0.3023ms 3.3082 KOps/s 3.2270 KOps/s $\color{#35bf28}+2.52\%$
test_memmaptd_index_op 0.8034ms 0.5917ms 1.6901 KOps/s 1.7016 KOps/s $\color{#d91a1a}-0.67\%$
test_serialize_model 0.1319s 0.1304s 7.6710 Ops/s 7.6801 Ops/s $\color{#d91a1a}-0.12\%$
test_serialize_model_pickle 1.3479s 1.2161s 0.8223 Ops/s 0.8215 Ops/s $\color{#35bf28}+0.10\%$
test_serialize_weights 0.1298s 0.1292s 7.7423 Ops/s 7.7438 Ops/s $\color{#d91a1a}-0.02\%$
test_serialize_weights_returnearly 0.4925s 72.6517ms 13.7643 Ops/s 15.6750 Ops/s $\textbf{\color{#d91a1a}-12.19\%}$
test_serialize_weights_pickle 1.3708s 1.2185s 0.8207 Ops/s 0.8365 Ops/s $\color{#d91a1a}-1.89\%$
test_reshape_pytree 0.1268ms 22.6959μs 44.0609 KOps/s 44.1565 KOps/s $\color{#d91a1a}-0.22\%$
test_reshape_td 58.2210μs 26.9805μs 37.0638 KOps/s 34.7326 KOps/s $\textbf{\color{#35bf28}+6.71\%}$
test_view_pytree 0.1635ms 22.3478μs 44.7470 KOps/s 45.0059 KOps/s $\color{#d91a1a}-0.58\%$
test_view_td 0.1518ms 31.6749μs 31.5707 KOps/s 29.4285 KOps/s $\textbf{\color{#35bf28}+7.28\%}$
test_unbind_pytree 0.1634ms 27.8687μs 35.8825 KOps/s 34.7440 KOps/s $\color{#35bf28}+3.28\%$
test_unbind_td 1.1137ms 36.9610μs 27.0555 KOps/s 26.4463 KOps/s $\color{#35bf28}+2.30\%$
test_split_pytree 0.1486ms 29.9078μs 33.4361 KOps/s 32.5558 KOps/s $\color{#35bf28}+2.70\%$
test_split_td 0.1807ms 39.3329μs 25.4240 KOps/s 25.1951 KOps/s $\color{#35bf28}+0.91\%$
test_add_pytree 0.1391ms 32.8615μs 30.4307 KOps/s 29.3439 KOps/s $\color{#35bf28}+3.70\%$
test_add_td 0.2015ms 51.5955μs 19.3815 KOps/s 20.2737 KOps/s $\color{#d91a1a}-4.40\%$
test_compile_add_one_nested[tensordict-compile] 0.2818ms 0.1255ms 7.9701 KOps/s 7.7573 KOps/s $\color{#35bf28}+2.74\%$
test_compile_add_one_nested[tensordict-eager] 0.2781ms 0.1336ms 7.4833 KOps/s 7.4480 KOps/s $\color{#35bf28}+0.47\%$
test_compile_add_one_nested[pytree-compile] 0.2427ms 97.3131μs 10.2761 KOps/s 10.0970 KOps/s $\color{#35bf28}+1.77\%$
test_compile_add_one_nested[pytree-eager] 0.2969ms 0.1467ms 6.8170 KOps/s 6.6823 KOps/s $\color{#35bf28}+2.02\%$
test_compile_copy_nested[tensordict-compile] 0.1489ms 24.9724μs 40.0443 KOps/s 40.7507 KOps/s $\color{#d91a1a}-1.73\%$
test_compile_copy_nested[tensordict-eager] 0.2039ms 29.5036μs 33.8941 KOps/s 32.7784 KOps/s $\color{#35bf28}+3.40\%$
test_compile_copy_nested[pytree-compile] 0.3824ms 64.5624μs 15.4889 KOps/s 15.0724 KOps/s $\color{#35bf28}+2.76\%$
test_compile_copy_nested[pytree-eager] 81.7100μs 48.9238μs 20.4400 KOps/s 19.7569 KOps/s $\color{#35bf28}+3.46\%$
test_compile_add_one_flat[tensordict-compile] 0.3290ms 0.1432ms 6.9850 KOps/s 6.9623 KOps/s $\color{#35bf28}+0.33\%$
test_compile_add_one_flat[tensordict-eager] 0.3964ms 0.2166ms 4.6179 KOps/s 4.6345 KOps/s $\color{#d91a1a}-0.36\%$
test_compile_add_one_flat[tensorclass-compile] 0.2464ms 0.1006ms 9.9422 KOps/s 10.0676 KOps/s $\color{#d91a1a}-1.25\%$
test_compile_add_one_flat[tensorclass-eager] 0.2329ms 57.4611μs 17.4031 KOps/s 18.0694 KOps/s $\color{#d91a1a}-3.69\%$
test_compile_add_one_flat[pytree-compile] 0.2816ms 0.1376ms 7.2683 KOps/s 7.3291 KOps/s $\color{#d91a1a}-0.83\%$
test_compile_add_one_flat[pytree-eager] 0.6517ms 0.4760ms 2.1010 KOps/s 2.0815 KOps/s $\color{#35bf28}+0.94\%$
test_compile_add_self_flat[tensordict-eager] 0.4168ms 0.2640ms 3.7885 KOps/s 3.8399 KOps/s $\color{#d91a1a}-1.34\%$
test_compile_add_self_flat[tensordict-compile] 0.2937ms 0.1458ms 6.8604 KOps/s 6.9627 KOps/s $\color{#d91a1a}-1.47\%$
test_compile_add_self_flat[tensorclass-eager] 0.2236ms 68.6284μs 14.5712 KOps/s 14.6494 KOps/s $\color{#d91a1a}-0.53\%$
test_compile_add_self_flat[tensorclass-compile] 0.2748ms 0.1023ms 9.7709 KOps/s 9.9889 KOps/s $\color{#d91a1a}-2.18\%$
test_compile_add_self_flat[pytree-eager] 0.5563ms 0.4076ms 2.4532 KOps/s 2.4516 KOps/s $\color{#35bf28}+0.06\%$
test_compile_add_self_flat[pytree-compile] 0.3063ms 0.1433ms 6.9798 KOps/s 7.3359 KOps/s $\color{#d91a1a}-4.85\%$
test_compile_copy_flat[tensordict-compile] 0.2071ms 21.9108μs 45.6396 KOps/s 54.1621 KOps/s $\textbf{\color{#d91a1a}-15.74\%}$
test_compile_copy_flat[tensordict-eager] 0.1321ms 30.9620μs 32.2977 KOps/s 31.2613 KOps/s $\color{#35bf28}+3.32\%$
test_compile_copy_flat[pytree-compile] 0.1063ms 69.9916μs 14.2874 KOps/s 13.9911 KOps/s $\color{#35bf28}+2.12\%$
test_compile_copy_flat[pytree-eager] 0.1199ms 51.0419μs 19.5918 KOps/s 19.4892 KOps/s $\color{#35bf28}+0.53\%$
test_compile_assign_and_add[tensordict-compile] 1.6334ms 0.3995ms 2.5030 KOps/s 2.1793 KOps/s $\textbf{\color{#35bf28}+14.85\%}$
test_compile_assign_and_add[tensordict-eager] 2.9971ms 2.6644ms 375.3259 Ops/s 383.5191 Ops/s $\color{#d91a1a}-2.14\%$
test_compile_assign_and_add[pytree-compile] 1.6080ms 0.4342ms 2.3029 KOps/s 2.2163 KOps/s $\color{#35bf28}+3.91\%$
test_compile_assign_and_add[pytree-eager] 2.9280ms 2.6670ms 374.9535 Ops/s 374.3194 Ops/s $\color{#35bf28}+0.17\%$
test_compile_indexing[tensor-tensordict-compile] 0.6010ms 0.1192ms 8.3908 KOps/s 8.4388 KOps/s $\color{#d91a1a}-0.57\%$
test_compile_indexing[tensor-tensordict-eager] 0.5698ms 80.8752μs 12.3647 KOps/s 12.2304 KOps/s $\color{#35bf28}+1.10\%$
test_compile_indexing[tensor-tensorclass-compile] 0.4719ms 0.1098ms 9.1067 KOps/s 9.1414 KOps/s $\color{#d91a1a}-0.38\%$
test_compile_indexing[tensor-tensorclass-eager] 0.2185ms 67.0001μs 14.9254 KOps/s 14.2463 KOps/s $\color{#35bf28}+4.77\%$
test_compile_indexing[tensor-pytree-compile] 0.2833ms 0.1099ms 9.1024 KOps/s 9.0127 KOps/s $\color{#35bf28}+1.00\%$
test_compile_indexing[tensor-pytree-eager] 0.2465ms 67.4677μs 14.8219 KOps/s 14.0514 KOps/s $\textbf{\color{#35bf28}+5.48\%}$
test_compile_indexing[slice-tensordict-compile] 0.2934ms 0.1056ms 9.4707 KOps/s 9.7820 KOps/s $\color{#d91a1a}-3.18\%$
test_compile_indexing[slice-tensordict-eager] 0.2253ms 17.3105μs 57.7683 KOps/s 54.9357 KOps/s $\textbf{\color{#35bf28}+5.16\%}$
test_compile_indexing[slice-tensorclass-compile] 0.2696ms 97.4494μs 10.2617 KOps/s 10.2824 KOps/s $\color{#d91a1a}-0.20\%$
test_compile_indexing[slice-tensorclass-eager] 0.1579ms 15.8436μs 63.1170 KOps/s 47.2418 KOps/s $\textbf{\color{#35bf28}+33.60\%}$
test_compile_indexing[slice-pytree-compile] 0.2948ms 0.1001ms 9.9929 KOps/s 10.1628 KOps/s $\color{#d91a1a}-1.67\%$
test_compile_indexing[slice-pytree-eager] 0.1586ms 15.9096μs 62.8550 KOps/s 63.2399 KOps/s $\color{#d91a1a}-0.61\%$
test_compile_indexing[int-tensordict-compile] 0.2541ms 0.1017ms 9.8298 KOps/s 9.7501 KOps/s $\color{#35bf28}+0.82\%$
test_compile_indexing[int-tensordict-eager] 0.5671ms 17.1694μs 58.2430 KOps/s 57.8019 KOps/s $\color{#35bf28}+0.76\%$
test_compile_indexing[int-tensorclass-compile] 0.2724ms 97.7006μs 10.2354 KOps/s 10.1662 KOps/s $\color{#35bf28}+0.68\%$
test_compile_indexing[int-tensorclass-eager] 0.1866ms 17.0511μs 58.6471 KOps/s 62.6757 KOps/s $\textbf{\color{#d91a1a}-6.43\%}$
test_compile_indexing[int-pytree-compile] 0.2545ms 97.8693μs 10.2177 KOps/s 10.1894 KOps/s $\color{#35bf28}+0.28\%$
test_compile_indexing[int-pytree-eager] 0.1338ms 15.7763μs 63.3861 KOps/s 63.0936 KOps/s $\color{#35bf28}+0.46\%$
test_mod_add[eager] 0.1927ms 40.1486μs 24.9075 KOps/s 25.1022 KOps/s $\color{#d91a1a}-0.78\%$
test_mod_add[compile] 0.2222ms 82.1256μs 12.1765 KOps/s 12.2210 KOps/s $\color{#d91a1a}-0.36\%$
test_mod_add[compile-overhead] 0.3310ms 0.1692ms 5.9098 KOps/s 5.6407 KOps/s $\color{#35bf28}+4.77\%$
test_mod_wrap[eager] 0.3984ms 0.2502ms 3.9961 KOps/s 3.9664 KOps/s $\color{#35bf28}+0.75\%$
test_mod_wrap[compile] 0.4802ms 0.2955ms 3.3838 KOps/s 3.4368 KOps/s $\color{#d91a1a}-1.54\%$
test_mod_wrap[compile-overhead] 6.9400ms 3.7111ms 269.4625 Ops/s 273.2943 Ops/s $\color{#d91a1a}-1.40\%$
test_mod_wrap_and_backward[eager] 1.5572ms 1.3511ms 740.1573 Ops/s 689.3594 Ops/s $\textbf{\color{#35bf28}+7.37\%}$
test_mod_wrap_and_backward[compile] 1.4880ms 1.2834ms 779.1924 Ops/s 713.6000 Ops/s $\textbf{\color{#35bf28}+9.19\%}$
test_mod_wrap_and_backward[compile-overhead] 1.3715ms 0.9277ms 1.0780 KOps/s 953.5552 Ops/s $\textbf{\color{#35bf28}+13.05\%}$
test_seq_add[eager] 0.2713ms 0.1183ms 8.4523 KOps/s 8.3982 KOps/s $\color{#35bf28}+0.64\%$
test_seq_add[compile] 0.2421ms 90.8471μs 11.0075 KOps/s 10.5049 KOps/s $\color{#35bf28}+4.78\%$
test_seq_add[compile-overhead] 0.2909ms 0.1314ms 7.6097 KOps/s 7.4801 KOps/s $\color{#35bf28}+1.73\%$
test_seq_wrap[eager] 0.5832ms 0.4287ms 2.3325 KOps/s 2.3304 KOps/s $\color{#35bf28}+0.09\%$
test_seq_wrap[compile] 0.5107ms 0.3182ms 3.1424 KOps/s 3.2432 KOps/s $\color{#d91a1a}-3.11\%$
test_seq_wrap[compile-overhead] 0.3738ms 0.2265ms 4.4157 KOps/s 4.3243 KOps/s $\color{#35bf28}+2.11\%$
test_func_call_runtime[False-eager] 0.8678ms 0.7247ms 1.3798 KOps/s 1.3651 KOps/s $\color{#35bf28}+1.08\%$
test_func_call_runtime[False-compile] 0.9227ms 0.7503ms 1.3328 KOps/s 1.3019 KOps/s $\color{#35bf28}+2.38\%$
test_func_call_runtime[False-compile-overhead] 0.5014ms 0.3647ms 2.7419 KOps/s 2.6839 KOps/s $\color{#35bf28}+2.16\%$
test_func_call_runtime[True-eager] 1.0756ms 0.8912ms 1.1221 KOps/s 1.1084 KOps/s $\color{#35bf28}+1.24\%$
test_func_call_runtime[True-compile] 0.9426ms 0.7689ms 1.3006 KOps/s 1.2653 KOps/s $\color{#35bf28}+2.79\%$
test_func_call_runtime[True-compile-overhead] 0.5099ms 0.3863ms 2.5888 KOps/s 2.5560 KOps/s $\color{#35bf28}+1.28\%$
test_func_call_cm_runtime[False-eager] 0.8951ms 0.7271ms 1.3754 KOps/s 1.3750 KOps/s $\color{#35bf28}+0.03\%$
test_func_call_cm_runtime[False-compile] 0.9286ms 0.7515ms 1.3307 KOps/s 1.2927 KOps/s $\color{#35bf28}+2.94\%$
test_func_call_cm_runtime[False-compile-overhead] 0.5132ms 0.3667ms 2.7270 KOps/s 2.6819 KOps/s $\color{#35bf28}+1.68\%$
test_func_call_cm_runtime[True-eager] 1.1609ms 0.9966ms 1.0034 KOps/s 997.7598 Ops/s $\color{#35bf28}+0.57\%$
test_func_call_cm_runtime[True-compile] 1.1667ms 0.9824ms 1.0180 KOps/s 1.0202 KOps/s $\color{#d91a1a}-0.22\%$
test_func_call_cm_runtime[True-compile-overhead] 1.1858ms 0.9784ms 1.0221 KOps/s 1.0101 KOps/s $\color{#35bf28}+1.19\%$
test_vmap_func_call_cm_runtime[eager] 2.4655ms 2.0369ms 490.9440 Ops/s 482.9462 Ops/s $\color{#35bf28}+1.66\%$
test_vmap_func_call_cm_runtime[compile] 0.9608ms 0.8175ms 1.2233 KOps/s 1.1928 KOps/s $\color{#35bf28}+2.56\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.5416ms 0.4180ms 2.3925 KOps/s 2.3593 KOps/s $\color{#35bf28}+1.40\%$
test_distributed 3.9099ms 0.2809ms 3.5599 KOps/s 8.0921 KOps/s $\textbf{\color{#d91a1a}-56.01\%}$
test_tdmodule 0.3166ms 22.2187μs 45.0071 KOps/s 44.0824 KOps/s $\color{#35bf28}+2.10\%$
test_tdmodule_dispatch 61.5300μs 38.6467μs 25.8754 KOps/s 24.8649 KOps/s $\color{#35bf28}+4.06\%$
test_tdseq 0.1461ms 22.3128μs 44.8174 KOps/s 42.3152 KOps/s $\textbf{\color{#35bf28}+5.91\%}$
test_tdseq_dispatch 78.0310μs 41.7446μs 23.9552 KOps/s 22.8351 KOps/s $\color{#35bf28}+4.91\%$
test_instantiation_functorch 1.7137ms 1.5525ms 644.1100 Ops/s 636.0005 Ops/s $\color{#35bf28}+1.28\%$
test_exec_functorch 0.3381ms 0.1436ms 6.9653 KOps/s 7.0189 KOps/s $\color{#d91a1a}-0.76\%$
test_exec_functional_call 0.3136ms 0.1339ms 7.4657 KOps/s 7.4130 KOps/s $\color{#35bf28}+0.71\%$
test_exec_td_decorator 0.3872ms 0.1878ms 5.3244 KOps/s 5.3346 KOps/s $\color{#d91a1a}-0.19\%$
test_vmap_mlp_speed_decorator[True-True] 0.9302ms 0.6921ms 1.4449 KOps/s 1.4647 KOps/s $\color{#d91a1a}-1.35\%$
test_vmap_mlp_speed_decorator[True-False] 0.8922ms 0.6795ms 1.4717 KOps/s 1.4370 KOps/s $\color{#35bf28}+2.42\%$
test_vmap_mlp_speed_decorator[False-True] 0.8072ms 0.5961ms 1.6775 KOps/s 1.6053 KOps/s $\color{#35bf28}+4.50\%$
test_vmap_mlp_speed_decorator[False-False] 0.7876ms 0.5859ms 1.7067 KOps/s 1.6091 KOps/s $\textbf{\color{#35bf28}+6.07\%}$
test_vmap_transformer_speed_decorator[True-True] 19.0028ms 18.7534ms 53.3237 Ops/s 52.6524 Ops/s $\color{#35bf28}+1.28\%$
test_vmap_transformer_speed_decorator[True-False] 19.2914ms 18.7708ms 53.2742 Ops/s 51.7023 Ops/s $\color{#35bf28}+3.04\%$
test_vmap_transformer_speed_decorator[False-True] 18.7387ms 18.5871ms 53.8007 Ops/s 53.0041 Ops/s $\color{#35bf28}+1.50\%$
test_vmap_transformer_speed_decorator[False-False] 18.7533ms 18.6146ms 53.7212 Ops/s 52.8395 Ops/s $\color{#35bf28}+1.67\%$
test_to_module_speed[True] 1.1238ms 0.9580ms 1.0438 KOps/s 1.0274 KOps/s $\color{#35bf28}+1.60\%$
test_to_module_speed[False] 1.3464ms 0.9441ms 1.0592 KOps/s 1.0437 KOps/s $\color{#35bf28}+1.49\%$
test_tc_init 0.2270ms 39.3232μs 25.4303 KOps/s 26.1139 KOps/s $\color{#d91a1a}-2.62\%$
test_tc_init_nested 0.1714ms 80.7971μs 12.3767 KOps/s 13.5449 KOps/s $\textbf{\color{#d91a1a}-8.62\%}$
test_tc_first_layer_tensor 4.1644μs 0.7092μs 1.4101 MOps/s 1.2609 MOps/s $\textbf{\color{#35bf28}+11.83\%}$
test_tc_first_layer_nontensor 20.2400μs 2.2820μs 438.2094 KOps/s 450.1265 KOps/s $\color{#d91a1a}-2.65\%$
test_tc_second_layer_tensor 40.4003μs 1.4422μs 693.3818 KOps/s 709.5448 KOps/s $\color{#d91a1a}-2.28\%$
test_tc_second_layer_nontensor 32.9200μs 3.0468μs 328.2086 KOps/s 333.4055 KOps/s $\color{#d91a1a}-1.56\%$
test_unbind 0.2218s 10.2859ms 97.2203 Ops/s 143.5235 Ops/s $\textbf{\color{#d91a1a}-32.26\%}$
test_full_like 11.1722ms 9.1759ms 108.9816 Ops/s 108.1625 Ops/s $\color{#35bf28}+0.76\%$
test_zeros_like 4.9387ms 4.3233ms 231.3060 Ops/s 230.9596 Ops/s $\color{#35bf28}+0.15\%$
test_ones_like 9.2456ms 7.1415ms 140.0274 Ops/s 230.6614 Ops/s $\textbf{\color{#d91a1a}-39.29\%}$
test_clone 6.6523ms 6.3820ms 156.6912 Ops/s 156.4158 Ops/s $\color{#35bf28}+0.18\%$
test_squeeze 59.5300μs 10.0429μs 99.5732 KOps/s 102.0313 KOps/s $\color{#d91a1a}-2.41\%$
test_unsqueeze 0.2277ms 73.7627μs 13.5570 KOps/s 13.5461 KOps/s $\color{#35bf28}+0.08\%$
test_split 0.2780ms 0.1589ms 6.2946 KOps/s 6.1628 KOps/s $\color{#35bf28}+2.14\%$
test_permute 0.2940ms 0.1765ms 5.6672 KOps/s 5.7249 KOps/s $\color{#d91a1a}-1.01\%$
test_stack 50.9696ms 50.3698ms 19.8532 Ops/s 19.9221 Ops/s $\color{#d91a1a}-0.35\%$
test_cat 50.8702ms 50.2362ms 19.9060 Ops/s 19.9669 Ops/s $\color{#d91a1a}-0.31\%$

@vmoens vmoens force-pushed the release/0.7.0 branch 3 times, most recently from be87fde to d8e27f3 Compare February 5, 2025 15:39
@@ -33,7 +33,7 @@ jobs:
include:
- repository: pytorch/tensordict
smoke-test-script: test/smoke_test.py
post-script: .github/scripts/linux-post-script.sh
pre-script: .github/scripts/linux-pre-script.sh
package-name: tensordict
name: pytorch/tensordict
uses: pytorch/test-infra/.github/workflows/build_wheels_linux.yml@main
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
uses: pytorch/test-infra/.github/workflows/build_wheels_linux.yml@main
uses: pytorch/test-infra/.github/workflows/build_wheels_linux.yml@release/2.6

Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please also add test-infra-ref: release/2.6 below

@@ -19,7 +19,7 @@ permissions:

jobs:
generate-matrix:
uses: pytorch/test-infra/.github/workflows/generate_binary_build_matrix.yml@main
uses: pytorch/test-infra/.github/workflows/generate_binary_build_matrix.yml@release/2.6
with:
package-type: wheel
os: linux
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pleas add test-infra-ref: release/2.6

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ciflow/binaries/all Build all wheels CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants