-
Notifications
You must be signed in to change notification settings - Fork 79
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[NOMERG] test 0.7 builds #1210
base: main
Are you sure you want to change the base?
[NOMERG] test 0.7 builds #1210
Conversation
27fffbb
to
2c2d48d
Compare
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 68.3180μs | 20.7607μs | 48.1678 KOps/s | 48.8808 KOps/s | |
test_plain_set_stack_nested | 58.8110μs | 20.8511μs | 47.9591 KOps/s | 48.6217 KOps/s | |
test_plain_set_nested_inplace | 73.5480μs | 22.4026μs | 44.6377 KOps/s | 43.7923 KOps/s | |
test_plain_set_stack_nested_inplace | 73.9090μs | 22.4815μs | 44.4809 KOps/s | 44.2334 KOps/s | |
test_items | 22.7420μs | 4.1145μs | 243.0409 KOps/s | 237.3754 KOps/s | |
test_items_nested | 0.7283ms | 0.4054ms | 2.4665 KOps/s | 2.4649 KOps/s | |
test_items_nested_locked | 0.7896ms | 0.4057ms | 2.4648 KOps/s | 2.4671 KOps/s | |
test_items_nested_leaf | 0.1301ms | 77.4228μs | 12.9161 KOps/s | 12.9086 KOps/s | |
test_items_stack_nested | 0.6552ms | 0.4102ms | 2.4379 KOps/s | 2.4388 KOps/s | |
test_items_stack_nested_leaf | 0.1416ms | 79.9614μs | 12.5060 KOps/s | 12.4258 KOps/s | |
test_items_stack_nested_locked | 0.4698ms | 0.4071ms | 2.4562 KOps/s | 2.4481 KOps/s | |
test_keys | 20.2890μs | 3.4434μs | 290.4148 KOps/s | 284.3201 KOps/s | |
test_keys_nested | 0.2157ms | 0.1630ms | 6.1356 KOps/s | 6.0963 KOps/s | |
test_keys_nested_locked | 1.7870ms | 0.1697ms | 5.8915 KOps/s | 5.8904 KOps/s | |
test_keys_nested_leaf | 0.2070ms | 0.1421ms | 7.0370 KOps/s | 6.9291 KOps/s | |
test_keys_stack_nested | 0.2519ms | 0.1613ms | 6.1990 KOps/s | 6.0925 KOps/s | |
test_keys_stack_nested_leaf | 0.2328ms | 0.1401ms | 7.1357 KOps/s | 6.9849 KOps/s | |
test_keys_stack_nested_locked | 0.2662ms | 0.1684ms | 5.9394 KOps/s | 5.8805 KOps/s | |
test_values | 9.5600μs | 1.0489μs | 953.3935 KOps/s | 958.4062 KOps/s | |
test_values_nested | 0.1183ms | 61.8490μs | 16.1684 KOps/s | 15.8582 KOps/s | |
test_values_nested_locked | 0.1062ms | 62.1381μs | 16.0932 KOps/s | 15.7027 KOps/s | |
test_values_nested_leaf | 0.1348ms | 71.5242μs | 13.9813 KOps/s | 12.4870 KOps/s | |
test_values_stack_nested | 0.1182ms | 63.1706μs | 15.8302 KOps/s | 15.4663 KOps/s | |
test_values_stack_nested_leaf | 0.1569ms | 70.7869μs | 14.1269 KOps/s | 13.5215 KOps/s | |
test_values_stack_nested_locked | 0.1096ms | 62.9413μs | 15.8878 KOps/s | 15.3283 KOps/s | |
test_membership | 18.6350μs | 0.8662μs | 1.1545 MOps/s | 1.3757 MOps/s | |
test_membership_nested | 36.9990μs | 2.8995μs | 344.8914 KOps/s | 342.1402 KOps/s | |
test_membership_nested_leaf | 34.3340μs | 2.9265μs | 341.7023 KOps/s | 336.9218 KOps/s | |
test_membership_stacked_nested | 18.1740μs | 2.8794μs | 347.2896 KOps/s | 328.9782 KOps/s | |
test_membership_stacked_nested_leaf | 27.3210μs | 2.9042μs | 344.3256 KOps/s | 344.7176 KOps/s | |
test_membership_nested_last | 22.5220μs | 4.3372μs | 230.5635 KOps/s | 225.0014 KOps/s | |
test_membership_nested_leaf_last | 32.2800μs | 4.6690μs | 214.1800 KOps/s | 226.5983 KOps/s | |
test_membership_stacked_nested_last | 33.5630μs | 8.0239μs | 124.6274 KOps/s | 227.9682 KOps/s | |
test_membership_stacked_nested_leaf_last | 37.0300μs | 8.0417μs | 124.3522 KOps/s | 221.2522 KOps/s | |
test_nested_getleaf | 34.7060μs | 10.5091μs | 95.1559 KOps/s | 94.0586 KOps/s | |
test_nested_get | 44.3530μs | 9.9109μs | 100.8992 KOps/s | 99.8992 KOps/s | |
test_stacked_getleaf | 38.5720μs | 10.4317μs | 95.8617 KOps/s | 93.9210 KOps/s | |
test_stacked_get | 39.9840μs | 10.0033μs | 99.9665 KOps/s | 99.3616 KOps/s | |
test_nested_getitemleaf | 46.9480μs | 11.2408μs | 88.9613 KOps/s | 87.7825 KOps/s | |
test_nested_getitem | 51.4360μs | 10.6894μs | 93.5503 KOps/s | 93.0789 KOps/s | |
test_stacked_getitemleaf | 37.5010μs | 11.2932μs | 88.5487 KOps/s | 88.6464 KOps/s | |
test_stacked_getitem | 47.7790μs | 10.5314μs | 94.9537 KOps/s | 91.4801 KOps/s | |
test_lock_nested | 0.8510ms | 0.4054ms | 2.4666 KOps/s | 2.4347 KOps/s | |
test_lock_stack_nested | 0.5032ms | 0.4115ms | 2.4303 KOps/s | 2.3268 KOps/s | |
test_unlock_nested | 0.7981ms | 0.3332ms | 3.0009 KOps/s | 2.9999 KOps/s | |
test_unlock_stack_nested | 0.4397ms | 0.3316ms | 3.0160 KOps/s | 2.9331 KOps/s | |
test_flatten_speed | 0.1626ms | 99.4511μs | 10.0552 KOps/s | 9.7054 KOps/s | |
test_unflatten_speed | 0.6183ms | 0.5169ms | 1.9345 KOps/s | 1.9117 KOps/s | |
test_common_ops | 4.1663ms | 0.8115ms | 1.2323 KOps/s | 1.2457 KOps/s | |
test_creation | 28.9240μs | 2.5018μs | 399.7109 KOps/s | 395.8424 KOps/s | |
test_creation_empty | 51.1460μs | 11.7543μs | 85.0750 KOps/s | 89.2753 KOps/s | |
test_creation_nested_1 | 43.7920μs | 14.6921μs | 68.0638 KOps/s | 71.2037 KOps/s | |
test_creation_nested_2 | 75.4510μs | 19.2381μs | 51.9801 KOps/s | 53.3938 KOps/s | |
test_clone | 0.1800ms | 14.8378μs | 67.3957 KOps/s | 72.6787 KOps/s | |
test_getitem[int] | 0.8672ms | 12.8225μs | 77.9879 KOps/s | 77.7220 KOps/s | |
test_getitem[slice_int] | 0.1320ms | 24.1068μs | 41.4821 KOps/s | 39.8627 KOps/s | |
test_getitem[range] | 0.1633ms | 49.7698μs | 20.0925 KOps/s | 19.4528 KOps/s | |
test_getitem[tuple] | 0.1361ms | 20.2115μs | 49.4767 KOps/s | 49.7958 KOps/s | |
test_getitem[list] | 0.3208ms | 45.3941μs | 22.0293 KOps/s | 21.2902 KOps/s | |
test_setitem_dim[int] | 62.5370μs | 26.1379μs | 38.2586 KOps/s | 38.2549 KOps/s | |
test_setitem_dim[slice_int] | 83.2560μs | 52.1006μs | 19.1936 KOps/s | 19.0098 KOps/s | |
test_setitem_dim[range] | 0.1162ms | 76.0597μs | 13.1476 KOps/s | 12.7749 KOps/s | |
test_setitem_dim[tuple] | 87.5140μs | 40.8795μs | 24.4621 KOps/s | 24.2454 KOps/s | |
test_setitem | 0.1440ms | 21.0239μs | 47.5649 KOps/s | 47.9775 KOps/s | |
test_set | 75.0830μs | 20.3125μs | 49.2308 KOps/s | 48.8532 KOps/s | |
test_set_shared | 4.4356ms | 0.1857ms | 5.3855 KOps/s | 5.3735 KOps/s | |
test_update | 0.2809ms | 23.1115μs | 43.2686 KOps/s | 43.2132 KOps/s | |
test_update_nested | 0.4849ms | 33.0769μs | 30.2325 KOps/s | 30.5185 KOps/s | |
test_update__nested | 0.1395ms | 34.0196μs | 29.3949 KOps/s | 29.4522 KOps/s | |
test_set_nested | 0.1311ms | 22.6825μs | 44.0868 KOps/s | 44.7598 KOps/s | |
test_set_nested_new | 0.1557ms | 27.6009μs | 36.2307 KOps/s | 37.4192 KOps/s | |
test_select | 98.4650μs | 42.8520μs | 23.3361 KOps/s | 23.1553 KOps/s | |
test_select_nested | 0.1608ms | 62.6662μs | 15.9576 KOps/s | 15.7317 KOps/s | |
test_exclude_nested | 0.1713ms | 80.4747μs | 12.4263 KOps/s | 12.1749 KOps/s | |
test_empty[True] | 0.5375ms | 0.4030ms | 2.4814 KOps/s | 2.4329 KOps/s | |
test_empty[False] | 7.9625μs | 1.4019μs | 713.3186 KOps/s | 734.8090 KOps/s | |
test_unbind_speed | 0.3367ms | 0.2680ms | 3.7310 KOps/s | 3.6767 KOps/s | |
test_unbind_speed_stack0 | 0.4337ms | 0.2608ms | 3.8338 KOps/s | 3.7261 KOps/s | |
test_unbind_speed_stack1 | 0.1121s | 0.7216ms | 1.3858 KOps/s | 1.2069 KOps/s | |
test_split | 0.1127s | 1.7425ms | 573.8876 Ops/s | 632.1916 Ops/s | |
test_chunk | 0.1312s | 1.7609ms | 567.8959 Ops/s | 566.3323 Ops/s | |
test_consolidate_njt[False-None] | 10.0245ms | 8.4300ms | 118.6234 Ops/s | 121.2037 Ops/s | |
test_creation[device0] | 0.3003ms | 92.0444μs | 10.8643 KOps/s | 10.7455 KOps/s | |
test_creation_from_tensor | 4.1407ms | 97.2553μs | 10.2822 KOps/s | 10.2052 KOps/s | |
test_add_one[memmap_tensor0] | 79.8500μs | 4.6365μs | 215.6777 KOps/s | 190.0964 KOps/s | |
test_contiguous[memmap_tensor0] | 29.4160μs | 0.5058μs | 1.9772 MOps/s | 1.9471 MOps/s | |
test_stack[memmap_tensor0] | 19.8670μs | 3.2784μs | 305.0267 KOps/s | 286.8480 KOps/s | |
test_memmaptd_index | 1.3981ms | 0.2298ms | 4.3514 KOps/s | 4.2793 KOps/s | |
test_memmaptd_index_astensor | 0.5428ms | 0.3154ms | 3.1703 KOps/s | 3.0868 KOps/s | |
test_memmaptd_index_op | 0.8208ms | 0.5828ms | 1.7158 KOps/s | 1.6526 KOps/s | |
test_serialize_model | 0.2383s | 0.1352s | 7.3967 Ops/s | 8.4639 Ops/s | |
test_serialize_model_pickle | 0.4476s | 0.3869s | 2.5845 Ops/s | 2.5624 Ops/s | |
test_serialize_weights | 0.1219s | 0.1176s | 8.5027 Ops/s | 8.6417 Ops/s | |
test_serialize_weights_returnearly | 0.1751s | 0.1618s | 6.1821 Ops/s | 6.0658 Ops/s | |
test_serialize_weights_pickle | 0.4895s | 0.4181s | 2.3920 Ops/s | 2.2687 Ops/s | |
test_serialize_weights_filesystem | 0.1539s | 0.1478s | 6.7645 Ops/s | 6.6609 Ops/s | |
test_serialize_model_filesystem | 0.2738s | 0.1680s | 5.9534 Ops/s | 6.4296 Ops/s | |
test_reshape_pytree | 71.7940μs | 26.1139μs | 38.2938 KOps/s | 37.2055 KOps/s | |
test_reshape_td | 89.2070μs | 33.8915μs | 29.5059 KOps/s | 30.3055 KOps/s | |
test_view_pytree | 0.1180ms | 25.9890μs | 38.4778 KOps/s | 36.9990 KOps/s | |
test_view_td | 90.9410μs | 38.9879μs | 25.6490 KOps/s | 25.3368 KOps/s | |
test_unbind_pytree | 67.7070μs | 29.0056μs | 34.4761 KOps/s | 33.8234 KOps/s | |
test_unbind_td | 0.3508ms | 40.0216μs | 24.9865 KOps/s | 25.2591 KOps/s | |
test_split_pytree | 74.2890μs | 28.5856μs | 34.9827 KOps/s | 34.0862 KOps/s | |
test_split_td | 0.2503ms | 44.6279μs | 22.4075 KOps/s | 21.6321 KOps/s | |
test_add_pytree | 93.3350μs | 36.0953μs | 27.7045 KOps/s | 26.9423 KOps/s | |
test_add_td | 0.1750ms | 62.2129μs | 16.0738 KOps/s | 17.4330 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1760ms | 68.2550μs | 14.6509 KOps/s | 14.6718 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3457ms | 0.1701ms | 5.8794 KOps/s | 5.7935 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1202ms | 47.1841μs | 21.1936 KOps/s | 21.2941 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2094ms | 0.1177ms | 8.4970 KOps/s | 8.2035 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 90.9700μs | 29.2903μs | 34.1410 KOps/s | 35.1499 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1108ms | 58.7023μs | 17.0351 KOps/s | 17.0656 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1492ms | 79.5672μs | 12.5680 KOps/s | 12.2452 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1295ms | 67.3275μs | 14.8528 KOps/s | 14.7270 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2174ms | 0.1090ms | 9.1773 KOps/s | 9.3024 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.4297ms | 0.2145ms | 4.6612 KOps/s | 4.6163 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1126ms | 47.8129μs | 20.9148 KOps/s | 21.0515 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1479ms | 65.8012μs | 15.1973 KOps/s | 14.7386 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1943ms | 0.1029ms | 9.7218 KOps/s | 9.8389 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.4336ms | 0.2022ms | 4.9456 KOps/s | 4.8481 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4908ms | 0.2318ms | 4.3149 KOps/s | 4.2609 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.1974ms | 0.1087ms | 9.2004 KOps/s | 9.1553 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1584ms | 62.8566μs | 15.9092 KOps/s | 15.7538 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.3623ms | 50.6911μs | 19.7273 KOps/s | 20.5330 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.3049ms | 0.1564ms | 6.3923 KOps/s | 6.1954 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1917ms | 0.1025ms | 9.7588 KOps/s | 9.6607 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 78.3160μs | 22.6153μs | 44.2178 KOps/s | 45.9102 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1512ms | 66.6848μs | 14.9959 KOps/s | 14.9319 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1579ms | 84.9730μs | 11.7684 KOps/s | 12.2511 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1206ms | 66.6910μs | 14.9945 KOps/s | 14.6050 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.3621ms | 0.2145ms | 4.6620 KOps/s | 4.4595 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 1.6866ms | 1.3607ms | 734.9247 Ops/s | 704.1142 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.3242ms | 0.2105ms | 4.7504 KOps/s | 4.6756 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.0379ms | 0.8317ms | 1.2024 KOps/s | 1.1853 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.6766ms | 0.4573ms | 2.1869 KOps/s | 2.1149 KOps/s | |
test_compile_assign_and_add_stack[eager] | 2.9403ms | 2.6841ms | 372.5586 Ops/s | 363.9606 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1022ms | 40.6230μs | 24.6166 KOps/s | 25.0240 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.6601ms | 33.2339μs | 30.0898 KOps/s | 28.9262 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 90.6700μs | 31.7622μs | 31.4839 KOps/s | 30.9264 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 77.2750μs | 22.9976μs | 43.4828 KOps/s | 42.5851 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1277ms | 32.7483μs | 30.5359 KOps/s | 30.0482 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1275ms | 23.2593μs | 42.9935 KOps/s | 42.2334 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1412ms | 54.4894μs | 18.3522 KOps/s | 18.2553 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.4046ms | 19.7103μs | 50.7349 KOps/s | 47.0406 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1092ms | 47.1315μs | 21.2172 KOps/s | 20.9316 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 0.1032ms | 18.2292μs | 54.8571 KOps/s | 51.4319 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1366ms | 48.5409μs | 20.6012 KOps/s | 20.5130 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 80.0300μs | 18.2093μs | 54.9170 KOps/s | 52.1634 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1777ms | 58.6123μs | 17.0613 KOps/s | 17.9276 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.8574ms | 19.4959μs | 51.2927 KOps/s | 49.4108 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1375ms | 47.6155μs | 21.0016 KOps/s | 20.7572 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 69.4710μs | 18.1432μs | 55.1169 KOps/s | 52.6073 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1064ms | 47.4432μs | 21.0778 KOps/s | 20.4715 KOps/s | |
test_compile_indexing[int-pytree-eager] | 60.3330μs | 18.1399μs | 55.1271 KOps/s | 52.3424 KOps/s | |
test_mod_add[eager] | 0.1548ms | 36.0131μs | 27.7676 KOps/s | 28.2603 KOps/s | |
test_mod_add[compile] | 0.1244ms | 67.5819μs | 14.7969 KOps/s | 14.7735 KOps/s | |
test_mod_add[compile-overhead] | 0.1776ms | 65.3609μs | 15.2997 KOps/s | 14.7390 KOps/s | |
test_mod_wrap[eager] | 0.4310ms | 0.2261ms | 4.4234 KOps/s | 4.2572 KOps/s | |
test_mod_wrap[compile] | 2.5423ms | 0.2344ms | 4.2655 KOps/s | 4.1811 KOps/s | |
test_mod_wrap[compile-overhead] | 0.4385ms | 0.2305ms | 4.3386 KOps/s | 4.2995 KOps/s | |
test_mod_wrap_and_backward[eager] | 15.5057ms | 13.3388ms | 74.9690 Ops/s | 88.4577 Ops/s | |
test_mod_wrap_and_backward[compile] | 15.5652ms | 11.9431ms | 83.7305 Ops/s | 88.7047 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 14.2549ms | 11.9716ms | 83.5311 Ops/s | 87.8987 Ops/s | |
test_seq_add[eager] | 0.1950ms | 0.1169ms | 8.5577 KOps/s | 8.4139 KOps/s | |
test_seq_add[compile] | 0.1801ms | 81.3320μs | 12.2953 KOps/s | 12.4617 KOps/s | |
test_seq_add[compile-overhead] | 0.1780ms | 79.2797μs | 12.6136 KOps/s | 12.7998 KOps/s | |
test_seq_wrap[eager] | 0.7524ms | 0.4566ms | 2.1900 KOps/s | 2.1728 KOps/s | |
test_seq_wrap[compile] | 0.4753ms | 0.2496ms | 4.0060 KOps/s | 3.9874 KOps/s | |
test_seq_wrap[compile-overhead] | 0.4436ms | 0.2483ms | 4.0277 KOps/s | 4.0443 KOps/s | |
test_func_call_runtime[False-eager] | 0.9844ms | 0.5474ms | 1.8267 KOps/s | 1.7805 KOps/s | |
test_func_call_runtime[False-compile] | 0.6755ms | 0.4443ms | 2.2505 KOps/s | 2.1880 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.8045ms | 0.4438ms | 2.2532 KOps/s | 2.2353 KOps/s | |
test_func_call_runtime[True-eager] | 1.1218ms | 0.7639ms | 1.3091 KOps/s | 1.2788 KOps/s | |
test_func_call_runtime[True-compile] | 0.6009ms | 0.4663ms | 2.1447 KOps/s | 2.1273 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.9704ms | 0.4678ms | 2.1376 KOps/s | 2.0746 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.9049ms | 0.5514ms | 1.8137 KOps/s | 1.7843 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.9413ms | 0.4540ms | 2.2028 KOps/s | 2.1899 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.6123ms | 0.4420ms | 2.2625 KOps/s | 2.1933 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.7277ms | 0.9153ms | 1.0925 KOps/s | 1.0820 KOps/s | |
test_func_call_cm_runtime[True-compile] | 1.2262ms | 0.8093ms | 1.2356 KOps/s | 1.2054 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.0295ms | 0.8147ms | 1.2275 KOps/s | 1.1933 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 4.7234ms | 1.9425ms | 514.8073 Ops/s | 513.4718 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.8358ms | 0.5410ms | 1.8484 KOps/s | 1.8055 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.9976ms | 0.5379ms | 1.8590 KOps/s | 1.8401 KOps/s | |
test_distributed | 0.3154ms | 0.1252ms | 7.9899 KOps/s | 7.6749 KOps/s | |
test_tdmodule | 47.9100μs | 26.1095μs | 38.3002 KOps/s | 36.1711 KOps/s | |
test_tdmodule_dispatch | 0.6446ms | 50.0802μs | 19.9680 KOps/s | 20.3767 KOps/s | |
test_tdseq | 57.7180μs | 30.1591μs | 33.1575 KOps/s | 33.6895 KOps/s | |
test_tdseq_dispatch | 81.4430μs | 54.0759μs | 18.4925 KOps/s | 18.3317 KOps/s | |
test_instantiation_functorch | 1.7574ms | 1.5389ms | 649.8153 Ops/s | 633.2390 Ops/s | |
test_exec_functorch | 0.4791ms | 0.1770ms | 5.6501 KOps/s | 5.5236 KOps/s | |
test_exec_functional_call | 0.3053ms | 0.1697ms | 5.8931 KOps/s | 5.6932 KOps/s | |
test_exec_td_decorator | 0.5510ms | 0.2313ms | 4.3239 KOps/s | 4.0291 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8623ms | 0.6578ms | 1.5203 KOps/s | 1.4848 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8625ms | 0.6537ms | 1.5298 KOps/s | 1.4914 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.9026ms | 0.5373ms | 1.8612 KOps/s | 1.8233 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7974ms | 0.5331ms | 1.8758 KOps/s | 1.8344 KOps/s | |
test_to_module_speed[True] | 2.0051ms | 1.3253ms | 754.5405 Ops/s | 751.2093 Ops/s | |
test_to_module_speed[False] | 1.8267ms | 1.2917ms | 774.1497 Ops/s | 766.0816 Ops/s | |
test_tc_init | 0.1027ms | 45.9788μs | 21.7491 KOps/s | 21.8747 KOps/s | |
test_tc_init_nested | 0.1797ms | 91.6949μs | 10.9057 KOps/s | 10.9467 KOps/s | |
test_tc_first_layer_tensor | 20.0770μs | 1.5450μs | 647.2460 KOps/s | 659.0469 KOps/s | |
test_tc_first_layer_nontensor | 20.9200μs | 4.7271μs | 211.5470 KOps/s | 215.6199 KOps/s | |
test_tc_second_layer_tensor | 0.4516ms | 2.9018μs | 344.6192 KOps/s | 356.0657 KOps/s | |
test_tc_second_layer_nontensor | 35.4270μs | 6.0585μs | 165.0582 KOps/s | 167.7901 KOps/s | |
test_unbind | 0.2486s | 14.5403ms | 68.7745 Ops/s | 73.4630 Ops/s | |
test_full_like | 9.8351ms | 8.2395ms | 121.3660 Ops/s | 115.2991 Ops/s | |
test_zeros_like | 5.4665ms | 3.3320ms | 300.1229 Ops/s | 318.1976 Ops/s | |
test_ones_like | 6.6831ms | 3.4984ms | 285.8443 Ops/s | 276.7719 Ops/s | |
test_clone | 8.1410ms | 5.8116ms | 172.0690 Ops/s | 142.2553 Ops/s | |
test_squeeze | 61.8160μs | 12.2143μs | 81.8713 KOps/s | 78.9595 KOps/s | |
test_unsqueeze | 0.1626ms | 91.9010μs | 10.8813 KOps/s | 10.6692 KOps/s | |
test_split | 0.4782ms | 0.1945ms | 5.1411 KOps/s | 4.9907 KOps/s | |
test_permute | 0.3434ms | 0.2009ms | 4.9788 KOps/s | 5.0210 KOps/s | |
test_stack | 37.1513ms | 27.4955ms | 36.3696 Ops/s | 35.8817 Ops/s | |
test_cat | 31.9557ms | 26.4014ms | 37.8769 Ops/s | 37.2746 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 56.8510μs | 13.3695μs | 74.7974 KOps/s | 78.4634 KOps/s | |
test_plain_set_stack_nested | 53.7200μs | 13.5858μs | 73.6065 KOps/s | 78.1798 KOps/s | |
test_plain_set_nested_inplace | 71.7700μs | 14.4750μs | 69.0847 KOps/s | 72.3143 KOps/s | |
test_plain_set_stack_nested_inplace | 66.9500μs | 14.3425μs | 69.7231 KOps/s | 71.7474 KOps/s | |
test_items | 27.8500μs | 2.9068μs | 344.0221 KOps/s | 344.9230 KOps/s | |
test_items_nested | 0.4231ms | 0.3775ms | 2.6489 KOps/s | 2.6579 KOps/s | |
test_items_nested_locked | 0.4407ms | 0.3822ms | 2.6161 KOps/s | 2.6406 KOps/s | |
test_items_nested_leaf | 0.1862ms | 59.3379μs | 16.8526 KOps/s | 17.1354 KOps/s | |
test_items_stack_nested | 0.4772ms | 0.3778ms | 2.6471 KOps/s | 2.6588 KOps/s | |
test_items_stack_nested_leaf | 0.1641ms | 59.3510μs | 16.8489 KOps/s | 17.1872 KOps/s | |
test_items_stack_nested_locked | 0.5239ms | 0.3788ms | 2.6399 KOps/s | 2.6546 KOps/s | |
test_keys | 35.0100μs | 3.4295μs | 291.5843 KOps/s | 289.9347 KOps/s | |
test_keys_nested | 0.2062ms | 88.9477μs | 11.2426 KOps/s | 11.5101 KOps/s | |
test_keys_nested_locked | 0.7776ms | 95.5637μs | 10.4642 KOps/s | 10.7304 KOps/s | |
test_keys_nested_leaf | 0.1140ms | 79.8506μs | 12.5234 KOps/s | 12.8162 KOps/s | |
test_keys_stack_nested | 0.1287ms | 90.3225μs | 11.0714 KOps/s | 11.2836 KOps/s | |
test_keys_stack_nested_leaf | 0.1187ms | 81.5261μs | 12.2660 KOps/s | 12.7285 KOps/s | |
test_keys_stack_nested_locked | 0.1527ms | 96.0726μs | 10.4088 KOps/s | 10.8035 KOps/s | |
test_values | 7.6975μs | 0.8604μs | 1.1623 MOps/s | 1.1734 MOps/s | |
test_values_nested | 68.1210μs | 37.9390μs | 26.3581 KOps/s | 26.7063 KOps/s | |
test_values_nested_locked | 81.3700μs | 39.3318μs | 25.4247 KOps/s | 25.6854 KOps/s | |
test_values_nested_leaf | 79.1110μs | 42.0339μs | 23.7903 KOps/s | 23.9497 KOps/s | |
test_values_stack_nested | 0.2318ms | 38.0477μs | 26.2828 KOps/s | 26.5307 KOps/s | |
test_values_stack_nested_leaf | 82.5110μs | 42.7469μs | 23.3935 KOps/s | 23.8183 KOps/s | |
test_values_stack_nested_locked | 79.0000μs | 39.6261μs | 25.2359 KOps/s | 25.6038 KOps/s | |
test_membership | 2.2635μs | 0.5131μs | 1.9488 MOps/s | 1.9007 MOps/s | |
test_membership_nested | 23.8755μs | 1.9954μs | 501.1476 KOps/s | 464.6231 KOps/s | |
test_membership_nested_leaf | 15.5155μs | 1.9697μs | 507.6793 KOps/s | 494.2725 KOps/s | |
test_membership_stacked_nested | 53.0610μs | 2.0642μs | 484.4507 KOps/s | 469.7361 KOps/s | |
test_membership_stacked_nested_leaf | 20.8100μs | 2.0416μs | 489.8056 KOps/s | 457.8555 KOps/s | |
test_membership_nested_last | 31.2700μs | 3.0334μs | 329.6615 KOps/s | 316.3037 KOps/s | |
test_membership_nested_leaf_last | 40.4300μs | 3.0191μs | 331.2288 KOps/s | 315.0670 KOps/s | |
test_membership_stacked_nested_last | 28.4500μs | 3.5469μs | 281.9358 KOps/s | 320.0805 KOps/s | |
test_membership_stacked_nested_leaf_last | 39.7100μs | 3.5620μs | 280.7435 KOps/s | 319.8031 KOps/s | |
test_nested_getleaf | 37.1100μs | 6.1632μs | 162.2531 KOps/s | 160.9345 KOps/s | |
test_nested_get | 49.2610μs | 5.8398μs | 171.2387 KOps/s | 171.5687 KOps/s | |
test_stacked_getleaf | 46.6400μs | 6.1688μs | 162.1063 KOps/s | 160.8645 KOps/s | |
test_stacked_get | 48.6600μs | 5.8312μs | 171.4915 KOps/s | 168.9526 KOps/s | |
test_nested_getitemleaf | 37.3300μs | 6.3859μs | 156.5960 KOps/s | 153.3916 KOps/s | |
test_nested_getitem | 51.5300μs | 6.1248μs | 163.2713 KOps/s | 162.3338 KOps/s | |
test_stacked_getitemleaf | 47.7800μs | 6.4268μs | 155.5992 KOps/s | 154.3952 KOps/s | |
test_stacked_getitem | 43.8710μs | 6.1192μs | 163.4188 KOps/s | 162.9849 KOps/s | |
test_lock_nested | 8.9732ms | 0.3485ms | 2.8695 KOps/s | 2.8484 KOps/s | |
test_lock_stack_nested | 0.4791ms | 0.3430ms | 2.9157 KOps/s | 2.8725 KOps/s | |
test_unlock_nested | 0.3729ms | 0.2832ms | 3.5308 KOps/s | 3.4746 KOps/s | |
test_unlock_stack_nested | 0.3160ms | 0.2822ms | 3.5431 KOps/s | 3.4749 KOps/s | |
test_flatten_speed | 0.4784ms | 75.9008μs | 13.1751 KOps/s | 13.1173 KOps/s | |
test_unflatten_speed | 0.7301ms | 0.3243ms | 3.0840 KOps/s | 3.0637 KOps/s | |
test_common_ops | 0.8396ms | 0.6562ms | 1.5239 KOps/s | 1.5695 KOps/s | |
test_creation | 0.1150ms | 1.7456μs | 572.8747 KOps/s | 562.6189 KOps/s | |
test_creation_empty | 0.1457ms | 10.7140μs | 93.3356 KOps/s | 108.3412 KOps/s | |
test_creation_nested_1 | 0.4097ms | 12.4242μs | 80.4879 KOps/s | 90.7067 KOps/s | |
test_creation_nested_2 | 47.8000μs | 15.1245μs | 66.1179 KOps/s | 73.6250 KOps/s | |
test_clone | 39.4300μs | 9.8876μs | 101.1370 KOps/s | 96.5657 KOps/s | |
test_getitem[int] | 1.2157ms | 10.8926μs | 91.8054 KOps/s | 91.8871 KOps/s | |
test_getitem[slice_int] | 0.4304ms | 20.8301μs | 48.0074 KOps/s | 47.1580 KOps/s | |
test_getitem[range] | 0.1781ms | 36.8610μs | 27.1289 KOps/s | 26.8051 KOps/s | |
test_getitem[tuple] | 0.1182ms | 18.4203μs | 54.2879 KOps/s | 53.4566 KOps/s | |
test_getitem[list] | 0.4331ms | 32.3713μs | 30.8915 KOps/s | 30.0209 KOps/s | |
test_setitem_dim[int] | 47.7510μs | 18.9490μs | 52.7731 KOps/s | 51.0883 KOps/s | |
test_setitem_dim[slice_int] | 0.1228ms | 37.6049μs | 26.5923 KOps/s | 25.9149 KOps/s | |
test_setitem_dim[range] | 0.1007ms | 52.4532μs | 19.0646 KOps/s | 18.5503 KOps/s | |
test_setitem_dim[tuple] | 52.9100μs | 32.0361μs | 31.2148 KOps/s | 30.0145 KOps/s | |
test_setitem | 48.0510μs | 15.2452μs | 65.5942 KOps/s | 63.6892 KOps/s | |
test_set | 52.6710μs | 15.2938μs | 65.3858 KOps/s | 65.6776 KOps/s | |
test_set_shared | 0.5995ms | 0.1555ms | 6.4321 KOps/s | 6.3327 KOps/s | |
test_update | 0.5212ms | 19.2294μs | 52.0037 KOps/s | 54.5357 KOps/s | |
test_update_nested | 0.4615ms | 24.3213μs | 41.1162 KOps/s | 40.5105 KOps/s | |
test_update__nested | 0.5031ms | 24.2449μs | 41.2458 KOps/s | 39.7781 KOps/s | |
test_set_nested | 0.1161ms | 16.6510μs | 60.0565 KOps/s | 61.1536 KOps/s | |
test_set_nested_new | 0.4233ms | 18.8901μs | 52.9378 KOps/s | 52.7131 KOps/s | |
test_select | 92.1410μs | 31.2571μs | 31.9927 KOps/s | 31.3501 KOps/s | |
test_select_nested | 0.4354ms | 44.6827μs | 22.3801 KOps/s | 22.6840 KOps/s | |
test_exclude_nested | 0.4606ms | 64.0828μs | 15.6048 KOps/s | 15.7751 KOps/s | |
test_empty[True] | 0.6888ms | 0.2983ms | 3.3519 KOps/s | 3.3776 KOps/s | |
test_empty[False] | 40.5452μs | 0.8235μs | 1.2143 MOps/s | 1.1947 MOps/s | |
test_to | 86.8500μs | 55.2241μs | 18.1080 KOps/s | 17.6482 KOps/s | |
test_to_nonblocking | 0.2430ms | 47.6631μs | 20.9806 KOps/s | 20.7530 KOps/s | |
test_unbind_speed | 0.2920ms | 0.2418ms | 4.1360 KOps/s | 4.0795 KOps/s | |
test_unbind_speed_stack0 | 0.6777ms | 0.2396ms | 4.1735 KOps/s | 4.0850 KOps/s | |
test_unbind_speed_stack1 | 92.5288ms | 0.7337ms | 1.3630 KOps/s | 1.3389 KOps/s | |
test_split | 93.7672ms | 1.5946ms | 627.1187 Ops/s | 610.0595 Ops/s | |
test_chunk | 95.4159ms | 1.6229ms | 616.1855 Ops/s | 608.3204 Ops/s | |
test_consolidate[False-None] | 3.3448ms | 2.6924ms | 371.4219 Ops/s | 363.9142 Ops/s | |
test_consolidate[default-None] | 1.8760ms | 1.7361ms | 575.9939 Ops/s | 575.2678 Ops/s | |
test_consolidate[reduce-overhead-None] | 2.0703ms | 1.7796ms | 561.9094 Ops/s | 559.9303 Ops/s | |
test_consolidate_njt[False-None] | 6.8626ms | 6.6516ms | 150.3406 Ops/s | 148.9390 Ops/s | |
test_to[False-False-None] | 1.9377ms | 1.7187ms | 581.8404 Ops/s | 577.2099 Ops/s | |
test_to[True-False-None] | 1.6017ms | 1.3630ms | 733.6945 Ops/s | 714.6849 Ops/s | |
test_to[within-False-None] | 4.4227ms | 4.1756ms | 239.4847 Ops/s | 235.0509 Ops/s | |
test_to[True-default-None] | 5.6513ms | 5.3166ms | 188.0893 Ops/s | 190.3569 Ops/s | |
test_to_njt[False-False-None] | 7.1204ms | 6.9372ms | 144.1509 Ops/s | 143.8521 Ops/s | |
test_to_njt[True-False-None] | 6.0226ms | 5.4884ms | 182.2019 Ops/s | 178.4613 Ops/s | |
test_to_njt[within-False-None] | 12.6194ms | 12.3929ms | 80.6913 Ops/s | 80.6314 Ops/s | |
test_creation[device0] | 0.6357ms | 79.8396μs | 12.5251 KOps/s | 12.5914 KOps/s | |
test_creation_from_tensor | 0.5758ms | 82.9353μs | 12.0576 KOps/s | 12.0837 KOps/s | |
test_add_one[memmap_tensor0] | 0.2245ms | 6.2523μs | 159.9413 KOps/s | 155.8023 KOps/s | |
test_contiguous[memmap_tensor0] | 2.1440μs | 0.4221μs | 2.3690 MOps/s | 2.3805 MOps/s | |
test_stack[memmap_tensor0] | 0.1402ms | 4.5827μs | 218.2103 KOps/s | 213.4702 KOps/s | |
test_memmaptd_index | 1.6385ms | 0.2425ms | 4.1229 KOps/s | 3.9871 KOps/s | |
test_memmaptd_index_astensor | 0.4482ms | 0.3023ms | 3.3082 KOps/s | 3.2270 KOps/s | |
test_memmaptd_index_op | 0.8034ms | 0.5917ms | 1.6901 KOps/s | 1.7016 KOps/s | |
test_serialize_model | 0.1319s | 0.1304s | 7.6710 Ops/s | 7.6801 Ops/s | |
test_serialize_model_pickle | 1.3479s | 1.2161s | 0.8223 Ops/s | 0.8215 Ops/s | |
test_serialize_weights | 0.1298s | 0.1292s | 7.7423 Ops/s | 7.7438 Ops/s | |
test_serialize_weights_returnearly | 0.4925s | 72.6517ms | 13.7643 Ops/s | 15.6750 Ops/s | |
test_serialize_weights_pickle | 1.3708s | 1.2185s | 0.8207 Ops/s | 0.8365 Ops/s | |
test_reshape_pytree | 0.1268ms | 22.6959μs | 44.0609 KOps/s | 44.1565 KOps/s | |
test_reshape_td | 58.2210μs | 26.9805μs | 37.0638 KOps/s | 34.7326 KOps/s | |
test_view_pytree | 0.1635ms | 22.3478μs | 44.7470 KOps/s | 45.0059 KOps/s | |
test_view_td | 0.1518ms | 31.6749μs | 31.5707 KOps/s | 29.4285 KOps/s | |
test_unbind_pytree | 0.1634ms | 27.8687μs | 35.8825 KOps/s | 34.7440 KOps/s | |
test_unbind_td | 1.1137ms | 36.9610μs | 27.0555 KOps/s | 26.4463 KOps/s | |
test_split_pytree | 0.1486ms | 29.9078μs | 33.4361 KOps/s | 32.5558 KOps/s | |
test_split_td | 0.1807ms | 39.3329μs | 25.4240 KOps/s | 25.1951 KOps/s | |
test_add_pytree | 0.1391ms | 32.8615μs | 30.4307 KOps/s | 29.3439 KOps/s | |
test_add_td | 0.2015ms | 51.5955μs | 19.3815 KOps/s | 20.2737 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.2818ms | 0.1255ms | 7.9701 KOps/s | 7.7573 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2781ms | 0.1336ms | 7.4833 KOps/s | 7.4480 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.2427ms | 97.3131μs | 10.2761 KOps/s | 10.0970 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2969ms | 0.1467ms | 6.8170 KOps/s | 6.6823 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 0.1489ms | 24.9724μs | 40.0443 KOps/s | 40.7507 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.2039ms | 29.5036μs | 33.8941 KOps/s | 32.7784 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.3824ms | 64.5624μs | 15.4889 KOps/s | 15.0724 KOps/s | |
test_compile_copy_nested[pytree-eager] | 81.7100μs | 48.9238μs | 20.4400 KOps/s | 19.7569 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.3290ms | 0.1432ms | 6.9850 KOps/s | 6.9623 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3964ms | 0.2166ms | 4.6179 KOps/s | 4.6345 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.2464ms | 0.1006ms | 9.9422 KOps/s | 10.0676 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.2329ms | 57.4611μs | 17.4031 KOps/s | 18.0694 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2816ms | 0.1376ms | 7.2683 KOps/s | 7.3291 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.6517ms | 0.4760ms | 2.1010 KOps/s | 2.0815 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4168ms | 0.2640ms | 3.7885 KOps/s | 3.8399 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2937ms | 0.1458ms | 6.8604 KOps/s | 6.9627 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.2236ms | 68.6284μs | 14.5712 KOps/s | 14.6494 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.2748ms | 0.1023ms | 9.7709 KOps/s | 9.9889 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.5563ms | 0.4076ms | 2.4532 KOps/s | 2.4516 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.3063ms | 0.1433ms | 6.9798 KOps/s | 7.3359 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.2071ms | 21.9108μs | 45.6396 KOps/s | 54.1621 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1321ms | 30.9620μs | 32.2977 KOps/s | 31.2613 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1063ms | 69.9916μs | 14.2874 KOps/s | 13.9911 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1199ms | 51.0419μs | 19.5918 KOps/s | 19.4892 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.6334ms | 0.3995ms | 2.5030 KOps/s | 2.1793 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.9971ms | 2.6644ms | 375.3259 Ops/s | 383.5191 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.6080ms | 0.4342ms | 2.3029 KOps/s | 2.2163 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.9280ms | 2.6670ms | 374.9535 Ops/s | 374.3194 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.6010ms | 0.1192ms | 8.3908 KOps/s | 8.4388 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5698ms | 80.8752μs | 12.3647 KOps/s | 12.2304 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.4719ms | 0.1098ms | 9.1067 KOps/s | 9.1414 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.2185ms | 67.0001μs | 14.9254 KOps/s | 14.2463 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.2833ms | 0.1099ms | 9.1024 KOps/s | 9.0127 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.2465ms | 67.4677μs | 14.8219 KOps/s | 14.0514 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.2934ms | 0.1056ms | 9.4707 KOps/s | 9.7820 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.2253ms | 17.3105μs | 57.7683 KOps/s | 54.9357 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.2696ms | 97.4494μs | 10.2617 KOps/s | 10.2824 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 0.1579ms | 15.8436μs | 63.1170 KOps/s | 47.2418 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.2948ms | 0.1001ms | 9.9929 KOps/s | 10.1628 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 0.1586ms | 15.9096μs | 62.8550 KOps/s | 63.2399 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.2541ms | 0.1017ms | 9.8298 KOps/s | 9.7501 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.5671ms | 17.1694μs | 58.2430 KOps/s | 57.8019 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.2724ms | 97.7006μs | 10.2354 KOps/s | 10.1662 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.1866ms | 17.0511μs | 58.6471 KOps/s | 62.6757 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.2545ms | 97.8693μs | 10.2177 KOps/s | 10.1894 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.1338ms | 15.7763μs | 63.3861 KOps/s | 63.0936 KOps/s | |
test_mod_add[eager] | 0.1927ms | 40.1486μs | 24.9075 KOps/s | 25.1022 KOps/s | |
test_mod_add[compile] | 0.2222ms | 82.1256μs | 12.1765 KOps/s | 12.2210 KOps/s | |
test_mod_add[compile-overhead] | 0.3310ms | 0.1692ms | 5.9098 KOps/s | 5.6407 KOps/s | |
test_mod_wrap[eager] | 0.3984ms | 0.2502ms | 3.9961 KOps/s | 3.9664 KOps/s | |
test_mod_wrap[compile] | 0.4802ms | 0.2955ms | 3.3838 KOps/s | 3.4368 KOps/s | |
test_mod_wrap[compile-overhead] | 6.9400ms | 3.7111ms | 269.4625 Ops/s | 273.2943 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.5572ms | 1.3511ms | 740.1573 Ops/s | 689.3594 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.4880ms | 1.2834ms | 779.1924 Ops/s | 713.6000 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3715ms | 0.9277ms | 1.0780 KOps/s | 953.5552 Ops/s | |
test_seq_add[eager] | 0.2713ms | 0.1183ms | 8.4523 KOps/s | 8.3982 KOps/s | |
test_seq_add[compile] | 0.2421ms | 90.8471μs | 11.0075 KOps/s | 10.5049 KOps/s | |
test_seq_add[compile-overhead] | 0.2909ms | 0.1314ms | 7.6097 KOps/s | 7.4801 KOps/s | |
test_seq_wrap[eager] | 0.5832ms | 0.4287ms | 2.3325 KOps/s | 2.3304 KOps/s | |
test_seq_wrap[compile] | 0.5107ms | 0.3182ms | 3.1424 KOps/s | 3.2432 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3738ms | 0.2265ms | 4.4157 KOps/s | 4.3243 KOps/s | |
test_func_call_runtime[False-eager] | 0.8678ms | 0.7247ms | 1.3798 KOps/s | 1.3651 KOps/s | |
test_func_call_runtime[False-compile] | 0.9227ms | 0.7503ms | 1.3328 KOps/s | 1.3019 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.5014ms | 0.3647ms | 2.7419 KOps/s | 2.6839 KOps/s | |
test_func_call_runtime[True-eager] | 1.0756ms | 0.8912ms | 1.1221 KOps/s | 1.1084 KOps/s | |
test_func_call_runtime[True-compile] | 0.9426ms | 0.7689ms | 1.3006 KOps/s | 1.2653 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.5099ms | 0.3863ms | 2.5888 KOps/s | 2.5560 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8951ms | 0.7271ms | 1.3754 KOps/s | 1.3750 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.9286ms | 0.7515ms | 1.3307 KOps/s | 1.2927 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5132ms | 0.3667ms | 2.7270 KOps/s | 2.6819 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1609ms | 0.9966ms | 1.0034 KOps/s | 997.7598 Ops/s | |
test_func_call_cm_runtime[True-compile] | 1.1667ms | 0.9824ms | 1.0180 KOps/s | 1.0202 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.1858ms | 0.9784ms | 1.0221 KOps/s | 1.0101 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.4655ms | 2.0369ms | 490.9440 Ops/s | 482.9462 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9608ms | 0.8175ms | 1.2233 KOps/s | 1.1928 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.5416ms | 0.4180ms | 2.3925 KOps/s | 2.3593 KOps/s | |
test_distributed | 3.9099ms | 0.2809ms | 3.5599 KOps/s | 8.0921 KOps/s | |
test_tdmodule | 0.3166ms | 22.2187μs | 45.0071 KOps/s | 44.0824 KOps/s | |
test_tdmodule_dispatch | 61.5300μs | 38.6467μs | 25.8754 KOps/s | 24.8649 KOps/s | |
test_tdseq | 0.1461ms | 22.3128μs | 44.8174 KOps/s | 42.3152 KOps/s | |
test_tdseq_dispatch | 78.0310μs | 41.7446μs | 23.9552 KOps/s | 22.8351 KOps/s | |
test_instantiation_functorch | 1.7137ms | 1.5525ms | 644.1100 Ops/s | 636.0005 Ops/s | |
test_exec_functorch | 0.3381ms | 0.1436ms | 6.9653 KOps/s | 7.0189 KOps/s | |
test_exec_functional_call | 0.3136ms | 0.1339ms | 7.4657 KOps/s | 7.4130 KOps/s | |
test_exec_td_decorator | 0.3872ms | 0.1878ms | 5.3244 KOps/s | 5.3346 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.9302ms | 0.6921ms | 1.4449 KOps/s | 1.4647 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8922ms | 0.6795ms | 1.4717 KOps/s | 1.4370 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8072ms | 0.5961ms | 1.6775 KOps/s | 1.6053 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7876ms | 0.5859ms | 1.7067 KOps/s | 1.6091 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 19.0028ms | 18.7534ms | 53.3237 Ops/s | 52.6524 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 19.2914ms | 18.7708ms | 53.2742 Ops/s | 51.7023 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 18.7387ms | 18.5871ms | 53.8007 Ops/s | 53.0041 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 18.7533ms | 18.6146ms | 53.7212 Ops/s | 52.8395 Ops/s | |
test_to_module_speed[True] | 1.1238ms | 0.9580ms | 1.0438 KOps/s | 1.0274 KOps/s | |
test_to_module_speed[False] | 1.3464ms | 0.9441ms | 1.0592 KOps/s | 1.0437 KOps/s | |
test_tc_init | 0.2270ms | 39.3232μs | 25.4303 KOps/s | 26.1139 KOps/s | |
test_tc_init_nested | 0.1714ms | 80.7971μs | 12.3767 KOps/s | 13.5449 KOps/s | |
test_tc_first_layer_tensor | 4.1644μs | 0.7092μs | 1.4101 MOps/s | 1.2609 MOps/s | |
test_tc_first_layer_nontensor | 20.2400μs | 2.2820μs | 438.2094 KOps/s | 450.1265 KOps/s | |
test_tc_second_layer_tensor | 40.4003μs | 1.4422μs | 693.3818 KOps/s | 709.5448 KOps/s | |
test_tc_second_layer_nontensor | 32.9200μs | 3.0468μs | 328.2086 KOps/s | 333.4055 KOps/s | |
test_unbind | 0.2218s | 10.2859ms | 97.2203 Ops/s | 143.5235 Ops/s | |
test_full_like | 11.1722ms | 9.1759ms | 108.9816 Ops/s | 108.1625 Ops/s | |
test_zeros_like | 4.9387ms | 4.3233ms | 231.3060 Ops/s | 230.9596 Ops/s | |
test_ones_like | 9.2456ms | 7.1415ms | 140.0274 Ops/s | 230.6614 Ops/s | |
test_clone | 6.6523ms | 6.3820ms | 156.6912 Ops/s | 156.4158 Ops/s | |
test_squeeze | 59.5300μs | 10.0429μs | 99.5732 KOps/s | 102.0313 KOps/s | |
test_unsqueeze | 0.2277ms | 73.7627μs | 13.5570 KOps/s | 13.5461 KOps/s | |
test_split | 0.2780ms | 0.1589ms | 6.2946 KOps/s | 6.1628 KOps/s | |
test_permute | 0.2940ms | 0.1765ms | 5.6672 KOps/s | 5.7249 KOps/s | |
test_stack | 50.9696ms | 50.3698ms | 19.8532 Ops/s | 19.9221 Ops/s | |
test_cat | 50.8702ms | 50.2362ms | 19.9060 Ops/s | 19.9669 Ops/s |
be87fde
to
d8e27f3
Compare
@@ -33,7 +33,7 @@ jobs: | |||
include: | |||
- repository: pytorch/tensordict | |||
smoke-test-script: test/smoke_test.py | |||
post-script: .github/scripts/linux-post-script.sh | |||
pre-script: .github/scripts/linux-pre-script.sh | |||
package-name: tensordict | |||
name: pytorch/tensordict | |||
uses: pytorch/test-infra/.github/workflows/build_wheels_linux.yml@main |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
uses: pytorch/test-infra/.github/workflows/build_wheels_linux.yml@main | |
uses: pytorch/test-infra/.github/workflows/build_wheels_linux.yml@release/2.6 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please also add test-infra-ref: release/2.6
below
@@ -19,7 +19,7 @@ permissions: | |||
|
|||
jobs: | |||
generate-matrix: | |||
uses: pytorch/test-infra/.github/workflows/generate_binary_build_matrix.yml@main | |||
uses: pytorch/test-infra/.github/workflows/generate_binary_build_matrix.yml@release/2.6 | |||
with: | |||
package-type: wheel | |||
os: linux |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Pleas add test-infra-ref: release/2.6
Description
Describe your changes in detail.
Motivation and Context
Why is this change required? What problem does it solve?
If it fixes an open issue, please link to the issue here.
You can use the syntax
close #15213
if this solves the issue #15213Types of changes
What types of changes does your code introduce? Remove all that do not apply:
Checklist
Go over all the following points, and put an
x
in all the boxes that apply.If you are unsure about any of these, don't hesitate to ask. We are here to help!