-
Notifications
You must be signed in to change notification settings - Fork 79
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Test] Mark MP tests as slow #946
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 67.5460μs | 20.8710μs | 47.9133 KOps/s | 48.5598 KOps/s | |
test_plain_set_stack_nested | 50.3630μs | 20.9440μs | 47.7463 KOps/s | 48.3902 KOps/s | |
test_plain_set_nested_inplace | 74.9710μs | 22.7758μs | 43.9063 KOps/s | 43.1664 KOps/s | |
test_plain_set_stack_nested_inplace | 76.2220μs | 22.5844μs | 44.2783 KOps/s | 43.5106 KOps/s | |
test_items | 25.1270μs | 2.6439μs | 378.2282 KOps/s | 364.5587 KOps/s | |
test_items_nested | 0.5038ms | 0.3409ms | 2.9332 KOps/s | 2.8278 KOps/s | |
test_items_nested_locked | 0.5593ms | 0.3395ms | 2.9453 KOps/s | 2.9736 KOps/s | |
test_items_nested_leaf | 0.1967ms | 87.7241μs | 11.3994 KOps/s | 11.9512 KOps/s | |
test_items_stack_nested | 0.4032ms | 0.3440ms | 2.9074 KOps/s | 2.9340 KOps/s | |
test_items_stack_nested_leaf | 0.1361ms | 89.5373μs | 11.1685 KOps/s | 11.8893 KOps/s | |
test_items_stack_nested_locked | 0.4259ms | 0.3439ms | 2.9082 KOps/s | 2.9449 KOps/s | |
test_keys | 21.6610μs | 3.9104μs | 255.7312 KOps/s | 259.6662 KOps/s | |
test_keys_nested | 0.8105ms | 0.1484ms | 6.7407 KOps/s | 6.9322 KOps/s | |
test_keys_nested_locked | 0.6753ms | 0.1546ms | 6.4670 KOps/s | 6.6481 KOps/s | |
test_keys_nested_leaf | 0.2354ms | 0.1225ms | 8.1627 KOps/s | 7.9588 KOps/s | |
test_keys_stack_nested | 0.2454ms | 0.1438ms | 6.9552 KOps/s | 6.9837 KOps/s | |
test_keys_stack_nested_leaf | 0.2380ms | 0.1229ms | 8.1335 KOps/s | 8.1074 KOps/s | |
test_keys_stack_nested_locked | 0.2675ms | 0.1499ms | 6.6693 KOps/s | 6.6440 KOps/s | |
test_values | 7.5190μs | 1.2027μs | 831.4705 KOps/s | 845.0358 KOps/s | |
test_values_nested | 98.4030μs | 50.1596μs | 19.9364 KOps/s | 19.5858 KOps/s | |
test_values_nested_locked | 90.7200μs | 50.0014μs | 19.9994 KOps/s | 19.6366 KOps/s | |
test_values_nested_leaf | 93.7850μs | 45.1808μs | 22.1333 KOps/s | 22.0583 KOps/s | |
test_values_stack_nested | 0.2997ms | 50.1223μs | 19.9512 KOps/s | 18.6421 KOps/s | |
test_values_stack_nested_leaf | 92.6630μs | 44.8851μs | 22.2791 KOps/s | 22.3441 KOps/s | |
test_values_stack_nested_locked | 93.5650μs | 50.1526μs | 19.9392 KOps/s | 19.3153 KOps/s | |
test_membership | 20.6990μs | 0.8875μs | 1.1268 MOps/s | 1.0996 MOps/s | |
test_membership_nested | 27.8210μs | 2.5654μs | 389.8026 KOps/s | 381.4835 KOps/s | |
test_membership_nested_leaf | 59.7510μs | 2.6149μs | 382.4294 KOps/s | 376.6551 KOps/s | |
test_membership_stacked_nested | 26.8500μs | 2.5770μs | 388.0461 KOps/s | 384.9669 KOps/s | |
test_membership_stacked_nested_leaf | 26.5300μs | 2.5925μs | 385.7310 KOps/s | 379.8142 KOps/s | |
test_membership_nested_last | 32.3710μs | 3.8323μs | 260.9368 KOps/s | 253.2728 KOps/s | |
test_membership_nested_leaf_last | 46.8270μs | 3.8356μs | 260.7161 KOps/s | 256.2834 KOps/s | |
test_membership_stacked_nested_last | 28.4030μs | 3.7908μs | 263.7996 KOps/s | 179.4535 KOps/s | |
test_membership_stacked_nested_leaf_last | 30.3060μs | 3.8646μs | 258.7605 KOps/s | 184.9005 KOps/s | |
test_nested_getleaf | 32.7510μs | 10.4146μs | 96.0189 KOps/s | 95.6996 KOps/s | |
test_nested_get | 42.2790μs | 9.7194μs | 102.8871 KOps/s | 101.2517 KOps/s | |
test_stacked_getleaf | 47.6170μs | 10.1558μs | 98.4658 KOps/s | 97.2350 KOps/s | |
test_stacked_get | 28.6530μs | 9.7263μs | 102.8141 KOps/s | 102.1773 KOps/s | |
test_nested_getitemleaf | 40.8760μs | 10.7343μs | 93.1597 KOps/s | 90.2511 KOps/s | |
test_nested_getitem | 44.5130μs | 9.9589μs | 100.4124 KOps/s | 98.8354 KOps/s | |
test_stacked_getitemleaf | 36.2680μs | 11.0008μs | 90.9024 KOps/s | 91.1913 KOps/s | |
test_stacked_getitem | 31.7400μs | 10.0699μs | 99.3055 KOps/s | 99.9466 KOps/s | |
test_lock_nested | 77.1954ms | 0.5729ms | 1.7454 KOps/s | 1.9938 KOps/s | |
test_lock_stack_nested | 0.8641ms | 0.4666ms | 2.1432 KOps/s | 2.1903 KOps/s | |
test_unlock_nested | 82.2836ms | 0.4942ms | 2.0236 KOps/s | 2.3908 KOps/s | |
test_unlock_stack_nested | 0.7394ms | 0.3806ms | 2.6275 KOps/s | 2.6711 KOps/s | |
test_flatten_speed | 0.2112ms | 0.1070ms | 9.3500 KOps/s | 9.6296 KOps/s | |
test_unflatten_speed | 0.7608ms | 0.4230ms | 2.3640 KOps/s | 2.3104 KOps/s | |
test_common_ops | 1.6285ms | 1.0170ms | 983.2957 Ops/s | 945.5236 Ops/s | |
test_creation | 24.5060μs | 2.0041μs | 498.9717 KOps/s | 491.9228 KOps/s | |
test_creation_empty | 41.7980μs | 15.6792μs | 63.7788 KOps/s | 62.9955 KOps/s | |
test_creation_nested_1 | 56.7260μs | 18.6995μs | 53.4775 KOps/s | 52.0746 KOps/s | |
test_creation_nested_2 | 73.9580μs | 22.8339μs | 43.7945 KOps/s | 42.4149 KOps/s | |
test_clone | 98.5950μs | 16.7561μs | 59.6799 KOps/s | 59.6802 KOps/s | |
test_getitem[int] | 1.3412ms | 16.1729μs | 61.8318 KOps/s | 60.5577 KOps/s | |
test_getitem[slice_int] | 0.1336ms | 31.1582μs | 32.0942 KOps/s | 32.6124 KOps/s | |
test_getitem[range] | 0.1987ms | 55.7263μs | 17.9448 KOps/s | 16.9874 KOps/s | |
test_getitem[tuple] | 0.1341ms | 24.5990μs | 40.6521 KOps/s | 39.9676 KOps/s | |
test_getitem[list] | 0.3987ms | 50.6314μs | 19.7506 KOps/s | 18.8947 KOps/s | |
test_setitem_dim[int] | 88.1450μs | 37.7249μs | 26.5077 KOps/s | 26.7122 KOps/s | |
test_setitem_dim[slice_int] | 0.1164ms | 65.9148μs | 15.1711 KOps/s | 14.7472 KOps/s | |
test_setitem_dim[range] | 0.1458ms | 87.0075μs | 11.4933 KOps/s | 10.9335 KOps/s | |
test_setitem_dim[tuple] | 90.5490μs | 53.7193μs | 18.6153 KOps/s | 18.3006 KOps/s | |
test_setitem | 0.1085ms | 27.7020μs | 36.0985 KOps/s | 36.2165 KOps/s | |
test_set | 88.0750μs | 26.8553μs | 37.2366 KOps/s | 36.9631 KOps/s | |
test_set_shared | 3.7968ms | 0.2121ms | 4.7147 KOps/s | 4.7200 KOps/s | |
test_update | 0.1473ms | 32.3119μs | 30.9483 KOps/s | 30.3484 KOps/s | |
test_update_nested | 0.1742ms | 43.6045μs | 22.9334 KOps/s | 22.7848 KOps/s | |
test_update__nested | 0.1188ms | 34.1975μs | 29.2419 KOps/s | 28.9996 KOps/s | |
test_set_nested | 89.4070μs | 29.0584μs | 34.4134 KOps/s | 33.1347 KOps/s | |
test_set_nested_new | 0.1155ms | 34.0958μs | 29.3291 KOps/s | 29.0207 KOps/s | |
test_select | 0.1197ms | 50.4696μs | 19.8139 KOps/s | 19.7377 KOps/s | |
test_select_nested | 0.1114ms | 58.4747μs | 17.1014 KOps/s | 16.4121 KOps/s | |
test_exclude_nested | 0.1459ms | 76.7615μs | 13.0274 KOps/s | 12.9759 KOps/s | |
test_empty[True] | 1.2407ms | 0.3217ms | 3.1086 KOps/s | 3.0971 KOps/s | |
test_empty[False] | 6.3098μs | 1.1737μs | 852.0312 KOps/s | 852.6591 KOps/s | |
test_unbind_speed | 0.3782ms | 0.3058ms | 3.2705 KOps/s | 3.1654 KOps/s | |
test_unbind_speed_stack0 | 0.3985ms | 0.2996ms | 3.3381 KOps/s | 3.3722 KOps/s | |
test_unbind_speed_stack1 | 85.2791ms | 0.7854ms | 1.2733 KOps/s | 1.4072 KOps/s | |
test_split | 83.2507ms | 2.0770ms | 481.4741 Ops/s | 475.6070 Ops/s | |
test_chunk | 82.2664ms | 2.0718ms | 482.6621 Ops/s | 476.8263 Ops/s | |
test_creation[device0] | 3.4963ms | 0.1177ms | 8.4957 KOps/s | 8.4495 KOps/s | |
test_creation_from_tensor | 0.2355ms | 0.1167ms | 8.5715 KOps/s | 8.5242 KOps/s | |
test_add_one[memmap_tensor0] | 0.2128ms | 7.1865μs | 139.1500 KOps/s | 132.6499 KOps/s | |
test_contiguous[memmap_tensor0] | 26.7500μs | 1.9812μs | 504.7418 KOps/s | 499.7396 KOps/s | |
test_stack[memmap_tensor0] | 44.9040μs | 5.5081μs | 181.5496 KOps/s | 173.9548 KOps/s | |
test_memmaptd_index | 1.0435ms | 0.3961ms | 2.5248 KOps/s | 2.5096 KOps/s | |
test_memmaptd_index_astensor | 1.0873ms | 0.4766ms | 2.0984 KOps/s | 2.0978 KOps/s | |
test_memmaptd_index_op | 1.7103ms | 0.9697ms | 1.0313 KOps/s | 964.1005 Ops/s | |
test_serialize_model | 0.1289s | 0.1176s | 8.5061 Ops/s | 7.7449 Ops/s | |
test_serialize_model_pickle | 0.4482s | 0.3904s | 2.5613 Ops/s | 2.5512 Ops/s | |
test_serialize_weights | 0.1301s | 0.1155s | 8.6584 Ops/s | 8.6923 Ops/s | |
test_serialize_weights_returnearly | 0.1704s | 0.1571s | 6.3668 Ops/s | 6.2561 Ops/s | |
test_serialize_weights_pickle | 0.5616s | 0.4405s | 2.2703 Ops/s | 2.5824 Ops/s | |
test_serialize_weights_filesystem | 0.1526s | 0.1418s | 7.0501 Ops/s | 6.6021 Ops/s | |
test_serialize_model_filesystem | 0.1549s | 0.1487s | 6.7244 Ops/s | 6.7983 Ops/s | |
test_reshape_pytree | 89.9480μs | 39.6435μs | 25.2248 KOps/s | 24.8118 KOps/s | |
test_reshape_td | 0.1204ms | 45.3666μs | 22.0427 KOps/s | 21.4182 KOps/s | |
test_view_pytree | 0.1003ms | 38.9562μs | 25.6698 KOps/s | 24.7074 KOps/s | |
test_view_td | 0.1105ms | 51.7193μs | 19.3351 KOps/s | 18.9156 KOps/s | |
test_unbind_pytree | 98.7140μs | 37.3769μs | 26.7545 KOps/s | 26.0035 KOps/s | |
test_unbind_td | 0.4382ms | 44.4153μs | 22.5148 KOps/s | 21.9380 KOps/s | |
test_split_pytree | 89.8180μs | 39.2462μs | 25.4802 KOps/s | 23.8468 KOps/s | |
test_split_td | 0.1825ms | 56.2040μs | 17.7923 KOps/s | 17.4694 KOps/s | |
test_add_pytree | 0.1117ms | 46.6606μs | 21.4313 KOps/s | 21.2597 KOps/s | |
test_add_td | 0.2064ms | 76.4243μs | 13.0848 KOps/s | 12.5490 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 99.9070μs | 53.0657μs | 18.8446 KOps/s | 18.1779 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3260ms | 0.1896ms | 5.2748 KOps/s | 5.1242 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1662ms | 54.2997μs | 18.4163 KOps/s | 18.1519 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2590ms | 0.1442ms | 6.9328 KOps/s | 6.8458 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 68.2880μs | 20.1272μs | 49.6839 KOps/s | 48.2891 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1593ms | 64.4866μs | 15.5071 KOps/s | 15.1292 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1686ms | 78.5685μs | 12.7277 KOps/s | 12.3187 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1245ms | 71.2811μs | 14.0290 KOps/s | 13.6384 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2537ms | 0.1720ms | 5.8138 KOps/s | 5.7704 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3823ms | 0.1925ms | 5.1956 KOps/s | 5.0742 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 88.0450μs | 37.2259μs | 26.8630 KOps/s | 25.6490 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.4672ms | 69.4472μs | 14.3994 KOps/s | 13.6684 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2430ms | 0.1707ms | 5.8570 KOps/s | 5.6904 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.4377ms | 0.2913ms | 3.4326 KOps/s | 3.4250 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.2814ms | 0.2051ms | 4.8763 KOps/s | 4.8027 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.3285ms | 0.1766ms | 5.6631 KOps/s | 5.6513 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1621ms | 63.1919μs | 15.8248 KOps/s | 15.6389 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1093ms | 37.7942μs | 26.4591 KOps/s | 25.1819 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.3982ms | 0.2395ms | 4.1761 KOps/s | 4.1870 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.3661ms | 0.1704ms | 5.8669 KOps/s | 5.7951 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.1727ms | 0.1080ms | 9.2565 KOps/s | 9.2121 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1218ms | 56.1295μs | 17.8159 KOps/s | 16.8994 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1466ms | 79.0888μs | 12.6440 KOps/s | 12.2679 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1514ms | 71.3917μs | 14.0072 KOps/s | 13.7507 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.3055ms | 0.1845ms | 5.4213 KOps/s | 5.2336 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 1.7655ms | 1.6299ms | 613.5180 Ops/s | 617.1004 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.2801ms | 0.1832ms | 5.4596 KOps/s | 5.2708 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.8660ms | 1.0881ms | 919.0664 Ops/s | 938.1581 Ops/s | |
test_compile_assign_and_add_stack[compile] | 0.5060ms | 0.4023ms | 2.4859 KOps/s | 2.4659 KOps/s | |
test_compile_assign_and_add_stack[eager] | 5.3810ms | 3.5851ms | 278.9316 Ops/s | 274.1002 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1000ms | 31.4043μs | 31.8427 KOps/s | 30.3142 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 1.1566ms | 48.9548μs | 20.4270 KOps/s | 20.1241 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 67.3950μs | 27.7061μs | 36.0931 KOps/s | 35.1105 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 72.5150μs | 31.0743μs | 32.1809 KOps/s | 32.2322 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1355ms | 27.9791μs | 35.7410 KOps/s | 35.7068 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 77.2940μs | 30.9658μs | 32.2937 KOps/s | 32.4477 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1603ms | 71.0662μs | 14.0714 KOps/s | 13.7306 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.3507ms | 27.6736μs | 36.1355 KOps/s | 35.6970 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1559ms | 66.7589μs | 14.9793 KOps/s | 14.7960 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 88.2650μs | 24.7795μs | 40.3560 KOps/s | 39.9250 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1335ms | 65.8682μs | 15.1818 KOps/s | 14.6997 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 84.7690μs | 23.8125μs | 41.9947 KOps/s | 39.9556 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1394ms | 71.2149μs | 14.0420 KOps/s | 13.8528 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.7665ms | 27.9624μs | 35.7623 KOps/s | 35.5542 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1369ms | 65.3701μs | 15.2975 KOps/s | 14.6197 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 88.3150μs | 23.9685μs | 41.7214 KOps/s | 40.2842 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1262ms | 65.7010μs | 15.2205 KOps/s | 14.8117 KOps/s | |
test_compile_indexing[int-pytree-eager] | 80.5600μs | 24.3615μs | 41.0484 KOps/s | 40.0251 KOps/s | |
test_mod_add[eager] | 88.4750μs | 22.1961μs | 45.0529 KOps/s | 44.2224 KOps/s | |
test_mod_add[compile] | 0.1034ms | 35.5395μs | 28.1377 KOps/s | 26.9772 KOps/s | |
test_mod_add[compile-overhead] | 0.1038ms | 35.0953μs | 28.4938 KOps/s | 27.8554 KOps/s | |
test_mod_wrap[eager] | 0.3535ms | 0.1971ms | 5.0745 KOps/s | 4.6994 KOps/s | |
test_mod_wrap[compile] | 1.4530ms | 0.2201ms | 4.5438 KOps/s | 4.3438 KOps/s | |
test_mod_wrap[compile-overhead] | 0.3804ms | 0.2165ms | 4.6186 KOps/s | 4.4331 KOps/s | |
test_mod_wrap_and_backward[eager] | 12.0262ms | 10.6088ms | 94.2614 Ops/s | 95.0142 Ops/s | |
test_mod_wrap_and_backward[compile] | 12.0515ms | 10.6316ms | 94.0595 Ops/s | 88.4957 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 11.7820ms | 10.5917ms | 94.4140 Ops/s | 91.2723 Ops/s | |
test_seq_add[eager] | 0.1653ms | 82.3206μs | 12.1476 KOps/s | 12.4093 KOps/s | |
test_seq_add[compile] | 8.7799ms | 59.9414μs | 16.6830 KOps/s | 16.0402 KOps/s | |
test_seq_add[compile-overhead] | 0.1378ms | 57.7901μs | 17.3040 KOps/s | 16.2490 KOps/s | |
test_seq_wrap[eager] | 0.5991ms | 0.3535ms | 2.8288 KOps/s | 2.7744 KOps/s | |
test_seq_wrap[compile] | 0.4565ms | 0.2527ms | 3.9578 KOps/s | 3.8011 KOps/s | |
test_seq_wrap[compile-overhead] | 0.4756ms | 0.2551ms | 3.9198 KOps/s | 3.7463 KOps/s | |
test_func_call_runtime[False-eager] | 0.6574ms | 0.4994ms | 2.0025 KOps/s | 1.9086 KOps/s | |
test_func_call_runtime[False-compile] | 0.8870ms | 0.4848ms | 2.0629 KOps/s | 2.0046 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 1.0530ms | 0.4822ms | 2.0738 KOps/s | 1.9989 KOps/s | |
test_func_call_runtime[True-eager] | 0.9633ms | 0.7252ms | 1.3789 KOps/s | 1.3269 KOps/s | |
test_func_call_runtime[True-compile] | 0.6104ms | 0.4960ms | 2.0161 KOps/s | 1.9475 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.5968ms | 0.4974ms | 2.0104 KOps/s | 1.9385 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8997ms | 0.4987ms | 2.0050 KOps/s | 1.9285 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.9149ms | 0.4851ms | 2.0613 KOps/s | 1.9962 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.6023ms | 0.4818ms | 2.0755 KOps/s | 2.0163 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.3656ms | 0.8520ms | 1.1737 KOps/s | 1.1331 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.9497ms | 0.8008ms | 1.2487 KOps/s | 1.2074 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.3762ms | 0.8059ms | 1.2408 KOps/s | 1.1919 KOps/s | |
test_distributed | 0.2629ms | 0.1311ms | 7.6249 KOps/s | 7.1909 KOps/s | |
test_tdmodule | 29.5650μs | 16.0508μs | 62.3022 KOps/s | 63.1840 KOps/s | |
test_tdmodule_dispatch | 70.7820μs | 32.6923μs | 30.5883 KOps/s | 30.3839 KOps/s | |
test_tdseq | 41.0460μs | 17.9284μs | 55.7773 KOps/s | 58.5732 KOps/s | |
test_tdseq_dispatch | 65.8930μs | 37.0037μs | 27.0244 KOps/s | 27.1258 KOps/s | |
test_instantiation_functorch | 2.5523ms | 1.6689ms | 599.1811 Ops/s | 607.6115 Ops/s | |
test_instantiation_td | 2.4196ms | 1.2368ms | 808.5585 Ops/s | 846.5986 Ops/s | |
test_exec_functorch | 0.3913ms | 0.1781ms | 5.6156 KOps/s | 5.6153 KOps/s | |
test_exec_functional_call | 0.3035ms | 0.1633ms | 6.1250 KOps/s | 5.8975 KOps/s | |
test_exec_td | 0.3244ms | 0.1683ms | 5.9433 KOps/s | 5.8306 KOps/s | |
test_exec_td_decorator | 0.5798ms | 0.2192ms | 4.5621 KOps/s | 4.4676 KOps/s | |
test_vmap_mlp_speed[True-True] | 1.0184ms | 0.5590ms | 1.7888 KOps/s | 1.6618 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.6693ms | 0.5493ms | 1.8205 KOps/s | 1.7621 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.7922ms | 0.4601ms | 2.1734 KOps/s | 2.0972 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.6744ms | 0.4591ms | 2.1783 KOps/s | 2.1157 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.5449ms | 0.6126ms | 1.6324 KOps/s | 1.5947 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8413ms | 0.6105ms | 1.6381 KOps/s | 1.6051 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8216ms | 0.5076ms | 1.9700 KOps/s | 1.9153 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.6973ms | 0.5087ms | 1.9658 KOps/s | 1.9233 KOps/s | |
test_to_module_speed[True] | 2.3048ms | 1.3699ms | 730.0069 Ops/s | 747.8035 Ops/s | |
test_to_module_speed[False] | 1.6117ms | 1.2971ms | 770.9523 Ops/s | 775.1939 Ops/s | |
test_tc_init | 73.1760μs | 39.5679μs | 25.2730 KOps/s | 23.6903 KOps/s | |
test_tc_init_nested | 0.1739ms | 82.0751μs | 12.1840 KOps/s | 11.8155 KOps/s | |
test_tc_first_layer_tensor | 21.8610μs | 1.4258μs | 701.3567 KOps/s | 693.6328 KOps/s | |
test_tc_first_layer_nontensor | 17.9940μs | 4.2476μs | 235.4298 KOps/s | 232.8763 KOps/s | |
test_tc_second_layer_tensor | 39.8040μs | 2.6875μs | 372.0936 KOps/s | 365.0795 KOps/s | |
test_tc_second_layer_nontensor | 24.7860μs | 5.4036μs | 185.0631 KOps/s | 176.0348 KOps/s | |
test_unbind | 0.4279s | 13.1320ms | 76.1498 Ops/s | 74.9851 Ops/s | |
test_full_like | 9.4695ms | 6.7286ms | 148.6196 Ops/s | 146.9256 Ops/s | |
test_zeros_like | 14.9072ms | 7.1835ms | 139.2087 Ops/s | 142.0714 Ops/s | |
test_ones_like | 12.3702ms | 7.5326ms | 132.7567 Ops/s | 134.8857 Ops/s | |
test_clone | 13.9989ms | 9.1186ms | 109.6666 Ops/s | 111.3257 Ops/s | |
test_squeeze | 87.4620μs | 12.7322μs | 78.5413 KOps/s | 78.5624 KOps/s | |
test_unsqueeze | 0.1711ms | 92.8966μs | 10.7647 KOps/s | 10.5855 KOps/s | |
test_split | 0.4850ms | 0.1979ms | 5.0540 KOps/s | 5.0163 KOps/s | |
test_permute | 0.3775ms | 0.2196ms | 4.5544 KOps/s | 4.5032 KOps/s | |
test_stack | 31.4412ms | 23.6307ms | 42.3179 Ops/s | 41.6754 Ops/s | |
test_cat | 29.5500ms | 23.4369ms | 42.6678 Ops/s | 41.8984 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 49.7100μs | 17.6134μs | 56.7748 KOps/s | 58.4835 KOps/s | |
test_plain_set_stack_nested | 34.4600μs | 17.7584μs | 56.3113 KOps/s | 57.8404 KOps/s | |
test_plain_set_nested_inplace | 53.0720μs | 18.5769μs | 53.8303 KOps/s | 54.0856 KOps/s | |
test_plain_set_stack_nested_inplace | 39.1510μs | 18.6009μs | 53.7609 KOps/s | 53.7146 KOps/s | |
test_items | 35.2600μs | 4.7053μs | 212.5262 KOps/s | 211.5919 KOps/s | |
test_items_nested | 0.3969ms | 0.3664ms | 2.7291 KOps/s | 2.7026 KOps/s | |
test_items_nested_locked | 0.4052ms | 0.3715ms | 2.6918 KOps/s | 2.6965 KOps/s | |
test_items_nested_leaf | 0.1102ms | 84.7256μs | 11.8028 KOps/s | 11.8511 KOps/s | |
test_items_stack_nested | 0.4116ms | 0.3703ms | 2.7007 KOps/s | 2.6791 KOps/s | |
test_items_stack_nested_leaf | 0.1084ms | 84.3288μs | 11.8583 KOps/s | 11.7259 KOps/s | |
test_items_stack_nested_locked | 0.4076ms | 0.3723ms | 2.6859 KOps/s | 2.6723 KOps/s | |
test_keys | 27.0600μs | 4.3605μs | 229.3310 KOps/s | 227.1550 KOps/s | |
test_keys_nested | 91.0510μs | 66.3672μs | 15.0677 KOps/s | 14.9602 KOps/s | |
test_keys_nested_locked | 2.3574ms | 73.1142μs | 13.6772 KOps/s | 13.7203 KOps/s | |
test_keys_nested_leaf | 80.6420μs | 56.7251μs | 17.6289 KOps/s | 17.4253 KOps/s | |
test_keys_stack_nested | 93.9010μs | 66.9131μs | 14.9448 KOps/s | 14.8175 KOps/s | |
test_keys_stack_nested_leaf | 0.1410ms | 56.9860μs | 17.5482 KOps/s | 16.9710 KOps/s | |
test_keys_stack_nested_locked | 0.1007ms | 72.3425μs | 13.8231 KOps/s | 13.6098 KOps/s | |
test_values | 10.7733μs | 1.7620μs | 567.5359 KOps/s | 568.8107 KOps/s | |
test_values_nested | 57.7410μs | 33.5292μs | 29.8248 KOps/s | 29.3668 KOps/s | |
test_values_nested_locked | 56.8510μs | 35.5817μs | 28.1043 KOps/s | 27.8870 KOps/s | |
test_values_nested_leaf | 47.5210μs | 29.9009μs | 33.4438 KOps/s | 33.2544 KOps/s | |
test_values_stack_nested | 68.5610μs | 34.3737μs | 29.0920 KOps/s | 28.5667 KOps/s | |
test_values_stack_nested_leaf | 56.3010μs | 30.6156μs | 32.6631 KOps/s | 32.2892 KOps/s | |
test_values_stack_nested_locked | 62.9210μs | 36.4642μs | 27.4241 KOps/s | 27.3298 KOps/s | |
test_membership | 1.7271μs | 0.5413μs | 1.8473 MOps/s | 1.8297 MOps/s | |
test_membership_nested | 23.7110μs | 2.0771μs | 481.4356 KOps/s | 485.5881 KOps/s | |
test_membership_nested_leaf | 9.2505μs | 2.0260μs | 493.5865 KOps/s | 502.1923 KOps/s | |
test_membership_stacked_nested | 16.9410μs | 2.0445μs | 489.1246 KOps/s | 490.8972 KOps/s | |
test_membership_stacked_nested_leaf | 38.3110μs | 2.0490μs | 488.0477 KOps/s | 484.5239 KOps/s | |
test_membership_nested_last | 25.0210μs | 2.9711μs | 336.5727 KOps/s | 336.7964 KOps/s | |
test_membership_nested_leaf_last | 36.5500μs | 3.0022μs | 333.0940 KOps/s | 337.2530 KOps/s | |
test_membership_stacked_nested_last | 19.7210μs | 2.9360μs | 340.5991 KOps/s | 332.7456 KOps/s | |
test_membership_stacked_nested_leaf_last | 36.8810μs | 2.9560μs | 338.2957 KOps/s | 332.7066 KOps/s | |
test_nested_getleaf | 25.0110μs | 7.9384μs | 125.9699 KOps/s | 124.5215 KOps/s | |
test_nested_get | 34.5120μs | 7.4841μs | 133.6173 KOps/s | 132.4086 KOps/s | |
test_stacked_getleaf | 30.2210μs | 7.9443μs | 125.8771 KOps/s | 123.9239 KOps/s | |
test_stacked_get | 39.6410μs | 7.4507μs | 134.2162 KOps/s | 132.2584 KOps/s | |
test_nested_getitemleaf | 36.9410μs | 8.1208μs | 123.1412 KOps/s | 122.1327 KOps/s | |
test_nested_getitem | 22.7700μs | 7.6189μs | 131.2533 KOps/s | 130.4691 KOps/s | |
test_stacked_getitemleaf | 39.4110μs | 8.1718μs | 122.3722 KOps/s | 121.0197 KOps/s | |
test_stacked_getitem | 21.7110μs | 7.5993μs | 131.5914 KOps/s | 129.6418 KOps/s | |
test_lock_nested | 7.3934ms | 0.4879ms | 2.0496 KOps/s | 2.0463 KOps/s | |
test_lock_stack_nested | 0.5014ms | 0.4362ms | 2.2926 KOps/s | 2.2367 KOps/s | |
test_unlock_nested | 0.8852ms | 0.3966ms | 2.5212 KOps/s | 2.4479 KOps/s | |
test_unlock_stack_nested | 0.4053ms | 0.3551ms | 2.8160 KOps/s | 2.7348 KOps/s | |
test_flatten_speed | 0.4601ms | 0.1052ms | 9.5035 KOps/s | 9.5010 KOps/s | |
test_unflatten_speed | 0.3253ms | 0.2868ms | 3.4874 KOps/s | 3.4616 KOps/s | |
test_common_ops | 1.5960ms | 1.3464ms | 742.7192 Ops/s | 744.0068 Ops/s | |
test_creation | 16.6400μs | 1.6588μs | 602.8592 KOps/s | 593.8404 KOps/s | |
test_creation_empty | 38.7910μs | 18.9056μs | 52.8945 KOps/s | 57.7513 KOps/s | |
test_creation_nested_1 | 56.2610μs | 20.7347μs | 48.2283 KOps/s | 52.1354 KOps/s | |
test_creation_nested_2 | 42.9600μs | 23.6574μs | 42.2701 KOps/s | 45.6923 KOps/s | |
test_clone | 59.1320μs | 29.9127μs | 33.4307 KOps/s | 31.9996 KOps/s | |
test_getitem[int] | 1.1682ms | 17.7657μs | 56.2882 KOps/s | 55.5801 KOps/s | |
test_getitem[slice_int] | 0.1531ms | 29.8303μs | 33.5230 KOps/s | 32.9612 KOps/s | |
test_getitem[range] | 0.2714ms | 0.1157ms | 8.6419 KOps/s | 8.6129 KOps/s | |
test_getitem[tuple] | 0.1566ms | 26.1482μs | 38.2436 KOps/s | 37.4339 KOps/s | |
test_getitem[list] | 0.2508ms | 0.1099ms | 9.1006 KOps/s | 9.3547 KOps/s | |
test_setitem_dim[int] | 83.9220μs | 56.0043μs | 17.8558 KOps/s | 16.9420 KOps/s | |
test_setitem_dim[slice_int] | 0.1023ms | 82.0809μs | 12.1831 KOps/s | 12.3207 KOps/s | |
test_setitem_dim[range] | 0.1859ms | 0.1446ms | 6.9163 KOps/s | 6.9745 KOps/s | |
test_setitem_dim[tuple] | 0.1073ms | 73.6837μs | 13.5715 KOps/s | 13.6720 KOps/s | |
test_setitem | 79.6520μs | 43.7874μs | 22.8376 KOps/s | 22.8370 KOps/s | |
test_set | 70.2620μs | 43.1179μs | 23.1922 KOps/s | 22.9739 KOps/s | |
test_set_shared | 90.7948ms | 62.5191μs | 15.9951 KOps/s | 17.7927 KOps/s | |
test_update | 91.5210μs | 53.0891μs | 18.8362 KOps/s | 19.2902 KOps/s | |
test_update_nested | 96.9820μs | 61.2508μs | 16.3263 KOps/s | 16.5246 KOps/s | |
test_update__nested | 99.0820μs | 62.3525μs | 16.0379 KOps/s | 15.6046 KOps/s | |
test_set_nested | 0.1877ms | 45.7810μs | 21.8431 KOps/s | 21.8358 KOps/s | |
test_set_nested_new | 80.1610μs | 49.3054μs | 20.2818 KOps/s | 20.1779 KOps/s | |
test_select | 88.9620μs | 65.1905μs | 15.3397 KOps/s | 15.2538 KOps/s | |
test_select_nested | 97.0220μs | 51.6792μs | 19.3502 KOps/s | 19.1451 KOps/s | |
test_exclude_nested | 97.5420μs | 69.0054μs | 14.4916 KOps/s | 14.4069 KOps/s | |
test_empty[True] | 0.3277ms | 0.2848ms | 3.5111 KOps/s | 3.4838 KOps/s | |
test_empty[False] | 3.1420μs | 0.8716μs | 1.1473 MOps/s | 1.1462 MOps/s | |
test_to | 48.1410μs | 28.3424μs | 35.2828 KOps/s | 35.8930 KOps/s | |
test_to_nonblocking | 49.7310μs | 26.9171μs | 37.1511 KOps/s | 37.4972 KOps/s | |
test_unbind_speed | 1.2396ms | 0.3088ms | 3.2388 KOps/s | 3.1026 KOps/s | |
test_unbind_speed_stack0 | 0.3373ms | 0.2990ms | 3.3447 KOps/s | 3.1356 KOps/s | |
test_unbind_speed_stack1 | 89.6920ms | 0.7687ms | 1.3010 KOps/s | 1.3915 KOps/s | |
test_split | 92.7463ms | 2.3686ms | 422.1822 Ops/s | 405.1393 Ops/s | |
test_chunk | 92.5522ms | 2.3751ms | 421.0386 Ops/s | 409.1125 Ops/s | |
test_creation[device0] | 0.1886ms | 0.1063ms | 9.4030 KOps/s | 9.3537 KOps/s | |
test_creation_from_tensor | 0.1659ms | 0.1035ms | 9.6640 KOps/s | 9.5605 KOps/s | |
test_add_one[memmap_tensor0] | 0.1179ms | 9.0491μs | 110.5077 KOps/s | 95.4636 KOps/s | |
test_contiguous[memmap_tensor0] | 27.0620μs | 2.2599μs | 442.5061 KOps/s | 444.9558 KOps/s | |
test_stack[memmap_tensor0] | 35.1700μs | 6.7230μs | 148.7426 KOps/s | 138.0438 KOps/s | |
test_memmaptd_index | 1.2002ms | 0.4461ms | 2.2414 KOps/s | 2.2570 KOps/s | |
test_memmaptd_index_astensor | 0.7892ms | 0.5127ms | 1.9505 KOps/s | 1.9684 KOps/s | |
test_memmaptd_index_op | 1.5128ms | 1.1050ms | 904.9634 Ops/s | 915.6560 Ops/s | |
test_serialize_model | 93.3833ms | 88.7311ms | 11.2700 Ops/s | 10.8509 Ops/s | |
test_serialize_model_pickle | 1.3476s | 1.2365s | 0.8087 Ops/s | 0.8058 Ops/s | |
test_serialize_weights | 89.9462ms | 85.4503ms | 11.7027 Ops/s | 9.9741 Ops/s | |
test_serialize_weights_returnearly | 59.7799ms | 55.2954ms | 18.0847 Ops/s | 17.9828 Ops/s | |
test_serialize_weights_pickle | 1.3557s | 1.2372s | 0.8083 Ops/s | 0.8088 Ops/s | |
test_reshape_pytree | 76.9820μs | 39.5038μs | 25.3140 KOps/s | 25.3300 KOps/s | |
test_reshape_td | 88.8020μs | 45.5672μs | 21.9456 KOps/s | 21.7562 KOps/s | |
test_view_pytree | 0.2376ms | 39.3995μs | 25.3810 KOps/s | 26.0924 KOps/s | |
test_view_td | 79.6420μs | 50.1890μs | 19.9247 KOps/s | 19.0548 KOps/s | |
test_unbind_pytree | 0.2356ms | 37.5619μs | 26.6228 KOps/s | 25.7793 KOps/s | |
test_unbind_td | 0.4678ms | 47.2246μs | 21.1754 KOps/s | 20.3052 KOps/s | |
test_split_pytree | 96.5120μs | 51.0859μs | 19.5749 KOps/s | 18.8295 KOps/s | |
test_split_td | 0.2580ms | 60.9683μs | 16.4020 KOps/s | 15.4478 KOps/s | |
test_add_pytree | 0.1030ms | 59.3329μs | 16.8541 KOps/s | 16.1553 KOps/s | |
test_add_td | 0.3238ms | 97.0157μs | 10.3076 KOps/s | 9.6252 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.4149ms | 0.2144ms | 4.6637 KOps/s | 4.6277 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2965ms | 0.1766ms | 5.6615 KOps/s | 5.7071 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1876ms | 0.1519ms | 6.5817 KOps/s | 6.5786 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2439ms | 0.1972ms | 5.0704 KOps/s | 4.9430 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 45.6100μs | 23.2489μs | 43.0128 KOps/s | 43.3168 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 71.2420μs | 49.6910μs | 20.1244 KOps/s | 20.3755 KOps/s | |
test_compile_copy_nested[pytree-compile] | 97.5320μs | 74.3772μs | 13.4450 KOps/s | 13.3609 KOps/s | |
test_compile_copy_nested[pytree-eager] | 89.1110μs | 59.3287μs | 16.8552 KOps/s | 16.3757 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.4654ms | 0.3417ms | 2.9267 KOps/s | 2.9258 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.2845ms | 0.2253ms | 4.4388 KOps/s | 4.3857 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1754ms | 0.1349ms | 7.4123 KOps/s | 7.1815 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1305ms | 64.3340μs | 15.5439 KOps/s | 14.8788 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.4079ms | 0.3403ms | 2.9389 KOps/s | 2.9525 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.7030ms | 0.6388ms | 1.5654 KOps/s | 1.5109 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3282ms | 0.2729ms | 3.6650 KOps/s | 3.6166 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.4181ms | 0.3428ms | 2.9169 KOps/s | 2.9080 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1611ms | 77.7005μs | 12.8699 KOps/s | 12.9676 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1892ms | 0.1409ms | 7.0949 KOps/s | 7.2914 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.7400ms | 0.5426ms | 1.8431 KOps/s | 1.7783 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.3744ms | 0.3387ms | 2.9521 KOps/s | 2.9278 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 40.3300μs | 19.5178μs | 51.2354 KOps/s | 50.7750 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 48.6900μs | 31.6491μs | 31.5964 KOps/s | 30.2214 KOps/s | |
test_compile_copy_flat[pytree-compile] | 94.5820μs | 75.9890μs | 13.1598 KOps/s | 12.2261 KOps/s | |
test_compile_copy_flat[pytree-eager] | 84.6920μs | 59.9222μs | 16.6883 KOps/s | 15.5524 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 2.4964ms | 0.8768ms | 1.1405 KOps/s | 1.0604 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 3.4464ms | 3.3953ms | 294.5241 Ops/s | 293.4456 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 2.4839ms | 0.8594ms | 1.1636 KOps/s | 1.0675 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 3.3941ms | 3.3317ms | 300.1507 Ops/s | 288.3406 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1579ms | 0.1148ms | 8.7131 KOps/s | 8.7204 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.2121ms | 62.2699μs | 16.0591 KOps/s | 15.6559 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1563ms | 0.1069ms | 9.3565 KOps/s | 9.2845 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1017ms | 45.0078μs | 22.2184 KOps/s | 21.0056 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1513ms | 0.1069ms | 9.3577 KOps/s | 9.3114 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 78.8420μs | 45.0860μs | 22.1798 KOps/s | 21.1120 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.2134ms | 0.1429ms | 6.9979 KOps/s | 6.9544 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1836ms | 27.0245μs | 37.0035 KOps/s | 35.4679 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.2044ms | 0.1359ms | 7.3578 KOps/s | 7.2184 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 45.2320μs | 22.8716μs | 43.7223 KOps/s | 41.1194 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.2114ms | 0.1359ms | 7.3557 KOps/s | 7.3322 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 52.7610μs | 22.8582μs | 43.7479 KOps/s | 42.5048 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.2670ms | 0.1425ms | 7.0172 KOps/s | 6.9629 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.5185ms | 28.6783μs | 34.8695 KOps/s | 35.5197 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1879ms | 0.1356ms | 7.3771 KOps/s | 7.3520 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 53.3820μs | 22.8662μs | 43.7326 KOps/s | 42.0255 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.2067ms | 0.1356ms | 7.3736 KOps/s | 7.3648 KOps/s | |
test_compile_indexing[int-pytree-eager] | 53.3010μs | 23.1758μs | 43.1484 KOps/s | 42.2116 KOps/s | |
test_mod_add[eager] | 77.3320μs | 34.4495μs | 29.0280 KOps/s | 29.5815 KOps/s | |
test_mod_add[compile] | 0.1110ms | 72.0193μs | 13.8852 KOps/s | 13.8260 KOps/s | |
test_mod_add[compile-overhead] | 0.2652ms | 0.1442ms | 6.9363 KOps/s | 6.3391 KOps/s | |
test_mod_wrap[eager] | 0.3208ms | 0.2560ms | 3.9068 KOps/s | 3.7612 KOps/s | |
test_mod_wrap[compile] | 1.1565ms | 0.3039ms | 3.2909 KOps/s | 3.2764 KOps/s | |
test_mod_wrap[compile-overhead] | 8.4491ms | 4.4166ms | 226.4177 Ops/s | 232.3442 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.4675ms | 1.3703ms | 729.7762 Ops/s | 681.4237 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.4926ms | 1.3760ms | 726.7666 Ops/s | 724.7330 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3728ms | 0.9219ms | 1.0847 KOps/s | 996.7464 Ops/s | |
test_seq_add[eager] | 0.1424ms | 0.1037ms | 9.6412 KOps/s | 9.7464 KOps/s | |
test_seq_add[compile] | 0.1986ms | 83.7372μs | 11.9421 KOps/s | 11.6157 KOps/s | |
test_seq_add[compile-overhead] | 0.2722ms | 0.1198ms | 8.3484 KOps/s | 8.3437 KOps/s | |
test_seq_wrap[eager] | 0.4535ms | 0.3943ms | 2.5362 KOps/s | 2.4022 KOps/s | |
test_seq_wrap[compile] | 0.3870ms | 0.3243ms | 3.0839 KOps/s | 3.0201 KOps/s | |
test_seq_wrap[compile-overhead] | 0.2717ms | 0.2308ms | 4.3328 KOps/s | 4.2198 KOps/s | |
test_func_call_runtime[False-eager] | 0.7986ms | 0.7496ms | 1.3340 KOps/s | 1.2495 KOps/s | |
test_func_call_runtime[False-compile] | 0.9291ms | 0.8175ms | 1.2232 KOps/s | 1.2076 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4562ms | 0.3768ms | 2.6542 KOps/s | 2.6215 KOps/s | |
test_func_call_runtime[True-eager] | 0.9984ms | 0.9284ms | 1.0772 KOps/s | 1.0352 KOps/s | |
test_func_call_runtime[True-compile] | 0.9669ms | 0.8636ms | 1.1579 KOps/s | 1.1511 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.5332ms | 0.4229ms | 2.3647 KOps/s | 2.3611 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.9821ms | 0.7721ms | 1.2952 KOps/s | 1.2430 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.9060ms | 0.8166ms | 1.2246 KOps/s | 1.1755 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4750ms | 0.3810ms | 2.6246 KOps/s | 2.6361 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.2848ms | 1.0503ms | 952.0694 Ops/s | 932.8980 Ops/s | |
test_func_call_cm_runtime[True-compile] | 1.1537ms | 1.0176ms | 982.7160 Ops/s | 960.2379 Ops/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.0629ms | 1.0197ms | 980.7133 Ops/s | 958.9533 Ops/s | |
test_distributed | 1.5480ms | 74.3406μs | 13.4516 KOps/s | 13.5083 KOps/s | |
test_tdmodule | 33.0310μs | 16.7986μs | 59.5289 KOps/s | 63.5832 KOps/s | |
test_tdmodule_dispatch | 49.7510μs | 33.8190μs | 29.5692 KOps/s | 30.9622 KOps/s | |
test_tdseq | 33.3300μs | 17.4252μs | 57.3883 KOps/s | 58.3688 KOps/s | |
test_tdseq_dispatch | 52.3610μs | 35.8820μs | 27.8691 KOps/s | 28.0871 KOps/s | |
test_instantiation_functorch | 2.0986ms | 2.0199ms | 495.0847 Ops/s | 479.8881 Ops/s | |
test_instantiation_td | 2.0372ms | 1.3076ms | 764.7759 Ops/s | 743.7518 Ops/s | |
test_exec_functorch | 0.2544ms | 0.2193ms | 4.5597 KOps/s | 4.4147 KOps/s | |
test_exec_functional_call | 0.2723ms | 0.2151ms | 4.6496 KOps/s | 4.5636 KOps/s | |
test_exec_td | 0.2608ms | 0.2238ms | 4.4682 KOps/s | 4.2596 KOps/s | |
test_exec_td_decorator | 0.8122ms | 0.2765ms | 3.6165 KOps/s | 3.5645 KOps/s | |
test_vmap_mlp_speed[True-True] | 0.7775ms | 0.6672ms | 1.4989 KOps/s | 1.4760 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.7737ms | 0.6573ms | 1.5213 KOps/s | 1.4623 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.7352ms | 0.5782ms | 1.7294 KOps/s | 1.7228 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.7088ms | 0.5760ms | 1.7361 KOps/s | 1.7125 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8181ms | 0.7073ms | 1.4139 KOps/s | 1.4049 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 1.1516ms | 0.7105ms | 1.4075 KOps/s | 1.4036 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7524ms | 0.6167ms | 1.6216 KOps/s | 1.6050 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7534ms | 0.6195ms | 1.6141 KOps/s | 1.6040 KOps/s | |
test_vmap_transformer_speed[True-True] | 9.0594ms | 8.8079ms | 113.5339 Ops/s | 112.5345 Ops/s | |
test_vmap_transformer_speed[True-False] | 9.2214ms | 8.8162ms | 113.4279 Ops/s | 112.7929 Ops/s | |
test_vmap_transformer_speed[False-True] | 9.0305ms | 8.7340ms | 114.4949 Ops/s | 113.5261 Ops/s | |
test_vmap_transformer_speed[False-False] | 8.7960ms | 8.6885ms | 115.0945 Ops/s | 113.4929 Ops/s | |
test_vmap_transformer_speed_decorator[True-True] | 20.8448ms | 20.7786ms | 48.1265 Ops/s | 48.1418 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 20.8494ms | 20.7611ms | 48.1671 Ops/s | 48.0184 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 21.1083ms | 20.6381ms | 48.4541 Ops/s | 48.4850 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 20.6974ms | 20.6382ms | 48.4539 Ops/s | 48.2323 Ops/s | |
test_to_module_speed[True] | 1.5297ms | 1.1339ms | 881.8920 Ops/s | 869.3480 Ops/s | |
test_to_module_speed[False] | 1.6278ms | 1.1310ms | 884.1425 Ops/s | 895.5215 Ops/s | |
test_tc_init | 67.8720μs | 41.9479μs | 23.8391 KOps/s | 24.4707 KOps/s | |
test_tc_init_nested | 0.1220ms | 86.9945μs | 11.4950 KOps/s | 12.5693 KOps/s | |
test_tc_first_layer_tensor | 4.4183μs | 0.8016μs | 1.2475 MOps/s | 1.2732 MOps/s | |
test_tc_first_layer_nontensor | 21.8400μs | 2.5777μs | 387.9452 KOps/s | 397.9064 KOps/s | |
test_tc_second_layer_tensor | 46.4173μs | 1.6288μs | 613.9370 KOps/s | 629.3522 KOps/s | |
test_tc_second_layer_nontensor | 19.4400μs | 3.4080μs | 293.4239 KOps/s | 296.8911 KOps/s | |
test_unbind | 0.1783s | 10.4769ms | 95.4477 Ops/s | 64.2190 Ops/s | |
test_full_like | 0.6580ms | 0.5785ms | 1.7287 KOps/s | 1.7334 KOps/s | |
test_zeros_like | 0.2683ms | 0.1978ms | 5.0556 KOps/s | 5.0599 KOps/s | |
test_ones_like | 0.2289ms | 0.1977ms | 5.0591 KOps/s | 5.0635 KOps/s | |
test_clone | 0.4746ms | 0.4146ms | 2.4118 KOps/s | 2.4201 KOps/s | |
test_squeeze | 31.8910μs | 11.0954μs | 90.1270 KOps/s | 90.2950 KOps/s | |
test_unsqueeze | 0.2400ms | 80.1955μs | 12.4695 KOps/s | 12.1539 KOps/s | |
test_split | 0.4543ms | 0.1766ms | 5.6615 KOps/s | 5.6086 KOps/s | |
test_permute | 0.3304ms | 0.1918ms | 5.2143 KOps/s | 5.1584 KOps/s | |
test_stack | 1.2489ms | 0.8986ms | 1.1129 KOps/s | 1.1233 KOps/s | |
test_cat | 1.2486ms | 1.2311ms | 812.3040 Ops/s | 811.8839 Ops/s |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Test
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
No description provided.