-
Notifications
You must be signed in to change notification settings - Fork 79
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Test] Skip compile tests that require 2.5 for stable #996
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vmoens
added a commit
that referenced
this pull request
Sep 17, 2024
ghstack-source-id: 531f17478756b54eacc70b1f4c9be319a6335a37 Pull Request resolved: #996
vmoens
added a commit
that referenced
this pull request
Sep 17, 2024
ghstack-source-id: 531f17478756b54eacc70b1f4c9be319a6335a37 Pull Request resolved: #996
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 45.4360μs | 21.1071μs | 47.3775 KOps/s | 52.7448 KOps/s | |
test_plain_set_stack_nested | 49.6930μs | 21.4057μs | 46.7165 KOps/s | 52.5680 KOps/s | |
test_plain_set_nested_inplace | 74.8100μs | 22.7879μs | 43.8830 KOps/s | 48.0164 KOps/s | |
test_plain_set_stack_nested_inplace | 57.0270μs | 22.8166μs | 43.8278 KOps/s | 47.9525 KOps/s | |
test_items | 20.0570μs | 4.3374μs | 230.5536 KOps/s | 240.5228 KOps/s | |
test_items_nested | 0.6000ms | 0.3638ms | 2.7486 KOps/s | 2.7319 KOps/s | |
test_items_nested_locked | 0.6260ms | 0.3627ms | 2.7570 KOps/s | 2.7471 KOps/s | |
test_items_nested_leaf | 0.1256ms | 67.8537μs | 14.7376 KOps/s | 14.6690 KOps/s | |
test_items_stack_nested | 0.4904ms | 0.3697ms | 2.7048 KOps/s | 2.6873 KOps/s | |
test_items_stack_nested_leaf | 0.1280ms | 70.3729μs | 14.2100 KOps/s | 14.1051 KOps/s | |
test_items_stack_nested_locked | 0.5754ms | 0.3707ms | 2.6977 KOps/s | 2.6759 KOps/s | |
test_keys | 20.2070μs | 3.7788μs | 264.6354 KOps/s | 269.9298 KOps/s | |
test_keys_nested | 0.1683ms | 0.1005ms | 9.9530 KOps/s | 9.7856 KOps/s | |
test_keys_nested_locked | 1.6825ms | 0.1071ms | 9.3342 KOps/s | 9.4524 KOps/s | |
test_keys_nested_leaf | 0.1514ms | 86.2090μs | 11.5997 KOps/s | 11.9323 KOps/s | |
test_keys_stack_nested | 0.1718ms | 0.1005ms | 9.9458 KOps/s | 9.8978 KOps/s | |
test_keys_stack_nested_leaf | 0.1445ms | 83.3967μs | 11.9909 KOps/s | 11.9299 KOps/s | |
test_keys_stack_nested_locked | 0.1752ms | 0.1061ms | 9.4256 KOps/s | 9.3091 KOps/s | |
test_values | 5.9592μs | 1.0691μs | 935.3570 KOps/s | 949.1950 KOps/s | |
test_values_nested | 0.3411ms | 76.7045μs | 13.0370 KOps/s | 13.6391 KOps/s | |
test_values_nested_locked | 0.1304ms | 73.3731μs | 13.6290 KOps/s | 13.6341 KOps/s | |
test_values_nested_leaf | 0.3848ms | 62.6979μs | 15.9495 KOps/s | 15.9367 KOps/s | |
test_values_stack_nested | 0.1330ms | 74.5782μs | 13.4087 KOps/s | 13.3730 KOps/s | |
test_values_stack_nested_leaf | 0.2338ms | 60.0617μs | 16.6496 KOps/s | 15.9171 KOps/s | |
test_values_stack_nested_locked | 0.1635ms | 74.5886μs | 13.4069 KOps/s | 13.4691 KOps/s | |
test_membership | 5.2108μs | 0.6978μs | 1.4330 MOps/s | 1.0897 MOps/s | |
test_membership_nested | 0.1093ms | 2.8610μs | 349.5339 KOps/s | 366.8371 KOps/s | |
test_membership_nested_leaf | 54.8430μs | 2.8557μs | 350.1778 KOps/s | 367.7709 KOps/s | |
test_membership_stacked_nested | 23.5850μs | 2.7811μs | 359.5739 KOps/s | 372.9649 KOps/s | |
test_membership_stacked_nested_leaf | 23.4740μs | 2.8320μs | 353.1062 KOps/s | 366.3796 KOps/s | |
test_membership_nested_last | 24.0160μs | 4.0424μs | 247.3780 KOps/s | 252.1458 KOps/s | |
test_membership_nested_leaf_last | 25.3180μs | 4.0664μs | 245.9205 KOps/s | 253.9772 KOps/s | |
test_membership_stacked_nested_last | 33.7140μs | 5.5474μs | 180.2646 KOps/s | 179.0521 KOps/s | |
test_membership_stacked_nested_leaf_last | 30.1570μs | 5.5266μs | 180.9421 KOps/s | 180.2578 KOps/s | |
test_nested_getleaf | 43.9030μs | 11.0340μs | 90.6291 KOps/s | 94.4306 KOps/s | |
test_nested_get | 45.3650μs | 10.5838μs | 94.4840 KOps/s | 98.8217 KOps/s | |
test_stacked_getleaf | 34.1340μs | 11.0523μs | 90.4786 KOps/s | 94.5207 KOps/s | |
test_stacked_get | 34.7360μs | 10.5709μs | 94.5995 KOps/s | 99.7441 KOps/s | |
test_nested_getitemleaf | 44.4740μs | 11.2535μs | 88.8616 KOps/s | 89.9628 KOps/s | |
test_nested_getitem | 47.4190μs | 10.6346μs | 94.0327 KOps/s | 95.9945 KOps/s | |
test_stacked_getitemleaf | 34.7960μs | 11.2830μs | 88.6292 KOps/s | 92.4000 KOps/s | |
test_stacked_getitem | 38.0320μs | 10.6701μs | 93.7200 KOps/s | 98.5988 KOps/s | |
test_lock_nested | 92.4819ms | 0.6023ms | 1.6603 KOps/s | 2.0237 KOps/s | |
test_lock_stack_nested | 0.8037ms | 0.4457ms | 2.2439 KOps/s | 2.2198 KOps/s | |
test_unlock_nested | 94.1709ms | 0.5184ms | 1.9289 KOps/s | 2.4466 KOps/s | |
test_unlock_stack_nested | 0.5851ms | 0.3644ms | 2.7442 KOps/s | 2.6954 KOps/s | |
test_flatten_speed | 0.1712ms | 88.3837μs | 11.3143 KOps/s | 11.2108 KOps/s | |
test_unflatten_speed | 0.9848ms | 0.4761ms | 2.1002 KOps/s | 2.1641 KOps/s | |
test_common_ops | 4.6756ms | 1.1801ms | 847.4082 Ops/s | 922.1649 Ops/s | |
test_creation | 19.9270μs | 2.0738μs | 482.1981 KOps/s | 484.7066 KOps/s | |
test_creation_empty | 70.2610μs | 19.1198μs | 52.3017 KOps/s | 64.3983 KOps/s | |
test_creation_nested_1 | 59.0710μs | 22.1323μs | 45.1828 KOps/s | 54.6995 KOps/s | |
test_creation_nested_2 | 63.8190μs | 26.3863μs | 37.8984 KOps/s | 43.4417 KOps/s | |
test_clone | 1.4067ms | 17.8627μs | 55.9825 KOps/s | 59.8058 KOps/s | |
test_getitem[int] | 0.8102ms | 16.8907μs | 59.2041 KOps/s | 60.8786 KOps/s | |
test_getitem[slice_int] | 0.1505ms | 31.1441μs | 32.1088 KOps/s | 32.8248 KOps/s | |
test_getitem[range] | 0.3809ms | 60.8297μs | 16.4393 KOps/s | 16.9929 KOps/s | |
test_getitem[tuple] | 0.1534ms | 25.5371μs | 39.1588 KOps/s | 40.5420 KOps/s | |
test_getitem[list] | 0.3402ms | 56.0888μs | 17.8289 KOps/s | 18.8536 KOps/s | |
test_setitem_dim[int] | 73.3070μs | 34.3543μs | 29.1085 KOps/s | 30.7244 KOps/s | |
test_setitem_dim[slice_int] | 0.1067ms | 63.0031μs | 15.8722 KOps/s | 16.4398 KOps/s | |
test_setitem_dim[range] | 0.1411ms | 86.2763μs | 11.5907 KOps/s | 11.9035 KOps/s | |
test_setitem_dim[tuple] | 0.1073ms | 51.0158μs | 19.6018 KOps/s | 20.6057 KOps/s | |
test_setitem | 0.1923ms | 31.6315μs | 31.6141 KOps/s | 34.0336 KOps/s | |
test_set | 0.1600ms | 30.3496μs | 32.9493 KOps/s | 34.8670 KOps/s | |
test_set_shared | 4.0104ms | 0.2178ms | 4.5922 KOps/s | 4.6439 KOps/s | |
test_update | 0.1857ms | 38.0723μs | 26.2658 KOps/s | 29.2061 KOps/s | |
test_update_nested | 0.2228ms | 49.8093μs | 20.0766 KOps/s | 22.0959 KOps/s | |
test_update__nested | 0.1590ms | 36.2348μs | 27.5978 KOps/s | 28.5923 KOps/s | |
test_set_nested | 0.1682ms | 33.7310μs | 29.6463 KOps/s | 32.4474 KOps/s | |
test_set_nested_new | 0.2037ms | 38.9592μs | 25.6679 KOps/s | 27.9705 KOps/s | |
test_select | 0.2186ms | 56.3738μs | 17.7387 KOps/s | 19.0938 KOps/s | |
test_select_nested | 0.1340ms | 60.1118μs | 16.6357 KOps/s | 17.0705 KOps/s | |
test_exclude_nested | 0.1422ms | 75.2593μs | 13.2874 KOps/s | 13.3502 KOps/s | |
test_empty[True] | 0.5222ms | 0.3201ms | 3.1237 KOps/s | 3.1165 KOps/s | |
test_empty[False] | 10.5012μs | 1.1707μs | 854.2146 KOps/s | 829.2823 KOps/s | |
test_unbind_speed | 0.5138ms | 0.3040ms | 3.2893 KOps/s | 3.3123 KOps/s | |
test_unbind_speed_stack0 | 0.4285ms | 0.2866ms | 3.4888 KOps/s | 3.4142 KOps/s | |
test_unbind_speed_stack1 | 96.5653ms | 0.7891ms | 1.2672 KOps/s | 1.3615 KOps/s | |
test_split | 2.1060ms | 1.9909ms | 502.2866 Ops/s | 457.2237 Ops/s | |
test_chunk | 94.7928ms | 2.1823ms | 458.2302 Ops/s | 458.1919 Ops/s | |
test_creation[device0] | 0.2327ms | 0.1159ms | 8.6247 KOps/s | 8.2204 KOps/s | |
test_creation_from_tensor | 3.5644ms | 0.1172ms | 8.5340 KOps/s | 8.5719 KOps/s | |
test_add_one[memmap_tensor0] | 0.2419ms | 7.5582μs | 132.3064 KOps/s | 134.8884 KOps/s | |
test_contiguous[memmap_tensor0] | 21.3600μs | 1.9214μs | 520.4497 KOps/s | 514.1760 KOps/s | |
test_stack[memmap_tensor0] | 43.6820μs | 5.6785μs | 176.1033 KOps/s | 174.5776 KOps/s | |
test_memmaptd_index | 1.1602ms | 0.4086ms | 2.4474 KOps/s | 2.5148 KOps/s | |
test_memmaptd_index_astensor | 1.2060ms | 0.4864ms | 2.0560 KOps/s | 2.1128 KOps/s | |
test_memmaptd_index_op | 1.7791ms | 1.0462ms | 955.8608 Ops/s | 1.0574 KOps/s | |
test_serialize_model | 0.2021s | 0.1317s | 7.5933 Ops/s | 8.4686 Ops/s | |
test_serialize_model_pickle | 0.4563s | 0.3897s | 2.5659 Ops/s | 2.5641 Ops/s | |
test_serialize_weights | 0.1242s | 0.1163s | 8.5985 Ops/s | 7.4539 Ops/s | |
test_serialize_weights_returnearly | 0.1580s | 0.1532s | 6.5284 Ops/s | 6.4311 Ops/s | |
test_serialize_weights_pickle | 0.4493s | 0.3984s | 2.5103 Ops/s | 2.4333 Ops/s | |
test_serialize_weights_filesystem | 0.1461s | 0.1427s | 7.0056 Ops/s | 6.9600 Ops/s | |
test_serialize_model_filesystem | 0.1589s | 0.1519s | 6.5818 Ops/s | 6.5472 Ops/s | |
test_reshape_pytree | 87.0540μs | 40.3338μs | 24.7931 KOps/s | 25.3988 KOps/s | |
test_reshape_td | 0.1555ms | 45.4208μs | 22.0164 KOps/s | 21.9225 KOps/s | |
test_view_pytree | 97.0420μs | 39.2288μs | 25.4915 KOps/s | 25.8846 KOps/s | |
test_view_td | 0.1361ms | 52.5163μs | 19.0417 KOps/s | 19.1283 KOps/s | |
test_unbind_pytree | 76.2130μs | 36.6866μs | 27.2579 KOps/s | 27.7037 KOps/s | |
test_unbind_td | 0.3132ms | 45.6760μs | 21.8933 KOps/s | 20.8676 KOps/s | |
test_split_pytree | 83.8170μs | 38.7662μs | 25.7956 KOps/s | 26.1442 KOps/s | |
test_split_td | 0.1993ms | 58.6200μs | 17.0590 KOps/s | 17.4603 KOps/s | |
test_add_pytree | 0.1187ms | 46.0517μs | 21.7147 KOps/s | 22.1533 KOps/s | |
test_add_td | 0.1674ms | 84.0575μs | 11.8966 KOps/s | 13.1326 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1481ms | 58.3809μs | 17.1289 KOps/s | 17.2941 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3905ms | 0.1789ms | 5.5911 KOps/s | 5.7267 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1228ms | 58.5316μs | 17.0848 KOps/s | 17.5362 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.3218ms | 0.1443ms | 6.9318 KOps/s | 7.2444 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 77.2850μs | 21.0865μs | 47.4238 KOps/s | 46.4339 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1486ms | 67.7824μs | 14.7531 KOps/s | 15.0297 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1590ms | 77.1224μs | 12.9664 KOps/s | 13.1056 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1281ms | 69.2424μs | 14.4420 KOps/s | 14.4277 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.3781ms | 0.1747ms | 5.7251 KOps/s | 5.8380 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3562ms | 0.1898ms | 5.2698 KOps/s | 5.3808 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1054ms | 45.6623μs | 21.8999 KOps/s | 20.9072 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1515ms | 68.7241μs | 14.5509 KOps/s | 14.6228 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2826ms | 0.1783ms | 5.6099 KOps/s | 5.7548 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.5379ms | 0.2895ms | 3.4538 KOps/s | 3.4354 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4609ms | 0.2011ms | 4.9730 KOps/s | 4.9412 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.4427ms | 0.1819ms | 5.4966 KOps/s | 5.7712 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1567ms | 63.5334μs | 15.7398 KOps/s | 15.9350 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1229ms | 47.1818μs | 21.1946 KOps/s | 21.0051 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.3080ms | 0.2344ms | 4.2669 KOps/s | 4.3748 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.3350ms | 0.1764ms | 5.6694 KOps/s | 5.6712 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.5520ms | 0.1078ms | 9.2786 KOps/s | 9.7006 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.2116ms | 62.3004μs | 16.0513 KOps/s | 17.5354 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1569ms | 77.4595μs | 12.9100 KOps/s | 12.6742 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1459ms | 68.8458μs | 14.5252 KOps/s | 14.0073 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.3776ms | 0.1911ms | 5.2323 KOps/s | 5.1242 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 1.8864ms | 1.6324ms | 612.6060 Ops/s | 619.1677 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.2857ms | 0.1933ms | 5.1725 KOps/s | 5.2033 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.5107ms | 1.1104ms | 900.5431 Ops/s | 935.4301 Ops/s | |
test_compile_assign_and_add_stack[compile] | 0.8110ms | 0.4173ms | 2.3964 KOps/s | 2.3280 KOps/s | |
test_compile_assign_and_add_stack[eager] | 5.6426ms | 3.8853ms | 257.3773 Ops/s | 279.3335 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 84.7390μs | 33.9404μs | 29.4634 KOps/s | 27.9032 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 1.0723ms | 50.2563μs | 19.8980 KOps/s | 20.6674 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1001ms | 30.7766μs | 32.4922 KOps/s | 33.8871 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 95.5990μs | 29.5506μs | 33.8403 KOps/s | 33.5357 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 79.1480μs | 30.2465μs | 33.0617 KOps/s | 34.0793 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 93.2550μs | 29.9999μs | 33.3334 KOps/s | 34.1460 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1581ms | 73.9279μs | 13.5267 KOps/s | 13.6682 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.5170ms | 27.6770μs | 36.1311 KOps/s | 37.1319 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1389ms | 67.0144μs | 14.9222 KOps/s | 14.6581 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 77.5160μs | 23.4359μs | 42.6696 KOps/s | 42.0803 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1628ms | 68.3970μs | 14.6205 KOps/s | 14.6782 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 82.8450μs | 23.4714μs | 42.6051 KOps/s | 42.2930 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1598ms | 74.2371μs | 13.4704 KOps/s | 13.8266 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.8901ms | 27.5914μs | 36.2432 KOps/s | 37.8094 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1415ms | 68.2478μs | 14.6525 KOps/s | 14.7464 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 82.7650μs | 23.2224μs | 43.0619 KOps/s | 42.6491 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1515ms | 68.2072μs | 14.6612 KOps/s | 14.7708 KOps/s | |
test_compile_indexing[int-pytree-eager] | 58.9500μs | 23.3418μs | 42.8416 KOps/s | 42.5321 KOps/s | |
test_mod_add[eager] | 87.1830μs | 26.1035μs | 38.3091 KOps/s | 41.4728 KOps/s | |
test_mod_add[compile] | 82.1850μs | 39.3576μs | 25.4080 KOps/s | 26.5832 KOps/s | |
test_mod_add[compile-overhead] | 80.6110μs | 39.7582μs | 25.1520 KOps/s | 26.3888 KOps/s | |
test_mod_wrap[eager] | 0.4199ms | 0.2127ms | 4.7009 KOps/s | 4.8562 KOps/s | |
test_mod_wrap[compile] | 0.4100ms | 0.2337ms | 4.2781 KOps/s | 4.3430 KOps/s | |
test_mod_wrap[compile-overhead] | 0.3840ms | 0.2320ms | 4.3101 KOps/s | 4.3914 KOps/s | |
test_mod_wrap_and_backward[eager] | 17.9058ms | 12.3975ms | 80.6614 Ops/s | 87.4740 Ops/s | |
test_mod_wrap_and_backward[compile] | 14.1027ms | 11.7904ms | 84.8149 Ops/s | 79.2917 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 17.6497ms | 12.0027ms | 83.3149 Ops/s | 85.2111 Ops/s | |
test_seq_add[eager] | 0.2077ms | 92.3625μs | 10.8269 KOps/s | 11.3372 KOps/s | |
test_seq_add[compile] | 0.4562ms | 72.0855μs | 13.8724 KOps/s | 15.5752 KOps/s | |
test_seq_add[compile-overhead] | 0.1182ms | 62.7258μs | 15.9424 KOps/s | 16.0528 KOps/s | |
test_seq_wrap[eager] | 0.6513ms | 0.3965ms | 2.5223 KOps/s | 2.6395 KOps/s | |
test_seq_wrap[compile] | 0.4845ms | 0.2642ms | 3.7847 KOps/s | 3.6986 KOps/s | |
test_seq_wrap[compile-overhead] | 0.5238ms | 0.2680ms | 3.7316 KOps/s | 3.7563 KOps/s | |
test_func_call_runtime[False-eager] | 0.8932ms | 0.5197ms | 1.9243 KOps/s | 1.8911 KOps/s | |
test_func_call_runtime[False-compile] | 0.9371ms | 0.5024ms | 1.9905 KOps/s | 2.0125 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.6260ms | 0.5013ms | 1.9950 KOps/s | 2.0028 KOps/s | |
test_func_call_runtime[True-eager] | 0.8672ms | 0.7399ms | 1.3515 KOps/s | 1.3261 KOps/s | |
test_func_call_runtime[True-compile] | 0.9196ms | 0.5139ms | 1.9460 KOps/s | 1.9543 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.9642ms | 0.5152ms | 1.9410 KOps/s | 1.9486 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.9300ms | 0.5236ms | 1.9098 KOps/s | 1.8671 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8042ms | 0.5069ms | 1.9729 KOps/s | 1.9818 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.8915ms | 0.5067ms | 1.9736 KOps/s | 1.9895 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.0911ms | 0.8648ms | 1.1564 KOps/s | 1.1298 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.9628ms | 0.7389ms | 1.3533 KOps/s | 1.3292 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.1818ms | 0.7347ms | 1.3611 KOps/s | 1.3193 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 3.2056ms | 1.8866ms | 530.0577 Ops/s | 539.3361 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 2.9016ms | 1.9577ms | 510.7912 Ops/s | 515.8425 Ops/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 2.6631ms | 1.9461ms | 513.8569 Ops/s | 521.9402 Ops/s | |
test_distributed | 0.3192ms | 0.1230ms | 8.1330 KOps/s | 7.7987 KOps/s | |
test_tdmodule | 46.0870μs | 19.1543μs | 52.2076 KOps/s | 59.4281 KOps/s | |
test_tdmodule_dispatch | 60.9740μs | 38.3454μs | 26.0787 KOps/s | 29.7709 KOps/s | |
test_tdseq | 50.2740μs | 21.4116μs | 46.7036 KOps/s | 52.4572 KOps/s | |
test_tdseq_dispatch | 73.1070μs | 42.9825μs | 23.2653 KOps/s | 26.1806 KOps/s | |
test_instantiation_functorch | 1.7443ms | 1.5816ms | 632.2848 Ops/s | 631.6823 Ops/s | |
test_instantiation_td | 2.0012ms | 1.1526ms | 867.6160 Ops/s | 848.2312 Ops/s | |
test_exec_functorch | 0.4010ms | 0.1869ms | 5.3504 KOps/s | 5.4071 KOps/s | |
test_exec_functional_call | 0.2744ms | 0.1728ms | 5.7854 KOps/s | 5.8092 KOps/s | |
test_exec_td | 0.2638ms | 0.1705ms | 5.8668 KOps/s | 5.9366 KOps/s | |
test_exec_td_decorator | 1.2151ms | 0.2254ms | 4.4367 KOps/s | 4.4003 KOps/s | |
test_vmap_mlp_speed[True-True] | 1.0456ms | 0.6583ms | 1.5191 KOps/s | 1.5691 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.8866ms | 0.6536ms | 1.5300 KOps/s | 1.5787 KOps/s | |
test_vmap_mlp_speed[False-True] | 1.4553ms | 0.5144ms | 1.9439 KOps/s | 2.0070 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.7663ms | 0.5063ms | 1.9749 KOps/s | 2.0027 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.3146ms | 0.6401ms | 1.5624 KOps/s | 1.6231 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.9058ms | 0.6372ms | 1.5695 KOps/s | 1.6084 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.6859ms | 0.5224ms | 1.9141 KOps/s | 1.9059 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.8001ms | 0.5236ms | 1.9099 KOps/s | 1.9478 KOps/s | |
test_to_module_speed[True] | 2.1009ms | 1.2808ms | 780.7726 Ops/s | 776.3551 Ops/s | |
test_to_module_speed[False] | 2.0342ms | 1.2517ms | 798.9142 Ops/s | 793.6217 Ops/s | |
test_tc_init | 0.2995ms | 52.7950μs | 18.9412 KOps/s | 24.2522 KOps/s | |
test_tc_init_nested | 0.3221ms | 94.6046μs | 10.5703 KOps/s | 12.0958 KOps/s | |
test_tc_first_layer_tensor | 19.6060μs | 1.5539μs | 643.5299 KOps/s | 653.2868 KOps/s | |
test_tc_first_layer_nontensor | 0.1016ms | 4.7720μs | 209.5539 KOps/s | 208.3185 KOps/s | |
test_tc_second_layer_tensor | 0.1659ms | 2.8883μs | 346.2190 KOps/s | 358.1946 KOps/s | |
test_tc_second_layer_nontensor | 50.1740μs | 5.9420μs | 168.2932 KOps/s | 166.5975 KOps/s | |
test_unbind | 0.4807s | 13.1526ms | 76.0304 Ops/s | 75.0380 Ops/s | |
test_full_like | 8.4463ms | 7.2449ms | 138.0279 Ops/s | 138.4911 Ops/s | |
test_zeros_like | 3.6726ms | 2.9802ms | 335.5440 Ops/s | 321.6460 Ops/s | |
test_ones_like | 3.8754ms | 3.3055ms | 302.5264 Ops/s | 159.2632 Ops/s | |
test_clone | 6.9830ms | 5.5755ms | 179.3566 Ops/s | 120.1433 Ops/s | |
test_squeeze | 71.7750μs | 12.6212μs | 79.2315 KOps/s | 80.3479 KOps/s | |
test_unsqueeze | 0.3482ms | 93.4207μs | 10.7043 KOps/s | 10.7441 KOps/s | |
test_split | 0.3890ms | 0.2017ms | 4.9573 KOps/s | 5.0136 KOps/s | |
test_permute | 0.3026ms | 0.2220ms | 4.5036 KOps/s | 4.4740 KOps/s | |
test_stack | 29.1201ms | 26.1285ms | 38.2724 Ops/s | 37.2066 Ops/s | |
test_cat | 32.1400ms | 26.5840ms | 37.6167 Ops/s | 38.0723 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 0.6011ms | 13.8308μs | 72.3026 KOps/s | 66.8500 KOps/s | |
test_plain_set_stack_nested | 47.9710μs | 14.0341μs | 71.2551 KOps/s | 66.1518 KOps/s | |
test_plain_set_nested_inplace | 52.2710μs | 14.9067μs | 67.0840 KOps/s | 62.0839 KOps/s | |
test_plain_set_stack_nested_inplace | 45.3010μs | 14.8726μs | 67.2379 KOps/s | 63.1544 KOps/s | |
test_items | 30.3200μs | 2.9063μs | 344.0746 KOps/s | 342.0789 KOps/s | |
test_items_nested | 0.3784ms | 0.3298ms | 3.0319 KOps/s | 3.0578 KOps/s | |
test_items_nested_locked | 0.5101ms | 0.3311ms | 3.0205 KOps/s | 3.0481 KOps/s | |
test_items_nested_leaf | 84.4910μs | 56.0848μs | 17.8302 KOps/s | 17.7977 KOps/s | |
test_items_stack_nested | 0.3877ms | 0.3352ms | 2.9837 KOps/s | 3.0785 KOps/s | |
test_items_stack_nested_leaf | 82.7720μs | 58.2817μs | 17.1580 KOps/s | 17.4717 KOps/s | |
test_items_stack_nested_locked | 0.4066ms | 0.3342ms | 2.9922 KOps/s | 3.0433 KOps/s | |
test_keys | 25.9200μs | 3.4644μs | 288.6526 KOps/s | 288.9242 KOps/s | |
test_keys_nested | 91.5520μs | 58.0816μs | 17.2171 KOps/s | 17.5110 KOps/s | |
test_keys_nested_locked | 2.2807ms | 63.9575μs | 15.6354 KOps/s | 16.0261 KOps/s | |
test_keys_nested_leaf | 80.6520μs | 49.2428μs | 20.3075 KOps/s | 20.7773 KOps/s | |
test_keys_stack_nested | 90.2620μs | 57.7822μs | 17.3064 KOps/s | 17.5621 KOps/s | |
test_keys_stack_nested_leaf | 84.3320μs | 49.4731μs | 20.2130 KOps/s | 20.4370 KOps/s | |
test_keys_stack_nested_locked | 91.7410μs | 62.7451μs | 15.9375 KOps/s | 16.1824 KOps/s | |
test_values | 5.3468μs | 0.8832μs | 1.1322 MOps/s | 1.1534 MOps/s | |
test_values_nested | 96.2020μs | 41.3024μs | 24.2117 KOps/s | 24.6496 KOps/s | |
test_values_nested_locked | 74.8010μs | 42.8217μs | 23.3526 KOps/s | 23.4504 KOps/s | |
test_values_nested_leaf | 79.6610μs | 35.4964μs | 28.1719 KOps/s | 28.4414 KOps/s | |
test_values_stack_nested | 76.7120μs | 42.0405μs | 23.7866 KOps/s | 24.1615 KOps/s | |
test_values_stack_nested_leaf | 61.9210μs | 36.0656μs | 27.7272 KOps/s | 28.0525 KOps/s | |
test_values_stack_nested_locked | 75.1320μs | 43.8582μs | 22.8008 KOps/s | 23.0919 KOps/s | |
test_membership | 4.5673μs | 0.5107μs | 1.9581 MOps/s | 1.9814 MOps/s | |
test_membership_nested | 25.0605μs | 1.9162μs | 521.8665 KOps/s | 523.7167 KOps/s | |
test_membership_nested_leaf | 18.8855μs | 1.9149μs | 522.2222 KOps/s | 526.7061 KOps/s | |
test_membership_stacked_nested | 40.6410μs | 1.9700μs | 507.6233 KOps/s | 511.9163 KOps/s | |
test_membership_stacked_nested_leaf | 28.7400μs | 1.9967μs | 500.8264 KOps/s | 503.9144 KOps/s | |
test_membership_nested_last | 41.1600μs | 2.8068μs | 356.2787 KOps/s | 352.5149 KOps/s | |
test_membership_nested_leaf_last | 35.8000μs | 2.8371μs | 352.4692 KOps/s | 357.7844 KOps/s | |
test_membership_stacked_nested_last | 45.2810μs | 5.3067μs | 188.4424 KOps/s | 269.6665 KOps/s | |
test_membership_stacked_nested_leaf_last | 30.8500μs | 5.3076μs | 188.4078 KOps/s | 269.3840 KOps/s | |
test_nested_getleaf | 33.8910μs | 6.1948μs | 161.4244 KOps/s | 161.8956 KOps/s | |
test_nested_get | 34.8610μs | 5.7642μs | 173.4832 KOps/s | 175.3526 KOps/s | |
test_stacked_getleaf | 34.7910μs | 6.0525μs | 165.2221 KOps/s | 163.9870 KOps/s | |
test_stacked_get | 43.8010μs | 5.6673μs | 176.4507 KOps/s | 178.2584 KOps/s | |
test_nested_getitemleaf | 45.5700μs | 6.1565μs | 162.4309 KOps/s | 160.7351 KOps/s | |
test_nested_getitem | 33.3310μs | 5.8621μs | 170.5876 KOps/s | 171.6982 KOps/s | |
test_stacked_getitemleaf | 36.7310μs | 6.0947μs | 164.0767 KOps/s | 161.7987 KOps/s | |
test_stacked_getitem | 33.1010μs | 5.7663μs | 173.4208 KOps/s | 174.4298 KOps/s | |
test_lock_nested | 3.0154ms | 0.4239ms | 2.3588 KOps/s | 2.3779 KOps/s | |
test_lock_stack_nested | 0.4288ms | 0.3812ms | 2.6236 KOps/s | 2.6266 KOps/s | |
test_unlock_nested | 0.7474ms | 0.3590ms | 2.7855 KOps/s | 2.8116 KOps/s | |
test_unlock_stack_nested | 0.3496ms | 0.3181ms | 3.1436 KOps/s | 3.1411 KOps/s | |
test_flatten_speed | 0.1079ms | 69.4292μs | 14.4032 KOps/s | 14.4688 KOps/s | |
test_unflatten_speed | 0.3605ms | 0.2841ms | 3.5201 KOps/s | 3.5490 KOps/s | |
test_common_ops | 1.5203ms | 1.2582ms | 794.7620 Ops/s | 761.1505 Ops/s | |
test_creation | 39.2900μs | 1.5414μs | 648.7661 KOps/s | 658.2968 KOps/s | |
test_creation_empty | 71.9020μs | 15.4242μs | 64.8330 KOps/s | 57.6934 KOps/s | |
test_creation_nested_1 | 48.4410μs | 17.1587μs | 58.2793 KOps/s | 53.6452 KOps/s | |
test_creation_nested_2 | 1.3810ms | 20.3870μs | 49.0509 KOps/s | 46.5746 KOps/s | |
test_clone | 66.4220μs | 29.6121μs | 33.7700 KOps/s | 34.9372 KOps/s | |
test_getitem[int] | 93.3077ms | 23.9943μs | 41.6766 KOps/s | 60.3779 KOps/s | |
test_getitem[slice_int] | 0.1817ms | 28.1843μs | 35.4807 KOps/s | 36.1451 KOps/s | |
test_getitem[range] | 0.2409ms | 0.1113ms | 8.9819 KOps/s | 9.0074 KOps/s | |
test_getitem[tuple] | 0.1164ms | 24.0271μs | 41.6196 KOps/s | 42.1826 KOps/s | |
test_getitem[list] | 0.2481ms | 99.2910μs | 10.0714 KOps/s | 10.0387 KOps/s | |
test_setitem_dim[int] | 71.8520μs | 45.6828μs | 21.8901 KOps/s | 21.3095 KOps/s | |
test_setitem_dim[slice_int] | 0.1031ms | 68.9617μs | 14.5008 KOps/s | 14.5864 KOps/s | |
test_setitem_dim[range] | 0.1620ms | 0.1300ms | 7.6904 KOps/s | 7.7604 KOps/s | |
test_setitem_dim[tuple] | 88.3010μs | 62.2328μs | 16.0687 KOps/s | 16.4240 KOps/s | |
test_setitem | 85.5110μs | 44.7539μs | 22.3444 KOps/s | 23.4218 KOps/s | |
test_set | 76.3520μs | 43.4821μs | 22.9979 KOps/s | 23.3526 KOps/s | |
test_set_shared | 0.3397ms | 51.7519μs | 19.3230 KOps/s | 19.8273 KOps/s | |
test_update | 0.1095ms | 50.7977μs | 19.6859 KOps/s | 18.6930 KOps/s | |
test_update_nested | 0.2026ms | 58.6591μs | 17.0477 KOps/s | 16.9198 KOps/s | |
test_update__nested | 0.2111ms | 60.5596μs | 16.5127 KOps/s | 17.1204 KOps/s | |
test_set_nested | 0.1739ms | 44.4377μs | 22.5034 KOps/s | 22.3827 KOps/s | |
test_set_nested_new | 84.6710μs | 47.8801μs | 20.8855 KOps/s | 21.0489 KOps/s | |
test_select | 0.1021ms | 60.8819μs | 16.4252 KOps/s | 16.4293 KOps/s | |
test_select_nested | 0.4501ms | 43.4305μs | 23.0253 KOps/s | 23.6350 KOps/s | |
test_exclude_nested | 0.1248ms | 59.2919μs | 16.8657 KOps/s | 16.6970 KOps/s | |
test_empty[True] | 0.3673ms | 0.2445ms | 4.0898 KOps/s | 4.1187 KOps/s | |
test_empty[False] | 3.5930μs | 0.7423μs | 1.3471 MOps/s | 1.3647 MOps/s | |
test_to | 62.0210μs | 26.0213μs | 38.4300 KOps/s | 40.2871 KOps/s | |
test_to_nonblocking | 53.0710μs | 26.3802μs | 37.9071 KOps/s | 43.0663 KOps/s | |
test_unbind_speed | 1.7698ms | 0.2974ms | 3.3630 KOps/s | 3.5503 KOps/s | |
test_unbind_speed_stack0 | 0.3252ms | 0.2763ms | 3.6197 KOps/s | 3.6389 KOps/s | |
test_unbind_speed_stack1 | 94.1723ms | 0.7041ms | 1.4203 KOps/s | 1.4039 KOps/s | |
test_split | 96.7237ms | 2.2542ms | 443.6112 Ops/s | 465.8515 Ops/s | |
test_chunk | 97.4661ms | 2.2430ms | 445.8298 Ops/s | 462.0457 Ops/s | |
test_creation[device0] | 0.3409ms | 0.1273ms | 7.8574 KOps/s | 7.9188 KOps/s | |
test_creation_from_tensor | 0.3826ms | 0.1337ms | 7.4779 KOps/s | 7.5516 KOps/s | |
test_add_one[memmap_tensor0] | 0.2327ms | 9.8677μs | 101.3407 KOps/s | 111.2682 KOps/s | |
test_contiguous[memmap_tensor0] | 23.2100μs | 2.2040μs | 453.7240 KOps/s | 455.4151 KOps/s | |
test_stack[memmap_tensor0] | 34.7610μs | 7.0582μs | 141.6783 KOps/s | 144.7248 KOps/s | |
test_memmaptd_index | 1.3276ms | 0.4384ms | 2.2809 KOps/s | 2.3274 KOps/s | |
test_memmaptd_index_astensor | 0.7465ms | 0.4927ms | 2.0296 KOps/s | 2.0587 KOps/s | |
test_memmaptd_index_op | 1.4481ms | 1.0527ms | 949.9037 Ops/s | 950.0023 Ops/s | |
test_serialize_model | 0.1311s | 0.1294s | 7.7299 Ops/s | 7.7315 Ops/s | |
test_serialize_model_pickle | 1.3481s | 1.2118s | 0.8252 Ops/s | 0.8231 Ops/s | |
test_serialize_weights | 0.1297s | 0.1283s | 7.7959 Ops/s | 7.7626 Ops/s | |
test_serialize_weights_returnearly | 0.2206s | 64.3938ms | 15.5294 Ops/s | 16.0887 Ops/s | |
test_serialize_weights_pickle | 1.3737s | 1.2170s | 0.8217 Ops/s | 0.8213 Ops/s | |
test_reshape_pytree | 69.7510μs | 37.1764μs | 26.8988 KOps/s | 27.2029 KOps/s | |
test_reshape_td | 0.1616ms | 43.0762μs | 23.2147 KOps/s | 23.6890 KOps/s | |
test_view_pytree | 82.2220μs | 36.4968μs | 27.3996 KOps/s | 27.8826 KOps/s | |
test_view_td | 87.4820μs | 47.2080μs | 21.1828 KOps/s | 21.1943 KOps/s | |
test_unbind_pytree | 61.3310μs | 35.3626μs | 28.2785 KOps/s | 28.7127 KOps/s | |
test_unbind_td | 0.5218ms | 43.1619μs | 23.1686 KOps/s | 23.1658 KOps/s | |
test_split_pytree | 76.1210μs | 48.2659μs | 20.7186 KOps/s | 21.0860 KOps/s | |
test_split_td | 0.6473ms | 59.1935μs | 16.8937 KOps/s | 17.6866 KOps/s | |
test_add_pytree | 0.2049ms | 60.6455μs | 16.4893 KOps/s | 17.7031 KOps/s | |
test_add_td | 0.2411ms | 97.2584μs | 10.2819 KOps/s | 10.7858 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.4148ms | 0.2138ms | 4.6762 KOps/s | 4.6556 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2986ms | 0.1507ms | 6.6339 KOps/s | 6.5397 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1977ms | 0.1546ms | 6.4673 KOps/s | 6.8193 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2846ms | 0.2043ms | 4.8953 KOps/s | 5.5116 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 68.0110μs | 20.9033μs | 47.8393 KOps/s | 45.0902 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 84.6220μs | 44.4552μs | 22.4945 KOps/s | 22.6616 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1175ms | 65.2214μs | 15.3324 KOps/s | 15.7068 KOps/s | |
test_compile_copy_nested[pytree-eager] | 90.7720μs | 49.6231μs | 20.1519 KOps/s | 20.0575 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.4328ms | 0.3223ms | 3.1032 KOps/s | 3.0978 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.5906ms | 0.2094ms | 4.7748 KOps/s | 4.8104 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.2785ms | 0.1298ms | 7.7027 KOps/s | 7.7021 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.2440ms | 61.7355μs | 16.1981 KOps/s | 16.5967 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.4492ms | 0.3211ms | 3.1140 KOps/s | 3.0978 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.9017ms | 0.6765ms | 1.4781 KOps/s | 1.6232 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3903ms | 0.2494ms | 4.0104 KOps/s | 4.0123 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.4692ms | 0.3230ms | 3.0961 KOps/s | 3.0762 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.4765ms | 72.9181μs | 13.7140 KOps/s | 13.5597 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.5581ms | 0.1365ms | 7.3241 KOps/s | 7.6072 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.9360ms | 0.5473ms | 1.8271 KOps/s | 1.8809 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.4697ms | 0.3235ms | 3.0915 KOps/s | 3.0892 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 57.7910μs | 18.7646μs | 53.2918 KOps/s | 56.4568 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 69.1820μs | 26.9468μs | 37.1102 KOps/s | 36.4072 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1063ms | 69.2121μs | 14.4483 KOps/s | 14.3336 KOps/s | |
test_compile_copy_flat[pytree-eager] | 81.6320μs | 51.0908μs | 19.5730 KOps/s | 19.4631 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 2.3416ms | 0.8204ms | 1.2189 KOps/s | 1.1295 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 3.3832ms | 3.2032ms | 312.1835 Ops/s | 314.9462 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 2.3014ms | 0.8060ms | 1.2407 KOps/s | 1.1420 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 3.4686ms | 3.2579ms | 306.9438 Ops/s | 311.7310 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1612ms | 0.1127ms | 8.8737 KOps/s | 9.0106 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.1889ms | 61.6349μs | 16.2246 KOps/s | 15.5886 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.2488ms | 0.1054ms | 9.4883 KOps/s | 9.1436 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 93.9620μs | 44.3301μs | 22.5580 KOps/s | 21.7480 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.2571ms | 0.1096ms | 9.1261 KOps/s | 9.0749 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 78.9620μs | 43.8334μs | 22.8136 KOps/s | 21.4394 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.2912ms | 0.1403ms | 7.1285 KOps/s | 7.2080 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1581ms | 26.2214μs | 38.1368 KOps/s | 38.5504 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1778ms | 0.1331ms | 7.5107 KOps/s | 7.5282 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 59.6210μs | 21.4153μs | 46.6957 KOps/s | 48.4160 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.2633ms | 0.1335ms | 7.4921 KOps/s | 7.3643 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 55.2810μs | 21.5304μs | 46.4459 KOps/s | 47.0998 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.2799ms | 0.1409ms | 7.0984 KOps/s | 6.8085 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.5081ms | 25.6873μs | 38.9298 KOps/s | 37.9182 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.2848ms | 0.1346ms | 7.4305 KOps/s | 7.2342 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.1941ms | 22.2432μs | 44.9576 KOps/s | 48.8155 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1796ms | 0.1351ms | 7.4042 KOps/s | 7.4600 KOps/s | |
test_compile_indexing[int-pytree-eager] | 59.0810μs | 22.4547μs | 44.5342 KOps/s | 48.9108 KOps/s | |
test_mod_add[eager] | 70.5210μs | 32.9043μs | 30.3911 KOps/s | 30.5484 KOps/s | |
test_mod_add[compile] | 0.3747ms | 72.2202μs | 13.8465 KOps/s | 14.1149 KOps/s | |
test_mod_add[compile-overhead] | 0.2628ms | 0.1375ms | 7.2702 KOps/s | 6.5803 KOps/s | |
test_mod_wrap[eager] | 0.4070ms | 0.2456ms | 4.0723 KOps/s | 4.0561 KOps/s | |
test_mod_wrap[compile] | 0.8695ms | 0.2970ms | 3.3672 KOps/s | 3.3427 KOps/s | |
test_mod_wrap[compile-overhead] | 7.4717ms | 4.0630ms | 246.1245 Ops/s | 248.0468 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.6141ms | 1.4520ms | 688.6942 Ops/s | 689.0922 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.7540ms | 1.4411ms | 693.9028 Ops/s | 699.1749 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.6645ms | 1.0456ms | 956.3751 Ops/s | 981.1245 Ops/s | |
test_seq_add[eager] | 0.2476ms | 98.2997μs | 10.1730 KOps/s | 10.0829 KOps/s | |
test_seq_add[compile] | 0.2221ms | 82.7330μs | 12.0871 KOps/s | 12.2765 KOps/s | |
test_seq_add[compile-overhead] | 0.1594ms | 0.1159ms | 8.6296 KOps/s | 8.5799 KOps/s | |
test_seq_wrap[eager] | 0.5509ms | 0.3955ms | 2.5287 KOps/s | 2.5665 KOps/s | |
test_seq_wrap[compile] | 0.4181ms | 0.3239ms | 3.0875 KOps/s | 3.1566 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3693ms | 0.2238ms | 4.4689 KOps/s | 4.5033 KOps/s | |
test_func_call_runtime[False-eager] | 0.8636ms | 0.7473ms | 1.3382 KOps/s | 1.3276 KOps/s | |
test_func_call_runtime[False-compile] | 0.8905ms | 0.8028ms | 1.2456 KOps/s | 1.2636 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4295ms | 0.3650ms | 2.7396 KOps/s | 2.7526 KOps/s | |
test_func_call_runtime[True-eager] | 1.0574ms | 0.9200ms | 1.0870 KOps/s | 1.1054 KOps/s | |
test_func_call_runtime[True-compile] | 1.0381ms | 0.8449ms | 1.1836 KOps/s | 1.2053 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.4772ms | 0.4027ms | 2.4835 KOps/s | 2.5127 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8229ms | 0.7418ms | 1.3481 KOps/s | 1.2712 KOps/s | |
test_func_call_cm_runtime[False-compile] | 1.1151ms | 0.8046ms | 1.2428 KOps/s | 1.2577 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4818ms | 0.3679ms | 2.7185 KOps/s | 2.7549 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1552ms | 1.0082ms | 991.8465 Ops/s | 987.9953 Ops/s | |
test_func_call_cm_runtime[True-compile] | 1.0247ms | 0.8687ms | 1.1511 KOps/s | 1.1705 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.5545ms | 0.4254ms | 2.3506 KOps/s | 2.3663 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5585ms | 2.1027ms | 475.5772 Ops/s | 478.8541 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 1.0714ms | 0.8879ms | 1.1262 KOps/s | 1.1293 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.4738ms | 0.4300ms | 2.3256 KOps/s | 2.3304 KOps/s | |
test_distributed | 3.5515ms | 0.2154ms | 4.6433 KOps/s | 8.2768 KOps/s | |
test_tdmodule | 52.1210μs | 15.2162μs | 65.7196 KOps/s | 59.5100 KOps/s | |
test_tdmodule_dispatch | 51.4510μs | 29.2003μs | 34.2462 KOps/s | 31.3541 KOps/s | |
test_tdseq | 24.4200μs | 15.7007μs | 63.6916 KOps/s | 59.1204 KOps/s | |
test_tdseq_dispatch | 54.8810μs | 31.4402μs | 31.8064 KOps/s | 28.9895 KOps/s | |
test_instantiation_functorch | 1.9888ms | 1.8889ms | 529.4212 Ops/s | 527.4011 Ops/s | |
test_instantiation_td | 1.8058ms | 1.2044ms | 830.2943 Ops/s | 824.7579 Ops/s | |
test_exec_functorch | 0.2410ms | 0.2132ms | 4.6898 KOps/s | 4.6567 KOps/s | |
test_exec_functional_call | 0.2830ms | 0.2128ms | 4.6984 KOps/s | 4.7788 KOps/s | |
test_exec_td | 0.2718ms | 0.2202ms | 4.5419 KOps/s | 4.6411 KOps/s | |
test_exec_td_decorator | 1.0285ms | 0.2598ms | 3.8498 KOps/s | 3.8711 KOps/s | |
test_vmap_mlp_speed[True-True] | 0.8197ms | 0.6866ms | 1.4565 KOps/s | 1.4387 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.8615ms | 0.6958ms | 1.4371 KOps/s | 1.4534 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.7456ms | 0.6058ms | 1.6507 KOps/s | 1.7342 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.7137ms | 0.5933ms | 1.6854 KOps/s | 1.7308 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.7988ms | 0.6754ms | 1.4806 KOps/s | 1.4752 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 1.1112ms | 0.6726ms | 1.4869 KOps/s | 1.4715 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7674ms | 0.5979ms | 1.6724 KOps/s | 1.6899 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7207ms | 0.6040ms | 1.6556 KOps/s | 1.6943 KOps/s | |
test_vmap_transformer_speed[True-True] | 8.6012ms | 8.4465ms | 118.3928 Ops/s | 117.9423 Ops/s | |
test_vmap_transformer_speed[True-False] | 8.7240ms | 8.4323ms | 118.5911 Ops/s | 118.0640 Ops/s | |
test_vmap_transformer_speed[False-True] | 8.3257ms | 8.2074ms | 121.8419 Ops/s | 120.3886 Ops/s | |
test_vmap_transformer_speed[False-False] | 8.5697ms | 8.2636ms | 121.0130 Ops/s | 121.4458 Ops/s | |
test_vmap_transformer_speed_decorator[True-True] | 20.4734ms | 19.8176ms | 50.4603 Ops/s | 50.5923 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 19.7923ms | 19.7156ms | 50.7212 Ops/s | 50.7216 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.7281ms | 19.5747ms | 51.0863 Ops/s | 51.2233 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 20.3832ms | 19.6111ms | 50.9914 Ops/s | 51.0252 Ops/s | |
test_to_module_speed[True] | 2.0264ms | 0.9530ms | 1.0493 KOps/s | 1.0417 KOps/s | |
test_to_module_speed[False] | 1.0449ms | 0.9362ms | 1.0681 KOps/s | 1.0753 KOps/s | |
test_tc_init | 63.4510μs | 34.4191μs | 29.0536 KOps/s | 26.8758 KOps/s | |
test_tc_init_nested | 0.1112ms | 68.6681μs | 14.5628 KOps/s | 13.4346 KOps/s | |
test_tc_first_layer_tensor | 3.7043μs | 0.6898μs | 1.4498 MOps/s | 1.4466 MOps/s | |
test_tc_first_layer_nontensor | 20.3300μs | 2.2984μs | 435.0936 KOps/s | 438.2595 KOps/s | |
test_tc_second_layer_tensor | 30.2180μs | 1.4068μs | 710.8160 KOps/s | 710.1403 KOps/s | |
test_tc_second_layer_nontensor | 31.9610μs | 2.9607μs | 337.7533 KOps/s | 332.1934 KOps/s | |
test_unbind | 0.1973s | 12.4247ms | 80.4849 Ops/s | 90.2867 Ops/s | |
test_full_like | 0.7253ms | 0.5725ms | 1.7468 KOps/s | 1.7382 KOps/s | |
test_zeros_like | 0.2860ms | 0.1978ms | 5.0563 KOps/s | 5.0479 KOps/s | |
test_ones_like | 0.2337ms | 0.1976ms | 5.0604 KOps/s | 5.0567 KOps/s | |
test_clone | 0.5645ms | 0.4142ms | 2.4140 KOps/s | 2.4095 KOps/s | |
test_squeeze | 0.1394ms | 9.9153μs | 100.8546 KOps/s | 101.4435 KOps/s | |
test_unsqueeze | 0.2243ms | 74.8127μs | 13.3667 KOps/s | 13.5164 KOps/s | |
test_split | 0.4250ms | 0.1591ms | 6.2845 KOps/s | 6.4024 KOps/s | |
test_permute | 0.2688ms | 0.1846ms | 5.4157 KOps/s | 5.6204 KOps/s | |
test_stack | 1.2759ms | 0.8576ms | 1.1660 KOps/s | 1.1773 KOps/s | |
test_cat | 1.2700ms | 1.2317ms | 811.9148 Ops/s | 811.7354 Ops/s |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):