-
Notifications
You must be signed in to change notification settings - Fork 79
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BugFix] select_out_keys for Prob sequential #1103
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vmoens
added a commit
that referenced
this pull request
Nov 22, 2024
ghstack-source-id: fc4868eab4153c5a2df9cbda2d7ec1c08a7ed377 Pull Request resolved: #1103
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 32.3110μs | 17.3479μs | 57.6440 KOps/s | 57.1023 KOps/s | |
test_plain_set_stack_nested | 45.8160μs | 18.0353μs | 55.4468 KOps/s | 56.3722 KOps/s | |
test_plain_set_nested_inplace | 60.8740μs | 19.5831μs | 51.0645 KOps/s | 51.8488 KOps/s | |
test_plain_set_stack_nested_inplace | 69.0990μs | 19.4454μs | 51.4262 KOps/s | 51.7727 KOps/s | |
test_items | 22.7330μs | 4.1281μs | 242.2427 KOps/s | 238.0223 KOps/s | |
test_items_nested | 0.4292ms | 0.3456ms | 2.8934 KOps/s | 2.9296 KOps/s | |
test_items_nested_locked | 0.6053ms | 0.3470ms | 2.8820 KOps/s | 2.9196 KOps/s | |
test_items_nested_leaf | 0.1217ms | 71.5122μs | 13.9836 KOps/s | 14.1453 KOps/s | |
test_items_stack_nested | 0.4038ms | 0.3495ms | 2.8614 KOps/s | 2.8975 KOps/s | |
test_items_stack_nested_leaf | 0.1448ms | 74.1514μs | 13.4859 KOps/s | 13.5657 KOps/s | |
test_items_stack_nested_locked | 0.5582ms | 0.3474ms | 2.8785 KOps/s | 2.8690 KOps/s | |
test_keys | 33.2220μs | 3.4886μs | 286.6491 KOps/s | 285.2909 KOps/s | |
test_keys_nested | 0.2603ms | 0.1376ms | 7.2678 KOps/s | 7.4014 KOps/s | |
test_keys_nested_locked | 1.9458ms | 0.1415ms | 7.0688 KOps/s | 7.0585 KOps/s | |
test_keys_nested_leaf | 0.2392ms | 0.1167ms | 8.5667 KOps/s | 8.5216 KOps/s | |
test_keys_stack_nested | 0.2240ms | 0.1369ms | 7.3035 KOps/s | 7.3084 KOps/s | |
test_keys_stack_nested_leaf | 0.1714ms | 0.1171ms | 8.5408 KOps/s | 8.5010 KOps/s | |
test_keys_stack_nested_locked | 0.2761ms | 0.1418ms | 7.0524 KOps/s | 7.0321 KOps/s | |
test_values | 6.9110μs | 1.0307μs | 970.2143 KOps/s | 953.6190 KOps/s | |
test_values_nested | 0.1079ms | 54.7850μs | 18.2532 KOps/s | 18.3544 KOps/s | |
test_values_nested_locked | 0.1177ms | 55.3582μs | 18.0642 KOps/s | 18.2408 KOps/s | |
test_values_nested_leaf | 0.1534ms | 59.3564μs | 16.8474 KOps/s | 16.6743 KOps/s | |
test_values_stack_nested | 0.1037ms | 56.0683μs | 17.8354 KOps/s | 18.0357 KOps/s | |
test_values_stack_nested_leaf | 0.1212ms | 60.7833μs | 16.4519 KOps/s | 16.5090 KOps/s | |
test_values_stack_nested_locked | 0.1151ms | 56.3557μs | 17.7444 KOps/s | 17.9323 KOps/s | |
test_membership | 4.8863μs | 0.7664μs | 1.3048 MOps/s | 1.4025 MOps/s | |
test_membership_nested | 29.1650μs | 2.7567μs | 362.7554 KOps/s | 367.9181 KOps/s | |
test_membership_nested_leaf | 29.8760μs | 2.8225μs | 354.2998 KOps/s | 366.1705 KOps/s | |
test_membership_stacked_nested | 30.5570μs | 2.7352μs | 365.6096 KOps/s | 373.6079 KOps/s | |
test_membership_stacked_nested_leaf | 49.6430μs | 2.7703μs | 360.9754 KOps/s | 370.2828 KOps/s | |
test_membership_nested_last | 31.3590μs | 4.0374μs | 247.6821 KOps/s | 248.2188 KOps/s | |
test_membership_nested_leaf_last | 32.8410μs | 4.0932μs | 244.3100 KOps/s | 245.2776 KOps/s | |
test_membership_stacked_nested_last | 33.7730μs | 4.0750μs | 245.3988 KOps/s | 252.2685 KOps/s | |
test_membership_stacked_nested_leaf_last | 46.3570μs | 4.1075μs | 243.4576 KOps/s | 244.6066 KOps/s | |
test_nested_getleaf | 39.7750μs | 10.9783μs | 91.0887 KOps/s | 92.8976 KOps/s | |
test_nested_get | 40.3760μs | 10.2269μs | 97.7818 KOps/s | 98.6833 KOps/s | |
test_stacked_getleaf | 41.7390μs | 10.8252μs | 92.3770 KOps/s | 94.5397 KOps/s | |
test_stacked_get | 49.1220μs | 10.2685μs | 97.3855 KOps/s | 98.6733 KOps/s | |
test_nested_getitemleaf | 38.0710μs | 11.2391μs | 88.9748 KOps/s | 90.6488 KOps/s | |
test_nested_getitem | 57.3280μs | 10.4744μs | 95.4708 KOps/s | 98.5424 KOps/s | |
test_stacked_getitemleaf | 71.5940μs | 10.8999μs | 91.7436 KOps/s | 90.2474 KOps/s | |
test_stacked_getitem | 37.8410μs | 10.3358μs | 96.7509 KOps/s | 97.0833 KOps/s | |
test_lock_nested | 2.8479ms | 0.4558ms | 2.1938 KOps/s | 2.2651 KOps/s | |
test_lock_stack_nested | 0.6901ms | 0.4173ms | 2.3964 KOps/s | 2.4372 KOps/s | |
test_unlock_nested | 0.8101ms | 0.3676ms | 2.7201 KOps/s | 2.6819 KOps/s | |
test_unlock_stack_nested | 0.5199ms | 0.3364ms | 2.9723 KOps/s | 3.0088 KOps/s | |
test_flatten_speed | 0.1580ms | 91.6496μs | 10.9111 KOps/s | 11.1302 KOps/s | |
test_unflatten_speed | 0.8746ms | 0.4752ms | 2.1042 KOps/s | 2.0677 KOps/s | |
test_common_ops | 4.1297ms | 0.7743ms | 1.2915 KOps/s | 1.3403 KOps/s | |
test_creation | 23.3840μs | 2.1026μs | 475.5939 KOps/s | 477.4025 KOps/s | |
test_creation_empty | 33.3530μs | 10.3727μs | 96.4067 KOps/s | 105.2997 KOps/s | |
test_creation_nested_1 | 1.2526ms | 13.2475μs | 75.4860 KOps/s | 81.1270 KOps/s | |
test_creation_nested_2 | 56.0050μs | 17.2080μs | 58.1125 KOps/s | 60.4695 KOps/s | |
test_clone | 0.1397ms | 13.0528μs | 76.6117 KOps/s | 79.1631 KOps/s | |
test_getitem[int] | 0.7819ms | 12.8968μs | 77.5386 KOps/s | 81.6404 KOps/s | |
test_getitem[slice_int] | 0.1462ms | 24.6929μs | 40.4974 KOps/s | 44.1075 KOps/s | |
test_getitem[range] | 0.2480ms | 50.5019μs | 19.8012 KOps/s | 20.7851 KOps/s | |
test_getitem[tuple] | 0.1269ms | 20.5027μs | 48.7742 KOps/s | 51.0078 KOps/s | |
test_getitem[list] | 0.2248ms | 45.0789μs | 22.1833 KOps/s | 23.0812 KOps/s | |
test_setitem_dim[int] | 52.9100μs | 27.2346μs | 36.7181 KOps/s | 40.9042 KOps/s | |
test_setitem_dim[slice_int] | 0.1164ms | 55.7554μs | 17.9355 KOps/s | 19.8156 KOps/s | |
test_setitem_dim[range] | 0.1212ms | 74.5165μs | 13.4198 KOps/s | 13.4798 KOps/s | |
test_setitem_dim[tuple] | 0.1025ms | 42.4246μs | 23.5712 KOps/s | 25.2379 KOps/s | |
test_setitem | 76.5330μs | 19.6116μs | 50.9904 KOps/s | 53.0372 KOps/s | |
test_set | 0.1371ms | 19.4080μs | 51.5253 KOps/s | 54.3913 KOps/s | |
test_set_shared | 1.1729ms | 0.1688ms | 5.9257 KOps/s | 6.0086 KOps/s | |
test_update | 0.2212ms | 21.7689μs | 45.9370 KOps/s | 49.0469 KOps/s | |
test_update_nested | 0.1068ms | 31.9084μs | 31.3397 KOps/s | 33.4903 KOps/s | |
test_update__nested | 0.5186ms | 32.6552μs | 30.6230 KOps/s | 31.4231 KOps/s | |
test_set_nested | 97.2920μs | 21.4150μs | 46.6962 KOps/s | 48.7241 KOps/s | |
test_set_nested_new | 93.7650μs | 26.0730μs | 38.3538 KOps/s | 39.9530 KOps/s | |
test_select | 0.1474ms | 42.0244μs | 23.7957 KOps/s | 24.8398 KOps/s | |
test_select_nested | 0.1146ms | 59.6663μs | 16.7599 KOps/s | 17.2092 KOps/s | |
test_exclude_nested | 0.1138ms | 74.8725μs | 13.3560 KOps/s | 13.3650 KOps/s | |
test_empty[True] | 0.6548ms | 0.3516ms | 2.8441 KOps/s | 2.8719 KOps/s | |
test_empty[False] | 11.1910μs | 1.2510μs | 799.3329 KOps/s | 820.1727 KOps/s | |
test_unbind_speed | 0.5906ms | 0.2640ms | 3.7882 KOps/s | 3.8413 KOps/s | |
test_unbind_speed_stack0 | 0.5000ms | 0.2620ms | 3.8174 KOps/s | 3.8940 KOps/s | |
test_unbind_speed_stack1 | 0.1069s | 0.7846ms | 1.2746 KOps/s | 1.4095 KOps/s | |
test_split | 0.1060s | 1.7464ms | 572.6140 Ops/s | 584.1621 Ops/s | |
test_chunk | 0.1101s | 1.7628ms | 567.2686 Ops/s | 585.4421 Ops/s | |
test_consolidate_njt[False-None] | 8.9875ms | 8.3498ms | 119.7639 Ops/s | 123.9488 Ops/s | |
test_creation[device0] | 3.4660ms | 95.6332μs | 10.4566 KOps/s | 10.7706 KOps/s | |
test_creation_from_tensor | 0.3113ms | 96.5410μs | 10.3583 KOps/s | 10.5986 KOps/s | |
test_add_one[memmap_tensor0] | 0.2721ms | 5.1090μs | 195.7328 KOps/s | 202.6682 KOps/s | |
test_contiguous[memmap_tensor0] | 16.6910μs | 0.5085μs | 1.9667 MOps/s | 1.9323 MOps/s | |
test_stack[memmap_tensor0] | 45.5450μs | 3.4863μs | 286.8374 KOps/s | 297.2361 KOps/s | |
test_memmaptd_index | 1.0473ms | 0.2436ms | 4.1056 KOps/s | 4.2906 KOps/s | |
test_memmaptd_index_astensor | 0.6648ms | 0.3235ms | 3.0907 KOps/s | 3.1652 KOps/s | |
test_memmaptd_index_op | 1.0070ms | 0.6000ms | 1.6667 KOps/s | 1.7762 KOps/s | |
test_serialize_model | 0.1271s | 0.1194s | 8.3784 Ops/s | 7.3560 Ops/s | |
test_serialize_model_pickle | 0.4434s | 0.3904s | 2.5613 Ops/s | 2.4870 Ops/s | |
test_serialize_weights | 0.1214s | 0.1143s | 8.7498 Ops/s | 8.6854 Ops/s | |
test_serialize_weights_returnearly | 0.1682s | 0.1592s | 6.2818 Ops/s | 6.2167 Ops/s | |
test_serialize_weights_pickle | 0.6402s | 0.4902s | 2.0399 Ops/s | 2.5706 Ops/s | |
test_serialize_weights_filesystem | 0.1560s | 0.1434s | 6.9740 Ops/s | 6.8921 Ops/s | |
test_serialize_model_filesystem | 0.1690s | 0.1469s | 6.8073 Ops/s | 6.6175 Ops/s | |
test_reshape_pytree | 72.6160μs | 27.0898μs | 36.9142 KOps/s | 37.8091 KOps/s | |
test_reshape_td | 89.2880μs | 33.8179μs | 29.5701 KOps/s | 32.1012 KOps/s | |
test_view_pytree | 72.8860μs | 26.8187μs | 37.2873 KOps/s | 37.8218 KOps/s | |
test_view_td | 87.0240μs | 38.0932μs | 26.2514 KOps/s | 26.8866 KOps/s | |
test_unbind_pytree | 66.2440μs | 30.2798μs | 33.0253 KOps/s | 34.4440 KOps/s | |
test_unbind_td | 0.3398ms | 39.2204μs | 25.4969 KOps/s | 26.2799 KOps/s | |
test_split_pytree | 80.0000μs | 29.6696μs | 33.7045 KOps/s | 34.0655 KOps/s | |
test_split_td | 0.1999ms | 44.7705μs | 22.3361 KOps/s | 23.5823 KOps/s | |
test_add_pytree | 85.6410μs | 36.4245μs | 27.4541 KOps/s | 27.9030 KOps/s | |
test_add_td | 0.1749ms | 54.4902μs | 18.3519 KOps/s | 18.7403 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1477ms | 62.7537μs | 15.9353 KOps/s | 16.2917 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.4061ms | 0.1643ms | 6.0869 KOps/s | 6.1697 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1100ms | 46.1818μs | 21.6535 KOps/s | 22.0789 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2624ms | 0.1206ms | 8.2913 KOps/s | 7.9767 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 65.2920μs | 25.7893μs | 38.7758 KOps/s | 39.0551 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1277ms | 53.2593μs | 18.7761 KOps/s | 18.6206 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1708ms | 80.9688μs | 12.3504 KOps/s | 12.6003 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1332ms | 68.7428μs | 14.5470 KOps/s | 14.8719 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1979ms | 0.1056ms | 9.4694 KOps/s | 9.5712 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3470ms | 0.2034ms | 4.9171 KOps/s | 4.9523 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1106ms | 45.8949μs | 21.7889 KOps/s | 22.2308 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.4848ms | 63.2218μs | 15.8173 KOps/s | 16.2907 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1842ms | 0.1041ms | 9.6075 KOps/s | 9.7473 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.3968ms | 0.2038ms | 4.9072 KOps/s | 5.0046 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3674ms | 0.2147ms | 4.6584 KOps/s | 4.6894 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2389ms | 0.1112ms | 8.9901 KOps/s | 9.6015 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.3053ms | 55.7501μs | 17.9372 KOps/s | 18.9056 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1562ms | 46.9720μs | 21.2893 KOps/s | 21.8918 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 1.3103ms | 0.1617ms | 6.1856 KOps/s | 6.3188 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1968ms | 0.1037ms | 9.6417 KOps/s | 9.8434 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 60.2530μs | 22.2509μs | 44.9420 KOps/s | 46.9586 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1369ms | 59.4219μs | 16.8288 KOps/s | 17.0519 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.2130ms | 81.9176μs | 12.2074 KOps/s | 12.4947 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1053ms | 69.3795μs | 14.4135 KOps/s | 14.8563 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.4556ms | 0.2088ms | 4.7884 KOps/s | 4.8225 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.3028ms | 1.3057ms | 765.8573 Ops/s | 788.1997 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.4359ms | 0.2074ms | 4.8224 KOps/s | 4.9781 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.3222ms | 0.7891ms | 1.2673 KOps/s | 1.2996 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.8182ms | 0.4644ms | 2.1532 KOps/s | 2.2328 KOps/s | |
test_compile_assign_and_add_stack[eager] | 4.0396ms | 2.6825ms | 372.7835 Ops/s | 395.7572 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 95.6390μs | 36.2389μs | 27.5946 KOps/s | 27.5010 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.4674ms | 33.9015μs | 29.4972 KOps/s | 30.0841 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 85.8970μs | 29.8345μs | 33.5182 KOps/s | 34.9232 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 70.6570μs | 23.7773μs | 42.0570 KOps/s | 42.2616 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 89.5890μs | 30.8022μs | 32.4653 KOps/s | 33.5459 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 83.8480μs | 24.0165μs | 41.6380 KOps/s | 43.4372 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1051ms | 52.5357μs | 19.0347 KOps/s | 19.9559 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.5777ms | 19.8656μs | 50.3382 KOps/s | 52.4258 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1107ms | 45.1225μs | 22.1619 KOps/s | 23.2562 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 67.2560μs | 19.2437μs | 51.9650 KOps/s | 54.2391 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1123ms | 45.8314μs | 21.8191 KOps/s | 22.6755 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 82.4590μs | 18.8207μs | 53.1330 KOps/s | 54.2135 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1306ms | 53.7903μs | 18.5907 KOps/s | 19.1207 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.9403ms | 19.7245μs | 50.6984 KOps/s | 52.1660 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1203ms | 46.2151μs | 21.6380 KOps/s | 22.8509 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 64.0010μs | 18.9126μs | 52.8749 KOps/s | 53.6867 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1315ms | 46.1258μs | 21.6798 KOps/s | 22.7404 KOps/s | |
test_compile_indexing[int-pytree-eager] | 70.1320μs | 19.0443μs | 52.5093 KOps/s | 54.5637 KOps/s | |
test_mod_add[eager] | 73.4880μs | 26.5046μs | 37.7293 KOps/s | 38.6442 KOps/s | |
test_mod_add[compile] | 0.1003ms | 47.1180μs | 21.2233 KOps/s | 22.5871 KOps/s | |
test_mod_add[compile-overhead] | 0.1460ms | 46.9661μs | 21.2920 KOps/s | 22.6658 KOps/s | |
test_mod_wrap[eager] | 0.4212ms | 0.2223ms | 4.4982 KOps/s | 4.8116 KOps/s | |
test_mod_wrap[compile] | 1.8326ms | 0.2110ms | 4.7389 KOps/s | 4.8109 KOps/s | |
test_mod_wrap[compile-overhead] | 1.8385ms | 0.2075ms | 4.8202 KOps/s | 4.9490 KOps/s | |
test_mod_wrap_and_backward[eager] | 13.8486ms | 11.8630ms | 84.2957 Ops/s | 93.4573 Ops/s | |
test_mod_wrap_and_backward[compile] | 19.2123ms | 12.7674ms | 78.3244 Ops/s | 86.4500 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 16.1457ms | 13.5113ms | 74.0121 Ops/s | 83.7396 Ops/s | |
test_seq_add[eager] | 0.1667ms | 94.1373μs | 10.6228 KOps/s | 11.2343 KOps/s | |
test_seq_add[compile] | 0.1683ms | 63.4870μs | 15.7513 KOps/s | 16.7482 KOps/s | |
test_seq_add[compile-overhead] | 0.1633ms | 60.5125μs | 16.5255 KOps/s | 17.1960 KOps/s | |
test_seq_wrap[eager] | 0.6039ms | 0.4004ms | 2.4973 KOps/s | 2.6055 KOps/s | |
test_seq_wrap[compile] | 0.4194ms | 0.2338ms | 4.2771 KOps/s | 4.3910 KOps/s | |
test_seq_wrap[compile-overhead] | 0.4071ms | 0.2330ms | 4.2913 KOps/s | 4.4039 KOps/s | |
test_func_call_runtime[False-eager] | 0.9416ms | 0.5736ms | 1.7432 KOps/s | 1.8588 KOps/s | |
test_func_call_runtime[False-compile] | 0.6063ms | 0.4376ms | 2.2849 KOps/s | 2.3516 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.7163ms | 0.4330ms | 2.3097 KOps/s | 2.3444 KOps/s | |
test_func_call_runtime[True-eager] | 1.3061ms | 0.7886ms | 1.2680 KOps/s | 1.3507 KOps/s | |
test_func_call_runtime[True-compile] | 0.5947ms | 0.4711ms | 2.1227 KOps/s | 2.0936 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.6321ms | 0.4681ms | 2.1363 KOps/s | 2.1016 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.7288ms | 0.5617ms | 1.7802 KOps/s | 1.8422 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.5849ms | 0.4349ms | 2.2992 KOps/s | 2.3107 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5593ms | 0.4322ms | 2.3135 KOps/s | 2.3200 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1136ms | 0.9139ms | 1.0942 KOps/s | 1.1134 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.6064ms | 0.4976ms | 2.0095 KOps/s | 1.9992 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.6023ms | 0.4992ms | 2.0032 KOps/s | 2.0013 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.4801ms | 1.9317ms | 517.6917 Ops/s | 525.9307 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9709ms | 0.5326ms | 1.8774 KOps/s | 1.9029 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.8413ms | 0.5238ms | 1.9090 KOps/s | 1.9055 KOps/s | |
test_distributed | 0.2840ms | 0.1274ms | 7.8520 KOps/s | 7.7980 KOps/s | |
test_tdmodule | 36.6590μs | 18.6679μs | 53.5679 KOps/s | 55.2453 KOps/s | |
test_tdmodule_dispatch | 68.1080μs | 38.4518μs | 26.0066 KOps/s | 26.6889 KOps/s | |
test_tdseq | 48.9620μs | 21.4415μs | 46.6386 KOps/s | 49.2967 KOps/s | |
test_tdseq_dispatch | 73.2780μs | 44.2060μs | 22.6214 KOps/s | 23.3866 KOps/s | |
test_instantiation_functorch | 2.0094ms | 1.6101ms | 621.0951 Ops/s | 647.4610 Ops/s | |
test_exec_functorch | 0.3780ms | 0.1848ms | 5.4107 KOps/s | 5.5104 KOps/s | |
test_exec_functional_call | 0.2813ms | 0.1761ms | 5.6773 KOps/s | 5.7332 KOps/s | |
test_exec_td_decorator | 0.5201ms | 0.2318ms | 4.3140 KOps/s | 4.3846 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8412ms | 0.6517ms | 1.5345 KOps/s | 1.5963 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.9370ms | 0.6526ms | 1.5323 KOps/s | 1.5915 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7494ms | 0.5322ms | 1.8791 KOps/s | 1.9389 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.8495ms | 0.5342ms | 1.8721 KOps/s | 1.9251 KOps/s | |
test_to_module_speed[True] | 1.7420ms | 1.3255ms | 754.4572 Ops/s | 785.5193 Ops/s | |
test_to_module_speed[False] | 1.5354ms | 1.2760ms | 783.7237 Ops/s | 796.9698 Ops/s | |
test_tc_init | 79.5000μs | 43.5695μs | 22.9519 KOps/s | 22.4661 KOps/s | |
test_tc_init_nested | 0.1623ms | 88.2609μs | 11.3300 KOps/s | 10.9646 KOps/s | |
test_tc_first_layer_tensor | 20.0770μs | 1.5065μs | 663.7948 KOps/s | 672.3221 KOps/s | |
test_tc_first_layer_nontensor | 51.4800μs | 4.8307μs | 207.0084 KOps/s | 210.1796 KOps/s | |
test_tc_second_layer_tensor | 28.4840μs | 2.7911μs | 358.2876 KOps/s | 364.4651 KOps/s | |
test_tc_second_layer_nontensor | 43.9830μs | 6.1958μs | 161.3991 KOps/s | 164.8824 KOps/s | |
test_unbind | 0.2281s | 13.5853ms | 73.6091 Ops/s | 83.9181 Ops/s | |
test_full_like | 8.6645ms | 7.5581ms | 132.3076 Ops/s | 80.5155 Ops/s | |
test_zeros_like | 3.7804ms | 2.9350ms | 340.7098 Ops/s | 136.7721 Ops/s | |
test_ones_like | 4.3630ms | 3.3143ms | 301.7216 Ops/s | 132.6607 Ops/s | |
test_clone | 6.7077ms | 5.1552ms | 193.9782 Ops/s | 108.1048 Ops/s | |
test_squeeze | 77.4750μs | 12.0818μs | 82.7688 KOps/s | 85.9653 KOps/s | |
test_unsqueeze | 0.1668ms | 92.1394μs | 10.8531 KOps/s | 11.6731 KOps/s | |
test_split | 0.3390ms | 0.1942ms | 5.1492 KOps/s | 5.4477 KOps/s | |
test_permute | 0.3492ms | 0.2203ms | 4.5396 KOps/s | 4.6289 KOps/s | |
test_stack | 29.5379ms | 25.1070ms | 39.8296 Ops/s | 40.1225 Ops/s | |
test_cat | 25.8227ms | 24.9296ms | 40.1130 Ops/s | 39.1932 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 25.2600μs | 10.4431μs | 95.7566 KOps/s | 95.4027 KOps/s | |
test_plain_set_stack_nested | 31.9910μs | 10.4606μs | 95.5964 KOps/s | 95.1474 KOps/s | |
test_plain_set_nested_inplace | 92.8220μs | 11.3136μs | 88.3890 KOps/s | 88.5546 KOps/s | |
test_plain_set_stack_nested_inplace | 39.4400μs | 11.2414μs | 88.9567 KOps/s | 88.8756 KOps/s | |
test_items | 48.6310μs | 2.8915μs | 345.8467 KOps/s | 344.3042 KOps/s | |
test_items_nested | 0.3743ms | 0.3186ms | 3.1390 KOps/s | 3.1827 KOps/s | |
test_items_nested_locked | 0.3711ms | 0.3202ms | 3.1231 KOps/s | 3.1286 KOps/s | |
test_items_nested_leaf | 81.3210μs | 58.4374μs | 17.1123 KOps/s | 17.1276 KOps/s | |
test_items_stack_nested | 0.3753ms | 0.3192ms | 3.1329 KOps/s | 3.1397 KOps/s | |
test_items_stack_nested_leaf | 0.1035ms | 59.7207μs | 16.7446 KOps/s | 16.9064 KOps/s | |
test_items_stack_nested_locked | 0.3722ms | 0.3226ms | 3.1000 KOps/s | 3.1331 KOps/s | |
test_keys | 29.3900μs | 3.4665μs | 288.4750 KOps/s | 289.5451 KOps/s | |
test_keys_nested | 0.1032ms | 69.5825μs | 14.3714 KOps/s | 14.2364 KOps/s | |
test_keys_nested_locked | 0.6693ms | 75.2019μs | 13.2975 KOps/s | 13.1545 KOps/s | |
test_keys_nested_leaf | 0.2561ms | 60.8786μs | 16.4261 KOps/s | 16.3428 KOps/s | |
test_keys_stack_nested | 0.2766ms | 70.3166μs | 14.2214 KOps/s | 14.1315 KOps/s | |
test_keys_stack_nested_leaf | 0.2546ms | 61.2323μs | 16.3312 KOps/s | 16.0695 KOps/s | |
test_keys_stack_nested_locked | 0.2626ms | 75.2262μs | 13.2932 KOps/s | 13.1865 KOps/s | |
test_values | 32.3187μs | 0.8474μs | 1.1801 MOps/s | 1.1872 MOps/s | |
test_values_nested | 60.0910μs | 31.2241μs | 32.0266 KOps/s | 32.1851 KOps/s | |
test_values_nested_locked | 0.2124ms | 32.7604μs | 30.5247 KOps/s | 30.7875 KOps/s | |
test_values_nested_leaf | 63.4210μs | 33.5935μs | 29.7676 KOps/s | 29.9320 KOps/s | |
test_values_stack_nested | 62.7510μs | 31.4987μs | 31.7473 KOps/s | 31.7761 KOps/s | |
test_values_stack_nested_leaf | 63.1610μs | 33.9651μs | 29.4420 KOps/s | 29.3530 KOps/s | |
test_values_stack_nested_locked | 0.1015ms | 33.1762μs | 30.1421 KOps/s | 30.3358 KOps/s | |
test_membership | 2.5095μs | 0.5095μs | 1.9627 MOps/s | 1.9660 MOps/s | |
test_membership_nested | 13.9705μs | 1.8410μs | 543.1832 KOps/s | 531.9150 KOps/s | |
test_membership_nested_leaf | 12.9267μs | 1.8078μs | 553.1564 KOps/s | 529.8595 KOps/s | |
test_membership_stacked_nested | 44.3100μs | 1.9273μs | 518.8661 KOps/s | 521.5446 KOps/s | |
test_membership_stacked_nested_leaf | 31.3510μs | 1.8960μs | 527.4312 KOps/s | 521.1797 KOps/s | |
test_membership_nested_last | 26.5400μs | 2.8595μs | 349.7147 KOps/s | 358.3806 KOps/s | |
test_membership_nested_leaf_last | 32.5610μs | 2.8780μs | 347.4673 KOps/s | 355.9653 KOps/s | |
test_membership_stacked_nested_last | 47.4610μs | 4.5114μs | 221.6600 KOps/s | 265.6981 KOps/s | |
test_membership_stacked_nested_leaf_last | 19.1600μs | 4.5099μs | 221.7347 KOps/s | 268.6838 KOps/s | |
test_nested_getleaf | 34.3600μs | 5.9583μs | 167.8321 KOps/s | 167.7987 KOps/s | |
test_nested_get | 35.9700μs | 5.7004μs | 175.4250 KOps/s | 175.2508 KOps/s | |
test_stacked_getleaf | 42.2410μs | 5.9448μs | 168.2147 KOps/s | 167.0028 KOps/s | |
test_stacked_get | 32.5800μs | 5.6998μs | 175.4459 KOps/s | 176.4689 KOps/s | |
test_nested_getitemleaf | 27.3700μs | 6.0732μs | 164.6587 KOps/s | 165.5792 KOps/s | |
test_nested_getitem | 32.5310μs | 5.7465μs | 174.0184 KOps/s | 174.0991 KOps/s | |
test_stacked_getitemleaf | 62.7110μs | 6.0301μs | 165.8359 KOps/s | 164.7669 KOps/s | |
test_stacked_getitem | 40.5900μs | 5.7261μs | 174.6384 KOps/s | 174.9400 KOps/s | |
test_lock_nested | 9.0931ms | 0.3612ms | 2.7684 KOps/s | 2.7570 KOps/s | |
test_lock_stack_nested | 0.4522ms | 0.3233ms | 3.0932 KOps/s | 3.0259 KOps/s | |
test_unlock_nested | 0.7128ms | 0.2939ms | 3.4025 KOps/s | 3.3063 KOps/s | |
test_unlock_stack_nested | 0.3831ms | 0.2617ms | 3.8217 KOps/s | 3.7034 KOps/s | |
test_flatten_speed | 0.1127ms | 71.9303μs | 13.9023 KOps/s | 13.6666 KOps/s | |
test_unflatten_speed | 0.4150ms | 0.2845ms | 3.5153 KOps/s | 3.4313 KOps/s | |
test_common_ops | 1.6621ms | 0.5523ms | 1.8106 KOps/s | 1.7716 KOps/s | |
test_creation | 89.1010μs | 1.4179μs | 705.2907 KOps/s | 705.7674 KOps/s | |
test_creation_empty | 32.8500μs | 7.0614μs | 141.6156 KOps/s | 141.0323 KOps/s | |
test_creation_nested_1 | 79.6110μs | 8.5943μs | 116.3559 KOps/s | 115.1995 KOps/s | |
test_creation_nested_2 | 40.3510μs | 10.9645μs | 91.2037 KOps/s | 89.4064 KOps/s | |
test_clone | 80.6010μs | 9.5258μs | 104.9776 KOps/s | 101.2994 KOps/s | |
test_getitem[int] | 1.3696ms | 10.2091μs | 97.9519 KOps/s | 94.3976 KOps/s | |
test_getitem[slice_int] | 0.1073ms | 19.1965μs | 52.0929 KOps/s | 50.0475 KOps/s | |
test_getitem[range] | 0.1301ms | 34.9024μs | 28.6514 KOps/s | 28.2398 KOps/s | |
test_getitem[tuple] | 0.1089ms | 17.3158μs | 57.7508 KOps/s | 56.1865 KOps/s | |
test_getitem[list] | 0.2100ms | 31.1754μs | 32.0765 KOps/s | 31.7344 KOps/s | |
test_setitem_dim[int] | 37.3410μs | 17.0829μs | 58.5380 KOps/s | 58.4215 KOps/s | |
test_setitem_dim[slice_int] | 55.6710μs | 34.4654μs | 29.0146 KOps/s | 28.7845 KOps/s | |
test_setitem_dim[range] | 80.5510μs | 49.7460μs | 20.1021 KOps/s | 19.9912 KOps/s | |
test_setitem_dim[tuple] | 51.3810μs | 29.7304μs | 33.6356 KOps/s | 33.4548 KOps/s | |
test_setitem | 91.0610μs | 13.1734μs | 75.9108 KOps/s | 72.8154 KOps/s | |
test_set | 0.1502ms | 12.7596μs | 78.3722 KOps/s | 75.0484 KOps/s | |
test_set_shared | 1.6236ms | 0.1422ms | 7.0331 KOps/s | 6.9458 KOps/s | |
test_update | 0.2936ms | 15.0968μs | 66.2391 KOps/s | 64.3253 KOps/s | |
test_update_nested | 0.1076ms | 19.4317μs | 51.4622 KOps/s | 49.3626 KOps/s | |
test_update__nested | 1.2064ms | 22.5622μs | 44.3219 KOps/s | 43.3114 KOps/s | |
test_set_nested | 79.8110μs | 13.5521μs | 73.7891 KOps/s | 69.8318 KOps/s | |
test_set_nested_new | 75.4810μs | 15.6733μs | 63.8028 KOps/s | 60.8578 KOps/s | |
test_select | 0.1498ms | 27.7301μs | 36.0619 KOps/s | 35.0304 KOps/s | |
test_select_nested | 70.2000μs | 41.2167μs | 24.2620 KOps/s | 23.7863 KOps/s | |
test_exclude_nested | 0.1053ms | 57.9133μs | 17.2672 KOps/s | 17.1402 KOps/s | |
test_empty[True] | 0.3332ms | 0.2558ms | 3.9098 KOps/s | 3.9341 KOps/s | |
test_empty[False] | 3.5360μs | 0.7486μs | 1.3359 MOps/s | 1.3392 MOps/s | |
test_to | 81.3610μs | 53.9249μs | 18.5443 KOps/s | 19.1165 KOps/s | |
test_to_nonblocking | 0.2396ms | 44.6055μs | 22.4188 KOps/s | 22.5474 KOps/s | |
test_unbind_speed | 1.9662ms | 0.2237ms | 4.4710 KOps/s | 4.3636 KOps/s | |
test_unbind_speed_stack0 | 0.4118ms | 0.2255ms | 4.4346 KOps/s | 4.3002 KOps/s | |
test_unbind_speed_stack1 | 93.1453ms | 0.6266ms | 1.5960 KOps/s | 1.5671 KOps/s | |
test_split | 94.7678ms | 1.6524ms | 605.1943 Ops/s | 581.8528 Ops/s | |
test_chunk | 1.5910ms | 1.4018ms | 713.3447 Ops/s | 695.1956 Ops/s | |
test_consolidate[False-None] | 97.3763ms | 2.7642ms | 361.7625 Ops/s | 353.8992 Ops/s | |
test_consolidate[default-None] | 1.7495ms | 1.6093ms | 621.3731 Ops/s | 600.3787 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.8546ms | 1.6576ms | 603.2891 Ops/s | 584.6911 Ops/s | |
test_consolidate_njt[False-None] | 6.5325ms | 6.3600ms | 157.2323 Ops/s | 156.9122 Ops/s | |
test_to[False-False-None] | 1.8098ms | 1.6416ms | 609.1770 Ops/s | 607.3954 Ops/s | |
test_to[True-False-None] | 1.4839ms | 1.2354ms | 809.4637 Ops/s | 771.7661 Ops/s | |
test_to[within-False-None] | 4.0901ms | 3.9046ms | 256.1056 Ops/s | 255.7307 Ops/s | |
test_to[True-default-None] | 5.4416ms | 5.1652ms | 193.6039 Ops/s | 194.6325 Ops/s | |
test_to_njt[False-False-None] | 7.0803ms | 6.8321ms | 146.3688 Ops/s | 147.1729 Ops/s | |
test_to_njt[True-False-None] | 5.7593ms | 5.3809ms | 185.8426 Ops/s | 185.1195 Ops/s | |
test_to_njt[within-False-None] | 12.0254ms | 11.8419ms | 84.4456 Ops/s | 84.3405 Ops/s | |
test_creation[device0] | 0.4609ms | 77.1473μs | 12.9622 KOps/s | 13.0128 KOps/s | |
test_creation_from_tensor | 0.5182ms | 81.3523μs | 12.2922 KOps/s | 12.5585 KOps/s | |
test_add_one[memmap_tensor0] | 0.4354ms | 5.9874μs | 167.0188 KOps/s | 159.5478 KOps/s | |
test_contiguous[memmap_tensor0] | 3.7206μs | 0.3952μs | 2.5303 MOps/s | 2.5945 MOps/s | |
test_stack[memmap_tensor0] | 40.6210μs | 4.1886μs | 238.7414 KOps/s | 228.2482 KOps/s | |
test_memmaptd_index | 1.6425ms | 0.2404ms | 4.1604 KOps/s | 4.1177 KOps/s | |
test_memmaptd_index_astensor | 0.5775ms | 0.2971ms | 3.3658 KOps/s | 3.3731 KOps/s | |
test_memmaptd_index_op | 1.1080ms | 0.5376ms | 1.8603 KOps/s | 1.8118 KOps/s | |
test_serialize_model | 0.4269s | 0.1723s | 5.8042 Ops/s | 7.6771 Ops/s | |
test_serialize_model_pickle | 1.3713s | 1.2208s | 0.8192 Ops/s | 0.8246 Ops/s | |
test_serialize_weights | 0.1311s | 0.1299s | 7.6978 Ops/s | 7.6969 Ops/s | |
test_serialize_weights_returnearly | 0.3177s | 52.8056ms | 18.9374 Ops/s | 13.0855 Ops/s | |
test_serialize_weights_pickle | 1.3585s | 1.2173s | 0.8215 Ops/s | 0.8123 Ops/s | |
test_reshape_pytree | 61.8410μs | 21.6897μs | 46.1047 KOps/s | 45.4510 KOps/s | |
test_reshape_td | 62.5010μs | 25.5561μs | 39.1296 KOps/s | 37.9597 KOps/s | |
test_view_pytree | 0.1815ms | 21.6581μs | 46.1721 KOps/s | 46.7514 KOps/s | |
test_view_td | 0.1624ms | 27.9226μs | 35.8133 KOps/s | 33.1264 KOps/s | |
test_unbind_pytree | 0.1199ms | 26.9105μs | 37.1603 KOps/s | 36.1457 KOps/s | |
test_unbind_td | 0.7678ms | 34.3593μs | 29.1042 KOps/s | 28.3659 KOps/s | |
test_split_pytree | 0.1606ms | 28.8629μs | 34.6465 KOps/s | 33.4576 KOps/s | |
test_split_td | 1.0248ms | 37.1137μs | 26.9443 KOps/s | 26.0545 KOps/s | |
test_add_pytree | 0.1749ms | 31.5999μs | 31.6457 KOps/s | 30.2634 KOps/s | |
test_add_td | 0.1485ms | 41.8262μs | 23.9085 KOps/s | 23.2105 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.2856ms | 0.1207ms | 8.2828 KOps/s | 8.1732 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3046ms | 0.1237ms | 8.0845 KOps/s | 8.0123 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.2780ms | 95.1959μs | 10.5046 KOps/s | 10.3356 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 1.5619ms | 0.1470ms | 6.8024 KOps/s | 6.7119 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 0.2041ms | 22.4930μs | 44.4582 KOps/s | 43.8068 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1509ms | 26.4346μs | 37.8292 KOps/s | 37.3404 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1563ms | 63.8323μs | 15.6661 KOps/s | 15.4685 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1813ms | 48.7852μs | 20.4980 KOps/s | 20.0453 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2930ms | 0.1413ms | 7.0790 KOps/s | 6.9156 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3566ms | 0.2081ms | 4.8053 KOps/s | 4.8524 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.2951ms | 98.6340μs | 10.1385 KOps/s | 10.2806 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.2373ms | 53.1053μs | 18.8305 KOps/s | 19.1002 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2970ms | 0.1375ms | 7.2715 KOps/s | 7.2526 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.6319ms | 0.4660ms | 2.1460 KOps/s | 2.0694 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3889ms | 0.2466ms | 4.0549 KOps/s | 4.0314 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2894ms | 0.1431ms | 6.9882 KOps/s | 6.8936 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.2231ms | 60.7875μs | 16.4507 KOps/s | 16.1596 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.2493ms | 99.0240μs | 10.0986 KOps/s | 10.1790 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.5566ms | 0.4032ms | 2.4804 KOps/s | 2.4603 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2799ms | 0.1370ms | 7.3008 KOps/s | 7.3875 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.1800ms | 19.6943μs | 50.7761 KOps/s | 53.7948 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 53.2410μs | 27.1542μs | 36.8267 KOps/s | 36.2621 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1765ms | 69.0184μs | 14.4889 KOps/s | 14.3638 KOps/s | |
test_compile_copy_flat[pytree-eager] | 91.3610μs | 51.1190μs | 19.5622 KOps/s | 19.5059 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.6013ms | 0.3866ms | 2.5864 KOps/s | 2.1900 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.8103ms | 2.5596ms | 390.6877 Ops/s | 395.8013 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.5773ms | 0.3780ms | 2.6452 KOps/s | 2.2899 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.7785ms | 2.5849ms | 386.8670 Ops/s | 382.3897 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.3271ms | 0.1166ms | 8.5791 KOps/s | 8.6410 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5734ms | 81.6573μs | 12.2463 KOps/s | 12.9931 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.3010ms | 0.1097ms | 9.1191 KOps/s | 9.7166 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.2477ms | 69.5748μs | 14.3730 KOps/s | 15.0044 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.3014ms | 0.1108ms | 9.0288 KOps/s | 9.6655 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.2702ms | 70.1304μs | 14.2592 KOps/s | 14.9735 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.2432ms | 99.6477μs | 10.0354 KOps/s | 10.0732 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.2336ms | 17.0131μs | 58.7782 KOps/s | 59.2528 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.2620ms | 95.9526μs | 10.4218 KOps/s | 10.4888 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 0.1517ms | 15.1828μs | 65.8638 KOps/s | 63.7376 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.2823ms | 95.2578μs | 10.4978 KOps/s | 9.9765 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 0.2205ms | 15.3649μs | 65.0833 KOps/s | 64.6706 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.2969ms | 0.1027ms | 9.7358 KOps/s | 9.7082 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.6072ms | 16.0110μs | 62.4572 KOps/s | 55.1714 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.2393ms | 95.0587μs | 10.5198 KOps/s | 10.3117 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.1505ms | 15.2315μs | 65.6533 KOps/s | 63.1555 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.2456ms | 95.0686μs | 10.5187 KOps/s | 10.3516 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.1241ms | 15.3142μs | 65.2988 KOps/s | 64.6312 KOps/s | |
test_mod_add[eager] | 0.1796ms | 29.7847μs | 33.5742 KOps/s | 33.5075 KOps/s | |
test_mod_add[compile] | 0.2489ms | 77.0884μs | 12.9721 KOps/s | 12.9482 KOps/s | |
test_mod_add[compile-overhead] | 0.3137ms | 0.1623ms | 6.1622 KOps/s | 5.9494 KOps/s | |
test_mod_wrap[eager] | 0.3798ms | 0.2331ms | 4.2905 KOps/s | 4.2644 KOps/s | |
test_mod_wrap[compile] | 1.5996ms | 0.2885ms | 3.4659 KOps/s | 3.5815 KOps/s | |
test_mod_wrap[compile-overhead] | 7.5129ms | 3.8147ms | 262.1442 Ops/s | 259.4481 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.5571ms | 1.3069ms | 765.1928 Ops/s | 700.0868 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.3996ms | 1.2490ms | 800.6573 Ops/s | 737.8164 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3420ms | 0.8955ms | 1.1166 KOps/s | 928.2526 Ops/s | |
test_seq_add[eager] | 0.2681ms | 94.5156μs | 10.5803 KOps/s | 10.6626 KOps/s | |
test_seq_add[compile] | 0.2579ms | 89.6176μs | 11.1585 KOps/s | 11.7538 KOps/s | |
test_seq_add[compile-overhead] | 0.3009ms | 0.1338ms | 7.4749 KOps/s | 7.9454 KOps/s | |
test_seq_wrap[eager] | 0.5832ms | 0.3801ms | 2.6312 KOps/s | 2.7177 KOps/s | |
test_seq_wrap[compile] | 1.6381ms | 0.2977ms | 3.3588 KOps/s | 3.3801 KOps/s | |
test_seq_wrap[compile-overhead] | 0.4050ms | 0.2228ms | 4.4884 KOps/s | 4.5711 KOps/s | |
test_func_call_runtime[False-eager] | 0.9598ms | 0.7497ms | 1.3338 KOps/s | 1.3913 KOps/s | |
test_func_call_runtime[False-compile] | 0.9762ms | 0.7644ms | 1.3081 KOps/s | 1.3597 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.5071ms | 0.3525ms | 2.8370 KOps/s | 2.8115 KOps/s | |
test_func_call_runtime[True-eager] | 1.1907ms | 0.8813ms | 1.1347 KOps/s | 1.1353 KOps/s | |
test_func_call_runtime[True-compile] | 0.9331ms | 0.7553ms | 1.3239 KOps/s | 1.3251 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.5262ms | 0.3745ms | 2.6703 KOps/s | 2.6642 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8966ms | 0.7025ms | 1.4235 KOps/s | 1.4038 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.9602ms | 0.7378ms | 1.3553 KOps/s | 1.3571 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5717ms | 0.3641ms | 2.7468 KOps/s | 2.7997 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.2126ms | 1.0104ms | 989.7386 Ops/s | 1.0226 KOps/s | |
test_func_call_cm_runtime[True-compile] | 1.1072ms | 0.8264ms | 1.2101 KOps/s | 1.2745 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.5918ms | 0.4077ms | 2.4527 KOps/s | 2.4816 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.4692ms | 1.9768ms | 505.8798 Ops/s | 499.1610 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9650ms | 0.7972ms | 1.2544 KOps/s | 1.2467 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.5306ms | 0.4017ms | 2.4894 KOps/s | 2.4721 KOps/s | |
test_distributed | 2.8786ms | 0.2918ms | 3.4268 KOps/s | 8.8836 KOps/s | |
test_tdmodule | 0.1127ms | 13.6452μs | 73.2859 KOps/s | 77.0764 KOps/s | |
test_tdmodule_dispatch | 56.2710μs | 28.5149μs | 35.0694 KOps/s | 36.8755 KOps/s | |
test_tdseq | 35.4310μs | 15.0260μs | 66.5513 KOps/s | 68.3434 KOps/s | |
test_tdseq_dispatch | 0.1271ms | 31.5139μs | 31.7320 KOps/s | 33.0181 KOps/s | |
test_instantiation_functorch | 1.6813ms | 1.5015ms | 666.0142 Ops/s | 661.3632 Ops/s | |
test_exec_functorch | 0.2563ms | 0.1388ms | 7.2035 KOps/s | 7.1712 KOps/s | |
test_exec_functional_call | 0.2605ms | 0.1300ms | 7.6928 KOps/s | 7.6504 KOps/s | |
test_exec_td_decorator | 0.4065ms | 0.1718ms | 5.8216 KOps/s | 5.7377 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8416ms | 0.6590ms | 1.5174 KOps/s | 1.5325 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8569ms | 0.6544ms | 1.5282 KOps/s | 1.5304 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7982ms | 0.5872ms | 1.7031 KOps/s | 1.6823 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7537ms | 0.5697ms | 1.7555 KOps/s | 1.7426 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 19.1063ms | 18.4692ms | 54.1441 Ops/s | 54.0438 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 19.2378ms | 18.5196ms | 53.9968 Ops/s | 53.9754 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.1170ms | 18.3971ms | 54.3563 Ops/s | 54.4659 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.0936ms | 18.3173ms | 54.5933 Ops/s | 54.2469 Ops/s | |
test_to_module_speed[True] | 1.1569ms | 0.9168ms | 1.0907 KOps/s | 1.0846 KOps/s | |
test_to_module_speed[False] | 1.4096ms | 0.9008ms | 1.1101 KOps/s | 1.1059 KOps/s | |
test_tc_init | 71.9600μs | 34.6601μs | 28.8516 KOps/s | 29.1429 KOps/s | |
test_tc_init_nested | 0.2059ms | 70.0073μs | 14.2842 KOps/s | 13.9422 KOps/s | |
test_tc_first_layer_tensor | 5.9600μs | 0.6827μs | 1.4648 MOps/s | 1.4500 MOps/s | |
test_tc_first_layer_nontensor | 27.9900μs | 2.2695μs | 440.6278 KOps/s | 438.0590 KOps/s | |
test_tc_second_layer_tensor | 23.8328μs | 1.4060μs | 711.2436 KOps/s | 710.0264 KOps/s | |
test_tc_second_layer_nontensor | 28.3000μs | 3.0302μs | 330.0136 KOps/s | 326.0231 KOps/s | |
test_unbind | 0.2387s | 9.7897ms | 102.1478 Ops/s | 154.1525 Ops/s | |
test_full_like | 9.6928ms | 9.2747ms | 107.8206 Ops/s | 106.4979 Ops/s | |
test_zeros_like | 5.2714ms | 4.3368ms | 230.5822 Ops/s | 230.1625 Ops/s | |
test_ones_like | 4.9970ms | 4.3492ms | 229.9285 Ops/s | 230.0691 Ops/s | |
test_clone | 11.7366ms | 9.2937ms | 107.5998 Ops/s | 151.8730 Ops/s | |
test_squeeze | 55.3510μs | 9.2672μs | 107.9074 KOps/s | 109.5122 KOps/s | |
test_unsqueeze | 0.2094ms | 71.3555μs | 14.0143 KOps/s | 14.1322 KOps/s | |
test_split | 0.3846ms | 0.1568ms | 6.3793 KOps/s | 6.4795 KOps/s | |
test_permute | 0.3904ms | 0.1806ms | 5.5385 KOps/s | 5.7813 KOps/s | |
test_stack | 51.7984ms | 51.2395ms | 19.5162 Ops/s | 19.6076 Ops/s | |
test_cat | 51.4460ms | 50.8419ms | 19.6688 Ops/s | 19.7915 Ops/s |
vmoens
added a commit
that referenced
this pull request
Nov 22, 2024
ghstack-source-id: 6d6a10c52d5558017497e4e3cf1995820350a952 Pull Request resolved: #1103
vmoens
added a commit
that referenced
this pull request
Nov 22, 2024
ghstack-source-id: e7267e4d756b8ac3ea0727c72fed4ffac9a74e4e Pull Request resolved: #1103
vmoens
added a commit
that referenced
this pull request
Nov 23, 2024
ghstack-source-id: 9d15fa52028fb9b055e4b562bf740e4640b61cfd Pull Request resolved: #1103
vmoens
added a commit
that referenced
this pull request
Nov 23, 2024
ghstack-source-id: 336a1ba18ff51b2a2fcb041cd18cc4c198ec211e Pull Request resolved: #1103
vmoens
added a commit
that referenced
this pull request
Nov 23, 2024
ghstack-source-id: d4a264a77251a71964c05834daeaa029ed142417 Pull Request resolved: #1103
vmoens
added a commit
that referenced
this pull request
Nov 23, 2024
ghstack-source-id: a566ae225c54f07a680b4bf380b16d8e797f62ea Pull Request resolved: #1103
vmoens
added a commit
that referenced
this pull request
Nov 23, 2024
ghstack-source-id: a566ae225c54f07a680b4bf380b16d8e797f62ea Pull Request resolved: #1103
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):