-
Notifications
You must be signed in to change notification settings - Fork 79
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BugFix] Softly revert get changes #950
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 54.2620μs | 22.2430μs | 44.9580 KOps/s | 49.6402 KOps/s | |
test_plain_set_stack_nested | 0.1458ms | 22.9661μs | 43.5424 KOps/s | 49.1414 KOps/s | |
test_plain_set_nested_inplace | 0.1156ms | 24.2887μs | 41.1714 KOps/s | 44.5578 KOps/s | |
test_plain_set_stack_nested_inplace | 59.8220μs | 24.3796μs | 41.0180 KOps/s | 45.0048 KOps/s | |
test_items | 29.6060μs | 2.6544μs | 376.7313 KOps/s | 377.6866 KOps/s | |
test_items_nested | 0.6188ms | 0.3384ms | 2.9555 KOps/s | 2.9813 KOps/s | |
test_items_nested_locked | 2.8456ms | 0.3384ms | 2.9548 KOps/s | 2.9734 KOps/s | |
test_items_nested_leaf | 0.1704ms | 86.6794μs | 11.5368 KOps/s | 11.9114 KOps/s | |
test_items_stack_nested | 0.6599ms | 0.3367ms | 2.9702 KOps/s | 2.9832 KOps/s | |
test_items_stack_nested_leaf | 0.1691ms | 87.7462μs | 11.3965 KOps/s | 12.2899 KOps/s | |
test_items_stack_nested_locked | 0.4274ms | 0.3369ms | 2.9679 KOps/s | 2.9743 KOps/s | |
test_keys | 34.1740μs | 3.9819μs | 251.1374 KOps/s | 257.5556 KOps/s | |
test_keys_nested | 0.3843ms | 0.1485ms | 6.7347 KOps/s | 7.0358 KOps/s | |
test_keys_nested_locked | 0.7614ms | 0.1496ms | 6.6846 KOps/s | 6.7509 KOps/s | |
test_keys_nested_leaf | 0.3823ms | 0.1265ms | 7.9036 KOps/s | 8.2417 KOps/s | |
test_keys_stack_nested | 0.2421ms | 0.1434ms | 6.9718 KOps/s | 7.1047 KOps/s | |
test_keys_stack_nested_leaf | 0.2421ms | 0.1221ms | 8.1931 KOps/s | 8.2569 KOps/s | |
test_keys_stack_nested_locked | 0.3057ms | 0.1472ms | 6.7956 KOps/s | 6.8347 KOps/s | |
test_values | 28.8915μs | 1.1939μs | 837.6168 KOps/s | 818.7262 KOps/s | |
test_values_nested | 0.2236ms | 50.0388μs | 19.9845 KOps/s | 19.8245 KOps/s | |
test_values_nested_locked | 0.1021ms | 49.7042μs | 20.1190 KOps/s | 19.6680 KOps/s | |
test_values_nested_leaf | 83.9070μs | 44.8037μs | 22.3196 KOps/s | 22.0196 KOps/s | |
test_values_stack_nested | 0.1227ms | 50.5051μs | 19.8000 KOps/s | 18.7741 KOps/s | |
test_values_stack_nested_leaf | 0.1269ms | 44.5803μs | 22.4314 KOps/s | 22.0439 KOps/s | |
test_values_stack_nested_locked | 0.1276ms | 50.6037μs | 19.7614 KOps/s | 19.3764 KOps/s | |
test_membership | 16.5810μs | 0.8969μs | 1.1150 MOps/s | 1.1112 MOps/s | |
test_membership_nested | 24.5060μs | 2.6141μs | 382.5442 KOps/s | 381.5341 KOps/s | |
test_membership_nested_leaf | 37.1200μs | 2.6211μs | 381.5189 KOps/s | 366.1926 KOps/s | |
test_membership_stacked_nested | 24.3450μs | 2.6200μs | 381.6792 KOps/s | 374.6617 KOps/s | |
test_membership_stacked_nested_leaf | 52.4760μs | 2.6337μs | 379.6868 KOps/s | 376.6879 KOps/s | |
test_membership_nested_last | 31.6290μs | 3.8735μs | 258.1664 KOps/s | 253.7737 KOps/s | |
test_membership_nested_leaf_last | 61.7780μs | 3.8427μs | 260.2349 KOps/s | 252.5206 KOps/s | |
test_membership_stacked_nested_last | 38.6020μs | 3.8622μs | 258.9211 KOps/s | 77.7265 KOps/s | |
test_membership_stacked_nested_leaf_last | 28.0720μs | 3.8392μs | 260.4681 KOps/s | 77.5213 KOps/s | |
test_nested_getleaf | 78.9970μs | 10.5688μs | 94.6185 KOps/s | 94.0290 KOps/s | |
test_nested_get | 37.2900μs | 10.0842μs | 99.1650 KOps/s | 102.1309 KOps/s | |
test_stacked_getleaf | 34.7750μs | 10.5103μs | 95.1445 KOps/s | 96.7563 KOps/s | |
test_stacked_get | 0.2593ms | 10.5553μs | 94.7392 KOps/s | 103.1889 KOps/s | |
test_nested_getitemleaf | 54.3810μs | 11.0780μs | 90.2687 KOps/s | 91.5389 KOps/s | |
test_nested_getitem | 38.9630μs | 10.1397μs | 98.6224 KOps/s | 100.6004 KOps/s | |
test_stacked_getitemleaf | 61.2550μs | 10.8357μs | 92.2874 KOps/s | 92.9886 KOps/s | |
test_stacked_getitem | 34.2640μs | 10.0694μs | 99.3109 KOps/s | 101.7217 KOps/s | |
test_lock_nested | 92.1199ms | 0.6063ms | 1.6494 KOps/s | 1.9728 KOps/s | |
test_lock_stack_nested | 0.8608ms | 0.4671ms | 2.1409 KOps/s | 2.2211 KOps/s | |
test_unlock_nested | 89.8652ms | 0.5125ms | 1.9512 KOps/s | 2.3590 KOps/s | |
test_unlock_stack_nested | 0.5672ms | 0.3817ms | 2.6196 KOps/s | 2.7211 KOps/s | |
test_flatten_speed | 0.5263ms | 0.1061ms | 9.4252 KOps/s | 9.7582 KOps/s | |
test_unflatten_speed | 0.6739ms | 0.4600ms | 2.1738 KOps/s | 2.1900 KOps/s | |
test_common_ops | 1.7532ms | 1.1125ms | 898.9014 Ops/s | 929.9893 Ops/s | |
test_creation | 21.1290μs | 2.0788μs | 481.0551 KOps/s | 485.2517 KOps/s | |
test_creation_empty | 85.6780μs | 18.9661μs | 52.7256 KOps/s | 61.2964 KOps/s | |
test_creation_nested_1 | 55.2930μs | 22.2072μs | 45.0304 KOps/s | 49.0192 KOps/s | |
test_creation_nested_2 | 67.4460μs | 26.2509μs | 38.0940 KOps/s | 39.6791 KOps/s | |
test_clone | 62.5770μs | 16.4717μs | 60.7104 KOps/s | 59.6678 KOps/s | |
test_getitem[int] | 1.4798ms | 16.7292μs | 59.7756 KOps/s | 57.5865 KOps/s | |
test_getitem[slice_int] | 0.1259ms | 31.2639μs | 31.9857 KOps/s | 31.3306 KOps/s | |
test_getitem[range] | 0.1651ms | 56.3077μs | 17.7595 KOps/s | 17.3350 KOps/s | |
test_getitem[tuple] | 0.1426ms | 25.4553μs | 39.2845 KOps/s | 38.3090 KOps/s | |
test_getitem[list] | 0.2221ms | 51.4906μs | 19.4210 KOps/s | 18.5535 KOps/s | |
test_setitem_dim[int] | 65.2220μs | 42.6378μs | 23.4534 KOps/s | 23.4878 KOps/s | |
test_setitem_dim[slice_int] | 0.1142ms | 73.4676μs | 13.6114 KOps/s | 13.8513 KOps/s | |
test_setitem_dim[range] | 0.1712ms | 95.6585μs | 10.4539 KOps/s | 10.8255 KOps/s | |
test_setitem_dim[tuple] | 0.1200ms | 60.1630μs | 16.6215 KOps/s | 17.3979 KOps/s | |
test_setitem | 0.1386ms | 30.2686μs | 33.0375 KOps/s | 36.0688 KOps/s | |
test_set | 0.1140ms | 29.2293μs | 34.2122 KOps/s | 36.6911 KOps/s | |
test_set_shared | 4.1298ms | 0.2165ms | 4.6190 KOps/s | 4.4940 KOps/s | |
test_update | 0.1410ms | 36.5816μs | 27.3361 KOps/s | 29.6886 KOps/s | |
test_update_nested | 0.1320ms | 45.9571μs | 21.7594 KOps/s | 22.7864 KOps/s | |
test_update__nested | 0.1911ms | 34.4792μs | 29.0030 KOps/s | 29.3842 KOps/s | |
test_set_nested | 0.1674ms | 31.1794μs | 32.0725 KOps/s | 33.4946 KOps/s | |
test_set_nested_new | 0.1446ms | 35.9070μs | 27.8497 KOps/s | 28.8953 KOps/s | |
test_select | 0.1267ms | 51.9101μs | 19.2641 KOps/s | 19.4145 KOps/s | |
test_select_nested | 0.1241ms | 58.8276μs | 16.9988 KOps/s | 17.1133 KOps/s | |
test_exclude_nested | 0.1415ms | 77.4747μs | 12.9074 KOps/s | 12.9384 KOps/s | |
test_empty[True] | 0.4389ms | 0.3242ms | 3.0848 KOps/s | 3.1403 KOps/s | |
test_empty[False] | 11.5040μs | 1.1483μs | 870.8409 KOps/s | 863.5897 KOps/s | |
test_unbind_speed | 0.6359ms | 0.3098ms | 3.2276 KOps/s | 3.1497 KOps/s | |
test_unbind_speed_stack0 | 0.5264ms | 0.3025ms | 3.3059 KOps/s | 3.4018 KOps/s | |
test_unbind_speed_stack1 | 94.0431ms | 0.8074ms | 1.2385 KOps/s | 1.4358 KOps/s | |
test_split | 84.4260ms | 2.1653ms | 461.8256 Ops/s | 460.3768 Ops/s | |
test_chunk | 90.3007ms | 2.2337ms | 447.6976 Ops/s | 459.3073 Ops/s | |
test_creation[device0] | 0.4338ms | 0.1207ms | 8.2868 KOps/s | 8.2253 KOps/s | |
test_creation_from_tensor | 4.1233ms | 0.1217ms | 8.2173 KOps/s | 8.2586 KOps/s | |
test_add_one[memmap_tensor0] | 0.4937ms | 7.7870μs | 128.4187 KOps/s | 122.8018 KOps/s | |
test_contiguous[memmap_tensor0] | 19.0860μs | 2.0107μs | 497.3389 KOps/s | 488.6262 KOps/s | |
test_stack[memmap_tensor0] | 43.7810μs | 5.8490μs | 170.9685 KOps/s | 168.5574 KOps/s | |
test_memmaptd_index | 0.9700ms | 0.4088ms | 2.4460 KOps/s | 2.3756 KOps/s | |
test_memmaptd_index_astensor | 0.9378ms | 0.4915ms | 2.0344 KOps/s | 1.9886 KOps/s | |
test_memmaptd_index_op | 1.6720ms | 1.0526ms | 949.9862 Ops/s | 972.0745 Ops/s | |
test_serialize_model | 0.1278s | 0.1181s | 8.4684 Ops/s | 7.6924 Ops/s | |
test_serialize_model_pickle | 0.4468s | 0.3984s | 2.5101 Ops/s | 2.4496 Ops/s | |
test_serialize_weights | 0.2079s | 0.1305s | 7.6655 Ops/s | 8.5431 Ops/s | |
test_serialize_weights_returnearly | 0.1833s | 0.1633s | 6.1225 Ops/s | 6.1667 Ops/s | |
test_serialize_weights_pickle | 0.5024s | 0.4527s | 2.2091 Ops/s | 1.1869 Ops/s | |
test_serialize_weights_filesystem | 0.1479s | 0.1437s | 6.9573 Ops/s | 6.5753 Ops/s | |
test_serialize_model_filesystem | 0.2358s | 0.1607s | 6.2238 Ops/s | 6.9578 Ops/s | |
test_reshape_pytree | 94.4070μs | 40.0640μs | 24.9600 KOps/s | 24.9945 KOps/s | |
test_reshape_td | 98.6850μs | 47.3145μs | 21.1352 KOps/s | 21.4608 KOps/s | |
test_view_pytree | 86.4020μs | 40.0969μs | 24.9396 KOps/s | 24.9181 KOps/s | |
test_view_td | 0.1145ms | 54.4001μs | 18.3823 KOps/s | 19.1476 KOps/s | |
test_unbind_pytree | 91.8820μs | 37.6546μs | 26.5572 KOps/s | 26.5440 KOps/s | |
test_unbind_td | 0.4287ms | 46.0020μs | 21.7382 KOps/s | 21.4104 KOps/s | |
test_split_pytree | 80.6300μs | 40.5102μs | 24.6851 KOps/s | 24.6575 KOps/s | |
test_split_td | 0.4672ms | 58.1202μs | 17.2057 KOps/s | 16.5500 KOps/s | |
test_add_pytree | 91.6620μs | 46.3288μs | 21.5848 KOps/s | 21.1204 KOps/s | |
test_add_td | 0.2437ms | 85.9922μs | 11.6290 KOps/s | 12.3158 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1199ms | 54.7079μs | 18.2789 KOps/s | 18.3417 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3871ms | 0.1902ms | 5.2571 KOps/s | 4.9929 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.2273ms | 54.7952μs | 18.2498 KOps/s | 18.2300 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2912ms | 0.1445ms | 6.9218 KOps/s | 6.8240 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 54.5210μs | 20.1628μs | 49.5963 KOps/s | 47.9031 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1344ms | 63.6929μs | 15.7003 KOps/s | 15.3231 KOps/s | |
test_compile_copy_nested[pytree-compile] | 4.7918ms | 79.5637μs | 12.5685 KOps/s | 12.5963 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1659ms | 72.3604μs | 13.8197 KOps/s | 13.7490 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2989ms | 0.1743ms | 5.7357 KOps/s | 5.6872 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.2993ms | 0.1934ms | 5.1705 KOps/s | 5.2014 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 83.6760μs | 38.1400μs | 26.2192 KOps/s | 25.7356 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.4770ms | 70.5264μs | 14.1791 KOps/s | 13.8351 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2595ms | 0.1710ms | 5.8485 KOps/s | 5.6957 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.4348ms | 0.2944ms | 3.3969 KOps/s | 3.4036 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3045ms | 0.2048ms | 4.8838 KOps/s | 4.7867 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.5409ms | 0.1789ms | 5.5903 KOps/s | 5.6156 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.4384ms | 62.2848μs | 16.0553 KOps/s | 15.4692 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1023ms | 39.8779μs | 25.0766 KOps/s | 25.4008 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.6095ms | 0.2436ms | 4.1047 KOps/s | 4.1975 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2888ms | 0.1725ms | 5.7982 KOps/s | 5.7259 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.2078ms | 0.1074ms | 9.3153 KOps/s | 9.0736 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1186ms | 55.9972μs | 17.8580 KOps/s | 17.2769 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1871ms | 81.1089μs | 12.3291 KOps/s | 12.3584 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1712ms | 71.9178μs | 13.9048 KOps/s | 13.7935 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.2838ms | 0.1884ms | 5.3089 KOps/s | 5.2750 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 3.2730ms | 1.6461ms | 607.5031 Ops/s | 601.8823 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.6026ms | 0.1907ms | 5.2436 KOps/s | 5.2720 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.3878ms | 1.0955ms | 912.8399 Ops/s | 914.9189 Ops/s | |
test_compile_assign_and_add_stack[compile] | 0.7034ms | 0.4146ms | 2.4117 KOps/s | 2.3938 KOps/s | |
test_compile_assign_and_add_stack[eager] | 6.9768ms | 3.9288ms | 254.5314 Ops/s | 267.2659 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.2336ms | 33.5990μs | 29.7628 KOps/s | 30.2766 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 1.0547ms | 46.4911μs | 21.5095 KOps/s | 19.9726 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 81.8030μs | 27.8988μs | 35.8438 KOps/s | 35.0547 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 94.9880μs | 29.4477μs | 33.9585 KOps/s | 32.1031 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1160ms | 27.9172μs | 35.8202 KOps/s | 34.7225 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1095ms | 29.7575μs | 33.6049 KOps/s | 31.3476 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.3476ms | 71.6317μs | 13.9603 KOps/s | 13.2134 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 3.4526ms | 27.8312μs | 35.9310 KOps/s | 34.7684 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1386ms | 66.8645μs | 14.9556 KOps/s | 14.4161 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 84.6380μs | 24.1127μs | 41.4719 KOps/s | 40.0408 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1350ms | 67.8474μs | 14.7390 KOps/s | 14.4883 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 64.5810μs | 24.4231μs | 40.9448 KOps/s | 40.2830 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 1.9929ms | 72.3103μs | 13.8293 KOps/s | 13.3506 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.7627ms | 27.3875μs | 36.5130 KOps/s | 35.1414 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1259ms | 66.4828μs | 15.0415 KOps/s | 14.8175 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 65.8230μs | 24.2341μs | 41.2642 KOps/s | 40.9161 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.3355ms | 68.9884μs | 14.4952 KOps/s | 14.6295 KOps/s | |
test_compile_indexing[int-pytree-eager] | 63.4790μs | 24.0126μs | 41.6448 KOps/s | 40.7984 KOps/s | |
test_mod_add[eager] | 0.1563ms | 24.9050μs | 40.1525 KOps/s | 41.9347 KOps/s | |
test_mod_add[compile] | 94.1760μs | 36.1708μs | 27.6466 KOps/s | 27.4501 KOps/s | |
test_mod_add[compile-overhead] | 0.1273ms | 37.8390μs | 26.4277 KOps/s | 27.3483 KOps/s | |
test_mod_wrap[eager] | 0.4413ms | 0.2130ms | 4.6950 KOps/s | 4.6680 KOps/s | |
test_mod_wrap[compile] | 1.4565ms | 0.2330ms | 4.2925 KOps/s | 4.2095 KOps/s | |
test_mod_wrap[compile-overhead] | 0.4904ms | 0.2257ms | 4.4303 KOps/s | 4.3402 KOps/s | |
test_mod_wrap_and_backward[eager] | 11.8448ms | 10.8749ms | 91.9549 Ops/s | 87.5404 Ops/s | |
test_mod_wrap_and_backward[compile] | 13.2671ms | 11.0813ms | 90.2418 Ops/s | 85.9913 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 15.0551ms | 11.5224ms | 86.7875 Ops/s | 90.2115 Ops/s | |
test_seq_add[eager] | 0.2633ms | 89.6255μs | 11.1575 KOps/s | 11.7325 KOps/s | |
test_seq_add[compile] | 0.1866ms | 60.5190μs | 16.5237 KOps/s | 16.1344 KOps/s | |
test_seq_add[compile-overhead] | 0.1665ms | 60.1710μs | 16.6193 KOps/s | 16.9235 KOps/s | |
test_seq_wrap[eager] | 0.6475ms | 0.3871ms | 2.5830 KOps/s | 2.7113 KOps/s | |
test_seq_wrap[compile] | 0.7321ms | 0.2672ms | 3.7428 KOps/s | 3.7714 KOps/s | |
test_seq_wrap[compile-overhead] | 0.4725ms | 0.2640ms | 3.7873 KOps/s | 3.7702 KOps/s | |
test_func_call_runtime[False-eager] | 0.9266ms | 0.5329ms | 1.8764 KOps/s | 1.8412 KOps/s | |
test_func_call_runtime[False-compile] | 0.6785ms | 0.4935ms | 2.0263 KOps/s | 1.9895 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 1.0359ms | 0.4976ms | 2.0097 KOps/s | 1.9886 KOps/s | |
test_func_call_runtime[True-eager] | 1.0171ms | 0.7564ms | 1.3221 KOps/s | 1.2889 KOps/s | |
test_func_call_runtime[True-compile] | 0.7114ms | 0.5072ms | 1.9717 KOps/s | 1.8793 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.8825ms | 0.5079ms | 1.9687 KOps/s | 1.8711 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.9018ms | 0.5346ms | 1.8704 KOps/s | 1.8165 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.6501ms | 0.4940ms | 2.0245 KOps/s | 1.9337 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.6153ms | 0.4957ms | 2.0173 KOps/s | 1.9414 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.2608ms | 0.8760ms | 1.1416 KOps/s | 1.0783 KOps/s | |
test_func_call_cm_runtime[True-compile] | 1.0910ms | 0.8373ms | 1.1943 KOps/s | 1.1241 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.9634ms | 0.8420ms | 1.1877 KOps/s | 1.1528 KOps/s | |
test_distributed | 0.3563ms | 0.1317ms | 7.5943 KOps/s | 7.5024 KOps/s | |
test_tdmodule | 0.1174ms | 17.9491μs | 55.7131 KOps/s | 64.1415 KOps/s | |
test_tdmodule_dispatch | 64.0290μs | 37.5747μs | 26.6137 KOps/s | 29.6721 KOps/s | |
test_tdseq | 51.0750μs | 19.6425μs | 50.9100 KOps/s | 58.7712 KOps/s | |
test_tdseq_dispatch | 71.7340μs | 41.1089μs | 24.3257 KOps/s | 27.1711 KOps/s | |
test_instantiation_functorch | 1.8407ms | 1.6278ms | 614.3311 Ops/s | 592.5961 Ops/s | |
test_instantiation_td | 1.7732ms | 1.1714ms | 853.6841 Ops/s | 853.3606 Ops/s | |
test_exec_functorch | 0.3251ms | 0.1809ms | 5.5267 KOps/s | 5.5247 KOps/s | |
test_exec_functional_call | 0.3328ms | 0.1713ms | 5.8383 KOps/s | 5.7752 KOps/s | |
test_exec_td | 0.2719ms | 0.1757ms | 5.6913 KOps/s | 5.7117 KOps/s | |
test_exec_td_decorator | 1.0723ms | 0.2401ms | 4.1655 KOps/s | 4.3940 KOps/s | |
test_vmap_mlp_speed[True-True] | 1.0519ms | 0.5981ms | 1.6720 KOps/s | 1.7134 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.7843ms | 0.5763ms | 1.7351 KOps/s | 1.7236 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.7459ms | 0.4757ms | 2.1021 KOps/s | 2.0542 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.8482ms | 0.4767ms | 2.0977 KOps/s | 2.0616 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.2988ms | 0.6335ms | 1.5786 KOps/s | 1.5403 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.9399ms | 0.6353ms | 1.5741 KOps/s | 1.5325 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7336ms | 0.5225ms | 1.9137 KOps/s | 1.8363 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 1.0050ms | 0.5235ms | 1.9101 KOps/s | 1.8285 KOps/s | |
test_to_module_speed[True] | 2.1485ms | 1.3380ms | 747.3790 Ops/s | 733.1241 Ops/s | |
test_to_module_speed[False] | 1.4173ms | 1.3003ms | 769.0478 Ops/s | 776.5428 Ops/s | |
test_tc_init | 83.9670μs | 45.5164μs | 21.9701 KOps/s | 23.3398 KOps/s | |
test_tc_init_nested | 0.2110ms | 94.8433μs | 10.5437 KOps/s | 11.6640 KOps/s | |
test_tc_first_layer_tensor | 25.7180μs | 1.4303μs | 699.1619 KOps/s | 671.3518 KOps/s | |
test_tc_first_layer_nontensor | 30.7870μs | 4.1903μs | 238.6442 KOps/s | 225.9259 KOps/s | |
test_tc_second_layer_tensor | 25.4780μs | 2.6706μs | 374.4407 KOps/s | 372.0306 KOps/s | |
test_tc_second_layer_nontensor | 32.5810μs | 5.4208μs | 184.4747 KOps/s | 183.5538 KOps/s | |
test_unbind | 0.4571s | 13.8122ms | 72.3996 Ops/s | 63.8742 Ops/s | |
test_full_like | 9.2468ms | 7.2404ms | 138.1140 Ops/s | 102.3873 Ops/s | |
test_zeros_like | 12.3081ms | 6.6363ms | 150.6873 Ops/s | 128.2236 Ops/s | |
test_ones_like | 12.8032ms | 7.3603ms | 135.8632 Ops/s | 118.1992 Ops/s | |
test_clone | 14.6502ms | 8.9758ms | 111.4109 Ops/s | 93.6129 Ops/s | |
test_squeeze | 63.5990μs | 12.5608μs | 79.6127 KOps/s | 77.1291 KOps/s | |
test_unsqueeze | 0.1732ms | 92.0450μs | 10.8642 KOps/s | 10.2745 KOps/s | |
test_split | 0.4909ms | 0.1983ms | 5.0439 KOps/s | 4.8053 KOps/s | |
test_permute | 0.3807ms | 0.2185ms | 4.5760 KOps/s | 4.4670 KOps/s | |
test_stack | 32.7312ms | 25.5840ms | 39.0869 Ops/s | 34.8550 Ops/s | |
test_cat | 35.0363ms | 25.3951ms | 39.3777 Ops/s | 35.3363 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 0.1520ms | 16.7182μs | 59.8151 KOps/s | 63.2210 KOps/s | |
test_plain_set_stack_nested | 35.3310μs | 16.8412μs | 59.3783 KOps/s | 63.0315 KOps/s | |
test_plain_set_nested_inplace | 53.7110μs | 17.8906μs | 55.8954 KOps/s | 59.3114 KOps/s | |
test_plain_set_stack_nested_inplace | 41.6310μs | 17.8598μs | 55.9918 KOps/s | 59.1493 KOps/s | |
test_items | 18.6710μs | 4.7019μs | 212.6781 KOps/s | 210.7114 KOps/s | |
test_items_nested | 0.4006ms | 0.3645ms | 2.7434 KOps/s | 2.6952 KOps/s | |
test_items_nested_locked | 0.4005ms | 0.3666ms | 2.7277 KOps/s | 2.6380 KOps/s | |
test_items_nested_leaf | 0.1126ms | 83.4886μs | 11.9777 KOps/s | 11.9713 KOps/s | |
test_items_stack_nested | 0.4374ms | 0.3663ms | 2.7301 KOps/s | 2.6724 KOps/s | |
test_items_stack_nested_leaf | 0.2677ms | 84.5307μs | 11.8300 KOps/s | 11.7402 KOps/s | |
test_items_stack_nested_locked | 0.5797ms | 0.3722ms | 2.6871 KOps/s | 2.7061 KOps/s | |
test_keys | 18.4200μs | 4.3723μs | 228.7141 KOps/s | 227.1713 KOps/s | |
test_keys_nested | 0.1070ms | 67.4549μs | 14.8247 KOps/s | 14.9675 KOps/s | |
test_keys_nested_locked | 0.6482ms | 73.0549μs | 13.6883 KOps/s | 13.6780 KOps/s | |
test_keys_nested_leaf | 90.8420μs | 57.9577μs | 17.2540 KOps/s | 17.4631 KOps/s | |
test_keys_stack_nested | 95.0220μs | 67.9018μs | 14.7272 KOps/s | 14.6920 KOps/s | |
test_keys_stack_nested_leaf | 94.5120μs | 58.1485μs | 17.1974 KOps/s | 17.0572 KOps/s | |
test_keys_stack_nested_locked | 0.1168ms | 73.2853μs | 13.6453 KOps/s | 13.5667 KOps/s | |
test_values | 9.4500μs | 1.7703μs | 564.8636 KOps/s | 568.7677 KOps/s | |
test_values_nested | 0.1077ms | 33.5186μs | 29.8342 KOps/s | 29.8287 KOps/s | |
test_values_nested_locked | 91.2620μs | 35.6280μs | 28.0678 KOps/s | 28.1341 KOps/s | |
test_values_nested_leaf | 53.8910μs | 29.7205μs | 33.6468 KOps/s | 33.6459 KOps/s | |
test_values_stack_nested | 73.4120μs | 34.4495μs | 29.0280 KOps/s | 29.0576 KOps/s | |
test_values_stack_nested_leaf | 54.7720μs | 30.4142μs | 32.8793 KOps/s | 32.7249 KOps/s | |
test_values_stack_nested_locked | 90.2020μs | 36.2351μs | 27.5976 KOps/s | 27.5467 KOps/s | |
test_membership | 1.3330μs | 0.5353μs | 1.8682 MOps/s | 1.8275 MOps/s | |
test_membership_nested | 9.2050μs | 1.9806μs | 504.9014 KOps/s | 485.3293 KOps/s | |
test_membership_nested_leaf | 14.2750μs | 1.9998μs | 500.0440 KOps/s | 512.0470 KOps/s | |
test_membership_stacked_nested | 20.9600μs | 2.0204μs | 494.9551 KOps/s | 497.7068 KOps/s | |
test_membership_stacked_nested_leaf | 19.9610μs | 2.0201μs | 495.0172 KOps/s | 498.2423 KOps/s | |
test_membership_nested_last | 21.9410μs | 2.9468μs | 339.3543 KOps/s | 339.9525 KOps/s | |
test_membership_nested_leaf_last | 15.8400μs | 2.9623μs | 337.5739 KOps/s | 337.7384 KOps/s | |
test_membership_stacked_nested_last | 24.4410μs | 2.9653μs | 337.2375 KOps/s | 340.4713 KOps/s | |
test_membership_stacked_nested_leaf_last | 22.6810μs | 2.9293μs | 341.3741 KOps/s | 339.8723 KOps/s | |
test_nested_getleaf | 27.7300μs | 7.8766μs | 126.9590 KOps/s | 128.8636 KOps/s | |
test_nested_get | 45.0620μs | 7.3785μs | 135.5281 KOps/s | 136.6831 KOps/s | |
test_stacked_getleaf | 27.4810μs | 7.9717μs | 125.4436 KOps/s | 127.9245 KOps/s | |
test_stacked_get | 66.3620μs | 7.4105μs | 134.9443 KOps/s | 137.4962 KOps/s | |
test_nested_getitemleaf | 21.6000μs | 8.1373μs | 122.8915 KOps/s | 123.0272 KOps/s | |
test_nested_getitem | 26.8710μs | 7.6614μs | 130.5238 KOps/s | 130.4013 KOps/s | |
test_stacked_getitemleaf | 26.3710μs | 8.1663μs | 122.4551 KOps/s | 122.9005 KOps/s | |
test_stacked_getitem | 21.2700μs | 7.7029μs | 129.8213 KOps/s | 130.5585 KOps/s | |
test_lock_nested | 0.9220ms | 0.4682ms | 2.1359 KOps/s | 2.1064 KOps/s | |
test_lock_stack_nested | 0.5665ms | 0.4358ms | 2.2945 KOps/s | 2.2752 KOps/s | |
test_unlock_nested | 0.8221ms | 0.3887ms | 2.5726 KOps/s | 2.5195 KOps/s | |
test_unlock_stack_nested | 0.4916ms | 0.3560ms | 2.8091 KOps/s | 2.7832 KOps/s | |
test_flatten_speed | 94.7136ms | 0.1166ms | 8.5748 KOps/s | 9.6622 KOps/s | |
test_unflatten_speed | 0.3730ms | 0.3143ms | 3.1817 KOps/s | 3.1985 KOps/s | |
test_common_ops | 1.6142ms | 1.3858ms | 721.5999 Ops/s | 754.2480 Ops/s | |
test_creation | 15.9400μs | 1.6823μs | 594.4358 KOps/s | 606.7345 KOps/s | |
test_creation_empty | 0.1522ms | 16.7539μs | 59.6878 KOps/s | 66.6796 KOps/s | |
test_creation_nested_1 | 0.1227ms | 18.6656μs | 53.5746 KOps/s | 59.1377 KOps/s | |
test_creation_nested_2 | 43.7710μs | 21.4322μs | 46.6587 KOps/s | 50.6193 KOps/s | |
test_clone | 0.1843ms | 30.5742μs | 32.7074 KOps/s | 31.1291 KOps/s | |
test_getitem[int] | 1.1452ms | 18.2971μs | 54.6535 KOps/s | 57.2756 KOps/s | |
test_getitem[slice_int] | 0.1657ms | 29.0626μs | 34.4085 KOps/s | 32.0008 KOps/s | |
test_getitem[range] | 0.2483ms | 0.1173ms | 8.5239 KOps/s | 8.3945 KOps/s | |
test_getitem[tuple] | 0.1403ms | 25.1228μs | 39.8045 KOps/s | 38.9600 KOps/s | |
test_getitem[list] | 0.3313ms | 0.1074ms | 9.3142 KOps/s | 9.3414 KOps/s | |
test_setitem_dim[int] | 0.1886ms | 56.4243μs | 17.7229 KOps/s | 18.3092 KOps/s | |
test_setitem_dim[slice_int] | 0.1222ms | 83.6051μs | 11.9610 KOps/s | 12.9284 KOps/s | |
test_setitem_dim[range] | 0.2882ms | 0.1538ms | 6.5036 KOps/s | 7.1129 KOps/s | |
test_setitem_dim[tuple] | 0.2219ms | 78.7870μs | 12.6924 KOps/s | 14.1073 KOps/s | |
test_setitem | 0.2225ms | 47.5003μs | 21.0525 KOps/s | 21.3771 KOps/s | |
test_set | 0.2212ms | 46.8459μs | 21.3466 KOps/s | 23.0017 KOps/s | |
test_set_shared | 0.3809ms | 55.2698μs | 18.0931 KOps/s | 17.6869 KOps/s | |
test_update | 0.1997ms | 52.9035μs | 18.9023 KOps/s | 19.2854 KOps/s | |
test_update_nested | 0.2164ms | 64.5432μs | 15.4935 KOps/s | 16.4802 KOps/s | |
test_update__nested | 0.2498ms | 68.5947μs | 14.5784 KOps/s | 14.8238 KOps/s | |
test_set_nested | 0.2018ms | 48.6185μs | 20.5683 KOps/s | 21.5903 KOps/s | |
test_set_nested_new | 0.2033ms | 53.1300μs | 18.8218 KOps/s | 19.8630 KOps/s | |
test_select | 0.2264ms | 69.0563μs | 14.4809 KOps/s | 15.1323 KOps/s | |
test_select_nested | 76.2320μs | 51.1973μs | 19.5323 KOps/s | 19.5541 KOps/s | |
test_exclude_nested | 95.4830μs | 69.7362μs | 14.3397 KOps/s | 14.3816 KOps/s | |
test_empty[True] | 0.3489ms | 0.2831ms | 3.5322 KOps/s | 3.4527 KOps/s | |
test_empty[False] | 1.8520μs | 0.8589μs | 1.1643 MOps/s | 1.1606 MOps/s | |
test_to | 64.0210μs | 25.6985μs | 38.9128 KOps/s | 36.5339 KOps/s | |
test_to_nonblocking | 47.5610μs | 25.1218μs | 39.8060 KOps/s | 36.3296 KOps/s | |
test_unbind_speed | 0.4483ms | 0.3027ms | 3.3032 KOps/s | 3.3020 KOps/s | |
test_unbind_speed_stack0 | 0.3504ms | 0.3023ms | 3.3084 KOps/s | 3.2492 KOps/s | |
test_unbind_speed_stack1 | 91.5653ms | 0.7703ms | 1.2982 KOps/s | 1.2655 KOps/s | |
test_split | 93.7822ms | 2.4013ms | 416.4395 Ops/s | 418.0717 Ops/s | |
test_chunk | 2.3164ms | 2.1866ms | 457.3348 Ops/s | 419.9192 Ops/s | |
test_creation[device0] | 0.2519ms | 0.1064ms | 9.4014 KOps/s | 9.3549 KOps/s | |
test_creation_from_tensor | 0.2762ms | 0.1092ms | 9.1593 KOps/s | 9.6149 KOps/s | |
test_add_one[memmap_tensor0] | 0.1567ms | 9.3393μs | 107.0739 KOps/s | 110.5579 KOps/s | |
test_contiguous[memmap_tensor0] | 0.1706ms | 2.2186μs | 450.7359 KOps/s | 444.7646 KOps/s | |
test_stack[memmap_tensor0] | 0.2021ms | 7.0617μs | 141.6099 KOps/s | 145.1317 KOps/s | |
test_memmaptd_index | 1.3299ms | 0.4385ms | 2.2807 KOps/s | 2.3179 KOps/s | |
test_memmaptd_index_astensor | 99.1266ms | 0.5555ms | 1.8002 KOps/s | 2.0240 KOps/s | |
test_memmaptd_index_op | 1.6459ms | 1.0705ms | 934.1620 Ops/s | 960.9092 Ops/s | |
test_serialize_model | 94.9624ms | 90.5848ms | 11.0394 Ops/s | 10.8415 Ops/s | |
test_serialize_model_pickle | 1.3511s | 1.2363s | 0.8089 Ops/s | 0.8084 Ops/s | |
test_serialize_weights | 87.8333ms | 86.3346ms | 11.5828 Ops/s | 9.6327 Ops/s | |
test_serialize_weights_returnearly | 55.8990ms | 51.8285ms | 19.2944 Ops/s | 14.7924 Ops/s | |
test_serialize_weights_pickle | 1.3525s | 1.2371s | 0.8083 Ops/s | 0.8036 Ops/s | |
test_reshape_pytree | 0.2375ms | 38.5326μs | 25.9521 KOps/s | 25.3404 KOps/s | |
test_reshape_td | 0.2139ms | 44.3874μs | 22.5289 KOps/s | 21.3552 KOps/s | |
test_view_pytree | 0.1412ms | 38.1601μs | 26.2054 KOps/s | 25.0454 KOps/s | |
test_view_td | 0.2166ms | 49.8220μs | 20.0715 KOps/s | 19.0823 KOps/s | |
test_unbind_pytree | 0.2543ms | 37.0495μs | 26.9910 KOps/s | 26.6588 KOps/s | |
test_unbind_td | 0.3940ms | 45.1729μs | 22.1372 KOps/s | 21.7790 KOps/s | |
test_split_pytree | 0.3451ms | 50.5006μs | 19.8017 KOps/s | 19.7961 KOps/s | |
test_split_td | 0.1986ms | 62.1371μs | 16.0935 KOps/s | 15.4912 KOps/s | |
test_add_pytree | 0.2030ms | 60.3920μs | 16.5585 KOps/s | 14.8743 KOps/s | |
test_add_td | 0.2464ms | 97.2783μs | 10.2798 KOps/s | 9.9439 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.4137ms | 0.2146ms | 4.6608 KOps/s | 4.5778 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3216ms | 0.1757ms | 5.6907 KOps/s | 5.6836 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.2995ms | 0.1495ms | 6.6887 KOps/s | 6.6468 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.3503ms | 0.1991ms | 5.0227 KOps/s | 4.9636 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 0.1441ms | 22.2131μs | 45.0185 KOps/s | 44.4145 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1160ms | 48.1909μs | 20.7508 KOps/s | 20.7220 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1534ms | 74.4271μs | 13.4360 KOps/s | 13.5859 KOps/s | |
test_compile_copy_nested[pytree-eager] | 86.3020μs | 60.0953μs | 16.6402 KOps/s | 16.5738 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.5142ms | 0.3357ms | 2.9785 KOps/s | 2.9739 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3737ms | 0.2223ms | 4.4986 KOps/s | 4.4006 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.2789ms | 0.1339ms | 7.4708 KOps/s | 7.4368 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.2103ms | 64.3834μs | 15.5320 KOps/s | 15.0760 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.5322ms | 0.3360ms | 2.9762 KOps/s | 2.9927 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.8650ms | 0.6599ms | 1.5153 KOps/s | 1.5186 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3855ms | 0.2692ms | 3.7150 KOps/s | 3.6404 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.4806ms | 0.3369ms | 2.9683 KOps/s | 2.9596 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.2266ms | 76.2062μs | 13.1223 KOps/s | 12.8944 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.3094ms | 0.1351ms | 7.4046 KOps/s | 7.3650 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.7225ms | 0.5641ms | 1.7727 KOps/s | 1.7731 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.5186ms | 0.3343ms | 2.9913 KOps/s | 2.9878 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.2280ms | 19.1853μs | 52.1231 KOps/s | 51.1114 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 65.8110μs | 31.4363μs | 31.8104 KOps/s | 30.3017 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.2738ms | 76.7027μs | 13.0374 KOps/s | 12.9762 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.2456ms | 60.8345μs | 16.4380 KOps/s | 16.4059 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 2.4685ms | 0.8578ms | 1.1658 KOps/s | 1.0650 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 3.5343ms | 3.3746ms | 296.3301 Ops/s | 288.6321 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 2.4560ms | 0.8498ms | 1.1767 KOps/s | 1.0780 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 3.6320ms | 3.4249ms | 291.9808 Ops/s | 293.0317 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.2544ms | 0.1139ms | 8.7814 KOps/s | 8.7196 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.2490ms | 67.2042μs | 14.8800 KOps/s | 15.2632 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.2588ms | 0.1061ms | 9.4253 KOps/s | 9.2449 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.2240ms | 50.8688μs | 19.6584 KOps/s | 21.2943 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.2796ms | 0.1102ms | 9.0724 KOps/s | 9.2951 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.2309ms | 50.9432μs | 19.6297 KOps/s | 21.1715 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.2714ms | 0.1429ms | 6.9998 KOps/s | 6.8128 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1774ms | 26.7036μs | 37.4481 KOps/s | 36.5091 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.2838ms | 0.1342ms | 7.4494 KOps/s | 7.3311 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 0.1248ms | 23.0287μs | 43.4242 KOps/s | 43.1354 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.3075ms | 0.1336ms | 7.4857 KOps/s | 7.2590 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 61.7120μs | 22.8365μs | 43.7896 KOps/s | 42.6197 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.3053ms | 0.1419ms | 7.0458 KOps/s | 6.9395 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.4484ms | 26.8689μs | 37.2178 KOps/s | 37.0497 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.3274ms | 0.1343ms | 7.4433 KOps/s | 7.3541 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 80.7520μs | 23.0704μs | 43.3455 KOps/s | 43.4090 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.3459ms | 0.1339ms | 7.4689 KOps/s | 7.3553 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.3255ms | 23.0272μs | 43.4270 KOps/s | 43.2019 KOps/s | |
test_mod_add[eager] | 0.2512ms | 33.6670μs | 29.7027 KOps/s | 30.0902 KOps/s | |
test_mod_add[compile] | 0.2090ms | 70.3575μs | 14.2131 KOps/s | 13.5692 KOps/s | |
test_mod_add[compile-overhead] | 0.2642ms | 0.1378ms | 7.2558 KOps/s | 6.2185 KOps/s | |
test_mod_wrap[eager] | 0.4415ms | 0.2720ms | 3.6760 KOps/s | 3.7922 KOps/s | |
test_mod_wrap[compile] | 1.2215ms | 0.3065ms | 3.2623 KOps/s | 3.2158 KOps/s | |
test_mod_wrap[compile-overhead] | 8.2738ms | 4.3381ms | 230.5172 Ops/s | 222.6593 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.5811ms | 1.3812ms | 724.0173 Ops/s | 719.0035 Ops/s | |
test_mod_wrap_and_backward[compile] | 2.7295ms | 1.3681ms | 730.9550 Ops/s | 718.4106 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3813ms | 0.9240ms | 1.0822 KOps/s | 1.0733 KOps/s | |
test_seq_add[eager] | 0.2597ms | 0.1031ms | 9.6994 KOps/s | 9.7898 KOps/s | |
test_seq_add[compile] | 0.2280ms | 84.1553μs | 11.8828 KOps/s | 11.7829 KOps/s | |
test_seq_add[compile-overhead] | 0.2712ms | 0.1198ms | 8.3458 KOps/s | 8.3008 KOps/s | |
test_seq_wrap[eager] | 0.5794ms | 0.4010ms | 2.4935 KOps/s | 2.4348 KOps/s | |
test_seq_wrap[compile] | 0.4907ms | 0.3195ms | 3.1295 KOps/s | 3.0490 KOps/s | |
test_seq_wrap[compile-overhead] | 0.4041ms | 0.2293ms | 4.3607 KOps/s | 4.3043 KOps/s | |
test_func_call_runtime[False-eager] | 1.0438ms | 0.8093ms | 1.2356 KOps/s | 1.3050 KOps/s | |
test_func_call_runtime[False-compile] | 1.0184ms | 0.8095ms | 1.2353 KOps/s | 1.2231 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.5797ms | 0.3738ms | 2.6751 KOps/s | 2.6247 KOps/s | |
test_func_call_runtime[True-eager] | 1.1571ms | 0.9533ms | 1.0489 KOps/s | 1.0381 KOps/s | |
test_func_call_runtime[True-compile] | 1.1090ms | 0.8584ms | 1.1649 KOps/s | 1.1464 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.5587ms | 0.4180ms | 2.3925 KOps/s | 2.3489 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.9431ms | 0.7545ms | 1.3254 KOps/s | 1.2347 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.9828ms | 0.8185ms | 1.2218 KOps/s | 1.2091 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5182ms | 0.3738ms | 2.6755 KOps/s | 2.6232 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.2072ms | 1.0538ms | 948.9175 Ops/s | 900.8351 Ops/s | |
test_func_call_cm_runtime[True-compile] | 1.2034ms | 1.0398ms | 961.7157 Ops/s | 949.0728 Ops/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.2925ms | 1.0554ms | 947.4691 Ops/s | 950.7106 Ops/s | |
test_distributed | 1.1018ms | 73.2294μs | 13.6557 KOps/s | 13.8871 KOps/s | |
test_tdmodule | 54.9610μs | 15.9612μs | 62.6521 KOps/s | 68.1826 KOps/s | |
test_tdmodule_dispatch | 48.2110μs | 32.6764μs | 30.6031 KOps/s | 33.0504 KOps/s | |
test_tdseq | 31.7210μs | 16.5848μs | 60.2960 KOps/s | 63.0168 KOps/s | |
test_tdseq_dispatch | 50.9010μs | 34.1535μs | 29.2796 KOps/s | 30.2842 KOps/s | |
test_instantiation_functorch | 2.1906ms | 2.0357ms | 491.2337 Ops/s | 476.0856 Ops/s | |
test_instantiation_td | 2.1257ms | 1.3451ms | 743.4496 Ops/s | 737.9527 Ops/s | |
test_exec_functorch | 0.4097ms | 0.2309ms | 4.3305 KOps/s | 4.3305 KOps/s | |
test_exec_functional_call | 0.4045ms | 0.2267ms | 4.4120 KOps/s | 4.4128 KOps/s | |
test_exec_td | 0.4033ms | 0.2264ms | 4.4174 KOps/s | 4.2906 KOps/s | |
test_exec_td_decorator | 0.5897ms | 0.2788ms | 3.5863 KOps/s | 3.4989 KOps/s | |
test_vmap_mlp_speed[True-True] | 0.8321ms | 0.6550ms | 1.5267 KOps/s | 1.5121 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.7938ms | 0.6533ms | 1.5307 KOps/s | 1.5300 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.8137ms | 0.6028ms | 1.6590 KOps/s | 1.7292 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.8017ms | 0.5927ms | 1.6871 KOps/s | 1.7351 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.5948ms | 0.7174ms | 1.3940 KOps/s | 1.4220 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.9241ms | 0.7144ms | 1.3997 KOps/s | 1.4190 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8577ms | 0.6260ms | 1.5975 KOps/s | 1.6139 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.8485ms | 0.6330ms | 1.5798 KOps/s | 1.6097 KOps/s | |
test_vmap_transformer_speed[True-True] | 9.0158ms | 8.8393ms | 113.1305 Ops/s | 113.2016 Ops/s | |
test_vmap_transformer_speed[True-False] | 9.2702ms | 8.7970ms | 113.6756 Ops/s | 113.4200 Ops/s | |
test_vmap_transformer_speed[False-True] | 8.9875ms | 8.7366ms | 114.4615 Ops/s | 114.7378 Ops/s | |
test_vmap_transformer_speed[False-False] | 8.8626ms | 8.7075ms | 114.8432 Ops/s | 114.7891 Ops/s | |
test_vmap_transformer_speed_decorator[True-True] | 21.0927ms | 20.8231ms | 48.0235 Ops/s | 48.1073 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 21.0641ms | 20.8835ms | 47.8848 Ops/s | 47.9258 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 21.5130ms | 20.7393ms | 48.2177 Ops/s | 48.4467 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 20.9555ms | 20.7089ms | 48.2885 Ops/s | 48.3938 Ops/s | |
test_to_module_speed[True] | 2.4273ms | 1.1579ms | 863.5981 Ops/s | 866.8960 Ops/s | |
test_to_module_speed[False] | 1.6255ms | 1.1366ms | 879.8518 Ops/s | 881.2403 Ops/s | |
test_tc_init | 82.1120μs | 38.2978μs | 26.1111 KOps/s | 27.7478 KOps/s | |
test_tc_init_nested | 0.2037ms | 76.5708μs | 13.0598 KOps/s | 13.6811 KOps/s | |
test_tc_first_layer_tensor | 16.8700μs | 0.9272μs | 1.0786 MOps/s | 1.2783 MOps/s | |
test_tc_first_layer_nontensor | 21.4510μs | 2.5531μs | 391.6814 KOps/s | 392.0741 KOps/s | |
test_tc_second_layer_tensor | 23.8500μs | 1.7318μs | 577.4196 KOps/s | 619.6230 KOps/s | |
test_tc_second_layer_nontensor | 18.7710μs | 3.4240μs | 292.0579 KOps/s | 296.5386 KOps/s | |
test_unbind | 0.1883s | 13.1743ms | 75.9052 Ops/s | 81.1499 Ops/s | |
test_full_like | 0.7615ms | 0.5789ms | 1.7273 KOps/s | 1.7313 KOps/s | |
test_zeros_like | 0.3482ms | 0.1979ms | 5.0534 KOps/s | 5.0546 KOps/s | |
test_ones_like | 0.3485ms | 0.1978ms | 5.0565 KOps/s | 5.0564 KOps/s | |
test_clone | 0.5906ms | 0.4150ms | 2.4095 KOps/s | 2.4069 KOps/s | |
test_squeeze | 29.0110μs | 10.7612μs | 92.9266 KOps/s | 91.8588 KOps/s | |
test_unsqueeze | 0.2439ms | 79.4173μs | 12.5917 KOps/s | 12.2763 KOps/s | |
test_split | 0.4228ms | 0.1723ms | 5.8030 KOps/s | 5.5362 KOps/s | |
test_permute | 0.2646ms | 0.1882ms | 5.3124 KOps/s | 5.2233 KOps/s | |
test_stack | 1.3779ms | 0.9133ms | 1.0949 KOps/s | 1.1380 KOps/s | |
test_cat | 1.3827ms | 1.2320ms | 811.7191 Ops/s | 811.4115 Ops/s |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
bug
Something isn't working
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Description
Describe your changes in detail.
Motivation and Context
Why is this change required? What problem does it solve?
If it fixes an open issue, please link to the issue here.
You can use the syntax
close #15213
if this solves the issue #15213Types of changes
What types of changes does your code introduce? Remove all that do not apply:
Checklist
Go over all the following points, and put an
x
in all the boxes that apply.If you are unsure about any of these, don't hesitate to ask. We are here to help!