-
Notifications
You must be signed in to change notification settings - Fork 79
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BugFix] Better repr of lazy stacks #1076
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 53.6810μs | 17.3672μs | 57.5799 KOps/s | 54.9837 KOps/s | |
test_plain_set_stack_nested | 41.1870μs | 17.5980μs | 56.8248 KOps/s | 54.7021 KOps/s | |
test_plain_set_nested_inplace | 59.9120μs | 19.0114μs | 52.6000 KOps/s | 50.2596 KOps/s | |
test_plain_set_stack_nested_inplace | 44.7440μs | 18.8513μs | 53.0467 KOps/s | 50.0457 KOps/s | |
test_items | 22.4320μs | 4.0742μs | 245.4488 KOps/s | 241.3768 KOps/s | |
test_items_nested | 0.5947ms | 0.3411ms | 2.9321 KOps/s | 2.9789 KOps/s | |
test_items_nested_locked | 0.4588ms | 0.3401ms | 2.9403 KOps/s | 2.9728 KOps/s | |
test_items_nested_leaf | 0.1340ms | 71.1671μs | 14.0514 KOps/s | 14.2940 KOps/s | |
test_items_stack_nested | 0.5241ms | 0.3431ms | 2.9150 KOps/s | 2.9531 KOps/s | |
test_items_stack_nested_leaf | 0.1355ms | 73.2988μs | 13.6428 KOps/s | 13.9255 KOps/s | |
test_items_stack_nested_locked | 0.4950ms | 0.3420ms | 2.9237 KOps/s | 2.9553 KOps/s | |
test_keys | 21.0290μs | 3.4740μs | 287.8495 KOps/s | 284.2116 KOps/s | |
test_keys_nested | 0.2304ms | 0.1374ms | 7.2757 KOps/s | 7.3662 KOps/s | |
test_keys_nested_locked | 1.9482ms | 0.1409ms | 7.0957 KOps/s | 6.9683 KOps/s | |
test_keys_nested_leaf | 0.2213ms | 0.1148ms | 8.7073 KOps/s | 8.4672 KOps/s | |
test_keys_stack_nested | 0.2329ms | 0.1335ms | 7.4907 KOps/s | 7.3516 KOps/s | |
test_keys_stack_nested_leaf | 0.2039ms | 0.1131ms | 8.8382 KOps/s | 8.5105 KOps/s | |
test_keys_stack_nested_locked | 0.2030ms | 0.1390ms | 7.1932 KOps/s | 6.9487 KOps/s | |
test_values | 4.5004μs | 1.0347μs | 966.4913 KOps/s | 974.1011 KOps/s | |
test_values_nested | 0.1091ms | 55.6029μs | 17.9847 KOps/s | 18.1615 KOps/s | |
test_values_nested_locked | 0.1258ms | 55.4226μs | 18.0432 KOps/s | 18.1436 KOps/s | |
test_values_nested_leaf | 0.1128ms | 59.5822μs | 16.7835 KOps/s | 16.7788 KOps/s | |
test_values_stack_nested | 0.1071ms | 57.0728μs | 17.5215 KOps/s | 17.3921 KOps/s | |
test_values_stack_nested_leaf | 0.1133ms | 59.7491μs | 16.7366 KOps/s | 16.8043 KOps/s | |
test_values_stack_nested_locked | 0.1049ms | 57.4968μs | 17.3923 KOps/s | 17.8985 KOps/s | |
test_membership | 4.0490μs | 0.7469μs | 1.3388 MOps/s | 1.4400 MOps/s | |
test_membership_nested | 36.5780μs | 2.7506μs | 363.5538 KOps/s | 370.2335 KOps/s | |
test_membership_nested_leaf | 21.3700μs | 2.7202μs | 367.6247 KOps/s | 364.2112 KOps/s | |
test_membership_stacked_nested | 28.4530μs | 2.7398μs | 364.9898 KOps/s | 372.5485 KOps/s | |
test_membership_stacked_nested_leaf | 29.2950μs | 2.6992μs | 370.4795 KOps/s | 364.8796 KOps/s | |
test_membership_nested_last | 23.1640μs | 4.0713μs | 245.6230 KOps/s | 246.7485 KOps/s | |
test_membership_nested_leaf_last | 29.7560μs | 4.1055μs | 243.5753 KOps/s | 246.1091 KOps/s | |
test_membership_stacked_nested_last | 28.9640μs | 5.6645μs | 176.5390 KOps/s | 247.4631 KOps/s | |
test_membership_stacked_nested_leaf_last | 36.1280μs | 5.6434μs | 177.1977 KOps/s | 247.6163 KOps/s | |
test_nested_getleaf | 25.7780μs | 10.4823μs | 95.3986 KOps/s | 95.2165 KOps/s | |
test_nested_get | 28.7130μs | 9.9091μs | 100.9168 KOps/s | 100.3478 KOps/s | |
test_stacked_getleaf | 37.8420μs | 10.6578μs | 93.8283 KOps/s | 95.5410 KOps/s | |
test_stacked_get | 34.5250μs | 9.9531μs | 100.4707 KOps/s | 100.8608 KOps/s | |
test_nested_getitemleaf | 35.0560μs | 11.0832μs | 90.2269 KOps/s | 92.2387 KOps/s | |
test_nested_getitem | 38.8730μs | 10.2293μs | 97.7583 KOps/s | 98.5538 KOps/s | |
test_stacked_getitemleaf | 32.0800μs | 11.1268μs | 89.8727 KOps/s | 91.7299 KOps/s | |
test_stacked_getitem | 31.4090μs | 10.1262μs | 98.7539 KOps/s | 97.5147 KOps/s | |
test_lock_nested | 5.2066ms | 0.4418ms | 2.2633 KOps/s | 2.2521 KOps/s | |
test_lock_stack_nested | 0.5021ms | 0.4068ms | 2.4582 KOps/s | 2.4097 KOps/s | |
test_unlock_nested | 0.8956ms | 0.3598ms | 2.7793 KOps/s | 2.7377 KOps/s | |
test_unlock_stack_nested | 0.6949ms | 0.3255ms | 3.0724 KOps/s | 2.9741 KOps/s | |
test_flatten_speed | 0.1560ms | 91.4200μs | 10.9385 KOps/s | 11.0594 KOps/s | |
test_unflatten_speed | 0.6367ms | 0.4638ms | 2.1562 KOps/s | 2.1259 KOps/s | |
test_common_ops | 4.3356ms | 0.7571ms | 1.3208 KOps/s | 1.2343 KOps/s | |
test_creation | 21.1190μs | 2.0658μs | 484.0628 KOps/s | 484.5022 KOps/s | |
test_creation_empty | 38.4630μs | 10.1870μs | 98.1642 KOps/s | 83.0823 KOps/s | |
test_creation_nested_1 | 42.1790μs | 12.9270μs | 77.3574 KOps/s | 66.1088 KOps/s | |
test_creation_nested_2 | 62.1860μs | 16.7476μs | 59.7101 KOps/s | 52.4644 KOps/s | |
test_clone | 60.0730μs | 13.2553μs | 75.4414 KOps/s | 74.4847 KOps/s | |
test_getitem[int] | 1.4174ms | 12.4440μs | 80.3603 KOps/s | 81.0942 KOps/s | |
test_getitem[slice_int] | 0.1404ms | 23.0669μs | 43.3522 KOps/s | 41.7612 KOps/s | |
test_getitem[range] | 0.2421ms | 47.4819μs | 21.0607 KOps/s | 20.3268 KOps/s | |
test_getitem[tuple] | 0.1394ms | 19.4104μs | 51.5188 KOps/s | 51.5902 KOps/s | |
test_getitem[list] | 0.1616ms | 42.9891μs | 23.2617 KOps/s | 22.9004 KOps/s | |
test_setitem_dim[int] | 70.6220μs | 25.1133μs | 39.8195 KOps/s | 38.9654 KOps/s | |
test_setitem_dim[slice_int] | 0.2076ms | 50.9687μs | 19.6199 KOps/s | 19.8236 KOps/s | |
test_setitem_dim[range] | 0.1372ms | 74.2355μs | 13.4706 KOps/s | 13.0615 KOps/s | |
test_setitem_dim[tuple] | 77.5660μs | 40.0243μs | 24.9848 KOps/s | 24.7311 KOps/s | |
test_setitem | 71.1240μs | 19.3557μs | 51.6643 KOps/s | 46.5998 KOps/s | |
test_set | 62.1670μs | 18.6339μs | 53.6656 KOps/s | 47.4178 KOps/s | |
test_set_shared | 2.1318ms | 0.1681ms | 5.9505 KOps/s | 5.9288 KOps/s | |
test_update | 0.1149ms | 21.1724μs | 47.2313 KOps/s | 41.4448 KOps/s | |
test_update_nested | 86.6830μs | 30.9064μs | 32.3557 KOps/s | 29.2080 KOps/s | |
test_update__nested | 1.0200ms | 32.1639μs | 31.0908 KOps/s | 30.1694 KOps/s | |
test_set_nested | 71.3640μs | 20.9079μs | 47.8289 KOps/s | 43.4973 KOps/s | |
test_set_nested_new | 81.9840μs | 25.7332μs | 38.8604 KOps/s | 36.0729 KOps/s | |
test_select | 93.5850μs | 41.5270μs | 24.0807 KOps/s | 22.5931 KOps/s | |
test_select_nested | 0.1405ms | 59.3845μs | 16.8394 KOps/s | 17.0250 KOps/s | |
test_exclude_nested | 0.3482ms | 74.5153μs | 13.4201 KOps/s | 13.5408 KOps/s | |
test_empty[True] | 0.6310ms | 0.3452ms | 2.8965 KOps/s | 2.9054 KOps/s | |
test_empty[False] | 8.9395μs | 1.2660μs | 789.8834 KOps/s | 810.9634 KOps/s | |
test_unbind_speed | 0.3568ms | 0.2601ms | 3.8450 KOps/s | 3.8368 KOps/s | |
test_unbind_speed_stack0 | 0.4710ms | 0.2531ms | 3.9514 KOps/s | 3.8890 KOps/s | |
test_unbind_speed_stack1 | 0.1076s | 0.7524ms | 1.3291 KOps/s | 1.5890 KOps/s | |
test_split | 1.7902ms | 1.5602ms | 640.9603 Ops/s | 587.1769 Ops/s | |
test_chunk | 92.1879ms | 1.8445ms | 542.1530 Ops/s | 592.5612 Ops/s | |
test_consolidate_njt[False-None] | 8.7679ms | 8.0550ms | 124.1467 Ops/s | 121.7984 Ops/s | |
test_creation[device0] | 0.1767ms | 90.7818μs | 11.0154 KOps/s | 10.6997 KOps/s | |
test_creation_from_tensor | 3.8072ms | 93.6932μs | 10.6731 KOps/s | 10.5949 KOps/s | |
test_add_one[memmap_tensor0] | 0.1070ms | 4.9944μs | 200.2240 KOps/s | 192.5943 KOps/s | |
test_contiguous[memmap_tensor0] | 18.9560μs | 0.5067μs | 1.9734 MOps/s | 1.9586 MOps/s | |
test_stack[memmap_tensor0] | 33.7630μs | 3.5167μs | 284.3579 KOps/s | 281.8057 KOps/s | |
test_memmaptd_index | 1.1275ms | 0.2353ms | 4.2491 KOps/s | 4.1674 KOps/s | |
test_memmaptd_index_astensor | 0.6735ms | 0.3138ms | 3.1869 KOps/s | 3.2051 KOps/s | |
test_memmaptd_index_op | 1.0831ms | 0.5812ms | 1.7207 KOps/s | 1.6193 KOps/s | |
test_serialize_model | 0.1216s | 0.1150s | 8.6969 Ops/s | 7.7138 Ops/s | |
test_serialize_model_pickle | 0.4548s | 0.3851s | 2.5966 Ops/s | 2.4769 Ops/s | |
test_serialize_weights | 0.2055s | 0.1236s | 8.0895 Ops/s | 8.9597 Ops/s | |
test_serialize_weights_returnearly | 0.1873s | 0.1594s | 6.2752 Ops/s | 6.3521 Ops/s | |
test_serialize_weights_pickle | 0.5559s | 0.4410s | 2.2677 Ops/s | 2.5107 Ops/s | |
test_serialize_weights_filesystem | 0.1499s | 0.1407s | 7.1094 Ops/s | 6.5418 Ops/s | |
test_serialize_model_filesystem | 0.2332s | 0.1603s | 6.2392 Ops/s | 6.7176 Ops/s | |
test_reshape_pytree | 85.8720μs | 26.8985μs | 37.1768 KOps/s | 35.7071 KOps/s | |
test_reshape_td | 71.4840μs | 32.5468μs | 30.7250 KOps/s | 32.1192 KOps/s | |
test_view_pytree | 0.1123ms | 27.1066μs | 36.8914 KOps/s | 36.8051 KOps/s | |
test_view_td | 84.7990μs | 36.4604μs | 27.4270 KOps/s | 27.5490 KOps/s | |
test_unbind_pytree | 67.3170μs | 29.9419μs | 33.3980 KOps/s | 33.7211 KOps/s | |
test_unbind_td | 0.3282ms | 38.5956μs | 25.9097 KOps/s | 25.7003 KOps/s | |
test_split_pytree | 75.1410μs | 29.4062μs | 34.0064 KOps/s | 33.7189 KOps/s | |
test_split_td | 0.5271ms | 43.0583μs | 23.2243 KOps/s | 22.9252 KOps/s | |
test_add_pytree | 93.6360μs | 36.2789μs | 27.5642 KOps/s | 26.9395 KOps/s | |
test_add_td | 0.1542ms | 54.1474μs | 18.4681 KOps/s | 16.6301 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1117ms | 61.9473μs | 16.1427 KOps/s | 15.9877 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3403ms | 0.1604ms | 6.2348 KOps/s | 6.2356 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1078ms | 45.5828μs | 21.9381 KOps/s | 21.5461 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2612ms | 0.1208ms | 8.2753 KOps/s | 8.1832 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 57.3470μs | 25.4410μs | 39.3066 KOps/s | 38.2567 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1428ms | 52.8366μs | 18.9263 KOps/s | 18.7316 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1670ms | 78.2755μs | 12.7754 KOps/s | 12.5046 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1304ms | 67.6613μs | 14.7795 KOps/s | 14.6502 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2206ms | 0.1047ms | 9.5489 KOps/s | 9.4646 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3390ms | 0.2000ms | 4.9998 KOps/s | 4.9424 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1151ms | 44.6318μs | 22.4055 KOps/s | 21.7748 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.4666ms | 61.6442μs | 16.2221 KOps/s | 15.8419 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1854ms | 0.1023ms | 9.7716 KOps/s | 9.5790 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.3883ms | 0.2057ms | 4.8625 KOps/s | 4.8244 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3377ms | 0.2125ms | 4.7062 KOps/s | 4.6870 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2387ms | 0.1093ms | 9.1509 KOps/s | 9.5041 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.2012ms | 54.4157μs | 18.3770 KOps/s | 18.2391 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.2192ms | 46.8783μs | 21.3318 KOps/s | 21.4608 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.2023ms | 0.1644ms | 6.0812 KOps/s | 6.1947 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1857ms | 0.1044ms | 9.5741 KOps/s | 9.4624 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 77.9360μs | 21.4325μs | 46.6581 KOps/s | 48.2293 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1279ms | 57.6577μs | 17.3437 KOps/s | 16.9433 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1447ms | 82.5293μs | 12.1169 KOps/s | 12.1978 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1301ms | 69.8363μs | 14.3192 KOps/s | 14.5506 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.3026ms | 0.2096ms | 4.7716 KOps/s | 4.7223 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 1.4128ms | 1.2896ms | 775.4208 Ops/s | 766.6530 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.2803ms | 0.2033ms | 4.9196 KOps/s | 4.9012 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.6469ms | 0.7938ms | 1.2597 KOps/s | 1.2422 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.7599ms | 0.4557ms | 2.1945 KOps/s | 2.1827 KOps/s | |
test_compile_assign_and_add_stack[eager] | 4.5998ms | 2.6544ms | 376.7362 Ops/s | 367.0909 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1029ms | 37.3042μs | 26.8066 KOps/s | 27.9354 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.4334ms | 33.4739μs | 29.8740 KOps/s | 30.4202 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 86.4920μs | 29.9267μs | 33.4149 KOps/s | 34.1300 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 84.2180μs | 23.6075μs | 42.3594 KOps/s | 41.8739 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 72.9170μs | 30.6449μs | 32.6319 KOps/s | 33.3198 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 73.1070μs | 23.8572μs | 41.9160 KOps/s | 42.5251 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1496ms | 52.5662μs | 19.0236 KOps/s | 19.7320 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.4422ms | 18.5012μs | 54.0505 KOps/s | 50.1013 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1114ms | 44.1594μs | 22.6452 KOps/s | 22.8040 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 69.3610μs | 18.7885μs | 53.2240 KOps/s | 52.1680 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1287ms | 45.2598μs | 22.0946 KOps/s | 22.6154 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 59.6410μs | 18.8256μs | 53.1190 KOps/s | 52.6519 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1324ms | 53.6252μs | 18.6479 KOps/s | 19.4865 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.8563ms | 18.6833μs | 53.5238 KOps/s | 50.3021 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1421ms | 45.5081μs | 21.9741 KOps/s | 22.5851 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 58.8400μs | 18.8832μs | 52.9572 KOps/s | 53.1122 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1116ms | 45.4677μs | 21.9936 KOps/s | 22.5833 KOps/s | |
test_compile_indexing[int-pytree-eager] | 52.9290μs | 18.6766μs | 53.5428 KOps/s | 53.3860 KOps/s | |
test_mod_add[eager] | 85.1590μs | 26.5656μs | 37.6427 KOps/s | 36.4452 KOps/s | |
test_mod_add[compile] | 87.7040μs | 44.9307μs | 22.2565 KOps/s | 22.6155 KOps/s | |
test_mod_add[compile-overhead] | 0.1090ms | 44.6655μs | 22.3886 KOps/s | 22.2786 KOps/s | |
test_mod_wrap[eager] | 0.4061ms | 0.2103ms | 4.7548 KOps/s | 4.7084 KOps/s | |
test_mod_wrap[compile] | 1.3104ms | 0.2052ms | 4.8721 KOps/s | 4.9012 KOps/s | |
test_mod_wrap[compile-overhead] | 1.3946ms | 0.1993ms | 5.0171 KOps/s | 4.9667 KOps/s | |
test_mod_wrap_and_backward[eager] | 16.1648ms | 11.7337ms | 85.2249 Ops/s | 84.4811 Ops/s | |
test_mod_wrap_and_backward[compile] | 15.7440ms | 13.1389ms | 76.1101 Ops/s | 73.8649 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 19.5961ms | 12.9941ms | 76.9582 Ops/s | 77.1895 Ops/s | |
test_seq_add[eager] | 0.2049ms | 93.8364μs | 10.6568 KOps/s | 10.5469 KOps/s | |
test_seq_add[compile] | 0.1514ms | 60.2857μs | 16.5877 KOps/s | 16.1993 KOps/s | |
test_seq_add[compile-overhead] | 0.1171ms | 58.2528μs | 17.1666 KOps/s | 16.4974 KOps/s | |
test_seq_wrap[eager] | 0.7235ms | 0.3869ms | 2.5846 KOps/s | 2.5015 KOps/s | |
test_seq_wrap[compile] | 0.4066ms | 0.2259ms | 4.4261 KOps/s | 4.3583 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3126ms | 0.2216ms | 4.5133 KOps/s | 4.3252 KOps/s | |
test_func_call_runtime[False-eager] | 0.9583ms | 0.5403ms | 1.8508 KOps/s | 1.8136 KOps/s | |
test_func_call_runtime[False-compile] | 0.7971ms | 0.4278ms | 2.3374 KOps/s | 2.2955 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.5384ms | 0.4224ms | 2.3675 KOps/s | 2.2901 KOps/s | |
test_func_call_runtime[True-eager] | 0.9169ms | 0.7474ms | 1.3380 KOps/s | 1.3125 KOps/s | |
test_func_call_runtime[True-compile] | 0.8573ms | 0.4714ms | 2.1212 KOps/s | 2.1104 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.5785ms | 0.4698ms | 2.1288 KOps/s | 2.0770 KOps/s | |
test_func_call_cm_runtime[False-eager] | 1.0600ms | 0.5440ms | 1.8382 KOps/s | 1.8427 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.5424ms | 0.4279ms | 2.3372 KOps/s | 2.3054 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5278ms | 0.4285ms | 2.3337 KOps/s | 2.3092 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.4430ms | 0.8948ms | 1.1175 KOps/s | 1.1149 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.7507ms | 0.4935ms | 2.0263 KOps/s | 1.9907 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.8861ms | 0.4935ms | 2.0262 KOps/s | 1.9926 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.7281ms | 1.8595ms | 537.7791 Ops/s | 521.5600 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9260ms | 0.5147ms | 1.9429 KOps/s | 1.8686 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.6265ms | 0.5126ms | 1.9510 KOps/s | 1.8625 KOps/s | |
test_distributed | 0.3312ms | 0.1253ms | 7.9835 KOps/s | 7.6101 KOps/s | |
test_tdmodule | 71.9840μs | 18.3626μs | 54.4585 KOps/s | 46.7693 KOps/s | |
test_tdmodule_dispatch | 55.9450μs | 36.4539μs | 27.4319 KOps/s | 25.3525 KOps/s | |
test_tdseq | 44.6130μs | 21.3200μs | 46.9042 KOps/s | 43.5719 KOps/s | |
test_tdseq_dispatch | 63.7290μs | 41.9255μs | 23.8519 KOps/s | 22.3642 KOps/s | |
test_instantiation_functorch | 2.0149ms | 1.5228ms | 656.6669 Ops/s | 655.2430 Ops/s | |
test_exec_functorch | 0.2898ms | 0.1790ms | 5.5866 KOps/s | 5.5526 KOps/s | |
test_exec_functional_call | 0.4236ms | 0.1715ms | 5.8300 KOps/s | 5.8456 KOps/s | |
test_exec_td_decorator | 0.4430ms | 0.2256ms | 4.4332 KOps/s | 4.4517 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.9670ms | 0.6262ms | 1.5970 KOps/s | 1.5327 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 1.1542ms | 0.6348ms | 1.5752 KOps/s | 1.4984 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.6928ms | 0.5108ms | 1.9579 KOps/s | 1.8724 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.6299ms | 0.5102ms | 1.9601 KOps/s | 1.8768 KOps/s | |
test_to_module_speed[True] | 2.0509ms | 1.3051ms | 766.2158 Ops/s | 769.4951 Ops/s | |
test_to_module_speed[False] | 1.6795ms | 1.2638ms | 791.2914 Ops/s | 799.8932 Ops/s | |
test_tc_init | 0.1331ms | 44.7368μs | 22.3530 KOps/s | 21.8502 KOps/s | |
test_tc_init_nested | 0.1581ms | 84.7442μs | 11.8002 KOps/s | 10.8135 KOps/s | |
test_tc_first_layer_tensor | 37.4300μs | 1.5050μs | 664.4648 KOps/s | 659.1101 KOps/s | |
test_tc_first_layer_nontensor | 26.1490μs | 4.6314μs | 215.9176 KOps/s | 213.6458 KOps/s | |
test_tc_second_layer_tensor | 38.3110μs | 2.8013μs | 356.9792 KOps/s | 360.9306 KOps/s | |
test_tc_second_layer_nontensor | 35.1650μs | 5.9543μs | 167.9464 KOps/s | 168.7486 KOps/s | |
test_unbind | 0.2154s | 13.3603ms | 74.8488 Ops/s | 83.4037 Ops/s | |
test_full_like | 14.7735ms | 12.9095ms | 77.4623 Ops/s | 143.0219 Ops/s | |
test_zeros_like | 11.5579ms | 7.2392ms | 138.1371 Ops/s | 371.8699 Ops/s | |
test_ones_like | 14.7841ms | 7.4488ms | 134.2490 Ops/s | 319.7593 Ops/s | |
test_clone | 14.3271ms | 8.7731ms | 113.9852 Ops/s | 204.6920 Ops/s | |
test_squeeze | 57.6580μs | 11.7628μs | 85.0141 KOps/s | 85.2087 KOps/s | |
test_unsqueeze | 0.1601ms | 90.1386μs | 11.0940 KOps/s | 11.3528 KOps/s | |
test_split | 0.4636ms | 0.1870ms | 5.3475 KOps/s | 5.3382 KOps/s | |
test_permute | 0.4036ms | 0.2182ms | 4.5826 KOps/s | 4.6172 KOps/s | |
test_stack | 26.9269ms | 24.0736ms | 41.5393 Ops/s | 41.2286 Ops/s | |
test_cat | 24.3831ms | 23.9502ms | 41.7534 Ops/s | 42.0149 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 34.6020μs | 10.1823μs | 98.2093 KOps/s | 96.0977 KOps/s | |
test_plain_set_stack_nested | 43.5830μs | 10.2484μs | 97.5760 KOps/s | 95.5481 KOps/s | |
test_plain_set_nested_inplace | 72.4540μs | 11.0967μs | 90.1167 KOps/s | 88.5409 KOps/s | |
test_plain_set_stack_nested_inplace | 38.1620μs | 11.1174μs | 89.9490 KOps/s | 88.9636 KOps/s | |
test_items | 0.1199ms | 3.0279μs | 330.2618 KOps/s | 341.5931 KOps/s | |
test_items_nested | 0.3460ms | 0.3212ms | 3.1131 KOps/s | 3.1514 KOps/s | |
test_items_nested_locked | 0.3646ms | 0.3205ms | 3.1197 KOps/s | 3.1312 KOps/s | |
test_items_nested_leaf | 0.1249ms | 58.4615μs | 17.1053 KOps/s | 17.2946 KOps/s | |
test_items_stack_nested | 0.3677ms | 0.3231ms | 3.0949 KOps/s | 3.1258 KOps/s | |
test_items_stack_nested_leaf | 0.1721ms | 59.1416μs | 16.9086 KOps/s | 16.8794 KOps/s | |
test_items_stack_nested_locked | 0.4062ms | 0.3220ms | 3.1052 KOps/s | 3.1143 KOps/s | |
test_keys | 30.8210μs | 3.4441μs | 290.3512 KOps/s | 290.2077 KOps/s | |
test_keys_nested | 0.1417ms | 70.3943μs | 14.2057 KOps/s | 14.2523 KOps/s | |
test_keys_nested_locked | 2.8584ms | 75.7368μs | 13.2036 KOps/s | 13.1367 KOps/s | |
test_keys_nested_leaf | 91.6750μs | 61.2345μs | 16.3307 KOps/s | 16.2369 KOps/s | |
test_keys_stack_nested | 0.1120ms | 70.7067μs | 14.1429 KOps/s | 13.9740 KOps/s | |
test_keys_stack_nested_leaf | 0.1000ms | 61.7243μs | 16.2011 KOps/s | 15.8775 KOps/s | |
test_keys_stack_nested_locked | 0.1318ms | 76.1715μs | 13.1283 KOps/s | 13.1489 KOps/s | |
test_values | 6.6837μs | 0.8438μs | 1.1851 MOps/s | 1.1869 MOps/s | |
test_values_nested | 54.5230μs | 31.2133μs | 32.0377 KOps/s | 32.1917 KOps/s | |
test_values_nested_locked | 55.8930μs | 32.7872μs | 30.4997 KOps/s | 30.7028 KOps/s | |
test_values_nested_leaf | 0.1004ms | 33.8820μs | 29.5142 KOps/s | 29.8666 KOps/s | |
test_values_stack_nested | 69.0540μs | 31.6986μs | 31.5471 KOps/s | 31.4536 KOps/s | |
test_values_stack_nested_leaf | 57.8930μs | 34.2329μs | 29.2117 KOps/s | 29.1563 KOps/s | |
test_values_stack_nested_locked | 64.3630μs | 33.0975μs | 30.2138 KOps/s | 30.1436 KOps/s | |
test_membership | 1.9426μs | 0.5048μs | 1.9810 MOps/s | 1.9814 MOps/s | |
test_membership_nested | 14.5755μs | 1.9122μs | 522.9489 KOps/s | 520.4250 KOps/s | |
test_membership_nested_leaf | 93.8245μs | 1.9286μs | 518.5127 KOps/s | 509.5950 KOps/s | |
test_membership_stacked_nested | 26.3020μs | 2.0122μs | 496.9590 KOps/s | 503.2057 KOps/s | |
test_membership_stacked_nested_leaf | 18.0610μs | 2.0180μs | 495.5288 KOps/s | 498.3972 KOps/s | |
test_membership_nested_last | 28.6720μs | 2.8333μs | 352.9478 KOps/s | 356.4412 KOps/s | |
test_membership_nested_leaf_last | 25.7510μs | 2.8615μs | 349.4680 KOps/s | 349.9063 KOps/s | |
test_membership_stacked_nested_last | 27.6110μs | 3.2908μs | 303.8770 KOps/s | 305.7269 KOps/s | |
test_membership_stacked_nested_leaf_last | 31.9920μs | 3.2971μs | 303.2939 KOps/s | 307.6735 KOps/s | |
test_nested_getleaf | 27.2920μs | 5.9865μs | 167.0418 KOps/s | 165.3164 KOps/s | |
test_nested_get | 30.8820μs | 5.7015μs | 175.3922 KOps/s | 174.9582 KOps/s | |
test_stacked_getleaf | 88.8750μs | 6.0001μs | 166.6630 KOps/s | 167.2222 KOps/s | |
test_stacked_get | 38.8620μs | 5.6623μs | 176.6064 KOps/s | 174.3474 KOps/s | |
test_nested_getitemleaf | 30.7310μs | 6.0775μs | 164.5410 KOps/s | 163.6658 KOps/s | |
test_nested_getitem | 34.7420μs | 5.7650μs | 173.4608 KOps/s | 171.2445 KOps/s | |
test_stacked_getitemleaf | 31.6310μs | 6.0850μs | 164.3392 KOps/s | 163.9898 KOps/s | |
test_stacked_getitem | 0.1676ms | 5.7515μs | 173.8664 KOps/s | 173.8906 KOps/s | |
test_lock_nested | 7.3868ms | 0.3735ms | 2.6775 KOps/s | 2.7504 KOps/s | |
test_lock_stack_nested | 0.4638ms | 0.3367ms | 2.9699 KOps/s | 3.0369 KOps/s | |
test_unlock_nested | 0.7822ms | 0.3076ms | 3.2509 KOps/s | 3.3455 KOps/s | |
test_unlock_stack_nested | 0.3093ms | 0.2769ms | 3.6117 KOps/s | 3.7370 KOps/s | |
test_flatten_speed | 94.5150μs | 72.1598μs | 13.8581 KOps/s | 13.9019 KOps/s | |
test_unflatten_speed | 0.3245ms | 0.2914ms | 3.4314 KOps/s | 3.4526 KOps/s | |
test_common_ops | 1.7633ms | 0.5807ms | 1.7221 KOps/s | 1.7360 KOps/s | |
test_creation | 0.1605ms | 1.5040μs | 664.9141 KOps/s | 666.3993 KOps/s | |
test_creation_empty | 39.2020μs | 6.7715μs | 147.6772 KOps/s | 141.0319 KOps/s | |
test_creation_nested_1 | 31.9920μs | 8.2945μs | 120.5611 KOps/s | 117.2644 KOps/s | |
test_creation_nested_2 | 44.5730μs | 10.8119μs | 92.4910 KOps/s | 89.8967 KOps/s | |
test_clone | 45.8920μs | 10.8592μs | 92.0881 KOps/s | 97.4668 KOps/s | |
test_getitem[int] | 1.4610ms | 11.1550μs | 89.6456 KOps/s | 95.5430 KOps/s | |
test_getitem[slice_int] | 0.1919ms | 21.5484μs | 46.4072 KOps/s | 46.8747 KOps/s | |
test_getitem[range] | 0.1455ms | 39.5678μs | 25.2730 KOps/s | 26.4961 KOps/s | |
test_getitem[tuple] | 0.2542ms | 18.8717μs | 52.9893 KOps/s | 54.1124 KOps/s | |
test_getitem[list] | 0.2540ms | 33.8003μs | 29.5855 KOps/s | 30.4636 KOps/s | |
test_setitem_dim[int] | 91.4140μs | 19.7121μs | 50.7303 KOps/s | 51.5893 KOps/s | |
test_setitem_dim[slice_int] | 60.4930μs | 38.6624μs | 25.8649 KOps/s | 24.4998 KOps/s | |
test_setitem_dim[range] | 0.1878ms | 55.1011μs | 18.1485 KOps/s | 17.9889 KOps/s | |
test_setitem_dim[tuple] | 71.0030μs | 33.6152μs | 29.7484 KOps/s | 31.6501 KOps/s | |
test_setitem | 92.8640μs | 14.6095μs | 68.4487 KOps/s | 70.7637 KOps/s | |
test_set | 88.2950μs | 14.2060μs | 70.3927 KOps/s | 72.6478 KOps/s | |
test_set_shared | 1.6216ms | 0.1469ms | 6.8083 KOps/s | 6.8398 KOps/s | |
test_update | 0.4915ms | 17.2070μs | 58.1159 KOps/s | 64.1102 KOps/s | |
test_update_nested | 0.2869ms | 21.0562μs | 47.4919 KOps/s | 49.8554 KOps/s | |
test_update__nested | 0.1381ms | 24.3879μs | 41.0040 KOps/s | 42.5880 KOps/s | |
test_set_nested | 89.4440μs | 15.1303μs | 66.0927 KOps/s | 68.8897 KOps/s | |
test_set_nested_new | 85.3740μs | 17.4302μs | 57.3718 KOps/s | 59.1747 KOps/s | |
test_select | 0.1669ms | 28.8220μs | 34.6957 KOps/s | 35.4121 KOps/s | |
test_select_nested | 74.6340μs | 41.9757μs | 23.8233 KOps/s | 23.9014 KOps/s | |
test_exclude_nested | 89.1850μs | 59.3023μs | 16.8627 KOps/s | 16.8871 KOps/s | |
test_empty[True] | 0.2988ms | 0.2562ms | 3.9036 KOps/s | 3.8635 KOps/s | |
test_empty[False] | 3.5592μs | 0.7370μs | 1.3568 MOps/s | 1.3372 MOps/s | |
test_to | 86.7140μs | 54.6000μs | 18.3150 KOps/s | 18.2256 KOps/s | |
test_to_nonblocking | 0.1991ms | 49.3272μs | 20.2728 KOps/s | 20.5791 KOps/s | |
test_unbind_speed | 0.2765ms | 0.2361ms | 4.2355 KOps/s | 4.3327 KOps/s | |
test_unbind_speed_stack0 | 0.3613ms | 0.2321ms | 4.3091 KOps/s | 4.3425 KOps/s | |
test_unbind_speed_stack1 | 94.1598ms | 0.6549ms | 1.5269 KOps/s | 1.5511 KOps/s | |
test_split | 96.2714ms | 1.5898ms | 629.0144 Ops/s | 624.6448 Ops/s | |
test_chunk | 96.3091ms | 1.6105ms | 620.9378 Ops/s | 623.5865 Ops/s | |
test_consolidate[False-None] | 97.7003ms | 2.9383ms | 340.3313 Ops/s | 337.4637 Ops/s | |
test_consolidate[default-None] | 2.0288ms | 1.6364ms | 611.0972 Ops/s | 605.2933 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.8039ms | 1.6567ms | 603.6228 Ops/s | 589.9326 Ops/s | |
test_consolidate_njt[False-None] | 7.1847ms | 6.6851ms | 149.5854 Ops/s | 151.3051 Ops/s | |
test_to[False-False-None] | 2.1584ms | 1.7508ms | 571.1705 Ops/s | 574.0497 Ops/s | |
test_to[True-False-None] | 1.7964ms | 1.3909ms | 718.9706 Ops/s | 743.3893 Ops/s | |
test_to[within-False-None] | 0.3449s | 5.5174ms | 181.2453 Ops/s | 243.9720 Ops/s | |
test_to[True-default-None] | 5.7705ms | 5.2938ms | 188.9005 Ops/s | 198.8959 Ops/s | |
test_to_njt[False-False-None] | 7.5253ms | 7.0966ms | 140.9134 Ops/s | 143.9289 Ops/s | |
test_to_njt[True-False-None] | 6.2410ms | 5.6222ms | 177.8663 Ops/s | 173.3391 Ops/s | |
test_to_njt[within-False-None] | 13.1502ms | 12.5742ms | 79.5280 Ops/s | 77.2664 Ops/s | |
test_creation[device0] | 0.4768ms | 83.8392μs | 11.9276 KOps/s | 12.0450 KOps/s | |
test_creation_from_tensor | 0.5124ms | 87.5003μs | 11.4285 KOps/s | 11.3559 KOps/s | |
test_add_one[memmap_tensor0] | 0.3972ms | 7.3130μs | 136.7423 KOps/s | 144.7304 KOps/s | |
test_contiguous[memmap_tensor0] | 19.5835μs | 0.4237μs | 2.3601 MOps/s | 2.3249 MOps/s | |
test_stack[memmap_tensor0] | 0.4015ms | 4.6134μs | 216.7609 KOps/s | 228.9296 KOps/s | |
test_memmaptd_index | 2.0482ms | 0.2595ms | 3.8534 KOps/s | 4.0430 KOps/s | |
test_memmaptd_index_astensor | 0.5922ms | 0.3209ms | 3.1158 KOps/s | 3.2634 KOps/s | |
test_memmaptd_index_op | 0.9944ms | 0.5790ms | 1.7271 KOps/s | 1.7834 KOps/s | |
test_serialize_model | 0.1305s | 0.1298s | 7.7029 Ops/s | 5.3945 Ops/s | |
test_serialize_model_pickle | 1.3625s | 1.2181s | 0.8210 Ops/s | 0.8230 Ops/s | |
test_serialize_weights | 0.1303s | 0.1295s | 7.7201 Ops/s | 7.6822 Ops/s | |
test_serialize_weights_returnearly | 0.3742s | 68.5731ms | 14.5830 Ops/s | 23.4360 Ops/s | |
test_serialize_weights_pickle | 1.3485s | 1.2114s | 0.8255 Ops/s | 0.8152 Ops/s | |
test_reshape_pytree | 0.1569ms | 22.3953μs | 44.6522 KOps/s | 44.5657 KOps/s | |
test_reshape_td | 0.1596ms | 27.0213μs | 37.0079 KOps/s | 37.6589 KOps/s | |
test_view_pytree | 0.1348ms | 22.1948μs | 45.0555 KOps/s | 45.5099 KOps/s | |
test_view_td | 0.1678ms | 29.7875μs | 33.5712 KOps/s | 35.0475 KOps/s | |
test_unbind_pytree | 80.1440μs | 27.8077μs | 35.9613 KOps/s | 36.2657 KOps/s | |
test_unbind_td | 0.6383ms | 35.4146μs | 28.2369 KOps/s | 28.1515 KOps/s | |
test_split_pytree | 0.1484ms | 29.8239μs | 33.5301 KOps/s | 34.0598 KOps/s | |
test_split_td | 0.7539ms | 38.6296μs | 25.8869 KOps/s | 25.9459 KOps/s | |
test_add_pytree | 0.1490ms | 34.6833μs | 28.8323 KOps/s | 28.3500 KOps/s | |
test_add_td | 0.1514ms | 45.9897μs | 21.7440 KOps/s | 20.6874 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1703ms | 0.1210ms | 8.2634 KOps/s | 7.9847 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2697ms | 0.1249ms | 8.0091 KOps/s | 7.7121 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.2426ms | 0.1010ms | 9.8961 KOps/s | 9.8052 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 1.1111ms | 0.1540ms | 6.4951 KOps/s | 6.3616 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 0.1040ms | 23.1066μs | 43.2777 KOps/s | 42.1182 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1146ms | 27.0778μs | 36.9307 KOps/s | 35.3962 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.2246ms | 64.7517μs | 15.4436 KOps/s | 15.2846 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1656ms | 49.5018μs | 20.2013 KOps/s | 20.2421 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2933ms | 0.1458ms | 6.8600 KOps/s | 6.7018 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3531ms | 0.2100ms | 4.7609 KOps/s | 4.7469 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.2471ms | 0.1005ms | 9.9519 KOps/s | 10.0067 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1980ms | 53.6061μs | 18.6546 KOps/s | 17.8991 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.3273ms | 0.1443ms | 6.9318 KOps/s | 6.8194 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.9038ms | 0.5054ms | 1.9788 KOps/s | 1.9415 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.6205ms | 0.2495ms | 4.0086 KOps/s | 3.9558 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2340ms | 0.1444ms | 6.9260 KOps/s | 6.9294 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.4552ms | 62.8659μs | 15.9069 KOps/s | 15.5624 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.5121ms | 99.3646μs | 10.0639 KOps/s | 10.1912 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.6062ms | 0.4256ms | 2.3495 KOps/s | 2.3089 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.5355ms | 0.1402ms | 7.1328 KOps/s | 7.0582 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.4056ms | 19.5685μs | 51.1025 KOps/s | 54.2296 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.4084ms | 27.5388μs | 36.3124 KOps/s | 35.0560 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.4429ms | 70.4073μs | 14.2031 KOps/s | 14.4243 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.4184ms | 51.7424μs | 19.3265 KOps/s | 19.3247 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.7011ms | 0.4591ms | 2.1783 KOps/s | 2.1778 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 3.0322ms | 2.7899ms | 358.4299 Ops/s | 374.9893 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.6423ms | 0.4476ms | 2.2339 KOps/s | 2.2163 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 3.0144ms | 2.8023ms | 356.8500 Ops/s | 360.9072 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.2734ms | 0.1208ms | 8.2792 KOps/s | 8.1871 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5702ms | 83.8492μs | 11.9262 KOps/s | 11.3951 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.2627ms | 0.1122ms | 8.9088 KOps/s | 9.0075 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.2549ms | 71.2261μs | 14.0398 KOps/s | 13.5603 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.2660ms | 0.1175ms | 8.5115 KOps/s | 8.8215 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.2637ms | 74.0152μs | 13.5107 KOps/s | 13.5663 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.2555ms | 0.1035ms | 9.6611 KOps/s | 9.6162 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.4057ms | 17.7829μs | 56.2337 KOps/s | 56.9402 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.5017ms | 0.1038ms | 9.6355 KOps/s | 10.1254 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 0.1175ms | 16.1418μs | 61.9509 KOps/s | 62.9286 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.5156ms | 0.1048ms | 9.5451 KOps/s | 10.0390 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 0.1346ms | 15.8791μs | 62.9757 KOps/s | 63.7282 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.5059ms | 0.1089ms | 9.1790 KOps/s | 9.1823 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.5981ms | 17.2729μs | 57.8942 KOps/s | 57.7925 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.4964ms | 99.2797μs | 10.0726 KOps/s | 10.0095 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.1247ms | 15.9224μs | 62.8045 KOps/s | 62.4770 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.2378ms | 98.5960μs | 10.1424 KOps/s | 10.0179 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.1608ms | 15.9523μs | 62.6871 KOps/s | 63.5667 KOps/s | |
test_mod_add[eager] | 0.2003ms | 31.2572μs | 31.9927 KOps/s | 30.6392 KOps/s | |
test_mod_add[compile] | 0.4629ms | 77.7420μs | 12.8631 KOps/s | 12.6423 KOps/s | |
test_mod_add[compile-overhead] | 0.3125ms | 0.1629ms | 6.1374 KOps/s | 5.7304 KOps/s | |
test_mod_wrap[eager] | 0.3791ms | 0.2477ms | 4.0366 KOps/s | 3.7391 KOps/s | |
test_mod_wrap[compile] | 1.5775ms | 0.2880ms | 3.4716 KOps/s | 3.4564 KOps/s | |
test_mod_wrap[compile-overhead] | 7.4462ms | 3.9852ms | 250.9308 Ops/s | 321.0841 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.5617ms | 1.3740ms | 727.7842 Ops/s | 695.9980 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.4270ms | 1.2773ms | 782.8928 Ops/s | 777.2548 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3693ms | 0.9261ms | 1.0798 KOps/s | 1.0662 KOps/s | |
test_seq_add[eager] | 0.2825ms | 94.6481μs | 10.5655 KOps/s | 9.8468 KOps/s | |
test_seq_add[compile] | 0.2367ms | 88.5423μs | 11.2940 KOps/s | 11.3599 KOps/s | |
test_seq_add[compile-overhead] | 0.2808ms | 0.1298ms | 7.7067 KOps/s | 7.7647 KOps/s | |
test_seq_wrap[eager] | 0.5327ms | 0.3843ms | 2.6023 KOps/s | 2.4393 KOps/s | |
test_seq_wrap[compile] | 0.4590ms | 0.3031ms | 3.2993 KOps/s | 3.2621 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3588ms | 0.2257ms | 4.4305 KOps/s | 4.4286 KOps/s | |
test_func_call_runtime[False-eager] | 0.9579ms | 0.7640ms | 1.3089 KOps/s | 1.2604 KOps/s | |
test_func_call_runtime[False-compile] | 1.1019ms | 0.7583ms | 1.3188 KOps/s | 1.3159 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4602ms | 0.3662ms | 2.7304 KOps/s | 2.7248 KOps/s | |
test_func_call_runtime[True-eager] | 1.0699ms | 0.9175ms | 1.0899 KOps/s | 1.0682 KOps/s | |
test_func_call_runtime[True-compile] | 0.9641ms | 0.7872ms | 1.2703 KOps/s | 1.2770 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.5331ms | 0.3858ms | 2.5923 KOps/s | 2.5912 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.9178ms | 0.7572ms | 1.3206 KOps/s | 1.3188 KOps/s | |
test_func_call_cm_runtime[False-compile] | 1.0349ms | 0.7694ms | 1.2997 KOps/s | 1.3166 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4432ms | 0.3660ms | 2.7325 KOps/s | 2.7192 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1885ms | 1.0254ms | 975.2053 Ops/s | 968.4188 Ops/s | |
test_func_call_cm_runtime[True-compile] | 0.9796ms | 0.8178ms | 1.2228 KOps/s | 1.2288 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.5762ms | 0.4122ms | 2.4259 KOps/s | 2.4147 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.6079ms | 2.1031ms | 475.4776 Ops/s | 471.5288 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9912ms | 0.8245ms | 1.2129 KOps/s | 1.2201 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.5309ms | 0.4153ms | 2.4077 KOps/s | 2.4049 KOps/s | |
test_distributed | 5.4884ms | 0.1749ms | 5.7172 KOps/s | 8.7089 KOps/s | |
test_tdmodule | 26.1620μs | 12.9235μs | 77.3781 KOps/s | 70.3322 KOps/s | |
test_tdmodule_dispatch | 0.3216ms | 25.7678μs | 38.8081 KOps/s | 37.4646 KOps/s | |
test_tdseq | 32.8510μs | 14.4655μs | 69.1298 KOps/s | 65.5719 KOps/s | |
test_tdseq_dispatch | 49.8230μs | 29.0729μs | 34.3963 KOps/s | 33.1810 KOps/s | |
test_instantiation_functorch | 1.6529ms | 1.5717ms | 636.2679 Ops/s | 643.0600 Ops/s | |
test_exec_functorch | 0.1993ms | 0.1477ms | 6.7702 KOps/s | 6.8733 KOps/s | |
test_exec_functional_call | 0.2785ms | 0.1422ms | 7.0333 KOps/s | 7.1324 KOps/s | |
test_exec_td_decorator | 0.3802ms | 0.1858ms | 5.3815 KOps/s | 5.4126 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8284ms | 0.6785ms | 1.4739 KOps/s | 1.4753 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8379ms | 0.6767ms | 1.4778 KOps/s | 1.4731 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8382ms | 0.5968ms | 1.6755 KOps/s | 1.6796 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7723ms | 0.5992ms | 1.6688 KOps/s | 1.6787 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 19.7322ms | 19.5683ms | 51.1032 Ops/s | 50.5350 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 20.2447ms | 19.6038ms | 51.0105 Ops/s | 50.6313 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.6611ms | 19.4615ms | 51.3835 Ops/s | 49.7990 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.6521ms | 19.4745ms | 51.3491 Ops/s | 50.7458 Ops/s | |
test_to_module_speed[True] | 1.0477ms | 0.9309ms | 1.0742 KOps/s | 1.0749 KOps/s | |
test_to_module_speed[False] | 1.2732ms | 0.9150ms | 1.0928 KOps/s | 1.0919 KOps/s | |
test_tc_init | 67.8130μs | 33.9797μs | 29.4294 KOps/s | 29.4953 KOps/s | |
test_tc_init_nested | 0.1188ms | 70.9259μs | 14.0992 KOps/s | 14.5826 KOps/s | |
test_tc_first_layer_tensor | 4.8017μs | 0.7196μs | 1.3896 MOps/s | 1.3061 MOps/s | |
test_tc_first_layer_nontensor | 30.8820μs | 2.3950μs | 417.5402 KOps/s | 423.4711 KOps/s | |
test_tc_second_layer_tensor | 10.8340μs | 1.4900μs | 671.1400 KOps/s | 655.3326 KOps/s | |
test_tc_second_layer_nontensor | 24.8610μs | 3.2227μs | 310.2968 KOps/s | 321.7122 KOps/s | |
test_unbind | 0.2371s | 10.1079ms | 98.9329 Ops/s | 148.2861 Ops/s | |
test_full_like | 10.3272ms | 9.5750ms | 104.4384 Ops/s | 103.6498 Ops/s | |
test_zeros_like | 5.2907ms | 4.3677ms | 228.9526 Ops/s | 232.5578 Ops/s | |
test_ones_like | 9.7273ms | 7.3665ms | 135.7505 Ops/s | 234.2589 Ops/s | |
test_clone | 7.3848ms | 6.7799ms | 147.4949 Ops/s | 146.9658 Ops/s | |
test_squeeze | 0.1135ms | 9.7661μs | 102.3953 KOps/s | 105.5985 KOps/s | |
test_unsqueeze | 0.1813ms | 72.0271μs | 13.8837 KOps/s | 13.7451 KOps/s | |
test_split | 0.4007ms | 0.1699ms | 5.8867 KOps/s | 6.1737 KOps/s | |
test_permute | 0.3147ms | 0.1776ms | 5.6307 KOps/s | 5.5457 KOps/s | |
test_stack | 54.7683ms | 54.3620ms | 18.3952 Ops/s | 18.4875 Ops/s | |
test_cat | 52.6036ms | 51.8209ms | 19.2972 Ops/s | 19.1144 Ops/s |
vmoens
added a commit
that referenced
this pull request
Nov 6, 2024
ghstack-source-id: 7256b4c95b239bf9e6467c0ea687abe2c9179922 Pull Request resolved: #1076
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):