-
Notifications
You must be signed in to change notification settings - Fork 79
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BugFix] Fix get for nestedkeys with default in tensorclass #1211
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
3 tasks
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 41.2970μs | 21.0975μs | 47.3990 KOps/s | 48.7173 KOps/s | |
test_plain_set_stack_nested | 56.5150μs | 21.3987μs | 46.7319 KOps/s | 48.5732 KOps/s | |
test_plain_set_nested_inplace | 56.8660μs | 23.2200μs | 43.0664 KOps/s | 44.4481 KOps/s | |
test_plain_set_stack_nested_inplace | 53.8410μs | 23.1042μs | 43.2821 KOps/s | 44.2785 KOps/s | |
test_items | 33.9630μs | 4.2105μs | 237.5005 KOps/s | 244.7400 KOps/s | |
test_items_nested | 0.5749ms | 0.4041ms | 2.4745 KOps/s | 2.4863 KOps/s | |
test_items_nested_locked | 0.7719ms | 0.4058ms | 2.4645 KOps/s | 2.4737 KOps/s | |
test_items_nested_leaf | 0.1616ms | 77.4093μs | 12.9183 KOps/s | 13.0079 KOps/s | |
test_items_stack_nested | 0.7416ms | 0.4112ms | 2.4318 KOps/s | 2.4763 KOps/s | |
test_items_stack_nested_leaf | 0.1605ms | 80.7940μs | 12.3772 KOps/s | 12.7532 KOps/s | |
test_items_stack_nested_locked | 0.5002ms | 0.4050ms | 2.4691 KOps/s | 2.4623 KOps/s | |
test_keys | 29.7650μs | 3.5113μs | 284.7918 KOps/s | 286.4989 KOps/s | |
test_keys_nested | 0.2078ms | 0.1628ms | 6.1416 KOps/s | 6.1448 KOps/s | |
test_keys_nested_locked | 0.8682ms | 0.1685ms | 5.9363 KOps/s | 5.9409 KOps/s | |
test_keys_nested_leaf | 0.1817ms | 0.1424ms | 7.0203 KOps/s | 7.0989 KOps/s | |
test_keys_stack_nested | 0.2116ms | 0.1608ms | 6.2188 KOps/s | 6.2346 KOps/s | |
test_keys_stack_nested_leaf | 0.1772ms | 0.1411ms | 7.0869 KOps/s | 7.3530 KOps/s | |
test_keys_stack_nested_locked | 0.2122ms | 0.1678ms | 5.9607 KOps/s | 6.0575 KOps/s | |
test_values | 8.5635μs | 1.0530μs | 949.7033 KOps/s | 952.7548 KOps/s | |
test_values_nested | 0.1189ms | 62.0936μs | 16.1047 KOps/s | 16.3184 KOps/s | |
test_values_nested_locked | 0.1243ms | 62.0653μs | 16.1121 KOps/s | 15.6481 KOps/s | |
test_values_nested_leaf | 0.1338ms | 71.6576μs | 13.9553 KOps/s | 14.0624 KOps/s | |
test_values_stack_nested | 0.1203ms | 63.5883μs | 15.7262 KOps/s | 15.6414 KOps/s | |
test_values_stack_nested_leaf | 0.1399ms | 71.7497μs | 13.9373 KOps/s | 14.4991 KOps/s | |
test_values_stack_nested_locked | 0.1317ms | 63.2394μs | 15.8129 KOps/s | 15.7646 KOps/s | |
test_membership | 15.8900μs | 0.8641μs | 1.1573 MOps/s | 1.4071 MOps/s | |
test_membership_nested | 35.8070μs | 2.8958μs | 345.3293 KOps/s | 347.7956 KOps/s | |
test_membership_nested_leaf | 19.6570μs | 2.9183μs | 342.6698 KOps/s | 346.8222 KOps/s | |
test_membership_stacked_nested | 23.4230μs | 2.9208μs | 342.3741 KOps/s | 348.0158 KOps/s | |
test_membership_stacked_nested_leaf | 23.0930μs | 2.8896μs | 346.0683 KOps/s | 347.5217 KOps/s | |
test_membership_nested_last | 51.8090μs | 4.3600μs | 229.3552 KOps/s | 228.1391 KOps/s | |
test_membership_nested_leaf_last | 40.4530μs | 4.4096μs | 226.7801 KOps/s | 224.1548 KOps/s | |
test_membership_stacked_nested_last | 33.7830μs | 5.2530μs | 190.3681 KOps/s | 74.8151 KOps/s | |
test_membership_stacked_nested_leaf_last | 31.8200μs | 5.2786μs | 189.4451 KOps/s | 74.9572 KOps/s | |
test_nested_getleaf | 33.0320μs | 11.3096μs | 88.4202 KOps/s | 96.3593 KOps/s | |
test_nested_get | 30.6870μs | 10.8774μs | 91.9335 KOps/s | 101.9186 KOps/s | |
test_stacked_getleaf | 48.8740μs | 11.4385μs | 87.4239 KOps/s | 97.7895 KOps/s | |
test_stacked_get | 29.9360μs | 10.8370μs | 92.2767 KOps/s | 102.1108 KOps/s | |
test_nested_getitemleaf | 35.2360μs | 12.1437μs | 82.3471 KOps/s | 91.9894 KOps/s | |
test_nested_getitem | 45.4750μs | 11.5327μs | 86.7101 KOps/s | 96.4582 KOps/s | |
test_stacked_getitemleaf | 31.5490μs | 11.9127μs | 83.9440 KOps/s | 93.0179 KOps/s | |
test_stacked_getitem | 37.0190μs | 11.4882μs | 87.0456 KOps/s | 96.7908 KOps/s | |
test_lock_nested | 6.7322ms | 0.4229ms | 2.3647 KOps/s | 2.4320 KOps/s | |
test_lock_stack_nested | 0.6584ms | 0.4228ms | 2.3655 KOps/s | 2.4516 KOps/s | |
test_unlock_nested | 0.4260ms | 0.3342ms | 2.9919 KOps/s | 2.9446 KOps/s | |
test_unlock_stack_nested | 0.5310ms | 0.3376ms | 2.9618 KOps/s | 3.0314 KOps/s | |
test_flatten_speed | 0.2079ms | 0.1006ms | 9.9387 KOps/s | 9.8463 KOps/s | |
test_unflatten_speed | 0.6352ms | 0.5350ms | 1.8691 KOps/s | 1.9718 KOps/s | |
test_common_ops | 0.9726ms | 0.8128ms | 1.2304 KOps/s | 1.1957 KOps/s | |
test_creation | 33.6130μs | 2.6105μs | 383.0665 KOps/s | 405.9510 KOps/s | |
test_creation_empty | 38.9020μs | 13.3447μs | 74.9360 KOps/s | 79.9108 KOps/s | |
test_creation_nested_1 | 41.1570μs | 16.3145μs | 61.2951 KOps/s | 64.2953 KOps/s | |
test_creation_nested_2 | 53.7400μs | 20.9747μs | 47.6765 KOps/s | 49.6301 KOps/s | |
test_clone | 64.5900μs | 13.7813μs | 72.5619 KOps/s | 73.7167 KOps/s | |
test_getitem[int] | 0.9051ms | 13.1561μs | 76.0105 KOps/s | 78.5636 KOps/s | |
test_getitem[slice_int] | 0.1279ms | 25.0587μs | 39.9063 KOps/s | 40.6341 KOps/s | |
test_getitem[range] | 0.1606ms | 52.4879μs | 19.0520 KOps/s | 20.5570 KOps/s | |
test_getitem[tuple] | 0.1241ms | 20.7158μs | 48.2723 KOps/s | 49.4695 KOps/s | |
test_getitem[list] | 0.1595ms | 46.4327μs | 21.5365 KOps/s | 23.0080 KOps/s | |
test_setitem_dim[int] | 60.6430μs | 25.0930μs | 39.8517 KOps/s | 39.6207 KOps/s | |
test_setitem_dim[slice_int] | 94.5160μs | 50.7685μs | 19.6972 KOps/s | 20.0573 KOps/s | |
test_setitem_dim[range] | 0.1381ms | 78.7211μs | 12.7031 KOps/s | 13.1409 KOps/s | |
test_setitem_dim[tuple] | 73.9080μs | 40.1818μs | 24.8869 KOps/s | 25.0630 KOps/s | |
test_setitem | 71.5230μs | 21.4361μs | 46.6504 KOps/s | 47.7877 KOps/s | |
test_set | 86.1310μs | 21.1249μs | 47.3375 KOps/s | 48.9219 KOps/s | |
test_set_shared | 0.3662ms | 0.1808ms | 5.5297 KOps/s | 5.4479 KOps/s | |
test_update | 0.1109ms | 24.3781μs | 41.0205 KOps/s | 42.0779 KOps/s | |
test_update_nested | 94.0550μs | 33.8700μs | 29.5247 KOps/s | 29.5284 KOps/s | |
test_update__nested | 0.4651ms | 33.9695μs | 29.4381 KOps/s | 29.5119 KOps/s | |
test_set_nested | 72.8660μs | 23.3041μs | 42.9108 KOps/s | 43.9302 KOps/s | |
test_set_nested_new | 80.0090μs | 27.7783μs | 35.9994 KOps/s | 36.5824 KOps/s | |
test_select | 0.9736ms | 43.9297μs | 22.7636 KOps/s | 22.5518 KOps/s | |
test_select_nested | 0.1319ms | 63.7027μs | 15.6979 KOps/s | 15.9148 KOps/s | |
test_exclude_nested | 0.1517ms | 81.7923μs | 12.2261 KOps/s | 12.4380 KOps/s | |
test_empty[True] | 0.7307ms | 0.4121ms | 2.4266 KOps/s | 2.4639 KOps/s | |
test_empty[False] | 9.3272μs | 1.3798μs | 724.7238 KOps/s | 706.5737 KOps/s | |
test_unbind_speed | 0.3841ms | 0.2730ms | 3.6630 KOps/s | 3.6468 KOps/s | |
test_unbind_speed_stack0 | 0.4707ms | 0.2693ms | 3.7132 KOps/s | 3.8329 KOps/s | |
test_unbind_speed_stack1 | 0.1024s | 0.7269ms | 1.3757 KOps/s | 1.2810 KOps/s | |
test_split | 0.1018s | 1.7673ms | 565.8323 Ops/s | 579.1086 Ops/s | |
test_chunk | 0.1047s | 1.7684ms | 565.4789 Ops/s | 638.6596 Ops/s | |
test_consolidate_njt[False-None] | 8.5542ms | 8.1495ms | 122.7072 Ops/s | 108.2749 Ops/s | |
test_creation[device0] | 3.4058ms | 92.1524μs | 10.8516 KOps/s | 10.8434 KOps/s | |
test_creation_from_tensor | 0.2150ms | 93.9803μs | 10.6405 KOps/s | 10.5186 KOps/s | |
test_add_one[memmap_tensor0] | 0.1003ms | 4.9960μs | 200.1596 KOps/s | 202.0771 KOps/s | |
test_contiguous[memmap_tensor0] | 10.4890μs | 0.5373μs | 1.8613 MOps/s | 1.9492 MOps/s | |
test_stack[memmap_tensor0] | 27.5410μs | 3.4495μs | 289.8979 KOps/s | 300.6608 KOps/s | |
test_memmaptd_index | 1.2487ms | 0.2283ms | 4.3794 KOps/s | 4.4565 KOps/s | |
test_memmaptd_index_astensor | 0.5273ms | 0.3147ms | 3.1780 KOps/s | 3.2484 KOps/s | |
test_memmaptd_index_op | 1.0497ms | 0.6063ms | 1.6493 KOps/s | 1.6703 KOps/s | |
test_serialize_model | 0.2090s | 0.1283s | 7.7962 Ops/s | 8.7276 Ops/s | |
test_serialize_model_pickle | 0.4340s | 0.3879s | 2.5777 Ops/s | 2.5362 Ops/s | |
test_serialize_weights | 0.1212s | 0.1143s | 8.7460 Ops/s | 8.9305 Ops/s | |
test_serialize_weights_returnearly | 0.3295s | 0.1833s | 5.4555 Ops/s | 5.7023 Ops/s | |
test_serialize_weights_pickle | 1.0544s | 0.7027s | 1.4232 Ops/s | 1.2027 Ops/s | |
test_serialize_weights_filesystem | 0.2557s | 0.1546s | 6.4667 Ops/s | 6.9826 Ops/s | |
test_serialize_model_filesystem | 0.1488s | 0.1408s | 7.1009 Ops/s | 6.7794 Ops/s | |
test_reshape_pytree | 68.5580μs | 26.1064μs | 38.3048 KOps/s | 38.1752 KOps/s | |
test_reshape_td | 72.2150μs | 33.1188μs | 30.1944 KOps/s | 29.8519 KOps/s | |
test_view_pytree | 66.2440μs | 26.3745μs | 37.9154 KOps/s | 38.2273 KOps/s | |
test_view_td | 0.1036ms | 38.8200μs | 25.7599 KOps/s | 26.5242 KOps/s | |
test_unbind_pytree | 74.9300μs | 29.8135μs | 33.5419 KOps/s | 33.3249 KOps/s | |
test_unbind_td | 0.3489ms | 40.0786μs | 24.9510 KOps/s | 25.0972 KOps/s | |
test_split_pytree | 75.5010μs | 29.3133μs | 34.1142 KOps/s | 34.2521 KOps/s | |
test_split_td | 0.5208ms | 45.6766μs | 21.8931 KOps/s | 22.3884 KOps/s | |
test_add_pytree | 0.1079ms | 34.9200μs | 28.6369 KOps/s | 28.2312 KOps/s | |
test_add_td | 0.1244ms | 59.5397μs | 16.7955 KOps/s | 17.1717 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1231ms | 67.2236μs | 14.8757 KOps/s | 15.0449 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.6439ms | 0.1729ms | 5.7847 KOps/s | 5.9455 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1440ms | 45.5173μs | 21.9697 KOps/s | 21.6422 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2301ms | 0.1178ms | 8.4901 KOps/s | 8.4195 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 89.5270μs | 28.9546μs | 34.5368 KOps/s | 35.8037 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1278ms | 59.6610μs | 16.7614 KOps/s | 17.2739 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1518ms | 79.6792μs | 12.5503 KOps/s | 12.6727 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1287ms | 66.6680μs | 14.9997 KOps/s | 15.0754 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1992ms | 0.1092ms | 9.1607 KOps/s | 9.2914 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.4213ms | 0.2168ms | 4.6131 KOps/s | 4.6696 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.2234ms | 48.4961μs | 20.6202 KOps/s | 20.6775 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.2005ms | 66.8540μs | 14.9580 KOps/s | 15.2795 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1847ms | 0.1011ms | 9.8866 KOps/s | 9.9411 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.3456ms | 0.2022ms | 4.9448 KOps/s | 5.0205 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4253ms | 0.2337ms | 4.2788 KOps/s | 4.2830 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2162ms | 0.1122ms | 8.9091 KOps/s | 9.2443 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.3038ms | 63.5556μs | 15.7343 KOps/s | 16.1193 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1749ms | 49.6059μs | 20.1589 KOps/s | 19.9647 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.2440ms | 0.1583ms | 6.3153 KOps/s | 6.3949 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2323ms | 0.1026ms | 9.7468 KOps/s | 9.8907 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 49.8540μs | 22.0536μs | 45.3441 KOps/s | 46.6687 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1341ms | 69.5944μs | 14.3690 KOps/s | 15.1530 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1820ms | 83.6681μs | 11.9520 KOps/s | 12.2432 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1587ms | 68.3786μs | 14.6245 KOps/s | 14.8903 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.3002ms | 0.2166ms | 4.6176 KOps/s | 4.6085 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 1.9870ms | 1.3857ms | 721.6512 Ops/s | 716.8705 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.4880ms | 0.2099ms | 4.7647 KOps/s | 4.6527 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.0333ms | 0.8231ms | 1.2149 KOps/s | 1.2293 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.8475ms | 0.4595ms | 2.1763 KOps/s | 2.1809 KOps/s | |
test_compile_assign_and_add_stack[eager] | 3.6995ms | 2.8335ms | 352.9180 Ops/s | 353.1213 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1025ms | 38.5586μs | 25.9346 KOps/s | 25.3787 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5307ms | 34.1947μs | 29.2443 KOps/s | 30.0857 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1005ms | 30.8458μs | 32.4193 KOps/s | 32.3329 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 70.6510μs | 23.1574μs | 43.1828 KOps/s | 41.9026 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 91.8910μs | 31.5819μs | 31.6637 KOps/s | 31.2978 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 55.2930μs | 23.1212μs | 43.2504 KOps/s | 41.6993 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1033ms | 53.5708μs | 18.6669 KOps/s | 18.6865 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.4695ms | 20.1524μs | 49.6219 KOps/s | 48.4865 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1017ms | 45.8213μs | 21.8239 KOps/s | 21.7831 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 56.9260μs | 19.1201μs | 52.3009 KOps/s | 52.3075 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 97.3920μs | 46.3834μs | 21.5594 KOps/s | 21.2538 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 52.6890μs | 18.9554μs | 52.7553 KOps/s | 52.9796 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1087ms | 55.0297μs | 18.1720 KOps/s | 18.1987 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.9171ms | 20.0102μs | 49.9744 KOps/s | 49.7976 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 94.9770μs | 46.4628μs | 21.5226 KOps/s | 20.9122 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 51.4960μs | 19.1640μs | 52.1811 KOps/s | 52.6326 KOps/s | |
test_compile_indexing[int-pytree-compile] | 95.4580μs | 46.3050μs | 21.5959 KOps/s | 21.0491 KOps/s | |
test_compile_indexing[int-pytree-eager] | 55.2730μs | 18.9914μs | 52.6554 KOps/s | 53.2244 KOps/s | |
test_mod_add[eager] | 83.8160μs | 35.8859μs | 27.8661 KOps/s | 27.2295 KOps/s | |
test_mod_add[compile] | 0.2084ms | 68.3220μs | 14.6366 KOps/s | 14.8205 KOps/s | |
test_mod_add[compile-overhead] | 0.1358ms | 64.8709μs | 15.4152 KOps/s | 15.0624 KOps/s | |
test_mod_wrap[eager] | 0.4440ms | 0.2197ms | 4.5514 KOps/s | 4.5288 KOps/s | |
test_mod_wrap[compile] | 2.7032ms | 0.2347ms | 4.2614 KOps/s | 4.2879 KOps/s | |
test_mod_wrap[compile-overhead] | 0.4460ms | 0.2322ms | 4.3063 KOps/s | 4.3043 KOps/s | |
test_mod_wrap_and_backward[eager] | 16.0338ms | 13.2525ms | 75.4576 Ops/s | 91.2782 Ops/s | |
test_mod_wrap_and_backward[compile] | 13.0434ms | 11.3132ms | 88.3921 Ops/s | 92.9863 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 12.8760ms | 11.1205ms | 89.9236 Ops/s | 92.6948 Ops/s | |
test_seq_add[eager] | 0.2000ms | 0.1180ms | 8.4714 KOps/s | 8.2910 KOps/s | |
test_seq_add[compile] | 0.1341ms | 79.8150μs | 12.5290 KOps/s | 12.7388 KOps/s | |
test_seq_add[compile-overhead] | 0.1503ms | 78.0501μs | 12.8123 KOps/s | 13.0877 KOps/s | |
test_seq_wrap[eager] | 0.6013ms | 0.4466ms | 2.2392 KOps/s | 2.1528 KOps/s | |
test_seq_wrap[compile] | 0.3388ms | 0.2433ms | 4.1101 KOps/s | 3.9961 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3761ms | 0.2429ms | 4.1168 KOps/s | 4.0083 KOps/s | |
test_func_call_runtime[False-eager] | 0.6485ms | 0.5398ms | 1.8526 KOps/s | 1.8397 KOps/s | |
test_func_call_runtime[False-compile] | 0.8035ms | 0.4447ms | 2.2486 KOps/s | 2.2342 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.6141ms | 0.4414ms | 2.2655 KOps/s | 2.2373 KOps/s | |
test_func_call_runtime[True-eager] | 1.1938ms | 0.7537ms | 1.3269 KOps/s | 1.3064 KOps/s | |
test_func_call_runtime[True-compile] | 0.5744ms | 0.4627ms | 2.1611 KOps/s | 2.1255 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.9547ms | 0.4646ms | 2.1523 KOps/s | 2.1301 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.7342ms | 0.5357ms | 1.8668 KOps/s | 1.8532 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.6183ms | 0.4405ms | 2.2704 KOps/s | 2.2113 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5648ms | 0.4408ms | 2.2686 KOps/s | 2.2209 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.6005ms | 0.8977ms | 1.1140 KOps/s | 1.1034 KOps/s | |
test_func_call_cm_runtime[True-compile] | 1.3541ms | 0.7891ms | 1.2672 KOps/s | 1.2284 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.9016ms | 0.7919ms | 1.2628 KOps/s | 1.2066 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.8874ms | 1.9109ms | 523.3113 Ops/s | 513.9987 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 1.1331ms | 0.5399ms | 1.8523 KOps/s | 1.7629 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.9546ms | 0.5423ms | 1.8438 KOps/s | 1.8245 KOps/s | |
test_distributed | 0.2386ms | 0.1289ms | 7.7588 KOps/s | 7.8443 KOps/s | |
test_tdmodule | 83.1450μs | 27.5251μs | 36.3304 KOps/s | 35.9063 KOps/s | |
test_tdmodule_dispatch | 89.4770μs | 54.9102μs | 18.2116 KOps/s | 19.8699 KOps/s | |
test_tdseq | 58.9000μs | 30.5722μs | 32.7094 KOps/s | 31.9034 KOps/s | |
test_tdseq_dispatch | 83.9670μs | 57.7461μs | 17.3172 KOps/s | 17.8809 KOps/s | |
test_instantiation_functorch | 2.3201ms | 1.5555ms | 642.8891 Ops/s | 662.4767 Ops/s | |
test_exec_functorch | 0.4237ms | 0.1828ms | 5.4690 KOps/s | 5.6156 KOps/s | |
test_exec_functional_call | 0.2827ms | 0.1705ms | 5.8653 KOps/s | 5.8744 KOps/s | |
test_exec_td_decorator | 0.4681ms | 0.2330ms | 4.2927 KOps/s | 4.3855 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.0276ms | 0.6666ms | 1.5002 KOps/s | 1.5023 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.9001ms | 0.6619ms | 1.5108 KOps/s | 1.5057 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8563ms | 0.5389ms | 1.8555 KOps/s | 1.8713 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.8671ms | 0.5333ms | 1.8753 KOps/s | 1.8701 KOps/s | |
test_to_module_speed[True] | 2.2263ms | 1.3814ms | 723.8916 Ops/s | 753.5332 Ops/s | |
test_to_module_speed[False] | 1.8452ms | 1.3390ms | 746.8121 Ops/s | 776.1250 Ops/s | |
test_tc_init | 0.1044ms | 47.3964μs | 21.0987 KOps/s | 20.1085 KOps/s | |
test_tc_init_nested | 0.2028ms | 95.9423μs | 10.4229 KOps/s | 10.1108 KOps/s | |
test_tc_first_layer_tensor | 16.6110μs | 1.5540μs | 643.5008 KOps/s | 580.0745 KOps/s | |
test_tc_first_layer_nontensor | 44.0620μs | 4.9823μs | 200.7096 KOps/s | 201.8775 KOps/s | |
test_tc_second_layer_tensor | 23.2530μs | 2.8655μs | 348.9752 KOps/s | 311.1409 KOps/s | |
test_tc_second_layer_nontensor | 47.7390μs | 6.2784μs | 159.2752 KOps/s | 154.6483 KOps/s | |
test_unbind | 0.2352s | 13.8955ms | 71.9658 Ops/s | 77.6679 Ops/s | |
test_full_like | 9.3345ms | 7.9109ms | 126.4081 Ops/s | 128.0069 Ops/s | |
test_zeros_like | 5.9790ms | 2.8308ms | 353.2629 Ops/s | 360.9436 Ops/s | |
test_ones_like | 4.6266ms | 3.1823ms | 314.2426 Ops/s | 319.6287 Ops/s | |
test_clone | 7.0485ms | 5.3164ms | 188.0975 Ops/s | 139.6760 Ops/s | |
test_squeeze | 59.4600μs | 12.3594μs | 80.9099 KOps/s | 81.6699 KOps/s | |
test_unsqueeze | 0.1841ms | 91.4457μs | 10.9355 KOps/s | 10.9102 KOps/s | |
test_split | 0.4854ms | 0.1959ms | 5.1046 KOps/s | 5.2305 KOps/s | |
test_permute | 0.3480ms | 0.2012ms | 4.9693 KOps/s | 4.9301 KOps/s | |
test_stack | 32.4199ms | 25.7551ms | 38.8273 Ops/s | 35.7972 Ops/s | |
test_cat | 27.5516ms | 25.4399ms | 39.3083 Ops/s | 38.8349 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 27.4410μs | 11.1697μs | 89.5282 KOps/s | 75.3457 KOps/s | |
test_plain_set_stack_nested | 35.1100μs | 11.2417μs | 88.9548 KOps/s | 74.5748 KOps/s | |
test_plain_set_nested_inplace | 40.0300μs | 12.1366μs | 82.3954 KOps/s | 70.1996 KOps/s | |
test_plain_set_stack_nested_inplace | 35.3510μs | 12.2272μs | 81.7849 KOps/s | 70.0550 KOps/s | |
test_items | 27.7210μs | 2.9220μs | 342.2362 KOps/s | 347.1277 KOps/s | |
test_items_nested | 0.4462ms | 0.3752ms | 2.6654 KOps/s | 2.6712 KOps/s | |
test_items_nested_locked | 0.4481ms | 0.3746ms | 2.6693 KOps/s | 2.6953 KOps/s | |
test_items_nested_leaf | 86.0600μs | 57.7176μs | 17.3257 KOps/s | 17.1050 KOps/s | |
test_items_stack_nested | 0.5237ms | 0.3783ms | 2.6437 KOps/s | 2.6506 KOps/s | |
test_items_stack_nested_leaf | 90.0010μs | 58.0536μs | 17.2255 KOps/s | 16.6786 KOps/s | |
test_items_stack_nested_locked | 0.4057ms | 0.3781ms | 2.6451 KOps/s | 2.6671 KOps/s | |
test_keys | 28.1000μs | 3.4359μs | 291.0449 KOps/s | 290.5446 KOps/s | |
test_keys_nested | 0.1173ms | 87.6921μs | 11.4035 KOps/s | 11.0728 KOps/s | |
test_keys_nested_locked | 0.8381ms | 93.6892μs | 10.6736 KOps/s | 10.3798 KOps/s | |
test_keys_nested_leaf | 0.1216ms | 78.2304μs | 12.7828 KOps/s | 12.3569 KOps/s | |
test_keys_stack_nested | 0.1264ms | 87.8747μs | 11.3798 KOps/s | 10.9458 KOps/s | |
test_keys_stack_nested_leaf | 0.1084ms | 78.2725μs | 12.7759 KOps/s | 12.1726 KOps/s | |
test_keys_stack_nested_locked | 0.1312ms | 94.0990μs | 10.6271 KOps/s | 10.4499 KOps/s | |
test_values | 3.7271μs | 0.8399μs | 1.1906 MOps/s | 750.0695 KOps/s | |
test_values_nested | 65.7100μs | 37.6754μs | 26.5425 KOps/s | 26.2856 KOps/s | |
test_values_nested_locked | 66.9010μs | 39.4061μs | 25.3768 KOps/s | 24.9847 KOps/s | |
test_values_nested_leaf | 66.9700μs | 42.0306μs | 23.7922 KOps/s | 23.1012 KOps/s | |
test_values_stack_nested | 68.3810μs | 37.5723μs | 26.6153 KOps/s | 25.5577 KOps/s | |
test_values_stack_nested_leaf | 67.8400μs | 42.3646μs | 23.6046 KOps/s | 22.9873 KOps/s | |
test_values_stack_nested_locked | 71.6100μs | 39.8196μs | 25.1132 KOps/s | 24.6220 KOps/s | |
test_membership | 1.9035μs | 0.5055μs | 1.9782 MOps/s | 1.9362 MOps/s | |
test_membership_nested | 19.0000μs | 2.0242μs | 494.0144 KOps/s | 473.9095 KOps/s | |
test_membership_nested_leaf | 16.7155μs | 2.0572μs | 486.0945 KOps/s | 488.5031 KOps/s | |
test_membership_stacked_nested | 27.1210μs | 2.1301μs | 469.4662 KOps/s | 465.1070 KOps/s | |
test_membership_stacked_nested_leaf | 26.9900μs | 2.1269μs | 470.1583 KOps/s | 474.0185 KOps/s | |
test_membership_nested_last | 39.1400μs | 3.1274μs | 319.7574 KOps/s | 320.8407 KOps/s | |
test_membership_nested_leaf_last | 38.4400μs | 3.0721μs | 325.5103 KOps/s | 324.1483 KOps/s | |
test_membership_stacked_nested_last | 33.6410μs | 3.2043μs | 312.0831 KOps/s | 182.8469 KOps/s | |
test_membership_stacked_nested_leaf_last | 30.6710μs | 3.1618μs | 316.2751 KOps/s | 185.2401 KOps/s | |
test_nested_getleaf | 41.8400μs | 6.0778μs | 164.5338 KOps/s | 161.3863 KOps/s | |
test_nested_get | 33.3300μs | 5.8707μs | 170.3380 KOps/s | 170.7787 KOps/s | |
test_stacked_getleaf | 32.6310μs | 6.1627μs | 162.2662 KOps/s | 162.3358 KOps/s | |
test_stacked_get | 35.1410μs | 5.8107μs | 172.0969 KOps/s | 171.7535 KOps/s | |
test_nested_getitemleaf | 40.5600μs | 6.3747μs | 156.8696 KOps/s | 153.4432 KOps/s | |
test_nested_getitem | 38.0000μs | 6.0750μs | 164.6085 KOps/s | 160.9475 KOps/s | |
test_stacked_getitemleaf | 43.2500μs | 6.4579μs | 154.8488 KOps/s | 154.6110 KOps/s | |
test_stacked_getitem | 31.1700μs | 6.0572μs | 165.0917 KOps/s | 164.1889 KOps/s | |
test_lock_nested | 0.3968ms | 0.3320ms | 3.0118 KOps/s | 2.9335 KOps/s | |
test_lock_stack_nested | 0.3783ms | 0.3366ms | 2.9713 KOps/s | 2.9860 KOps/s | |
test_unlock_nested | 0.3591ms | 0.2726ms | 3.6687 KOps/s | 3.6422 KOps/s | |
test_unlock_stack_nested | 0.3784ms | 0.2757ms | 3.6271 KOps/s | 3.6425 KOps/s | |
test_flatten_speed | 0.1144ms | 74.4272μs | 13.4359 KOps/s | 13.0673 KOps/s | |
test_unflatten_speed | 0.3776ms | 0.3261ms | 3.0669 KOps/s | 3.0442 KOps/s | |
test_common_ops | 0.7068ms | 0.5626ms | 1.7774 KOps/s | 1.4871 KOps/s | |
test_creation | 80.5410μs | 1.7387μs | 575.1429 KOps/s | 571.2719 KOps/s | |
test_creation_empty | 30.5700μs | 6.4425μs | 155.2197 KOps/s | 99.8913 KOps/s | |
test_creation_nested_1 | 30.2900μs | 7.9544μs | 125.7158 KOps/s | 87.3140 KOps/s | |
test_creation_nested_2 | 38.6500μs | 10.5840μs | 94.4820 KOps/s | 69.9811 KOps/s | |
test_clone | 37.8610μs | 10.1899μs | 98.1360 KOps/s | 96.1003 KOps/s | |
test_getitem[int] | 1.4158ms | 10.6170μs | 94.1882 KOps/s | 96.9382 KOps/s | |
test_getitem[slice_int] | 0.1298ms | 20.6536μs | 48.4178 KOps/s | 49.6060 KOps/s | |
test_getitem[range] | 0.1235ms | 36.7199μs | 27.2332 KOps/s | 27.7197 KOps/s | |
test_getitem[tuple] | 0.1050ms | 17.9746μs | 55.6340 KOps/s | 57.0815 KOps/s | |
test_getitem[list] | 0.1213ms | 31.7110μs | 31.5348 KOps/s | 31.2410 KOps/s | |
test_setitem_dim[int] | 36.8510μs | 19.0273μs | 52.5561 KOps/s | 52.2826 KOps/s | |
test_setitem_dim[slice_int] | 58.9600μs | 37.6643μs | 26.5503 KOps/s | 26.3784 KOps/s | |
test_setitem_dim[range] | 76.4000μs | 51.2338μs | 19.5184 KOps/s | 19.0364 KOps/s | |
test_setitem_dim[tuple] | 54.5900μs | 32.2150μs | 31.0415 KOps/s | 30.9984 KOps/s | |
test_setitem | 73.9610μs | 13.7255μs | 72.8572 KOps/s | 62.7656 KOps/s | |
test_set | 73.2910μs | 13.0159μs | 76.8289 KOps/s | 64.5667 KOps/s | |
test_set_shared | 0.5091ms | 0.1572ms | 6.3633 KOps/s | 6.3553 KOps/s | |
test_update | 0.4067ms | 15.0321μs | 66.5242 KOps/s | 51.4918 KOps/s | |
test_update_nested | 81.8900μs | 20.2381μs | 49.4117 KOps/s | 40.0459 KOps/s | |
test_update__nested | 0.5769ms | 24.6458μs | 40.5749 KOps/s | 39.7688 KOps/s | |
test_set_nested | 87.3200μs | 14.5424μs | 68.7645 KOps/s | 59.6960 KOps/s | |
test_set_nested_new | 90.8300μs | 16.6145μs | 60.1883 KOps/s | 52.0346 KOps/s | |
test_select | 58.0610μs | 28.4056μs | 35.2043 KOps/s | 31.9785 KOps/s | |
test_select_nested | 85.2410μs | 45.5571μs | 21.9505 KOps/s | 22.3122 KOps/s | |
test_exclude_nested | 88.3910μs | 63.9595μs | 15.6349 KOps/s | 15.7172 KOps/s | |
test_empty[True] | 0.6890ms | 0.2993ms | 3.3409 KOps/s | 3.3593 KOps/s | |
test_empty[False] | 3.3141μs | 0.8358μs | 1.1964 MOps/s | 1.1924 MOps/s | |
test_to | 86.4910μs | 56.5176μs | 17.6936 KOps/s | 17.8370 KOps/s | |
test_to_nonblocking | 90.0710μs | 48.4065μs | 20.6584 KOps/s | 21.3313 KOps/s | |
test_unbind_speed | 0.2712ms | 0.2323ms | 4.3049 KOps/s | 4.2215 KOps/s | |
test_unbind_speed_stack0 | 0.2656ms | 0.2349ms | 4.2567 KOps/s | 4.3003 KOps/s | |
test_unbind_speed_stack1 | 94.0365ms | 0.7223ms | 1.3845 KOps/s | 1.3894 KOps/s | |
test_split | 94.9062ms | 1.5949ms | 627.0097 Ops/s | 631.5403 Ops/s | |
test_chunk | 96.9287ms | 1.6097ms | 621.2372 Ops/s | 627.7284 Ops/s | |
test_consolidate[False-None] | 97.1368ms | 2.9735ms | 336.3081 Ops/s | 368.6059 Ops/s | |
test_consolidate[default-None] | 1.7922ms | 1.6899ms | 591.7410 Ops/s | 598.9217 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.8322ms | 1.7460ms | 572.7241 Ops/s | 587.2988 Ops/s | |
test_consolidate_njt[False-None] | 6.8182ms | 6.5004ms | 153.8377 Ops/s | 155.5010 Ops/s | |
test_to[False-False-None] | 1.8469ms | 1.7604ms | 568.0559 Ops/s | 573.3153 Ops/s | |
test_to[True-False-None] | 1.5540ms | 1.3239ms | 755.3310 Ops/s | 781.0233 Ops/s | |
test_to[within-False-None] | 4.3965ms | 4.1636ms | 240.1770 Ops/s | 178.5545 Ops/s | |
test_to[True-default-None] | 5.5866ms | 5.2945ms | 188.8745 Ops/s | 190.6869 Ops/s | |
test_to_njt[False-False-None] | 7.1104ms | 6.8268ms | 146.4824 Ops/s | 145.1540 Ops/s | |
test_to_njt[True-False-None] | 5.6310ms | 5.4240ms | 184.3646 Ops/s | 179.7739 Ops/s | |
test_to_njt[within-False-None] | 12.3183ms | 12.1423ms | 82.3570 Ops/s | 82.0778 Ops/s | |
test_creation[device0] | 0.4549ms | 78.6757μs | 12.7104 KOps/s | 12.6673 KOps/s | |
test_creation_from_tensor | 0.6177ms | 82.2467μs | 12.1585 KOps/s | 12.0369 KOps/s | |
test_add_one[memmap_tensor0] | 0.2529ms | 6.4319μs | 155.4739 KOps/s | 154.2897 KOps/s | |
test_contiguous[memmap_tensor0] | 1.8675μs | 0.3992μs | 2.5052 MOps/s | 2.4973 MOps/s | |
test_stack[memmap_tensor0] | 43.4410μs | 4.2602μs | 234.7289 KOps/s | 240.7432 KOps/s | |
test_memmaptd_index | 1.5017ms | 0.2341ms | 4.2721 KOps/s | 4.3795 KOps/s | |
test_memmaptd_index_astensor | 0.4417ms | 0.2971ms | 3.3660 KOps/s | 3.3898 KOps/s | |
test_memmaptd_index_op | 0.7189ms | 0.5274ms | 1.8960 KOps/s | 1.6998 KOps/s | |
test_serialize_model | 0.1324s | 0.1305s | 7.6603 Ops/s | 7.6585 Ops/s | |
test_serialize_model_pickle | 1.3494s | 1.1858s | 0.8433 Ops/s | 0.8263 Ops/s | |
test_serialize_weights | 0.1306s | 0.1298s | 7.7049 Ops/s | 7.7108 Ops/s | |
test_serialize_weights_returnearly | 0.4292s | 69.8852ms | 14.3092 Ops/s | 23.7892 Ops/s | |
test_serialize_weights_pickle | 1.3759s | 1.2189s | 0.8204 Ops/s | 0.8236 Ops/s | |
test_reshape_pytree | 67.2400μs | 21.8590μs | 45.7477 KOps/s | 46.0054 KOps/s | |
test_reshape_td | 67.7900μs | 27.1268μs | 36.8639 KOps/s | 36.9385 KOps/s | |
test_view_pytree | 55.5710μs | 21.7536μs | 45.9695 KOps/s | 45.8628 KOps/s | |
test_view_td | 84.8800μs | 31.7017μs | 31.5440 KOps/s | 31.3744 KOps/s | |
test_unbind_pytree | 55.8200μs | 27.5697μs | 36.2718 KOps/s | 36.6240 KOps/s | |
test_unbind_td | 0.7722ms | 35.7984μs | 27.9342 KOps/s | 28.3139 KOps/s | |
test_split_pytree | 0.1218ms | 29.7880μs | 33.5706 KOps/s | 34.0007 KOps/s | |
test_split_td | 0.9405ms | 38.2931μs | 26.1144 KOps/s | 26.1243 KOps/s | |
test_add_pytree | 74.6910μs | 33.5654μs | 29.7926 KOps/s | 30.0502 KOps/s | |
test_add_td | 88.6500μs | 47.6117μs | 21.0032 KOps/s | 19.8561 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1769ms | 0.1230ms | 8.1327 KOps/s | 7.7041 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2556ms | 0.1338ms | 7.4712 KOps/s | 7.4875 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1409ms | 94.8735μs | 10.5404 KOps/s | 10.3661 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2165ms | 0.1489ms | 6.7162 KOps/s | 6.7255 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 57.0510μs | 24.4590μs | 40.8847 KOps/s | 41.9335 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 54.9710μs | 29.1360μs | 34.3219 KOps/s | 33.6024 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1216ms | 63.5038μs | 15.7471 KOps/s | 15.4458 KOps/s | |
test_compile_copy_nested[pytree-eager] | 79.0910μs | 48.6915μs | 20.5374 KOps/s | 20.1245 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2015ms | 0.1411ms | 7.0860 KOps/s | 7.0892 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3341ms | 0.2177ms | 4.5924 KOps/s | 4.6136 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1507ms | 97.2148μs | 10.2865 KOps/s | 9.9613 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1185ms | 53.9431μs | 18.5380 KOps/s | 17.5657 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1749ms | 0.1346ms | 7.4308 KOps/s | 7.4062 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.6132ms | 0.4822ms | 2.0738 KOps/s | 2.0760 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3958ms | 0.2609ms | 3.8322 KOps/s | 3.7987 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.1826ms | 0.1433ms | 6.9801 KOps/s | 7.0075 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1471ms | 65.9313μs | 15.1673 KOps/s | 14.6398 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1420ms | 99.0031μs | 10.1007 KOps/s | 10.0451 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.4697ms | 0.4114ms | 2.4305 KOps/s | 2.4266 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1734ms | 0.1344ms | 7.4392 KOps/s | 7.3051 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 49.7100μs | 18.6694μs | 53.5637 KOps/s | 53.6566 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 63.4100μs | 31.2058μs | 32.0454 KOps/s | 31.5155 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1258ms | 69.8558μs | 14.3152 KOps/s | 14.3603 KOps/s | |
test_compile_copy_flat[pytree-eager] | 82.2300μs | 51.1754μs | 19.5406 KOps/s | 19.3033 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.6298ms | 0.3915ms | 2.5542 KOps/s | 2.1978 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.7436ms | 2.6245ms | 381.0279 Ops/s | 371.1404 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.5856ms | 0.4370ms | 2.2884 KOps/s | 2.2799 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.8462ms | 2.6785ms | 373.3404 Ops/s | 371.3994 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1715ms | 0.1218ms | 8.2080 KOps/s | 8.5446 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5618ms | 82.1589μs | 12.1715 KOps/s | 12.6749 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1701ms | 0.1142ms | 8.7564 KOps/s | 9.3180 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1206ms | 67.4788μs | 14.8195 KOps/s | 14.7407 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1773ms | 0.1135ms | 8.8128 KOps/s | 9.3419 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1208ms | 71.4401μs | 13.9977 KOps/s | 14.8065 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1506ms | 0.1055ms | 9.4816 KOps/s | 9.8664 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1455ms | 17.1335μs | 58.3653 KOps/s | 57.0590 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1442ms | 98.9408μs | 10.1071 KOps/s | 10.2143 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 54.4300μs | 15.7741μs | 63.3952 KOps/s | 63.9281 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1822ms | 95.5610μs | 10.4645 KOps/s | 10.2452 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 56.2910μs | 16.5159μs | 60.5477 KOps/s | 63.3007 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1684ms | 0.1056ms | 9.4706 KOps/s | 9.8496 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.5977ms | 17.3693μs | 57.5729 KOps/s | 58.1931 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1467ms | 95.6479μs | 10.4550 KOps/s | 10.2145 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 43.2700μs | 15.6841μs | 63.7589 KOps/s | 63.8172 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1542ms | 95.4416μs | 10.4776 KOps/s | 10.3028 KOps/s | |
test_compile_indexing[int-pytree-eager] | 53.1010μs | 16.5292μs | 60.4990 KOps/s | 64.3438 KOps/s | |
test_mod_add[eager] | 89.0600μs | 35.5178μs | 28.1549 KOps/s | 25.7233 KOps/s | |
test_mod_add[compile] | 0.1278ms | 78.1809μs | 12.7908 KOps/s | 11.8328 KOps/s | |
test_mod_add[compile-overhead] | 0.3302ms | 0.1676ms | 5.9651 KOps/s | 5.6940 KOps/s | |
test_mod_wrap[eager] | 0.3301ms | 0.2472ms | 4.0459 KOps/s | 3.8232 KOps/s | |
test_mod_wrap[compile] | 0.3895ms | 0.2817ms | 3.5497 KOps/s | 3.5084 KOps/s | |
test_mod_wrap[compile-overhead] | 7.1334ms | 3.7857ms | 264.1518 Ops/s | 263.7635 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.4596ms | 1.3539ms | 738.6278 Ops/s | 683.6757 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.5969ms | 1.2677ms | 788.8249 Ops/s | 722.6421 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3750ms | 0.9290ms | 1.0765 KOps/s | 956.6002 Ops/s | |
test_seq_add[eager] | 0.1580ms | 0.1112ms | 8.9921 KOps/s | 8.4671 KOps/s | |
test_seq_add[compile] | 0.1342ms | 90.7204μs | 11.0229 KOps/s | 11.5442 KOps/s | |
test_seq_add[compile-overhead] | 0.1698ms | 0.1284ms | 7.7874 KOps/s | 7.4837 KOps/s | |
test_seq_wrap[eager] | 0.5216ms | 0.4354ms | 2.2965 KOps/s | 2.3200 KOps/s | |
test_seq_wrap[compile] | 0.3655ms | 0.3011ms | 3.3216 KOps/s | 3.1290 KOps/s | |
test_seq_wrap[compile-overhead] | 0.2943ms | 0.2220ms | 4.5047 KOps/s | 4.2576 KOps/s | |
test_func_call_runtime[False-eager] | 0.7871ms | 0.7283ms | 1.3730 KOps/s | 1.3291 KOps/s | |
test_func_call_runtime[False-compile] | 0.8041ms | 0.7351ms | 1.3604 KOps/s | 1.3521 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4433ms | 0.3601ms | 2.7773 KOps/s | 2.7768 KOps/s | |
test_func_call_runtime[True-eager] | 0.9760ms | 0.8893ms | 1.1245 KOps/s | 1.1097 KOps/s | |
test_func_call_runtime[True-compile] | 0.8295ms | 0.7586ms | 1.3181 KOps/s | 1.2673 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.4346ms | 0.3811ms | 2.6239 KOps/s | 2.6103 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.7878ms | 0.7295ms | 1.3707 KOps/s | 1.3625 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.9030ms | 0.7411ms | 1.3494 KOps/s | 1.3428 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4668ms | 0.3652ms | 2.7386 KOps/s | 2.7660 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.0756ms | 0.9984ms | 1.0016 KOps/s | 986.5633 Ops/s | |
test_func_call_cm_runtime[True-compile] | 1.1180ms | 0.9856ms | 1.0146 KOps/s | 1.0150 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.0629ms | 0.9795ms | 1.0209 KOps/s | 1.0093 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5085ms | 2.0845ms | 479.7335 Ops/s | 470.9893 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.8546ms | 0.7992ms | 1.2513 KOps/s | 1.2306 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.4645ms | 0.4139ms | 2.4158 KOps/s | 2.4099 KOps/s | |
test_distributed | 3.0981ms | 0.2008ms | 4.9792 KOps/s | 8.4217 KOps/s | |
test_tdmodule | 0.5739ms | 18.9320μs | 52.8206 KOps/s | 47.0875 KOps/s | |
test_tdmodule_dispatch | 0.1723ms | 33.0443μs | 30.2624 KOps/s | 26.3316 KOps/s | |
test_tdseq | 41.1010μs | 19.5371μs | 51.1847 KOps/s | 45.6837 KOps/s | |
test_tdseq_dispatch | 63.9510μs | 35.9214μs | 27.8385 KOps/s | 24.3729 KOps/s | |
test_instantiation_functorch | 1.6498ms | 1.5463ms | 646.6924 Ops/s | 646.5730 Ops/s | |
test_exec_functorch | 0.1846ms | 0.1435ms | 6.9692 KOps/s | 6.9945 KOps/s | |
test_exec_functional_call | 0.1751ms | 0.1334ms | 7.4965 KOps/s | 7.5182 KOps/s | |
test_exec_td_decorator | 0.3751ms | 0.1831ms | 5.4613 KOps/s | 5.3969 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.7485ms | 0.6751ms | 1.4812 KOps/s | 1.4468 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.7952ms | 0.6788ms | 1.4732 KOps/s | 1.4439 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7084ms | 0.5972ms | 1.6744 KOps/s | 1.6646 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7090ms | 0.5956ms | 1.6791 KOps/s | 1.6667 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 19.2919ms | 19.1858ms | 52.1218 Ops/s | 51.7255 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 19.2285ms | 19.1594ms | 52.1937 Ops/s | 51.8416 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.2333ms | 19.0298ms | 52.5491 Ops/s | 52.1104 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.2681ms | 18.9839ms | 52.6761 Ops/s | 52.2277 Ops/s | |
test_to_module_speed[True] | 1.4445ms | 0.9676ms | 1.0335 KOps/s | 1.0192 KOps/s | |
test_to_module_speed[False] | 1.0135ms | 0.9457ms | 1.0574 KOps/s | 1.0382 KOps/s | |
test_tc_init | 63.1300μs | 33.6806μs | 29.6907 KOps/s | 26.0085 KOps/s | |
test_tc_init_nested | 97.9710μs | 68.6676μs | 14.5629 KOps/s | 12.6657 KOps/s | |
test_tc_first_layer_tensor | 31.6910μs | 0.7951μs | 1.2578 MOps/s | 1.4612 MOps/s | |
test_tc_first_layer_nontensor | 24.1000μs | 2.2287μs | 448.7002 KOps/s | 452.2342 KOps/s | |
test_tc_second_layer_tensor | 21.6800μs | 1.3809μs | 724.1467 KOps/s | 713.9297 KOps/s | |
test_tc_second_layer_nontensor | 25.4200μs | 2.9270μs | 341.6432 KOps/s | 338.8124 KOps/s | |
test_unbind | 0.2158s | 12.0190ms | 83.2016 Ops/s | 143.6766 Ops/s | |
test_full_like | 9.2554ms | 9.0940ms | 109.9628 Ops/s | 107.6753 Ops/s | |
test_zeros_like | 9.2868ms | 7.1551ms | 139.7601 Ops/s | 231.4381 Ops/s | |
test_ones_like | 9.2139ms | 7.2330ms | 138.2550 Ops/s | 231.0697 Ops/s | |
test_clone | 6.7601ms | 6.3410ms | 157.7026 Ops/s | 157.2195 Ops/s | |
test_squeeze | 79.0910μs | 9.5283μs | 104.9506 KOps/s | 107.2107 KOps/s | |
test_unsqueeze | 0.1225ms | 71.7254μs | 13.9421 KOps/s | 13.6734 KOps/s | |
test_split | 0.3695ms | 0.1583ms | 6.3188 KOps/s | 6.2742 KOps/s | |
test_permute | 0.2401ms | 0.1869ms | 5.3502 KOps/s | 5.6490 KOps/s | |
test_stack | 50.2264ms | 50.0182ms | 19.9927 Ops/s | 19.9692 Ops/s | |
test_cat | 50.2454ms | 49.8336ms | 20.0668 Ops/s | 19.9517 Ops/s |
vmoens
added a commit
that referenced
this pull request
Feb 6, 2025
ghstack-source-id: 1ab62b35bb34b0a0ddaa8b1afc06a48702f41fdf Pull Request resolved: #1211
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
bug
Something isn't working
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
suitable for minor
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):
zip(..., strict=True)
in TDModules #1212