-
Notifications
You must be signed in to change notification settings - Fork 79
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BugFix] Fix number check in array parsing (np>=2 compatibility) #999
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vmoens
added a commit
that referenced
this pull request
Sep 17, 2024
ghstack-source-id: fa8f983381fd1019f9bd371389f728d84140a80c Pull Request resolved: #999
vmoens
added a commit
that referenced
this pull request
Sep 17, 2024
ghstack-source-id: 867b59d12577ce4e84a0aa0bd17afcc135ed291c Pull Request resolved: #999
vmoens
added a commit
that referenced
this pull request
Sep 17, 2024
ghstack-source-id: bdf9ab4f4c61ef5a407d76ebd5433d0d6a897b59 Pull Request resolved: #999
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 38.4210μs | 19.8095μs | 50.4809 KOps/s | 48.5027 KOps/s | |
test_plain_set_stack_nested | 71.1930μs | 19.7886μs | 50.5341 KOps/s | 48.0801 KOps/s | |
test_plain_set_nested_inplace | 65.4920μs | 21.6906μs | 46.1029 KOps/s | 44.6459 KOps/s | |
test_plain_set_stack_nested_inplace | 0.1504ms | 21.8656μs | 45.7339 KOps/s | 44.0759 KOps/s | |
test_items | 42.6990μs | 4.3780μs | 228.4152 KOps/s | 243.9068 KOps/s | |
test_items_nested | 0.7459ms | 0.3567ms | 2.8037 KOps/s | 2.8016 KOps/s | |
test_items_nested_locked | 0.8277ms | 0.3576ms | 2.7963 KOps/s | 2.7966 KOps/s | |
test_items_nested_leaf | 0.1527ms | 69.6365μs | 14.3603 KOps/s | 14.5355 KOps/s | |
test_items_stack_nested | 0.6451ms | 0.3593ms | 2.7834 KOps/s | 2.7460 KOps/s | |
test_items_stack_nested_leaf | 0.1301ms | 71.0671μs | 14.0712 KOps/s | 14.3239 KOps/s | |
test_items_stack_nested_locked | 0.6248ms | 0.3578ms | 2.7950 KOps/s | 2.7856 KOps/s | |
test_keys | 37.8710μs | 3.9717μs | 251.7826 KOps/s | 282.8754 KOps/s | |
test_keys_nested | 0.1769ms | 0.1027ms | 9.7410 KOps/s | 10.1439 KOps/s | |
test_keys_nested_locked | 0.7169ms | 0.1075ms | 9.2983 KOps/s | 9.5089 KOps/s | |
test_keys_nested_leaf | 0.1522ms | 82.4981μs | 12.1215 KOps/s | 12.1692 KOps/s | |
test_keys_stack_nested | 0.1695ms | 99.6175μs | 10.0384 KOps/s | 10.0796 KOps/s | |
test_keys_stack_nested_leaf | 0.1416ms | 81.3795μs | 12.2881 KOps/s | 12.2503 KOps/s | |
test_keys_stack_nested_locked | 0.1825ms | 0.1061ms | 9.4223 KOps/s | 9.5436 KOps/s | |
test_values | 11.2335μs | 1.1671μs | 856.8564 KOps/s | 900.5919 KOps/s | |
test_values_nested | 0.1236ms | 74.5054μs | 13.4218 KOps/s | 13.5544 KOps/s | |
test_values_nested_locked | 0.1362ms | 74.2625μs | 13.4657 KOps/s | 13.6504 KOps/s | |
test_values_nested_leaf | 0.1105ms | 61.8846μs | 16.1591 KOps/s | 16.0812 KOps/s | |
test_values_stack_nested | 0.1581ms | 74.9483μs | 13.3425 KOps/s | 13.6492 KOps/s | |
test_values_stack_nested_leaf | 0.1140ms | 61.6335μs | 16.2249 KOps/s | 16.1042 KOps/s | |
test_values_stack_nested_locked | 0.1365ms | 75.3324μs | 13.2745 KOps/s | 13.6343 KOps/s | |
test_membership | 30.0160μs | 0.8471μs | 1.1804 MOps/s | 1.1144 MOps/s | |
test_membership_nested | 31.4280μs | 2.6828μs | 372.7439 KOps/s | 365.2778 KOps/s | |
test_membership_nested_leaf | 22.4620μs | 2.7145μs | 368.3910 KOps/s | 362.7005 KOps/s | |
test_membership_stacked_nested | 30.5280μs | 2.7370μs | 365.3681 KOps/s | 363.2645 KOps/s | |
test_membership_stacked_nested_leaf | 36.2580μs | 2.7246μs | 367.0199 KOps/s | 359.9597 KOps/s | |
test_membership_nested_last | 22.9720μs | 3.8578μs | 259.2134 KOps/s | 251.1625 KOps/s | |
test_membership_nested_leaf_last | 32.5010μs | 3.8884μs | 257.1783 KOps/s | 249.7245 KOps/s | |
test_membership_stacked_nested_last | 43.7020μs | 3.8478μs | 259.8899 KOps/s | 253.2176 KOps/s | |
test_membership_stacked_nested_leaf_last | 29.1550μs | 3.8901μs | 257.0639 KOps/s | 254.1571 KOps/s | |
test_nested_getleaf | 0.1401ms | 10.6369μs | 94.0121 KOps/s | 95.3581 KOps/s | |
test_nested_get | 0.1646ms | 10.3509μs | 96.6101 KOps/s | 99.8596 KOps/s | |
test_stacked_getleaf | 0.1723ms | 10.8947μs | 91.7874 KOps/s | 95.7451 KOps/s | |
test_stacked_get | 44.6530μs | 10.0426μs | 99.5753 KOps/s | 100.9735 KOps/s | |
test_nested_getitemleaf | 54.0710μs | 10.9716μs | 91.1446 KOps/s | 90.4454 KOps/s | |
test_nested_getitem | 45.8260μs | 10.2669μs | 97.4004 KOps/s | 99.6818 KOps/s | |
test_stacked_getitemleaf | 0.2400ms | 10.8786μs | 91.9234 KOps/s | 90.9965 KOps/s | |
test_stacked_getitem | 0.1277ms | 10.2610μs | 97.4563 KOps/s | 99.1017 KOps/s | |
test_lock_nested | 91.5955ms | 0.5788ms | 1.7277 KOps/s | 2.0753 KOps/s | |
test_lock_stack_nested | 0.6038ms | 0.4594ms | 2.1769 KOps/s | 2.1951 KOps/s | |
test_unlock_nested | 99.2661ms | 0.5035ms | 1.9859 KOps/s | 2.4915 KOps/s | |
test_unlock_stack_nested | 0.5561ms | 0.3735ms | 2.6776 KOps/s | 2.6616 KOps/s | |
test_flatten_speed | 0.2197ms | 89.5695μs | 11.1645 KOps/s | 11.3618 KOps/s | |
test_unflatten_speed | 0.7446ms | 0.4657ms | 2.1473 KOps/s | 2.1781 KOps/s | |
test_common_ops | 2.2889ms | 1.1151ms | 896.7781 Ops/s | 863.6591 Ops/s | |
test_creation | 27.5120μs | 2.1971μs | 455.1432 KOps/s | 469.7966 KOps/s | |
test_creation_empty | 85.5100μs | 17.4895μs | 57.1771 KOps/s | 54.0268 KOps/s | |
test_creation_nested_1 | 0.1002ms | 20.8838μs | 47.8840 KOps/s | 45.7117 KOps/s | |
test_creation_nested_2 | 79.4080μs | 24.7883μs | 40.3416 KOps/s | 38.4066 KOps/s | |
test_clone | 1.3805ms | 16.4232μs | 60.8896 KOps/s | 58.6224 KOps/s | |
test_getitem[int] | 0.7365ms | 16.8905μs | 59.2049 KOps/s | 60.3035 KOps/s | |
test_getitem[slice_int] | 0.1595ms | 30.0362μs | 33.2932 KOps/s | 32.9113 KOps/s | |
test_getitem[range] | 0.2027ms | 55.1203μs | 18.1421 KOps/s | 17.2967 KOps/s | |
test_getitem[tuple] | 0.1670ms | 25.2241μs | 39.6447 KOps/s | 39.2543 KOps/s | |
test_getitem[list] | 0.1976ms | 51.1505μs | 19.5501 KOps/s | 18.8818 KOps/s | |
test_setitem_dim[int] | 54.5110μs | 31.3594μs | 31.8884 KOps/s | 30.0019 KOps/s | |
test_setitem_dim[slice_int] | 0.1292ms | 60.6903μs | 16.4771 KOps/s | 16.2869 KOps/s | |
test_setitem_dim[range] | 0.2054ms | 83.4793μs | 11.9790 KOps/s | 11.9230 KOps/s | |
test_setitem_dim[tuple] | 0.1094ms | 48.8976μs | 20.4509 KOps/s | 20.5505 KOps/s | |
test_setitem | 96.8810μs | 29.0209μs | 34.4579 KOps/s | 32.9649 KOps/s | |
test_set | 0.3155ms | 27.9440μs | 35.7859 KOps/s | 33.7095 KOps/s | |
test_set_shared | 7.4537ms | 0.2132ms | 4.6905 KOps/s | 4.5531 KOps/s | |
test_update | 0.2912ms | 35.5220μs | 28.1516 KOps/s | 26.8587 KOps/s | |
test_update_nested | 1.0785ms | 45.8770μs | 21.7974 KOps/s | 20.9965 KOps/s | |
test_update__nested | 0.2996ms | 33.2037μs | 30.1171 KOps/s | 29.1089 KOps/s | |
test_set_nested | 0.3518ms | 30.9465μs | 32.3138 KOps/s | 31.3773 KOps/s | |
test_set_nested_new | 0.1123ms | 35.7686μs | 27.9575 KOps/s | 26.5033 KOps/s | |
test_select | 0.1294ms | 53.3484μs | 18.7447 KOps/s | 18.4849 KOps/s | |
test_select_nested | 0.1409ms | 59.3664μs | 16.8446 KOps/s | 16.8785 KOps/s | |
test_exclude_nested | 0.1477ms | 74.7407μs | 13.3796 KOps/s | 13.2731 KOps/s | |
test_empty[True] | 0.5894ms | 0.3158ms | 3.1665 KOps/s | 3.1693 KOps/s | |
test_empty[False] | 14.0035μs | 1.1948μs | 836.9519 KOps/s | 843.0833 KOps/s | |
test_unbind_speed | 0.6177ms | 0.2959ms | 3.3795 KOps/s | 3.3383 KOps/s | |
test_unbind_speed_stack0 | 0.4481ms | 0.2934ms | 3.4081 KOps/s | 3.3953 KOps/s | |
test_unbind_speed_stack1 | 97.5352ms | 0.8022ms | 1.2466 KOps/s | 1.3188 KOps/s | |
test_split | 3.2370ms | 1.9370ms | 516.2752 Ops/s | 456.5424 Ops/s | |
test_chunk | 94.1987ms | 2.2860ms | 437.4456 Ops/s | 452.0026 Ops/s | |
test_creation[device0] | 0.1906ms | 0.1137ms | 8.7989 KOps/s | 8.2714 KOps/s | |
test_creation_from_tensor | 4.0315ms | 0.1167ms | 8.5669 KOps/s | 8.4651 KOps/s | |
test_add_one[memmap_tensor0] | 0.1163ms | 6.9679μs | 143.5151 KOps/s | 136.4298 KOps/s | |
test_contiguous[memmap_tensor0] | 25.0570μs | 1.8996μs | 526.4320 KOps/s | 526.9050 KOps/s | |
test_stack[memmap_tensor0] | 57.1560μs | 5.4772μs | 182.5754 KOps/s | 175.8322 KOps/s | |
test_memmaptd_index | 1.0679ms | 0.4007ms | 2.4955 KOps/s | 2.5881 KOps/s | |
test_memmaptd_index_astensor | 0.7754ms | 0.4766ms | 2.0982 KOps/s | 2.1482 KOps/s | |
test_memmaptd_index_op | 1.5025ms | 0.9918ms | 1.0083 KOps/s | 981.3538 Ops/s | |
test_serialize_model | 0.1246s | 0.1182s | 8.4591 Ops/s | 8.2158 Ops/s | |
test_serialize_model_pickle | 0.4706s | 0.4003s | 2.4981 Ops/s | 2.5249 Ops/s | |
test_serialize_weights | 0.1257s | 0.1163s | 8.5992 Ops/s | 8.3921 Ops/s | |
test_serialize_weights_returnearly | 0.2604s | 0.1756s | 5.6940 Ops/s | 6.3004 Ops/s | |
test_serialize_weights_pickle | 0.4787s | 0.4060s | 2.4633 Ops/s | 1.1605 Ops/s | |
test_serialize_weights_filesystem | 0.1502s | 0.1456s | 6.8704 Ops/s | 6.4637 Ops/s | |
test_serialize_model_filesystem | 0.2487s | 0.1646s | 6.0764 Ops/s | 6.9851 Ops/s | |
test_reshape_pytree | 84.9890μs | 37.8978μs | 26.3867 KOps/s | 25.8687 KOps/s | |
test_reshape_td | 88.5760μs | 44.7040μs | 22.3694 KOps/s | 21.6863 KOps/s | |
test_view_pytree | 81.0420μs | 38.0059μs | 26.3117 KOps/s | 25.8671 KOps/s | |
test_view_td | 0.1227ms | 49.9245μs | 20.0303 KOps/s | 18.9367 KOps/s | |
test_unbind_pytree | 73.6170μs | 35.1762μs | 28.4283 KOps/s | 28.1853 KOps/s | |
test_unbind_td | 0.3465ms | 43.4008μs | 23.0410 KOps/s | 22.3616 KOps/s | |
test_split_pytree | 99.9860μs | 39.4832μs | 25.3272 KOps/s | 26.5076 KOps/s | |
test_split_td | 0.2373ms | 55.6600μs | 17.9662 KOps/s | 17.4131 KOps/s | |
test_add_pytree | 95.8290μs | 41.9446μs | 23.8410 KOps/s | 22.3741 KOps/s | |
test_add_td | 0.1595ms | 77.2134μs | 12.9511 KOps/s | 12.2399 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1167ms | 57.1063μs | 17.5112 KOps/s | 17.5569 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3657ms | 0.1749ms | 5.7179 KOps/s | 5.6942 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1085ms | 56.1045μs | 17.8239 KOps/s | 12.7760 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.3873ms | 0.1377ms | 7.2636 KOps/s | 7.0910 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 85.9100μs | 21.8433μs | 45.7805 KOps/s | 47.6833 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1297ms | 66.1382μs | 15.1199 KOps/s | 14.6962 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1538ms | 74.8147μs | 13.3664 KOps/s | 13.2806 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1299ms | 67.8455μs | 14.7394 KOps/s | 14.8104 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2864ms | 0.1718ms | 5.8212 KOps/s | 5.7798 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3630ms | 0.1887ms | 5.3005 KOps/s | 5.3493 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1001ms | 46.3930μs | 21.5550 KOps/s | 21.2853 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1274ms | 67.8163μs | 14.7457 KOps/s | 14.9630 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2981ms | 0.1738ms | 5.7533 KOps/s | 5.7654 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.6424ms | 0.2809ms | 3.5604 KOps/s | 3.5208 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3820ms | 0.2018ms | 4.9564 KOps/s | 5.0024 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.3473ms | 0.1727ms | 5.7919 KOps/s | 5.7743 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1572ms | 62.1164μs | 16.0988 KOps/s | 15.9552 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1153ms | 46.5925μs | 21.4627 KOps/s | 21.0159 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.4893ms | 0.2283ms | 4.3811 KOps/s | 4.2267 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2878ms | 0.1736ms | 5.7615 KOps/s | 5.7599 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.2079ms | 0.1022ms | 9.7875 KOps/s | 9.6263 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1234ms | 56.6895μs | 17.6400 KOps/s | 16.7330 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1893ms | 78.1116μs | 12.8022 KOps/s | 13.0306 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1926ms | 68.3245μs | 14.6360 KOps/s | 14.8148 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.2954ms | 0.1972ms | 5.0698 KOps/s | 5.1704 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.0262ms | 1.6306ms | 613.2840 Ops/s | 617.2924 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.3477ms | 0.1939ms | 5.1561 KOps/s | 5.2351 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.3702ms | 1.0670ms | 937.1771 Ops/s | 937.8228 Ops/s | |
test_compile_assign_and_add_stack[compile] | 0.7292ms | 0.4228ms | 2.3654 KOps/s | 2.4054 KOps/s | |
test_compile_assign_and_add_stack[eager] | 3.8164ms | 3.6704ms | 272.4523 Ops/s | 264.2362 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.2675ms | 34.4220μs | 29.0512 KOps/s | 28.9684 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.6658ms | 46.2419μs | 21.6254 KOps/s | 21.1766 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1071ms | 28.8531μs | 34.6584 KOps/s | 33.7492 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 75.2410μs | 27.4688μs | 36.4049 KOps/s | 36.2255 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 93.9960μs | 28.6806μs | 34.8668 KOps/s | 33.6290 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 83.0950μs | 26.6157μs | 37.5718 KOps/s | 36.1356 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1889ms | 73.5696μs | 13.5926 KOps/s | 13.6240 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.5356ms | 26.6651μs | 37.5021 KOps/s | 36.2076 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1316ms | 67.6838μs | 14.7746 KOps/s | 14.5442 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 95.6990μs | 22.7910μs | 43.8769 KOps/s | 43.2312 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1517ms | 68.3635μs | 14.6277 KOps/s | 14.4986 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 0.1914ms | 22.6583μs | 44.1338 KOps/s | 43.5674 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1509ms | 74.0370μs | 13.5068 KOps/s | 13.8683 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.9247ms | 26.6509μs | 37.5221 KOps/s | 36.3597 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1688ms | 67.4333μs | 14.8295 KOps/s | 14.1776 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.1060ms | 22.0382μs | 45.3758 KOps/s | 43.3151 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.4127ms | 67.8408μs | 14.7404 KOps/s | 14.5413 KOps/s | |
test_compile_indexing[int-pytree-eager] | 85.2990μs | 22.1556μs | 45.1354 KOps/s | 43.4301 KOps/s | |
test_mod_add[eager] | 87.4640μs | 25.1605μs | 39.7449 KOps/s | 38.3365 KOps/s | |
test_mod_add[compile] | 0.1153ms | 39.0774μs | 25.5902 KOps/s | 26.7957 KOps/s | |
test_mod_add[compile-overhead] | 0.1039ms | 40.0235μs | 24.9853 KOps/s | 26.3384 KOps/s | |
test_mod_wrap[eager] | 0.3832ms | 0.2060ms | 4.8537 KOps/s | 5.0260 KOps/s | |
test_mod_wrap[compile] | 0.3135ms | 0.2262ms | 4.4206 KOps/s | 4.4141 KOps/s | |
test_mod_wrap[compile-overhead] | 0.3898ms | 0.2204ms | 4.5368 KOps/s | 4.3870 KOps/s | |
test_mod_wrap_and_backward[eager] | 12.3492ms | 10.9749ms | 91.1172 Ops/s | 90.5053 Ops/s | |
test_mod_wrap_and_backward[compile] | 12.1706ms | 10.8229ms | 92.3971 Ops/s | 78.8386 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 12.2722ms | 10.8651ms | 92.0377 Ops/s | 78.4642 Ops/s | |
test_seq_add[eager] | 0.1632ms | 89.3984μs | 11.1859 KOps/s | 11.0824 KOps/s | |
test_seq_add[compile] | 0.1150ms | 63.1836μs | 15.8269 KOps/s | 15.6747 KOps/s | |
test_seq_add[compile-overhead] | 0.1282ms | 62.3210μs | 16.0460 KOps/s | 15.9222 KOps/s | |
test_seq_wrap[eager] | 0.6285ms | 0.3665ms | 2.7284 KOps/s | 2.6047 KOps/s | |
test_seq_wrap[compile] | 5.3693ms | 0.2614ms | 3.8258 KOps/s | 3.7625 KOps/s | |
test_seq_wrap[compile-overhead] | 1.3734ms | 0.2589ms | 3.8621 KOps/s | 3.7922 KOps/s | |
test_func_call_runtime[False-eager] | 0.8626ms | 0.4848ms | 2.0628 KOps/s | 2.0023 KOps/s | |
test_func_call_runtime[False-compile] | 1.0396ms | 0.4870ms | 2.0536 KOps/s | 1.9944 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.6029ms | 0.4799ms | 2.0838 KOps/s | 1.9903 KOps/s | |
test_func_call_runtime[True-eager] | 0.8715ms | 0.6984ms | 1.4317 KOps/s | 1.3762 KOps/s | |
test_func_call_runtime[True-compile] | 0.8278ms | 0.4996ms | 2.0014 KOps/s | 1.9938 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.8650ms | 0.4977ms | 2.0094 KOps/s | 1.9721 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8523ms | 0.4862ms | 2.0568 KOps/s | 2.0368 KOps/s | |
test_func_call_cm_runtime[False-compile] | 1.0492ms | 0.5081ms | 1.9679 KOps/s | 1.9693 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.6494ms | 0.4818ms | 2.0754 KOps/s | 1.9541 KOps/s | |
test_func_call_cm_runtime[True-eager] | 0.9952ms | 0.8267ms | 1.2096 KOps/s | 1.1695 KOps/s | |
test_func_call_cm_runtime[True-compile] | 1.0022ms | 0.6945ms | 1.4400 KOps/s | 1.3501 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.9582ms | 0.6997ms | 1.4291 KOps/s | 1.3862 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5390ms | 1.8258ms | 547.6911 Ops/s | 531.2093 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 3.1950ms | 1.8901ms | 529.0835 Ops/s | 520.9236 Ops/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 2.9753ms | 1.8809ms | 531.6536 Ops/s | 517.3321 Ops/s | |
test_distributed | 0.5119ms | 0.1255ms | 7.9701 KOps/s | 7.4780 KOps/s | |
test_tdmodule | 33.3620μs | 17.7086μs | 56.4696 KOps/s | 51.9032 KOps/s | |
test_tdmodule_dispatch | 66.6950μs | 35.6516μs | 28.0492 KOps/s | 26.5659 KOps/s | |
test_tdseq | 37.6100μs | 19.9119μs | 50.2212 KOps/s | 47.4015 KOps/s | |
test_tdseq_dispatch | 68.3880μs | 40.4125μs | 24.7448 KOps/s | 23.5739 KOps/s | |
test_instantiation_functorch | 1.8356ms | 1.5225ms | 656.8141 Ops/s | 632.0929 Ops/s | |
test_instantiation_td | 1.9061ms | 1.1497ms | 869.7969 Ops/s | 854.6135 Ops/s | |
test_exec_functorch | 0.6122ms | 0.1793ms | 5.5772 KOps/s | 5.4501 KOps/s | |
test_exec_functional_call | 0.3490ms | 0.1676ms | 5.9663 KOps/s | 6.0465 KOps/s | |
test_exec_td | 0.4079ms | 0.1633ms | 6.1253 KOps/s | 6.1739 KOps/s | |
test_exec_td_decorator | 0.9330ms | 0.2142ms | 4.6680 KOps/s | 4.6070 KOps/s | |
test_vmap_mlp_speed[True-True] | 0.9924ms | 0.6351ms | 1.5745 KOps/s | 1.5681 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.9405ms | 0.6260ms | 1.5974 KOps/s | 1.6047 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.8661ms | 0.4892ms | 2.0439 KOps/s | 2.0785 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.9294ms | 0.4857ms | 2.0588 KOps/s | 2.0636 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.5362ms | 0.6101ms | 1.6391 KOps/s | 1.6442 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8088ms | 0.6040ms | 1.6557 KOps/s | 1.6300 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7737ms | 0.4985ms | 2.0059 KOps/s | 1.9981 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7422ms | 0.4971ms | 2.0118 KOps/s | 2.0033 KOps/s | |
test_to_module_speed[True] | 2.5293ms | 1.2769ms | 783.1576 Ops/s | 760.0149 Ops/s | |
test_to_module_speed[False] | 2.0390ms | 1.2733ms | 785.3554 Ops/s | 776.7833 Ops/s | |
test_tc_init | 82.5140μs | 43.4190μs | 23.0314 KOps/s | 21.4822 KOps/s | |
test_tc_init_nested | 0.1493ms | 84.4776μs | 11.8375 KOps/s | 10.8325 KOps/s | |
test_tc_first_layer_tensor | 20.9900μs | 1.5500μs | 645.1497 KOps/s | 643.4956 KOps/s | |
test_tc_first_layer_nontensor | 42.6190μs | 4.9622μs | 201.5249 KOps/s | 208.9628 KOps/s | |
test_tc_second_layer_tensor | 24.3850μs | 2.8342μs | 352.8279 KOps/s | 346.0527 KOps/s | |
test_tc_second_layer_nontensor | 37.2500μs | 6.3441μs | 157.6261 KOps/s | 161.9332 KOps/s | |
test_unbind | 0.4753s | 13.4613ms | 74.2869 Ops/s | 63.7104 Ops/s | |
test_full_like | 20.6506ms | 12.4465ms | 80.3436 Ops/s | 102.9791 Ops/s | |
test_zeros_like | 14.3546ms | 7.6277ms | 131.1009 Ops/s | 324.4049 Ops/s | |
test_ones_like | 13.3937ms | 7.5087ms | 133.1793 Ops/s | 299.6888 Ops/s | |
test_clone | 14.9913ms | 9.1281ms | 109.5513 Ops/s | 170.5978 Ops/s | |
test_squeeze | 62.9570μs | 12.7457μs | 78.4579 KOps/s | 76.3815 KOps/s | |
test_unsqueeze | 0.2924ms | 90.8720μs | 11.0045 KOps/s | 10.5551 KOps/s | |
test_split | 0.3391ms | 0.1899ms | 5.2651 KOps/s | 5.0114 KOps/s | |
test_permute | 0.3654ms | 0.2138ms | 4.6774 KOps/s | 4.5000 KOps/s | |
test_stack | 27.9960ms | 25.8100ms | 38.7447 Ops/s | 39.5397 Ops/s | |
test_cat | 31.1312ms | 25.4561ms | 39.2834 Ops/s | 38.9400 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 0.1147ms | 14.7169μs | 67.9493 KOps/s | 70.7691 KOps/s | |
test_plain_set_stack_nested | 45.9400μs | 14.4911μs | 69.0077 KOps/s | 71.3100 KOps/s | |
test_plain_set_nested_inplace | 54.5110μs | 15.8968μs | 62.9058 KOps/s | 65.6420 KOps/s | |
test_plain_set_stack_nested_inplace | 44.6510μs | 15.4091μs | 64.8965 KOps/s | 66.1759 KOps/s | |
test_items | 29.9110μs | 2.8541μs | 350.3716 KOps/s | 342.5932 KOps/s | |
test_items_nested | 0.3720ms | 0.3295ms | 3.0345 KOps/s | 3.0112 KOps/s | |
test_items_nested_locked | 0.3739ms | 0.3279ms | 3.0494 KOps/s | 3.0246 KOps/s | |
test_items_nested_leaf | 81.8510μs | 55.5066μs | 18.0159 KOps/s | 17.8671 KOps/s | |
test_items_stack_nested | 0.3821ms | 0.3239ms | 3.0878 KOps/s | 3.0165 KOps/s | |
test_items_stack_nested_leaf | 88.9510μs | 56.5590μs | 17.6807 KOps/s | 17.3524 KOps/s | |
test_items_stack_nested_locked | 0.3840ms | 0.3315ms | 3.0164 KOps/s | 3.0552 KOps/s | |
test_keys | 33.0400μs | 3.4239μs | 292.0682 KOps/s | 293.6231 KOps/s | |
test_keys_nested | 83.6620μs | 56.4525μs | 17.7140 KOps/s | 18.2813 KOps/s | |
test_keys_nested_locked | 2.5685ms | 62.7144μs | 15.9453 KOps/s | 16.1089 KOps/s | |
test_keys_nested_leaf | 0.1086ms | 47.0493μs | 21.2543 KOps/s | 21.0757 KOps/s | |
test_keys_stack_nested | 85.7010μs | 55.7596μs | 17.9341 KOps/s | 18.2275 KOps/s | |
test_keys_stack_nested_leaf | 97.5620μs | 48.0839μs | 20.7970 KOps/s | 20.7079 KOps/s | |
test_keys_stack_nested_locked | 94.0210μs | 60.7620μs | 16.4577 KOps/s | 16.2829 KOps/s | |
test_values | 4.0583μs | 0.8472μs | 1.1804 MOps/s | 1.2011 MOps/s | |
test_values_nested | 58.4910μs | 40.8789μs | 24.4625 KOps/s | 24.5124 KOps/s | |
test_values_nested_locked | 71.0310μs | 42.7490μs | 23.3924 KOps/s | 23.2944 KOps/s | |
test_values_nested_leaf | 59.9510μs | 35.4260μs | 28.2278 KOps/s | 28.2037 KOps/s | |
test_values_stack_nested | 75.2320μs | 41.4817μs | 24.1070 KOps/s | 23.9840 KOps/s | |
test_values_stack_nested_leaf | 77.6220μs | 35.5863μs | 28.1007 KOps/s | 28.2026 KOps/s | |
test_values_stack_nested_locked | 74.1110μs | 43.1447μs | 23.1778 KOps/s | 22.9853 KOps/s | |
test_membership | 1.6421μs | 0.4995μs | 2.0022 MOps/s | 1.9897 MOps/s | |
test_membership_nested | 17.1405μs | 1.9076μs | 524.2231 KOps/s | 531.2675 KOps/s | |
test_membership_nested_leaf | 15.9755μs | 1.9118μs | 523.0743 KOps/s | 540.0284 KOps/s | |
test_membership_stacked_nested | 24.5600μs | 1.9432μs | 514.6210 KOps/s | 525.8887 KOps/s | |
test_membership_stacked_nested_leaf | 33.7910μs | 1.9423μs | 514.8476 KOps/s | 518.7014 KOps/s | |
test_membership_nested_last | 27.1400μs | 2.8642μs | 349.1398 KOps/s | 359.1361 KOps/s | |
test_membership_nested_leaf_last | 27.4100μs | 2.8131μs | 355.4755 KOps/s | 363.1856 KOps/s | |
test_membership_stacked_nested_last | 41.1210μs | 7.9590μs | 125.6443 KOps/s | 210.8753 KOps/s | |
test_membership_stacked_nested_leaf_last | 24.5800μs | 7.8146μs | 127.9657 KOps/s | 212.7440 KOps/s | |
test_nested_getleaf | 29.5310μs | 6.0637μs | 164.9160 KOps/s | 164.6661 KOps/s | |
test_nested_get | 27.4110μs | 5.7208μs | 174.7994 KOps/s | 175.8635 KOps/s | |
test_stacked_getleaf | 32.0500μs | 6.0475μs | 165.3581 KOps/s | 163.3499 KOps/s | |
test_stacked_get | 25.1610μs | 5.7209μs | 174.7992 KOps/s | 178.8267 KOps/s | |
test_nested_getitemleaf | 29.7110μs | 6.1337μs | 163.0327 KOps/s | 161.8152 KOps/s | |
test_nested_getitem | 27.7300μs | 5.7853μs | 172.8531 KOps/s | 172.4962 KOps/s | |
test_stacked_getitemleaf | 36.9210μs | 6.0832μs | 164.3883 KOps/s | 163.8443 KOps/s | |
test_stacked_getitem | 34.2210μs | 5.6870μs | 175.8384 KOps/s | 173.5926 KOps/s | |
test_lock_nested | 4.7097ms | 0.4244ms | 2.3561 KOps/s | 2.3866 KOps/s | |
test_lock_stack_nested | 0.4180ms | 0.3708ms | 2.6966 KOps/s | 2.6860 KOps/s | |
test_unlock_nested | 0.7722ms | 0.3591ms | 2.7848 KOps/s | 2.8036 KOps/s | |
test_unlock_stack_nested | 0.3521ms | 0.3088ms | 3.2382 KOps/s | 3.2134 KOps/s | |
test_flatten_speed | 0.1462ms | 69.3596μs | 14.4176 KOps/s | 14.2669 KOps/s | |
test_unflatten_speed | 0.3459ms | 0.2867ms | 3.4882 KOps/s | 3.5050 KOps/s | |
test_common_ops | 1.5612ms | 1.2724ms | 785.9400 Ops/s | 796.8171 Ops/s | |
test_creation | 26.4700μs | 1.4954μs | 668.7048 KOps/s | 669.1785 KOps/s | |
test_creation_empty | 44.3210μs | 16.7462μs | 59.7151 KOps/s | 62.5183 KOps/s | |
test_creation_nested_1 | 53.4410μs | 18.5056μs | 54.0378 KOps/s | 57.0517 KOps/s | |
test_creation_nested_2 | 47.2110μs | 21.0210μs | 47.5715 KOps/s | 48.9843 KOps/s | |
test_clone | 59.0110μs | 29.4012μs | 34.0122 KOps/s | 33.5641 KOps/s | |
test_getitem[int] | 1.3069ms | 15.9602μs | 62.6559 KOps/s | 61.6586 KOps/s | |
test_getitem[slice_int] | 0.1196ms | 27.1910μs | 36.7769 KOps/s | 37.0718 KOps/s | |
test_getitem[range] | 0.2190ms | 0.1101ms | 9.0797 KOps/s | 9.1494 KOps/s | |
test_getitem[tuple] | 0.1166ms | 23.5987μs | 42.3752 KOps/s | 42.2090 KOps/s | |
test_getitem[list] | 0.1988ms | 97.9544μs | 10.2088 KOps/s | 10.1963 KOps/s | |
test_setitem_dim[int] | 67.6210μs | 45.3942μs | 22.0292 KOps/s | 22.1552 KOps/s | |
test_setitem_dim[slice_int] | 0.1001ms | 66.5719μs | 15.0214 KOps/s | 14.8444 KOps/s | |
test_setitem_dim[range] | 0.1738ms | 0.1268ms | 7.8880 KOps/s | 7.8812 KOps/s | |
test_setitem_dim[tuple] | 88.0110μs | 61.0650μs | 16.3760 KOps/s | 16.3399 KOps/s | |
test_setitem | 82.9120μs | 43.0192μs | 23.2454 KOps/s | 23.5532 KOps/s | |
test_set | 83.0020μs | 42.4016μs | 23.5840 KOps/s | 23.9533 KOps/s | |
test_set_shared | 0.3083ms | 51.3038μs | 19.4917 KOps/s | 19.2732 KOps/s | |
test_update | 93.7510μs | 51.2990μs | 19.4935 KOps/s | 19.8010 KOps/s | |
test_update_nested | 91.2820μs | 58.6683μs | 17.0450 KOps/s | 17.6523 KOps/s | |
test_update__nested | 97.2410μs | 60.5091μs | 16.5264 KOps/s | 16.8470 KOps/s | |
test_set_nested | 80.5320μs | 44.7858μs | 22.3285 KOps/s | 22.8653 KOps/s | |
test_set_nested_new | 84.2720μs | 48.2859μs | 20.7100 KOps/s | 20.9243 KOps/s | |
test_select | 0.1017ms | 61.9894μs | 16.1318 KOps/s | 16.3019 KOps/s | |
test_select_nested | 0.5746ms | 42.6454μs | 23.4492 KOps/s | 23.0602 KOps/s | |
test_exclude_nested | 95.7420μs | 60.3953μs | 16.5576 KOps/s | 16.7444 KOps/s | |
test_empty[True] | 0.2947ms | 0.2443ms | 4.0929 KOps/s | 4.0742 KOps/s | |
test_empty[False] | 2.9130μs | 0.7409μs | 1.3497 MOps/s | 1.3164 MOps/s | |
test_to | 55.1410μs | 25.5370μs | 39.1589 KOps/s | 39.3872 KOps/s | |
test_to_nonblocking | 53.5010μs | 23.6045μs | 42.3647 KOps/s | 40.7979 KOps/s | |
test_unbind_speed | 0.3418ms | 0.2806ms | 3.5641 KOps/s | 3.5801 KOps/s | |
test_unbind_speed_stack0 | 0.3401ms | 0.2690ms | 3.7171 KOps/s | 3.7067 KOps/s | |
test_unbind_speed_stack1 | 0.6790ms | 0.6381ms | 1.5671 KOps/s | 1.4394 KOps/s | |
test_split | 94.0752ms | 2.1532ms | 464.4348 Ops/s | 464.4564 Ops/s | |
test_chunk | 93.4650ms | 2.1602ms | 462.9161 Ops/s | 459.2503 Ops/s | |
test_creation[device0] | 0.3413ms | 0.1251ms | 7.9952 KOps/s | 7.9681 KOps/s | |
test_creation_from_tensor | 0.3378ms | 0.1274ms | 7.8464 KOps/s | 7.8061 KOps/s | |
test_add_one[memmap_tensor0] | 0.2403ms | 8.7728μs | 113.9892 KOps/s | 117.4325 KOps/s | |
test_contiguous[memmap_tensor0] | 30.2710μs | 2.2207μs | 450.3147 KOps/s | 454.5871 KOps/s | |
test_stack[memmap_tensor0] | 39.4910μs | 6.6089μs | 151.3100 KOps/s | 150.6485 KOps/s | |
test_memmaptd_index | 1.1517ms | 0.4204ms | 2.3788 KOps/s | 2.3813 KOps/s | |
test_memmaptd_index_astensor | 0.9534ms | 0.4778ms | 2.0929 KOps/s | 2.0832 KOps/s | |
test_memmaptd_index_op | 1.4215ms | 1.0489ms | 953.3441 Ops/s | 976.2743 Ops/s | |
test_serialize_model | 0.1298s | 0.1292s | 7.7403 Ops/s | 7.7165 Ops/s | |
test_serialize_model_pickle | 1.3606s | 1.2189s | 0.8204 Ops/s | 0.8247 Ops/s | |
test_serialize_weights | 0.2213s | 0.1419s | 7.0465 Ops/s | 7.6758 Ops/s | |
test_serialize_weights_returnearly | 0.2162s | 55.6281ms | 17.9765 Ops/s | 16.2658 Ops/s | |
test_serialize_weights_pickle | 1.3757s | 1.2166s | 0.8220 Ops/s | 0.8139 Ops/s | |
test_reshape_pytree | 71.2610μs | 35.7951μs | 27.9368 KOps/s | 28.2200 KOps/s | |
test_reshape_td | 0.4226ms | 42.2475μs | 23.6701 KOps/s | 23.8460 KOps/s | |
test_view_pytree | 62.2510μs | 35.4919μs | 28.1755 KOps/s | 28.0069 KOps/s | |
test_view_td | 0.4388ms | 47.5306μs | 21.0391 KOps/s | 21.0635 KOps/s | |
test_unbind_pytree | 75.2110μs | 34.8218μs | 28.7176 KOps/s | 29.0371 KOps/s | |
test_unbind_td | 0.4249ms | 43.7699μs | 22.8467 KOps/s | 23.6352 KOps/s | |
test_split_pytree | 90.6010μs | 46.3859μs | 21.5583 KOps/s | 21.4822 KOps/s | |
test_split_td | 0.6660ms | 56.0938μs | 17.8273 KOps/s | 18.0502 KOps/s | |
test_add_pytree | 95.3910μs | 57.9688μs | 17.2507 KOps/s | 17.4350 KOps/s | |
test_add_td | 0.4929ms | 93.8494μs | 10.6554 KOps/s | 10.8605 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.4131ms | 0.2105ms | 4.7503 KOps/s | 4.7879 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.5491ms | 0.1505ms | 6.6465 KOps/s | 6.6148 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1858ms | 0.1455ms | 6.8744 KOps/s | 6.8691 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.5856ms | 0.1869ms | 5.3491 KOps/s | 5.0325 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 49.4210μs | 21.4795μs | 46.5560 KOps/s | 46.5285 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.4347ms | 43.0468μs | 23.2306 KOps/s | 22.6780 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.2624ms | 64.9106μs | 15.4058 KOps/s | 15.8480 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.4329ms | 49.7606μs | 20.0962 KOps/s | 20.4889 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.7154ms | 0.3187ms | 3.1376 KOps/s | 3.1542 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.6047ms | 0.2088ms | 4.7882 KOps/s | 4.6235 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1649ms | 0.1284ms | 7.7898 KOps/s | 7.3918 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.4643ms | 59.8903μs | 16.6972 KOps/s | 15.7980 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.4905ms | 0.3188ms | 3.1369 KOps/s | 3.1498 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.7238ms | 0.6291ms | 1.5895 KOps/s | 1.5918 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3576ms | 0.2480ms | 4.0330 KOps/s | 4.0085 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.3651ms | 0.3164ms | 3.1608 KOps/s | 3.0211 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1217ms | 69.8409μs | 14.3183 KOps/s | 13.8600 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1916ms | 0.1282ms | 7.7998 KOps/s | 7.7777 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.6264ms | 0.5411ms | 1.8482 KOps/s | 1.8550 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.3715ms | 0.3156ms | 3.1690 KOps/s | 3.1587 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 89.4310μs | 18.0793μs | 55.3118 KOps/s | 54.4360 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1000ms | 26.7165μs | 37.4301 KOps/s | 36.7716 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1322ms | 70.0555μs | 14.2744 KOps/s | 14.2311 KOps/s | |
test_compile_copy_flat[pytree-eager] | 84.8920μs | 51.2267μs | 19.5211 KOps/s | 19.5633 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 2.2972ms | 0.8119ms | 1.2316 KOps/s | 1.1235 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 3.5458ms | 3.2310ms | 309.5042 Ops/s | 319.4211 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 2.2627ms | 0.8079ms | 1.2378 KOps/s | 1.1250 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 3.2521ms | 3.1809ms | 314.3755 Ops/s | 311.3846 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1669ms | 0.1099ms | 9.1024 KOps/s | 9.1847 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.1864ms | 60.6886μs | 16.4776 KOps/s | 16.3259 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1381ms | 0.1038ms | 9.6375 KOps/s | 9.6208 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1540ms | 42.8503μs | 23.3370 KOps/s | 21.8671 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1626ms | 0.1049ms | 9.5312 KOps/s | 9.2340 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 93.4420μs | 42.4760μs | 23.5427 KOps/s | 22.8597 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1784ms | 0.1374ms | 7.2784 KOps/s | 7.2634 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1666ms | 24.8982μs | 40.1636 KOps/s | 39.9213 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1792ms | 0.1302ms | 7.6829 KOps/s | 7.6432 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 57.9310μs | 20.6954μs | 48.3199 KOps/s | 47.8878 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1815ms | 0.1313ms | 7.6177 KOps/s | 7.4496 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 84.4710μs | 20.6005μs | 48.5426 KOps/s | 47.0439 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1753ms | 0.1383ms | 7.2296 KOps/s | 7.2190 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.5322ms | 24.6684μs | 40.5377 KOps/s | 39.6789 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.2829ms | 0.1313ms | 7.6178 KOps/s | 7.5412 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 56.2700μs | 20.6794μs | 48.3572 KOps/s | 48.5992 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1957ms | 0.1311ms | 7.6266 KOps/s | 7.3933 KOps/s | |
test_compile_indexing[int-pytree-eager] | 57.1010μs | 20.3913μs | 49.0405 KOps/s | 48.1782 KOps/s | |
test_mod_add[eager] | 79.2720μs | 32.1709μs | 31.0840 KOps/s | 31.4233 KOps/s | |
test_mod_add[compile] | 0.2232ms | 71.1515μs | 14.0545 KOps/s | 13.6349 KOps/s | |
test_mod_add[compile-overhead] | 0.2671ms | 0.1362ms | 7.3406 KOps/s | 7.0298 KOps/s | |
test_mod_wrap[eager] | 0.3231ms | 0.2370ms | 4.2194 KOps/s | 3.9099 KOps/s | |
test_mod_wrap[compile] | 0.6992ms | 0.2990ms | 3.3440 KOps/s | 3.3648 KOps/s | |
test_mod_wrap[compile-overhead] | 7.5751ms | 3.9957ms | 250.2710 Ops/s | 253.7883 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.7440ms | 1.3481ms | 741.7587 Ops/s | 687.2420 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.7169ms | 1.3176ms | 758.9669 Ops/s | 695.9793 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3112ms | 0.8915ms | 1.1217 KOps/s | 947.5567 Ops/s | |
test_seq_add[eager] | 0.1506ms | 97.8678μs | 10.2179 KOps/s | 10.0308 KOps/s | |
test_seq_add[compile] | 0.1388ms | 81.4139μs | 12.2829 KOps/s | 12.1687 KOps/s | |
test_seq_add[compile-overhead] | 0.1633ms | 0.1134ms | 8.8203 KOps/s | 8.6802 KOps/s | |
test_seq_wrap[eager] | 0.4745ms | 0.3751ms | 2.6662 KOps/s | 2.5769 KOps/s | |
test_seq_wrap[compile] | 0.3993ms | 0.3090ms | 3.2362 KOps/s | 3.1599 KOps/s | |
test_seq_wrap[compile-overhead] | 0.2862ms | 0.2187ms | 4.5724 KOps/s | 4.5844 KOps/s | |
test_func_call_runtime[False-eager] | 0.8098ms | 0.7151ms | 1.3983 KOps/s | 1.3426 KOps/s | |
test_func_call_runtime[False-compile] | 0.9267ms | 0.7719ms | 1.2955 KOps/s | 1.2705 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4022ms | 0.3549ms | 2.8179 KOps/s | 2.7991 KOps/s | |
test_func_call_runtime[True-eager] | 1.0246ms | 0.8821ms | 1.1336 KOps/s | 1.1020 KOps/s | |
test_func_call_runtime[True-compile] | 1.0332ms | 0.7953ms | 1.2574 KOps/s | 1.2380 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.5146ms | 0.3766ms | 2.6551 KOps/s | 2.6598 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8581ms | 0.7175ms | 1.3936 KOps/s | 1.3472 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.9773ms | 0.7732ms | 1.2934 KOps/s | 1.2647 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4929ms | 0.3569ms | 2.8016 KOps/s | 2.7923 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1196ms | 0.9679ms | 1.0332 KOps/s | 992.2607 Ops/s | |
test_func_call_cm_runtime[True-compile] | 0.9570ms | 0.8239ms | 1.2137 KOps/s | 1.1907 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.4763ms | 0.4024ms | 2.4853 KOps/s | 2.4770 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.4851ms | 2.0521ms | 487.3107 Ops/s | 477.7378 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 1.2466ms | 0.8392ms | 1.1916 KOps/s | 1.1149 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.4934ms | 0.4068ms | 2.4585 KOps/s | 2.4390 KOps/s | |
test_distributed | 4.2306ms | 0.1971ms | 5.0746 KOps/s | 8.4422 KOps/s | |
test_tdmodule | 55.1310μs | 15.4623μs | 64.6734 KOps/s | 61.6996 KOps/s | |
test_tdmodule_dispatch | 50.0610μs | 30.4262μs | 32.8664 KOps/s | 34.1244 KOps/s | |
test_tdseq | 37.9310μs | 15.9731μs | 62.6051 KOps/s | 62.1004 KOps/s | |
test_tdseq_dispatch | 56.8510μs | 32.6544μs | 30.6237 KOps/s | 30.9775 KOps/s | |
test_instantiation_functorch | 1.9589ms | 1.8483ms | 541.0362 Ops/s | 534.0351 Ops/s | |
test_instantiation_td | 1.8533ms | 1.2240ms | 816.9686 Ops/s | 834.2453 Ops/s | |
test_exec_functorch | 0.3182ms | 0.2187ms | 4.5725 KOps/s | 4.7479 KOps/s | |
test_exec_functional_call | 0.2782ms | 0.2213ms | 4.5193 KOps/s | 4.7969 KOps/s | |
test_exec_td | 0.2833ms | 0.2250ms | 4.4436 KOps/s | 4.6224 KOps/s | |
test_exec_td_decorator | 0.5717ms | 0.2674ms | 3.7395 KOps/s | 3.8682 KOps/s | |
test_vmap_mlp_speed[True-True] | 0.8220ms | 0.6957ms | 1.4374 KOps/s | 1.4595 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.7656ms | 0.6825ms | 1.4651 KOps/s | 1.4558 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.6333ms | 0.5700ms | 1.7543 KOps/s | 1.7303 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.6179ms | 0.5716ms | 1.7493 KOps/s | 1.7335 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.2009ms | 0.6691ms | 1.4946 KOps/s | 1.4809 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.7925ms | 0.6707ms | 1.4910 KOps/s | 1.4746 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.6917ms | 0.5853ms | 1.7084 KOps/s | 1.6868 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7121ms | 0.5908ms | 1.6925 KOps/s | 1.6893 KOps/s | |
test_vmap_transformer_speed[True-True] | 9.1597ms | 8.3333ms | 120.0008 Ops/s | 119.0690 Ops/s | |
test_vmap_transformer_speed[True-False] | 8.3100ms | 8.2573ms | 121.1046 Ops/s | 119.3712 Ops/s | |
test_vmap_transformer_speed[False-True] | 8.1274ms | 8.0612ms | 124.0511 Ops/s | 122.3798 Ops/s | |
test_vmap_transformer_speed[False-False] | 8.1691ms | 8.0779ms | 123.7947 Ops/s | 122.0096 Ops/s | |
test_vmap_transformer_speed_decorator[True-True] | 20.0525ms | 19.3018ms | 51.8087 Ops/s | 51.3629 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 19.4321ms | 19.3628ms | 51.6453 Ops/s | 51.0118 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.2859ms | 19.2113ms | 52.0527 Ops/s | 51.7465 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.3091ms | 19.2040ms | 52.0724 Ops/s | 51.6986 Ops/s | |
test_to_module_speed[True] | 2.0240ms | 0.9417ms | 1.0620 KOps/s | 1.0371 KOps/s | |
test_to_module_speed[False] | 1.0160ms | 0.9171ms | 1.0903 KOps/s | 1.0695 KOps/s | |
test_tc_init | 73.1410μs | 36.2384μs | 27.5950 KOps/s | 28.1108 KOps/s | |
test_tc_init_nested | 0.1055ms | 73.1124μs | 13.6776 KOps/s | 13.6855 KOps/s | |
test_tc_first_layer_tensor | 15.5131μs | 0.6906μs | 1.4479 MOps/s | 1.4508 MOps/s | |
test_tc_first_layer_nontensor | 33.1700μs | 2.2495μs | 444.5510 KOps/s | 447.5195 KOps/s | |
test_tc_second_layer_tensor | 7.2525μs | 1.3792μs | 725.0404 KOps/s | 722.4722 KOps/s | |
test_tc_second_layer_nontensor | 28.9710μs | 2.9239μs | 342.0086 KOps/s | 340.4652 KOps/s | |
test_unbind | 0.1952s | 10.9018ms | 91.7278 Ops/s | 101.4254 Ops/s | |
test_full_like | 0.6491ms | 0.5739ms | 1.7424 KOps/s | 1.7374 KOps/s | |
test_zeros_like | 0.2635ms | 0.1979ms | 5.0543 KOps/s | 5.0523 KOps/s | |
test_ones_like | 0.2319ms | 0.1977ms | 5.0591 KOps/s | 5.0570 KOps/s | |
test_clone | 0.4577ms | 0.4144ms | 2.4131 KOps/s | 2.4140 KOps/s | |
test_squeeze | 36.1910μs | 9.6456μs | 103.6747 KOps/s | 102.9745 KOps/s | |
test_unsqueeze | 0.2167ms | 75.1298μs | 13.3103 KOps/s | 13.3092 KOps/s | |
test_split | 0.4489ms | 0.1599ms | 6.2557 KOps/s | 6.3777 KOps/s | |
test_permute | 0.2489ms | 0.1804ms | 5.5438 KOps/s | 5.5986 KOps/s | |
test_stack | 1.2680ms | 0.8554ms | 1.1690 KOps/s | 1.1415 KOps/s | |
test_cat | 1.2654ms | 1.2315ms | 812.0003 Ops/s | 811.8053 Ops/s |
vmoens
added a commit
that referenced
this pull request
Sep 17, 2024
ghstack-source-id: bdf9ab4f4c61ef5a407d76ebd5433d0d6a897b59 Pull Request resolved: #999
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
bug
Something isn't working
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):