-
Notifications
You must be signed in to change notification settings - Fork 79
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature] str2td #953
Merged
Merged
[Feature] str2td #953
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vmoens
added a commit
that referenced
this pull request
Aug 9, 2024
ghstack-source-id: af2afacb11521683b977659bc14212853af17a56 Pull Request resolved: #953
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 54.2510μs | 19.7582μs | 50.6118 KOps/s | 50.3875 KOps/s | |
test_plain_set_stack_nested | 47.2690μs | 20.1838μs | 49.5448 KOps/s | 49.4162 KOps/s | |
test_plain_set_nested_inplace | 55.2040μs | 21.4807μs | 46.5535 KOps/s | 46.3358 KOps/s | |
test_plain_set_stack_nested_inplace | 67.4870μs | 21.4250μs | 46.6745 KOps/s | 43.6517 KOps/s | |
test_items | 22.6320μs | 4.1907μs | 238.6215 KOps/s | 242.8176 KOps/s | |
test_items_nested | 0.7146ms | 0.3375ms | 2.9631 KOps/s | 3.0730 KOps/s | |
test_items_nested_locked | 0.5257ms | 0.3375ms | 2.9630 KOps/s | 3.0792 KOps/s | |
test_items_nested_leaf | 0.2016ms | 85.8746μs | 11.6449 KOps/s | 11.7256 KOps/s | |
test_items_stack_nested | 0.5181ms | 0.3378ms | 2.9607 KOps/s | 3.0509 KOps/s | |
test_items_stack_nested_leaf | 0.1608ms | 83.8456μs | 11.9267 KOps/s | 11.8423 KOps/s | |
test_items_stack_nested_locked | 0.4327ms | 0.3368ms | 2.9689 KOps/s | 3.0575 KOps/s | |
test_keys | 36.3480μs | 3.5281μs | 283.4426 KOps/s | 281.9610 KOps/s | |
test_keys_nested | 0.1830ms | 96.8303μs | 10.3273 KOps/s | 10.2215 KOps/s | |
test_keys_nested_locked | 1.8026ms | 0.1033ms | 9.6839 KOps/s | 9.8528 KOps/s | |
test_keys_nested_leaf | 0.1835ms | 81.9226μs | 12.2066 KOps/s | 11.9699 KOps/s | |
test_keys_stack_nested | 0.1646ms | 95.2988μs | 10.4933 KOps/s | 10.4800 KOps/s | |
test_keys_stack_nested_leaf | 0.1763ms | 79.8440μs | 12.5244 KOps/s | 12.5806 KOps/s | |
test_keys_stack_nested_locked | 0.1950ms | 99.8303μs | 10.0170 KOps/s | 10.0667 KOps/s | |
test_values | 10.3118μs | 1.0635μs | 940.3354 KOps/s | 927.7392 KOps/s | |
test_values_nested | 90.3690μs | 48.0000μs | 20.8333 KOps/s | 20.2710 KOps/s | |
test_values_nested_locked | 84.0570μs | 47.9436μs | 20.8578 KOps/s | 20.7636 KOps/s | |
test_values_nested_leaf | 0.1223ms | 42.6563μs | 23.4432 KOps/s | 23.7696 KOps/s | |
test_values_stack_nested | 87.8950μs | 48.1655μs | 20.7618 KOps/s | 20.7810 KOps/s | |
test_values_stack_nested_leaf | 89.6070μs | 41.4550μs | 24.1226 KOps/s | 24.5493 KOps/s | |
test_values_stack_nested_locked | 92.8730μs | 48.3440μs | 20.6851 KOps/s | 20.8164 KOps/s | |
test_membership | 4.8934μs | 0.6946μs | 1.4397 MOps/s | 1.1675 MOps/s | |
test_membership_nested | 19.9070μs | 2.5975μs | 384.9801 KOps/s | 385.0532 KOps/s | |
test_membership_nested_leaf | 30.1760μs | 2.5992μs | 384.7379 KOps/s | 390.4880 KOps/s | |
test_membership_stacked_nested | 29.5960μs | 2.6109μs | 383.0093 KOps/s | 388.8942 KOps/s | |
test_membership_stacked_nested_leaf | 31.6990μs | 2.6699μs | 374.5459 KOps/s | 383.7570 KOps/s | |
test_membership_nested_last | 31.8000μs | 3.8150μs | 262.1244 KOps/s | 267.5132 KOps/s | |
test_membership_nested_leaf_last | 37.0890μs | 3.8052μs | 262.7995 KOps/s | 264.6012 KOps/s | |
test_membership_stacked_nested_last | 32.0100μs | 4.8205μs | 207.4469 KOps/s | 235.1893 KOps/s | |
test_membership_stacked_nested_leaf_last | 29.4250μs | 4.8454μs | 206.3823 KOps/s | 233.8531 KOps/s | |
test_nested_getleaf | 30.5970μs | 10.5913μs | 94.4173 KOps/s | 92.6239 KOps/s | |
test_nested_get | 46.3570μs | 9.9968μs | 100.0316 KOps/s | 98.2999 KOps/s | |
test_stacked_getleaf | 37.4300μs | 10.6161μs | 94.1964 KOps/s | 93.9600 KOps/s | |
test_stacked_get | 48.3900μs | 9.9846μs | 100.1538 KOps/s | 98.5857 KOps/s | |
test_nested_getitemleaf | 39.2530μs | 10.9713μs | 91.1465 KOps/s | 90.7708 KOps/s | |
test_nested_getitem | 49.5120μs | 10.2791μs | 97.2849 KOps/s | 96.6023 KOps/s | |
test_stacked_getitemleaf | 38.7520μs | 10.8718μs | 91.9810 KOps/s | 91.4686 KOps/s | |
test_stacked_getitem | 39.8140μs | 10.2275μs | 97.7753 KOps/s | 98.3970 KOps/s | |
test_lock_nested | 89.0178ms | 0.5719ms | 1.7486 KOps/s | 2.0881 KOps/s | |
test_lock_stack_nested | 0.7954ms | 0.4486ms | 2.2290 KOps/s | 2.2463 KOps/s | |
test_unlock_nested | 91.0712ms | 0.4974ms | 2.0104 KOps/s | 2.4880 KOps/s | |
test_unlock_stack_nested | 0.4663ms | 0.3627ms | 2.7568 KOps/s | 2.7346 KOps/s | |
test_flatten_speed | 0.2121ms | 0.1042ms | 9.5994 KOps/s | 9.4464 KOps/s | |
test_unflatten_speed | 0.5479ms | 0.4585ms | 2.1809 KOps/s | 2.2039 KOps/s | |
test_common_ops | 4.7204ms | 1.0877ms | 919.3928 Ops/s | 893.1228 Ops/s | |
test_creation | 84.0770μs | 2.1312μs | 469.2158 KOps/s | 470.4667 KOps/s | |
test_creation_empty | 45.4340μs | 16.4765μs | 60.6925 KOps/s | 56.8370 KOps/s | |
test_creation_nested_1 | 49.2220μs | 19.4350μs | 51.4534 KOps/s | 48.5301 KOps/s | |
test_creation_nested_2 | 51.9470μs | 23.9940μs | 41.6771 KOps/s | 40.2020 KOps/s | |
test_clone | 0.1829ms | 17.0409μs | 58.6825 KOps/s | 56.4214 KOps/s | |
test_getitem[int] | 0.8981ms | 16.9210μs | 59.0981 KOps/s | 60.3314 KOps/s | |
test_getitem[slice_int] | 0.1478ms | 31.0716μs | 32.1838 KOps/s | 31.3587 KOps/s | |
test_getitem[range] | 0.2848ms | 58.6529μs | 17.0494 KOps/s | 16.9522 KOps/s | |
test_getitem[tuple] | 0.1627ms | 25.7219μs | 38.8773 KOps/s | 39.7771 KOps/s | |
test_getitem[list] | 0.2508ms | 52.7425μs | 18.9600 KOps/s | 18.2714 KOps/s | |
test_setitem_dim[int] | 55.4040μs | 37.7643μs | 26.4800 KOps/s | 25.1791 KOps/s | |
test_setitem_dim[slice_int] | 0.1033ms | 66.9104μs | 14.9454 KOps/s | 14.3978 KOps/s | |
test_setitem_dim[range] | 0.1812ms | 92.2885μs | 10.8356 KOps/s | 10.7824 KOps/s | |
test_setitem_dim[tuple] | 0.1015ms | 55.5250μs | 18.0099 KOps/s | 17.8822 KOps/s | |
test_setitem | 0.1132ms | 29.4427μs | 33.9642 KOps/s | 33.1172 KOps/s | |
test_set | 94.9670μs | 29.1112μs | 34.3510 KOps/s | 33.7323 KOps/s | |
test_set_shared | 1.1779ms | 0.2114ms | 4.7304 KOps/s | 4.6508 KOps/s | |
test_update | 0.1515ms | 35.1994μs | 28.4095 KOps/s | 27.8572 KOps/s | |
test_update_nested | 0.2050ms | 44.8585μs | 22.2923 KOps/s | 21.5270 KOps/s | |
test_update__nested | 0.1139ms | 34.9556μs | 28.6077 KOps/s | 28.0498 KOps/s | |
test_set_nested | 90.7090μs | 30.9450μs | 32.3154 KOps/s | 31.2196 KOps/s | |
test_set_nested_new | 0.1244ms | 36.1416μs | 27.6689 KOps/s | 26.7400 KOps/s | |
test_select | 0.1859ms | 55.1884μs | 18.1198 KOps/s | 18.3203 KOps/s | |
test_select_nested | 0.1513ms | 59.6532μs | 16.7636 KOps/s | 16.7950 KOps/s | |
test_exclude_nested | 0.1416ms | 75.1848μs | 13.3006 KOps/s | 13.3519 KOps/s | |
test_empty[True] | 0.5184ms | 0.3147ms | 3.1774 KOps/s | 3.1661 KOps/s | |
test_empty[False] | 7.3076μs | 1.1684μs | 855.8460 KOps/s | 806.6329 KOps/s | |
test_unbind_speed | 0.4799ms | 0.3076ms | 3.2505 KOps/s | 3.2537 KOps/s | |
test_unbind_speed_stack0 | 0.5134ms | 0.2937ms | 3.4043 KOps/s | 3.4512 KOps/s | |
test_unbind_speed_stack1 | 99.6902ms | 0.8107ms | 1.2335 KOps/s | 1.3829 KOps/s | |
test_split | 93.9126ms | 2.2113ms | 452.2271 Ops/s | 459.8287 Ops/s | |
test_chunk | 3.2903ms | 2.0247ms | 493.9019 Ops/s | 463.0873 Ops/s | |
test_creation[device0] | 4.2086ms | 0.1202ms | 8.3224 KOps/s | 8.5212 KOps/s | |
test_creation_from_tensor | 0.2442ms | 0.1185ms | 8.4412 KOps/s | 8.4520 KOps/s | |
test_add_one[memmap_tensor0] | 0.1649ms | 7.7185μs | 129.5586 KOps/s | 131.6916 KOps/s | |
test_contiguous[memmap_tensor0] | 18.7850μs | 1.9813μs | 504.7129 KOps/s | 523.2414 KOps/s | |
test_stack[memmap_tensor0] | 49.2720μs | 5.9927μs | 166.8694 KOps/s | 170.2663 KOps/s | |
test_memmaptd_index | 1.1580ms | 0.4011ms | 2.4929 KOps/s | 2.4904 KOps/s | |
test_memmaptd_index_astensor | 0.7868ms | 0.4765ms | 2.0985 KOps/s | 2.0610 KOps/s | |
test_memmaptd_index_op | 2.4675ms | 0.9897ms | 1.0104 KOps/s | 962.5372 Ops/s | |
test_serialize_model | 0.2224s | 0.1309s | 7.6393 Ops/s | 8.4288 Ops/s | |
test_serialize_model_pickle | 0.4743s | 0.3976s | 2.5151 Ops/s | 2.5174 Ops/s | |
test_serialize_weights | 0.1216s | 0.1172s | 8.5304 Ops/s | 7.7280 Ops/s | |
test_serialize_weights_returnearly | 0.2440s | 0.1709s | 5.8499 Ops/s | 6.2873 Ops/s | |
test_serialize_weights_pickle | 0.6552s | 0.4519s | 2.2129 Ops/s | 1.1831 Ops/s | |
test_serialize_weights_filesystem | 0.1507s | 0.1411s | 7.0850 Ops/s | 7.0371 Ops/s | |
test_serialize_model_filesystem | 0.1497s | 0.1436s | 6.9623 Ops/s | 6.4835 Ops/s | |
test_reshape_pytree | 86.3310μs | 38.6706μs | 25.8594 KOps/s | 25.7619 KOps/s | |
test_reshape_td | 0.1037ms | 45.8130μs | 21.8279 KOps/s | 21.0762 KOps/s | |
test_view_pytree | 0.1040ms | 39.4188μs | 25.3686 KOps/s | 25.7704 KOps/s | |
test_view_td | 0.1334ms | 52.0600μs | 19.2086 KOps/s | 18.8265 KOps/s | |
test_unbind_pytree | 0.1076ms | 36.8332μs | 27.1495 KOps/s | 27.4717 KOps/s | |
test_unbind_td | 0.3416ms | 44.9734μs | 22.2354 KOps/s | 21.6201 KOps/s | |
test_split_pytree | 96.1800μs | 38.5283μs | 25.9549 KOps/s | 25.8173 KOps/s | |
test_split_td | 0.5026ms | 58.5245μs | 17.0869 KOps/s | 17.2799 KOps/s | |
test_add_pytree | 0.1128ms | 45.1001μs | 22.1729 KOps/s | 21.6743 KOps/s | |
test_add_td | 0.1517ms | 78.1791μs | 12.7911 KOps/s | 12.2287 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1298ms | 58.6194μs | 17.0592 KOps/s | 17.3803 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3372ms | 0.1870ms | 5.3473 KOps/s | 5.3208 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1355ms | 57.3402μs | 17.4398 KOps/s | 17.8565 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2772ms | 0.1434ms | 6.9730 KOps/s | 6.9255 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 73.9380μs | 20.9325μs | 47.7725 KOps/s | 48.2755 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1379ms | 66.7807μs | 14.9744 KOps/s | 14.9747 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1524ms | 75.4920μs | 13.2464 KOps/s | 13.2888 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1464ms | 68.0274μs | 14.6999 KOps/s | 14.8574 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.3631ms | 0.1768ms | 5.6576 KOps/s | 5.7662 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3686ms | 0.1874ms | 5.3370 KOps/s | 5.2173 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1127ms | 48.0682μs | 20.8038 KOps/s | 21.3354 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.5302ms | 68.1759μs | 14.6679 KOps/s | 14.3295 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.3807ms | 0.1785ms | 5.6035 KOps/s | 5.7385 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.4838ms | 0.2957ms | 3.3820 KOps/s | 3.3258 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4679ms | 0.2077ms | 4.8153 KOps/s | 4.8936 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.3899ms | 0.1783ms | 5.6070 KOps/s | 5.7896 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1235ms | 62.7999μs | 15.9236 KOps/s | 15.9881 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1136ms | 48.5809μs | 20.5842 KOps/s | 20.4383 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.3468ms | 0.2396ms | 4.1733 KOps/s | 4.0896 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.3403ms | 0.1811ms | 5.5226 KOps/s | 5.6928 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.2402ms | 0.1053ms | 9.4973 KOps/s | 9.7851 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1335ms | 57.7456μs | 17.3173 KOps/s | 16.8463 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1632ms | 78.5821μs | 12.7255 KOps/s | 12.8730 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1206ms | 69.2044μs | 14.4499 KOps/s | 14.6087 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.4085ms | 0.1970ms | 5.0772 KOps/s | 5.1383 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.2179ms | 1.6323ms | 612.6331 Ops/s | 596.4690 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.2738ms | 0.1974ms | 5.0655 KOps/s | 5.1330 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.2610ms | 1.1166ms | 895.5898 Ops/s | 878.2217 Ops/s | |
test_compile_assign_and_add_stack[compile] | 0.7638ms | 0.4154ms | 2.4072 KOps/s | 2.3449 KOps/s | |
test_compile_assign_and_add_stack[eager] | 3.9383ms | 3.6216ms | 276.1188 Ops/s | 258.0824 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 77.1450μs | 34.9693μs | 28.5965 KOps/s | 28.4616 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.4004s | 75.1795μs | 13.3015 KOps/s | 20.2657 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1026ms | 30.7955μs | 32.4723 KOps/s | 32.9977 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 83.2160μs | 28.3454μs | 35.2791 KOps/s | 34.3835 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 88.5760μs | 31.0732μs | 32.1821 KOps/s | 32.9461 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 79.8290μs | 28.1752μs | 35.4922 KOps/s | 34.0101 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1636ms | 76.3105μs | 13.1044 KOps/s | 13.5401 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.5738ms | 27.8241μs | 35.9400 KOps/s | 35.7627 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1685ms | 71.2371μs | 14.0376 KOps/s | 14.2861 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 70.6620μs | 23.2935μs | 42.9304 KOps/s | 43.2406 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1286ms | 69.9958μs | 14.2866 KOps/s | 14.8800 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 74.4690μs | 23.2109μs | 43.0832 KOps/s | 43.3988 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1341ms | 75.4622μs | 13.2517 KOps/s | 13.6054 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 1.0381ms | 28.0051μs | 35.7078 KOps/s | 35.8726 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1724ms | 70.8194μs | 14.1204 KOps/s | 14.5739 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 70.1910μs | 23.2447μs | 43.0205 KOps/s | 42.9175 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1532ms | 70.5461μs | 14.1751 KOps/s | 14.4403 KOps/s | |
test_compile_indexing[int-pytree-eager] | 68.2370μs | 22.9970μs | 43.4839 KOps/s | 43.7762 KOps/s | |
test_mod_add[eager] | 73.1570μs | 22.9583μs | 43.5572 KOps/s | 39.4356 KOps/s | |
test_mod_add[compile] | 0.1070ms | 40.1728μs | 24.8924 KOps/s | 23.8041 KOps/s | |
test_mod_add[compile-overhead] | 84.7780μs | 40.2789μs | 24.8269 KOps/s | 24.2594 KOps/s | |
test_mod_wrap[eager] | 0.4121ms | 0.2105ms | 4.7514 KOps/s | 4.6807 KOps/s | |
test_mod_wrap[compile] | 0.3666ms | 0.2352ms | 4.2515 KOps/s | 4.3026 KOps/s | |
test_mod_wrap[compile-overhead] | 0.3240ms | 0.2299ms | 4.3488 KOps/s | 4.3408 KOps/s | |
test_mod_wrap_and_backward[eager] | 11.5221ms | 10.5810ms | 94.5086 Ops/s | 93.4037 Ops/s | |
test_mod_wrap_and_backward[compile] | 12.1789ms | 10.7582ms | 92.9523 Ops/s | 88.5089 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 11.9226ms | 10.8010ms | 92.5838 Ops/s | 83.4666 Ops/s | |
test_seq_add[eager] | 0.1787ms | 84.7723μs | 11.7963 KOps/s | 11.1183 KOps/s | |
test_seq_add[compile] | 0.1528ms | 66.2800μs | 15.0875 KOps/s | 15.4125 KOps/s | |
test_seq_add[compile-overhead] | 0.1369ms | 63.7724μs | 15.6808 KOps/s | 15.9715 KOps/s | |
test_seq_wrap[eager] | 0.6012ms | 0.3763ms | 2.6573 KOps/s | 2.5957 KOps/s | |
test_seq_wrap[compile] | 0.5359ms | 0.2685ms | 3.7244 KOps/s | 3.6953 KOps/s | |
test_seq_wrap[compile-overhead] | 0.4851ms | 0.2671ms | 3.7434 KOps/s | 3.6885 KOps/s | |
test_func_call_runtime[False-eager] | 0.9439ms | 0.5310ms | 1.8833 KOps/s | 1.9074 KOps/s | |
test_func_call_runtime[False-compile] | 0.8496ms | 0.5055ms | 1.9784 KOps/s | 1.9594 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.9395ms | 0.5013ms | 1.9946 KOps/s | 1.9764 KOps/s | |
test_func_call_runtime[True-eager] | 1.2784ms | 0.7459ms | 1.3407 KOps/s | 1.3207 KOps/s | |
test_func_call_runtime[True-compile] | 0.6399ms | 0.5158ms | 1.9386 KOps/s | 1.9283 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.9218ms | 0.5154ms | 1.9401 KOps/s | 1.9262 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.7143ms | 0.5237ms | 1.9094 KOps/s | 1.9381 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.9198ms | 0.5054ms | 1.9787 KOps/s | 1.9775 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.6074ms | 0.5019ms | 1.9923 KOps/s | 1.9616 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.0511ms | 0.8684ms | 1.1516 KOps/s | 1.1306 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.8945ms | 0.7404ms | 1.3506 KOps/s | 1.3408 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.8974ms | 0.7308ms | 1.3683 KOps/s | 1.3203 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.6149ms | 1.8730ms | 533.9011 Ops/s | 525.6203 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 2.7310ms | 1.9188ms | 521.1678 Ops/s | 510.9846 Ops/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 2.8024ms | 1.9145ms | 522.3354 Ops/s | 510.1659 Ops/s | |
test_distributed | 0.2255ms | 0.1230ms | 8.1291 KOps/s | 7.8411 KOps/s | |
test_tdmodule | 29.3040μs | 16.0796μs | 62.1906 KOps/s | 55.7239 KOps/s | |
test_tdmodule_dispatch | 84.9190μs | 34.2183μs | 29.2242 KOps/s | 27.7822 KOps/s | |
test_tdseq | 54.3920μs | 19.4346μs | 51.4546 KOps/s | 48.7185 KOps/s | |
test_tdseq_dispatch | 71.5730μs | 40.3985μs | 24.7534 KOps/s | 24.4642 KOps/s | |
test_instantiation_functorch | 2.5748ms | 1.6051ms | 623.0157 Ops/s | 621.0659 Ops/s | |
test_instantiation_td | 1.8812ms | 1.1767ms | 849.8182 Ops/s | 841.6440 Ops/s | |
test_exec_functorch | 0.3400ms | 0.1825ms | 5.4787 KOps/s | 5.3926 KOps/s | |
test_exec_functional_call | 0.3223ms | 0.1759ms | 5.6841 KOps/s | 5.6208 KOps/s | |
test_exec_td | 0.3081ms | 0.1676ms | 5.9652 KOps/s | 5.7977 KOps/s | |
test_exec_td_decorator | 0.8653ms | 0.2239ms | 4.4670 KOps/s | 4.4263 KOps/s | |
test_vmap_mlp_speed[True-True] | 0.9647ms | 0.6409ms | 1.5602 KOps/s | 1.5229 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.9870ms | 0.6367ms | 1.5705 KOps/s | 1.5283 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.6179ms | 0.4944ms | 2.0228 KOps/s | 1.9564 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.9615ms | 0.5092ms | 1.9640 KOps/s | 1.9550 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.3066ms | 0.6155ms | 1.6247 KOps/s | 1.5661 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8801ms | 0.6178ms | 1.6186 KOps/s | 1.5573 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.9245ms | 0.5132ms | 1.9486 KOps/s | 1.9031 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.6929ms | 0.5093ms | 1.9634 KOps/s | 1.9056 KOps/s | |
test_to_module_speed[True] | 1.7065ms | 1.2786ms | 782.0872 Ops/s | 771.8578 Ops/s | |
test_to_module_speed[False] | 1.7648ms | 1.2479ms | 801.3408 Ops/s | 791.1296 Ops/s | |
test_tc_init | 87.1920μs | 43.0597μs | 23.2236 KOps/s | 23.1329 KOps/s | |
test_tc_init_nested | 0.1538ms | 85.9261μs | 11.6379 KOps/s | 11.4905 KOps/s | |
test_tc_first_layer_tensor | 18.8750μs | 1.5948μs | 627.0506 KOps/s | 675.6225 KOps/s | |
test_tc_first_layer_nontensor | 39.7840μs | 4.7654μs | 209.8446 KOps/s | 217.0363 KOps/s | |
test_tc_second_layer_tensor | 27.7620μs | 2.8945μs | 345.4868 KOps/s | 360.2096 KOps/s | |
test_tc_second_layer_nontensor | 32.1200μs | 6.0721μs | 164.6865 KOps/s | 163.6429 KOps/s | |
test_unbind | 0.4572s | 12.7866ms | 78.2067 Ops/s | 64.4978 Ops/s | |
test_full_like | 8.1585ms | 6.9807ms | 143.2522 Ops/s | 139.5363 Ops/s | |
test_zeros_like | 3.2738ms | 2.6615ms | 375.7263 Ops/s | 154.1934 Ops/s | |
test_ones_like | 14.3302ms | 6.6020ms | 151.4681 Ops/s | 123.4016 Ops/s | |
test_clone | 13.0379ms | 8.2263ms | 121.5615 Ops/s | 104.5588 Ops/s | |
test_squeeze | 71.7140μs | 12.3976μs | 80.6606 KOps/s | 81.0435 KOps/s | |
test_unsqueeze | 0.1672ms | 90.3444μs | 11.0687 KOps/s | 10.8349 KOps/s | |
test_split | 0.4739ms | 0.1953ms | 5.1211 KOps/s | 5.1059 KOps/s | |
test_permute | 0.3594ms | 0.2189ms | 4.5693 KOps/s | 4.5661 KOps/s | |
test_stack | 33.2928ms | 25.5657ms | 39.1149 Ops/s | 41.5363 Ops/s | |
test_cat | 33.4592ms | 25.3897ms | 39.3861 Ops/s | 41.9868 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 0.4743ms | 13.7088μs | 72.9459 KOps/s | 68.0308 KOps/s | |
test_plain_set_stack_nested | 39.6910μs | 14.0089μs | 71.3831 KOps/s | 67.3871 KOps/s | |
test_plain_set_nested_inplace | 45.1610μs | 14.7500μs | 67.7967 KOps/s | 62.9640 KOps/s | |
test_plain_set_stack_nested_inplace | 47.2200μs | 14.9624μs | 66.8344 KOps/s | 63.7270 KOps/s | |
test_items | 33.8010μs | 2.8452μs | 351.4683 KOps/s | 349.6420 KOps/s | |
test_items_nested | 0.3637ms | 0.3127ms | 3.1976 KOps/s | 3.2116 KOps/s | |
test_items_nested_locked | 0.3737ms | 0.3127ms | 3.1981 KOps/s | 3.1570 KOps/s | |
test_items_nested_leaf | 97.3210μs | 62.9724μs | 15.8800 KOps/s | 15.9115 KOps/s | |
test_items_stack_nested | 0.3733ms | 0.3145ms | 3.1792 KOps/s | 3.2054 KOps/s | |
test_items_stack_nested_leaf | 96.2210μs | 64.5668μs | 15.4878 KOps/s | 15.9401 KOps/s | |
test_items_stack_nested_locked | 0.3701ms | 0.3139ms | 3.1853 KOps/s | 3.1956 KOps/s | |
test_keys | 32.0210μs | 3.3990μs | 294.2083 KOps/s | 294.0127 KOps/s | |
test_keys_nested | 83.8910μs | 55.5977μs | 17.9864 KOps/s | 18.1646 KOps/s | |
test_keys_nested_locked | 0.8248ms | 60.9095μs | 16.4178 KOps/s | 16.7641 KOps/s | |
test_keys_nested_leaf | 74.2710μs | 47.2139μs | 21.1802 KOps/s | 21.4172 KOps/s | |
test_keys_stack_nested | 86.8610μs | 55.7566μs | 17.9351 KOps/s | 18.0189 KOps/s | |
test_keys_stack_nested_leaf | 77.1710μs | 47.4506μs | 21.0746 KOps/s | 21.4875 KOps/s | |
test_keys_stack_nested_locked | 90.1110μs | 60.2971μs | 16.5845 KOps/s | 16.6465 KOps/s | |
test_values | 9.4452μs | 0.8164μs | 1.2249 MOps/s | 1.2435 MOps/s | |
test_values_nested | 52.2910μs | 27.3312μs | 36.5882 KOps/s | 36.1003 KOps/s | |
test_values_nested_locked | 56.6910μs | 29.5147μs | 33.8814 KOps/s | 34.1316 KOps/s | |
test_values_nested_leaf | 51.0810μs | 24.0352μs | 41.6056 KOps/s | 41.0027 KOps/s | |
test_values_stack_nested | 52.8710μs | 27.9831μs | 35.7359 KOps/s | 36.0172 KOps/s | |
test_values_stack_nested_leaf | 48.9800μs | 24.5665μs | 40.7058 KOps/s | 41.1467 KOps/s | |
test_values_stack_nested_locked | 62.8510μs | 30.1648μs | 33.1513 KOps/s | 34.0484 KOps/s | |
test_membership | 2.1315μs | 0.4700μs | 2.1275 MOps/s | 2.1137 MOps/s | |
test_membership_nested | 12.1350μs | 1.7935μs | 557.5776 KOps/s | 563.9411 KOps/s | |
test_membership_nested_leaf | 11.7367μs | 1.7417μs | 574.1385 KOps/s | 564.6197 KOps/s | |
test_membership_stacked_nested | 24.4900μs | 1.8093μs | 552.6853 KOps/s | 548.6632 KOps/s | |
test_membership_stacked_nested_leaf | 29.8000μs | 1.7990μs | 555.8709 KOps/s | 538.8158 KOps/s | |
test_membership_nested_last | 71.4110μs | 2.6451μs | 378.0530 KOps/s | 372.1097 KOps/s | |
test_membership_nested_leaf_last | 26.1700μs | 2.6359μs | 379.3795 KOps/s | 373.6340 KOps/s | |
test_membership_stacked_nested_last | 25.4400μs | 3.0219μs | 330.9174 KOps/s | 380.0421 KOps/s | |
test_membership_stacked_nested_leaf_last | 36.5800μs | 3.0099μs | 332.2404 KOps/s | 375.3886 KOps/s | |
test_nested_getleaf | 25.3300μs | 6.1552μs | 162.4633 KOps/s | 164.6943 KOps/s | |
test_nested_get | 34.3410μs | 5.8109μs | 172.0891 KOps/s | 173.9414 KOps/s | |
test_stacked_getleaf | 35.7400μs | 6.1585μs | 162.3775 KOps/s | 166.6915 KOps/s | |
test_stacked_get | 29.5300μs | 5.7120μs | 175.0697 KOps/s | 174.5312 KOps/s | |
test_nested_getitemleaf | 26.7600μs | 6.2050μs | 161.1614 KOps/s | 163.5467 KOps/s | |
test_nested_getitem | 30.1100μs | 5.8161μs | 171.9362 KOps/s | 173.8755 KOps/s | |
test_stacked_getitemleaf | 41.3100μs | 6.1506μs | 162.5859 KOps/s | 164.5978 KOps/s | |
test_stacked_getitem | 32.0510μs | 5.7191μs | 174.8526 KOps/s | 175.5656 KOps/s | |
test_lock_nested | 1.3286ms | 0.4154ms | 2.4070 KOps/s | 2.3809 KOps/s | |
test_lock_stack_nested | 0.4387ms | 0.3816ms | 2.6207 KOps/s | 2.6232 KOps/s | |
test_unlock_nested | 0.7472ms | 0.3562ms | 2.8078 KOps/s | 2.7960 KOps/s | |
test_unlock_stack_nested | 0.3659ms | 0.3208ms | 3.1174 KOps/s | 3.1205 KOps/s | |
test_flatten_speed | 0.1663ms | 81.8554μs | 12.2167 KOps/s | 12.6031 KOps/s | |
test_unflatten_speed | 0.3326ms | 0.2865ms | 3.4904 KOps/s | 3.5064 KOps/s | |
test_common_ops | 92.0901ms | 1.3693ms | 730.3248 Ops/s | 761.0714 Ops/s | |
test_creation | 30.4200μs | 1.5137μs | 660.6469 KOps/s | 659.5173 KOps/s | |
test_creation_empty | 50.7500μs | 14.9576μs | 66.8556 KOps/s | 58.2012 KOps/s | |
test_creation_nested_1 | 46.5800μs | 16.6530μs | 60.0492 KOps/s | 53.0669 KOps/s | |
test_creation_nested_2 | 48.8210μs | 19.3630μs | 51.6448 KOps/s | 46.3838 KOps/s | |
test_clone | 65.5210μs | 28.8232μs | 34.6943 KOps/s | 34.8493 KOps/s | |
test_getitem[int] | 1.2619ms | 16.0810μs | 62.1853 KOps/s | 61.0078 KOps/s | |
test_getitem[slice_int] | 0.1200ms | 27.7694μs | 36.0109 KOps/s | 35.9655 KOps/s | |
test_getitem[range] | 0.2180ms | 0.1092ms | 9.1600 KOps/s | 9.1329 KOps/s | |
test_getitem[tuple] | 0.1203ms | 24.3246μs | 41.1106 KOps/s | 41.6137 KOps/s | |
test_getitem[list] | 0.1960ms | 99.0681μs | 10.0941 KOps/s | 10.1807 KOps/s | |
test_setitem_dim[int] | 75.0410μs | 50.8740μs | 19.6564 KOps/s | 19.0281 KOps/s | |
test_setitem_dim[slice_int] | 0.1084ms | 74.9354μs | 13.3448 KOps/s | 13.1548 KOps/s | |
test_setitem_dim[range] | 0.1799ms | 0.1344ms | 7.4428 KOps/s | 7.3053 KOps/s | |
test_setitem_dim[tuple] | 93.3210μs | 67.7284μs | 14.7648 KOps/s | 14.6068 KOps/s | |
test_setitem | 81.5210μs | 41.7133μs | 23.9732 KOps/s | 23.8213 KOps/s | |
test_set | 99.9520μs | 41.1619μs | 24.2943 KOps/s | 24.3181 KOps/s | |
test_set_shared | 0.3570ms | 50.7299μs | 19.7122 KOps/s | 19.7654 KOps/s | |
test_update | 84.9810μs | 49.2449μs | 20.3067 KOps/s | 18.7318 KOps/s | |
test_update_nested | 0.1144ms | 55.5403μs | 18.0050 KOps/s | 16.3352 KOps/s | |
test_update__nested | 0.3857ms | 60.3270μs | 16.5763 KOps/s | 15.1637 KOps/s | |
test_set_nested | 79.7710μs | 43.4074μs | 23.0375 KOps/s | 23.0470 KOps/s | |
test_set_nested_new | 0.1266ms | 47.1341μs | 21.2161 KOps/s | 20.9971 KOps/s | |
test_select | 97.9610μs | 60.5858μs | 16.5055 KOps/s | 15.7541 KOps/s | |
test_select_nested | 0.2244ms | 43.3117μs | 23.0885 KOps/s | 23.7251 KOps/s | |
test_exclude_nested | 95.9410μs | 60.1923μs | 16.6134 KOps/s | 16.7417 KOps/s | |
test_empty[True] | 0.2926ms | 0.2418ms | 4.1354 KOps/s | 4.1288 KOps/s | |
test_empty[False] | 2.9880μs | 0.7573μs | 1.3204 MOps/s | 1.3510 MOps/s | |
test_to | 59.6110μs | 25.0426μs | 39.9319 KOps/s | 34.9360 KOps/s | |
test_to_nonblocking | 64.9310μs | 24.3882μs | 41.0035 KOps/s | 41.4692 KOps/s | |
test_unbind_speed | 1.5204ms | 0.2770ms | 3.6101 KOps/s | 3.3132 KOps/s | |
test_unbind_speed_stack0 | 0.3213ms | 0.2751ms | 3.6355 KOps/s | 3.3822 KOps/s | |
test_unbind_speed_stack1 | 91.5579ms | 0.7161ms | 1.3965 KOps/s | 1.3795 KOps/s | |
test_split | 92.7777ms | 2.1880ms | 457.0444 Ops/s | 459.7868 Ops/s | |
test_chunk | 95.0311ms | 2.2020ms | 454.1397 Ops/s | 458.9797 Ops/s | |
test_creation[device0] | 0.3530ms | 0.1272ms | 7.8604 KOps/s | 7.8479 KOps/s | |
test_creation_from_tensor | 0.3473ms | 0.1325ms | 7.5498 KOps/s | 7.4258 KOps/s | |
test_add_one[memmap_tensor0] | 0.1283ms | 8.9551μs | 111.6688 KOps/s | 115.6200 KOps/s | |
test_contiguous[memmap_tensor0] | 24.9510μs | 2.2105μs | 452.3773 KOps/s | 460.4108 KOps/s | |
test_stack[memmap_tensor0] | 40.7300μs | 6.6893μs | 149.4923 KOps/s | 145.8618 KOps/s | |
test_memmaptd_index | 1.0460ms | 0.4226ms | 2.3661 KOps/s | 2.2632 KOps/s | |
test_memmaptd_index_astensor | 0.7376ms | 0.4784ms | 2.0905 KOps/s | 2.0114 KOps/s | |
test_memmaptd_index_op | 1.4075ms | 1.0093ms | 990.7431 Ops/s | 951.6112 Ops/s | |
test_serialize_model | 0.1303s | 0.1295s | 7.7231 Ops/s | 7.7068 Ops/s | |
test_serialize_model_pickle | 1.3490s | 1.2118s | 0.8252 Ops/s | 0.8249 Ops/s | |
test_serialize_weights | 0.1291s | 0.1280s | 7.8121 Ops/s | 7.7651 Ops/s | |
test_serialize_weights_returnearly | 0.2323s | 61.3899ms | 16.2893 Ops/s | 18.2389 Ops/s | |
test_serialize_weights_pickle | 1.3719s | 1.2164s | 0.8221 Ops/s | 0.8213 Ops/s | |
test_reshape_pytree | 0.4193ms | 35.8649μs | 27.8824 KOps/s | 27.8446 KOps/s | |
test_reshape_td | 84.7310μs | 41.5846μs | 24.0473 KOps/s | 24.6233 KOps/s | |
test_view_pytree | 0.4247ms | 35.2929μs | 28.3343 KOps/s | 28.3625 KOps/s | |
test_view_td | 88.2010μs | 47.3397μs | 21.1239 KOps/s | 21.5831 KOps/s | |
test_unbind_pytree | 0.4109ms | 33.7302μs | 29.6470 KOps/s | 29.4641 KOps/s | |
test_unbind_td | 0.3955ms | 42.2103μs | 23.6909 KOps/s | 23.9867 KOps/s | |
test_split_pytree | 0.4238ms | 46.0956μs | 21.6940 KOps/s | 21.4422 KOps/s | |
test_split_td | 0.1520ms | 56.7946μs | 17.6073 KOps/s | 15.5292 KOps/s | |
test_add_pytree | 0.4322ms | 56.0042μs | 17.8558 KOps/s | 18.1090 KOps/s | |
test_add_td | 0.1558ms | 90.5976μs | 11.0378 KOps/s | 10.8596 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.4080ms | 0.2123ms | 4.7099 KOps/s | 4.7741 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.5497ms | 0.1556ms | 6.4274 KOps/s | 6.3558 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.5425ms | 0.1437ms | 6.9609 KOps/s | 6.9129 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2595ms | 0.1804ms | 5.5421 KOps/s | 5.5738 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 0.4206ms | 21.7546μs | 45.9673 KOps/s | 47.1873 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1214ms | 42.7597μs | 23.3865 KOps/s | 22.5447 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.2659ms | 64.3071μs | 15.5504 KOps/s | 15.4964 KOps/s | |
test_compile_copy_nested[pytree-eager] | 92.0410μs | 49.1940μs | 20.3277 KOps/s | 20.1499 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.4564ms | 0.3190ms | 3.1347 KOps/s | 3.1306 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3079ms | 0.2079ms | 4.8101 KOps/s | 4.8899 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1827ms | 0.1265ms | 7.9065 KOps/s | 7.5651 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1285ms | 59.8053μs | 16.7209 KOps/s | 16.1217 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.4524ms | 0.3161ms | 3.1637 KOps/s | 3.1085 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.7164ms | 0.6072ms | 1.6470 KOps/s | 1.6555 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3583ms | 0.2442ms | 4.0946 KOps/s | 4.0787 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.3577ms | 0.3206ms | 3.1192 KOps/s | 3.0992 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1501ms | 70.9825μs | 14.0880 KOps/s | 14.2611 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.2059ms | 0.1285ms | 7.7817 KOps/s | 7.5357 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.6159ms | 0.5289ms | 1.8908 KOps/s | 1.9593 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.3688ms | 0.3175ms | 3.1495 KOps/s | 3.1292 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 69.3810μs | 18.9785μs | 52.6911 KOps/s | 53.1097 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 64.7210μs | 27.2402μs | 36.7105 KOps/s | 36.4830 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1078ms | 70.0636μs | 14.2727 KOps/s | 14.1920 KOps/s | |
test_compile_copy_flat[pytree-eager] | 83.6820μs | 51.4170μs | 19.4488 KOps/s | 19.4877 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 2.3257ms | 0.8100ms | 1.2346 KOps/s | 1.1223 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 3.2530ms | 3.1172ms | 320.7978 Ops/s | 315.4240 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 2.2741ms | 0.7977ms | 1.2537 KOps/s | 1.1394 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 3.3088ms | 3.1754ms | 314.9207 Ops/s | 314.4383 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1553ms | 0.1131ms | 8.8424 KOps/s | 9.2014 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.1960ms | 63.7983μs | 15.6744 KOps/s | 16.1381 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1453ms | 0.1025ms | 9.7521 KOps/s | 9.3182 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1669ms | 44.4635μs | 22.4903 KOps/s | 21.5227 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1457ms | 0.1038ms | 9.6372 KOps/s | 9.0992 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 84.1220μs | 42.8070μs | 23.3606 KOps/s | 21.8371 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1985ms | 0.1389ms | 7.2010 KOps/s | 7.2642 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1677ms | 25.2651μs | 39.5803 KOps/s | 38.9236 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1801ms | 0.1339ms | 7.4702 KOps/s | 7.4520 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 65.9810μs | 21.0951μs | 47.4044 KOps/s | 48.6960 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.2249ms | 0.1333ms | 7.4996 KOps/s | 7.5551 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 53.3610μs | 20.8990μs | 47.8491 KOps/s | 48.9941 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1702ms | 0.1379ms | 7.2534 KOps/s | 7.2428 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.4804ms | 25.0352μs | 39.9438 KOps/s | 39.6744 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1760ms | 0.1306ms | 7.6543 KOps/s | 7.5341 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.1500ms | 24.4888μs | 40.8350 KOps/s | 48.2346 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1925ms | 0.1311ms | 7.6274 KOps/s | 7.5678 KOps/s | |
test_compile_indexing[int-pytree-eager] | 54.2010μs | 20.7069μs | 48.2930 KOps/s | 47.8557 KOps/s | |
test_mod_add[eager] | 71.6910μs | 31.2504μs | 31.9996 KOps/s | 29.0603 KOps/s | |
test_mod_add[compile] | 0.1106ms | 69.0467μs | 14.4829 KOps/s | 14.0093 KOps/s | |
test_mod_add[compile-overhead] | 0.2648ms | 0.1415ms | 7.0647 KOps/s | 7.0641 KOps/s | |
test_mod_wrap[eager] | 0.3329ms | 0.2395ms | 4.1755 KOps/s | 3.8779 KOps/s | |
test_mod_wrap[compile] | 0.4440ms | 0.2823ms | 3.5423 KOps/s | 3.3413 KOps/s | |
test_mod_wrap[compile-overhead] | 7.3511ms | 3.8992ms | 256.4627 Ops/s | 246.2545 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.4533ms | 1.3410ms | 745.7356 Ops/s | 697.8158 Ops/s | |
test_mod_wrap_and_backward[compile] | 2.7359ms | 1.3076ms | 764.7573 Ops/s | 699.5514 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.2757ms | 0.8850ms | 1.1299 KOps/s | 1.0141 KOps/s | |
test_seq_add[eager] | 0.1530ms | 95.3208μs | 10.4909 KOps/s | 9.7707 KOps/s | |
test_seq_add[compile] | 0.3126ms | 80.9966μs | 12.3462 KOps/s | 12.2702 KOps/s | |
test_seq_add[compile-overhead] | 0.1514ms | 0.1141ms | 8.7625 KOps/s | 8.7715 KOps/s | |
test_seq_wrap[eager] | 0.4708ms | 0.3729ms | 2.6819 KOps/s | 2.5493 KOps/s | |
test_seq_wrap[compile] | 0.3657ms | 0.3013ms | 3.3194 KOps/s | 3.2154 KOps/s | |
test_seq_wrap[compile-overhead] | 0.2897ms | 0.2087ms | 4.7906 KOps/s | 4.5678 KOps/s | |
test_func_call_runtime[False-eager] | 0.7998ms | 0.7324ms | 1.3654 KOps/s | 1.3429 KOps/s | |
test_func_call_runtime[False-compile] | 1.0383ms | 0.7784ms | 1.2848 KOps/s | 1.2668 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4025ms | 0.3462ms | 2.8885 KOps/s | 2.8134 KOps/s | |
test_func_call_runtime[True-eager] | 1.0934ms | 0.8906ms | 1.1229 KOps/s | 1.0862 KOps/s | |
test_func_call_runtime[True-compile] | 0.8982ms | 0.8114ms | 1.2324 KOps/s | 1.1464 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.4374ms | 0.3809ms | 2.6252 KOps/s | 2.5873 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8022ms | 0.7300ms | 1.3698 KOps/s | 1.2721 KOps/s | |
test_func_call_cm_runtime[False-compile] | 1.0469ms | 0.7775ms | 1.2862 KOps/s | 1.1832 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4026ms | 0.3497ms | 2.8593 KOps/s | 2.8106 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.0697ms | 0.9853ms | 1.0150 KOps/s | 995.0649 Ops/s | |
test_func_call_cm_runtime[True-compile] | 0.8840ms | 0.8370ms | 1.1948 KOps/s | 1.1715 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.5006ms | 0.4055ms | 2.4663 KOps/s | 2.4266 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5596ms | 2.0650ms | 484.2544 Ops/s | 478.7228 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9254ms | 0.8587ms | 1.1645 KOps/s | 1.1465 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.4985ms | 0.4098ms | 2.4403 KOps/s | 2.3994 KOps/s | |
test_distributed | 1.9164ms | 0.1724ms | 5.7990 KOps/s | 8.6855 KOps/s | |
test_tdmodule | 42.8300μs | 14.2050μs | 70.3977 KOps/s | 62.9597 KOps/s | |
test_tdmodule_dispatch | 56.7410μs | 28.4838μs | 35.1077 KOps/s | 32.8340 KOps/s | |
test_tdseq | 22.1100μs | 14.8570μs | 67.3083 KOps/s | 62.2291 KOps/s | |
test_tdseq_dispatch | 50.8710μs | 30.7098μs | 32.5629 KOps/s | 30.2979 KOps/s | |
test_instantiation_functorch | 2.0181ms | 1.8397ms | 543.5619 Ops/s | 532.9209 Ops/s | |
test_instantiation_td | 1.7883ms | 1.1937ms | 837.7189 Ops/s | 828.2105 Ops/s | |
test_exec_functorch | 0.2912ms | 0.2209ms | 4.5260 KOps/s | 4.7320 KOps/s | |
test_exec_functional_call | 0.3208ms | 0.2064ms | 4.8438 KOps/s | 4.7233 KOps/s | |
test_exec_td | 0.2746ms | 0.2135ms | 4.6842 KOps/s | 4.6944 KOps/s | |
test_exec_td_decorator | 0.5882ms | 0.2534ms | 3.9469 KOps/s | 3.7262 KOps/s | |
test_vmap_mlp_speed[True-True] | 0.8492ms | 0.6882ms | 1.4531 KOps/s | 1.4473 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.7311ms | 0.6812ms | 1.4679 KOps/s | 1.4593 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.6179ms | 0.5714ms | 1.7500 KOps/s | 1.6600 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.6150ms | 0.5726ms | 1.7463 KOps/s | 1.6999 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.7861ms | 0.6655ms | 1.5026 KOps/s | 1.4802 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8538ms | 0.6665ms | 1.5003 KOps/s | 1.4825 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7006ms | 0.5880ms | 1.7008 KOps/s | 1.6985 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7143ms | 0.5873ms | 1.7028 KOps/s | 1.6981 KOps/s | |
test_vmap_transformer_speed[True-True] | 8.3888ms | 8.3053ms | 120.4055 Ops/s | 119.4687 Ops/s | |
test_vmap_transformer_speed[True-False] | 8.2966ms | 8.2468ms | 121.2587 Ops/s | 119.6583 Ops/s | |
test_vmap_transformer_speed[False-True] | 8.1725ms | 8.0684ms | 123.9406 Ops/s | 122.8932 Ops/s | |
test_vmap_transformer_speed[False-False] | 8.1694ms | 8.0932ms | 123.5605 Ops/s | 122.8291 Ops/s | |
test_vmap_transformer_speed_decorator[True-True] | 19.9789ms | 19.3177ms | 51.7659 Ops/s | 51.5586 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 19.4196ms | 19.3385ms | 51.7103 Ops/s | 51.5378 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.3169ms | 19.2166ms | 52.0384 Ops/s | 51.8863 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.2779ms | 19.1888ms | 52.1138 Ops/s | 51.9142 Ops/s | |
test_to_module_speed[True] | 1.4368ms | 0.9399ms | 1.0639 KOps/s | 1.0635 KOps/s | |
test_to_module_speed[False] | 1.2903ms | 0.9185ms | 1.0887 KOps/s | 1.0811 KOps/s | |
test_tc_init | 71.2910μs | 34.3988μs | 29.0708 KOps/s | 27.7800 KOps/s | |
test_tc_init_nested | 0.1066ms | 71.6526μs | 13.9562 KOps/s | 13.4348 KOps/s | |
test_tc_first_layer_tensor | 4.7030μs | 0.6835μs | 1.4632 MOps/s | 1.4461 MOps/s | |
test_tc_first_layer_nontensor | 28.6810μs | 2.2416μs | 446.1033 KOps/s | 448.2151 KOps/s | |
test_tc_second_layer_tensor | 7.2950μs | 1.3839μs | 722.5981 KOps/s | 729.3432 KOps/s | |
test_tc_second_layer_nontensor | 22.8600μs | 2.9543μs | 338.4893 KOps/s | 340.2674 KOps/s | |
test_unbind | 0.1919s | 11.9371ms | 83.7721 Ops/s | 93.2137 Ops/s | |
test_full_like | 0.6579ms | 0.5739ms | 1.7425 KOps/s | 1.7373 KOps/s | |
test_zeros_like | 0.2692ms | 0.1979ms | 5.0520 KOps/s | 5.0525 KOps/s | |
test_ones_like | 0.2575ms | 0.1977ms | 5.0586 KOps/s | 5.0532 KOps/s | |
test_clone | 0.4453ms | 0.4131ms | 2.4206 KOps/s | 2.4105 KOps/s | |
test_squeeze | 35.3200μs | 9.7534μs | 102.5283 KOps/s | 100.9646 KOps/s | |
test_unsqueeze | 0.2205ms | 74.0326μs | 13.5076 KOps/s | 13.3637 KOps/s | |
test_split | 0.4135ms | 0.1569ms | 6.3730 KOps/s | 6.3295 KOps/s | |
test_permute | 0.2803ms | 0.1800ms | 5.5558 KOps/s | 5.3402 KOps/s | |
test_stack | 1.2561ms | 0.8742ms | 1.1439 KOps/s | 1.1964 KOps/s | |
test_cat | 1.2701ms | 1.2322ms | 811.5606 Ops/s | 811.7761 Ops/s |
vmoens
added a commit
that referenced
this pull request
Sep 9, 2024
ghstack-source-id: c7737107beff69fa6f5438067c795d2e1a98d45f Pull Request resolved: #953
vmoens
added a commit
that referenced
this pull request
Sep 10, 2024
ghstack-source-id: c7737107beff69fa6f5438067c795d2e1a98d45f Pull Request resolved: #953
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):