-
Notifications
You must be signed in to change notification settings - Fork 79
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature] Make NonTensorData a callable #939
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 40.7050μs | 21.5569μs | 46.3888 KOps/s | 44.5308 KOps/s | |
test_plain_set_stack_nested | 49.4720μs | 21.8434μs | 45.7805 KOps/s | 44.8696 KOps/s | |
test_plain_set_nested_inplace | 76.0110μs | 23.4607μs | 42.6244 KOps/s | 41.5698 KOps/s | |
test_plain_set_stack_nested_inplace | 79.0170μs | 23.5227μs | 42.5121 KOps/s | 41.7377 KOps/s | |
test_items | 24.5160μs | 2.6080μs | 383.4344 KOps/s | 373.7472 KOps/s | |
test_items_nested | 1.4068ms | 0.3437ms | 2.9091 KOps/s | 2.7550 KOps/s | |
test_items_nested_locked | 0.6308ms | 0.3445ms | 2.9025 KOps/s | 2.9709 KOps/s | |
test_items_nested_leaf | 0.2522ms | 85.7327μs | 11.6642 KOps/s | 11.8759 KOps/s | |
test_items_stack_nested | 0.6691ms | 0.3414ms | 2.9292 KOps/s | 2.9189 KOps/s | |
test_items_stack_nested_leaf | 0.1599ms | 87.1862μs | 11.4697 KOps/s | 11.5437 KOps/s | |
test_items_stack_nested_locked | 0.5431ms | 0.3426ms | 2.9191 KOps/s | 2.9372 KOps/s | |
test_keys | 22.8420μs | 3.8433μs | 260.1937 KOps/s | 256.1236 KOps/s | |
test_keys_nested | 0.2347ms | 0.1420ms | 7.0439 KOps/s | 7.1024 KOps/s | |
test_keys_nested_locked | 0.7234ms | 0.1480ms | 6.7552 KOps/s | 6.8627 KOps/s | |
test_keys_nested_leaf | 0.2118ms | 0.1226ms | 8.1544 KOps/s | 8.3422 KOps/s | |
test_keys_stack_nested | 0.2332ms | 0.1420ms | 7.0409 KOps/s | 7.0232 KOps/s | |
test_keys_stack_nested_leaf | 0.2105ms | 0.1213ms | 8.2464 KOps/s | 8.2418 KOps/s | |
test_keys_stack_nested_locked | 0.2932ms | 0.1480ms | 6.7547 KOps/s | 6.6883 KOps/s | |
test_values | 7.5265μs | 1.1748μs | 851.2171 KOps/s | 840.9549 KOps/s | |
test_values_nested | 87.7140μs | 50.7283μs | 19.7129 KOps/s | 20.2829 KOps/s | |
test_values_nested_locked | 93.5150μs | 50.2023μs | 19.9194 KOps/s | 20.1608 KOps/s | |
test_values_nested_leaf | 90.9200μs | 45.7578μs | 21.8542 KOps/s | 22.1688 KOps/s | |
test_values_stack_nested | 97.3020μs | 51.3790μs | 19.4632 KOps/s | 19.6685 KOps/s | |
test_values_stack_nested_leaf | 89.9480μs | 45.0938μs | 22.1760 KOps/s | 22.3456 KOps/s | |
test_values_stack_nested_locked | 94.0060μs | 51.0928μs | 19.5722 KOps/s | 19.5722 KOps/s | |
test_membership | 13.5350μs | 0.9097μs | 1.0993 MOps/s | 1.3366 MOps/s | |
test_membership_nested | 23.1730μs | 2.6309μs | 380.0954 KOps/s | 387.1768 KOps/s | |
test_membership_nested_leaf | 29.5150μs | 2.6098μs | 383.1758 KOps/s | 384.0617 KOps/s | |
test_membership_stacked_nested | 27.6120μs | 2.6059μs | 383.7463 KOps/s | 389.5553 KOps/s | |
test_membership_stacked_nested_leaf | 33.9540μs | 2.6203μs | 381.6329 KOps/s | 382.5286 KOps/s | |
test_membership_nested_last | 25.7380μs | 3.8811μs | 257.6610 KOps/s | 258.5849 KOps/s | |
test_membership_nested_leaf_last | 35.8070μs | 3.9145μs | 255.4578 KOps/s | 258.5955 KOps/s | |
test_membership_stacked_nested_last | 33.2320μs | 3.8920μs | 256.9380 KOps/s | 221.2117 KOps/s | |
test_membership_stacked_nested_leaf_last | 33.3830μs | 3.8919μs | 256.9432 KOps/s | 222.8119 KOps/s | |
test_nested_getleaf | 42.0280μs | 10.6290μs | 94.0822 KOps/s | 97.7887 KOps/s | |
test_nested_get | 34.8050μs | 10.0270μs | 99.7304 KOps/s | 102.9391 KOps/s | |
test_stacked_getleaf | 34.4550μs | 10.5017μs | 95.2227 KOps/s | 97.6393 KOps/s | |
test_stacked_get | 44.0620μs | 9.9060μs | 100.9484 KOps/s | 103.5080 KOps/s | |
test_nested_getitemleaf | 62.2090μs | 10.6189μs | 94.1716 KOps/s | 92.0862 KOps/s | |
test_nested_getitem | 56.3260μs | 10.0932μs | 99.0769 KOps/s | 100.3691 KOps/s | |
test_stacked_getitemleaf | 53.0590μs | 10.8761μs | 91.9445 KOps/s | 92.8782 KOps/s | |
test_stacked_getitem | 57.3470μs | 10.0391μs | 99.6110 KOps/s | 99.9008 KOps/s | |
test_lock_nested | 7.3772ms | 0.5048ms | 1.9811 KOps/s | 1.9993 KOps/s | |
test_lock_stack_nested | 0.7058ms | 0.4688ms | 2.1331 KOps/s | 2.1834 KOps/s | |
test_unlock_nested | 0.9448ms | 0.4193ms | 2.3852 KOps/s | 2.4381 KOps/s | |
test_unlock_stack_nested | 0.5884ms | 0.3830ms | 2.6113 KOps/s | 2.6894 KOps/s | |
test_flatten_speed | 0.5809ms | 0.1046ms | 9.5629 KOps/s | 9.6274 KOps/s | |
test_unflatten_speed | 0.5141ms | 0.4269ms | 2.3427 KOps/s | 2.3776 KOps/s | |
test_common_ops | 4.3303ms | 1.1097ms | 901.1588 Ops/s | 908.6517 Ops/s | |
test_creation | 47.4190μs | 2.0174μs | 495.6851 KOps/s | 502.3599 KOps/s | |
test_creation_empty | 45.6260μs | 17.6914μs | 56.5247 KOps/s | 53.0766 KOps/s | |
test_creation_nested_1 | 67.1260μs | 21.0252μs | 47.5619 KOps/s | 44.3145 KOps/s | |
test_creation_nested_2 | 0.1083ms | 24.2483μs | 41.2399 KOps/s | 39.0474 KOps/s | |
test_clone | 81.0220μs | 17.5005μs | 57.1413 KOps/s | 60.1686 KOps/s | |
test_getitem[int] | 1.2456ms | 16.7161μs | 59.8225 KOps/s | 61.0441 KOps/s | |
test_getitem[slice_int] | 0.1335ms | 31.7061μs | 31.5397 KOps/s | 31.8432 KOps/s | |
test_getitem[range] | 0.1634ms | 58.6001μs | 17.0648 KOps/s | 17.6165 KOps/s | |
test_getitem[tuple] | 0.1263ms | 25.2573μs | 39.5926 KOps/s | 39.8977 KOps/s | |
test_getitem[list] | 0.3276ms | 53.2255μs | 18.7880 KOps/s | 19.3241 KOps/s | |
test_setitem_dim[int] | 0.1102ms | 39.8117μs | 25.1183 KOps/s | 24.2489 KOps/s | |
test_setitem_dim[slice_int] | 0.1124ms | 70.8192μs | 14.1205 KOps/s | 13.6698 KOps/s | |
test_setitem_dim[range] | 0.1391ms | 93.1622μs | 10.7340 KOps/s | 10.6612 KOps/s | |
test_setitem_dim[tuple] | 99.8970μs | 56.9996μs | 17.5440 KOps/s | 17.0988 KOps/s | |
test_setitem | 0.1863ms | 29.6104μs | 33.7719 KOps/s | 33.5519 KOps/s | |
test_set | 0.1537ms | 28.7456μs | 34.7879 KOps/s | 34.3686 KOps/s | |
test_set_shared | 3.9911ms | 0.2205ms | 4.5352 KOps/s | 4.6910 KOps/s | |
test_update | 0.1878ms | 35.6252μs | 28.0700 KOps/s | 27.3880 KOps/s | |
test_update_nested | 0.1880ms | 45.5272μs | 21.9649 KOps/s | 21.8979 KOps/s | |
test_update__nested | 0.1262ms | 35.2422μs | 28.3750 KOps/s | 30.0324 KOps/s | |
test_set_nested | 0.1106ms | 31.0913μs | 32.1633 KOps/s | 32.4269 KOps/s | |
test_set_nested_new | 0.1624ms | 35.5549μs | 28.1256 KOps/s | 27.7518 KOps/s | |
test_select | 0.1706ms | 52.7070μs | 18.9728 KOps/s | 17.5439 KOps/s | |
test_select_nested | 0.1716ms | 58.8530μs | 16.9915 KOps/s | 17.3405 KOps/s | |
test_exclude_nested | 0.1464ms | 76.7688μs | 13.0261 KOps/s | 13.3284 KOps/s | |
test_empty[True] | 0.7413ms | 0.3272ms | 3.0567 KOps/s | 3.1591 KOps/s | |
test_empty[False] | 9.8885μs | 1.1701μs | 854.6266 KOps/s | 859.7462 KOps/s | |
test_unbind_speed | 0.6118ms | 0.3159ms | 3.1658 KOps/s | 3.2817 KOps/s | |
test_unbind_speed_stack0 | 0.3810ms | 0.3027ms | 3.3032 KOps/s | 3.3831 KOps/s | |
test_unbind_speed_stack1 | 86.7007ms | 0.7915ms | 1.2634 KOps/s | 1.3898 KOps/s | |
test_split | 90.4262ms | 2.2140ms | 451.6738 Ops/s | 457.7297 Ops/s | |
test_chunk | 89.3407ms | 2.2061ms | 453.2922 Ops/s | 464.2747 Ops/s | |
test_creation[device0] | 0.2259ms | 0.1195ms | 8.3699 KOps/s | 8.5950 KOps/s | |
test_creation_from_tensor | 4.9890ms | 0.1228ms | 8.1438 KOps/s | 8.3873 KOps/s | |
test_add_one[memmap_tensor0] | 0.2520ms | 8.0000μs | 125.0000 KOps/s | 134.1687 KOps/s | |
test_contiguous[memmap_tensor0] | 25.5280μs | 2.0347μs | 491.4848 KOps/s | 503.0307 KOps/s | |
test_stack[memmap_tensor0] | 51.4470μs | 5.8791μs | 170.0945 KOps/s | 180.4658 KOps/s | |
test_memmaptd_index | 1.2247ms | 0.4170ms | 2.3981 KOps/s | 2.4931 KOps/s | |
test_memmaptd_index_astensor | 1.0196ms | 0.4953ms | 2.0191 KOps/s | 2.0827 KOps/s | |
test_memmaptd_index_op | 1.5783ms | 1.0493ms | 953.0145 Ops/s | 940.6712 Ops/s | |
test_serialize_model | 0.1296s | 0.1207s | 8.2816 Ops/s | 8.4174 Ops/s | |
test_serialize_model_pickle | 0.4512s | 0.3984s | 2.5098 Ops/s | 2.4946 Ops/s | |
test_serialize_weights | 0.1262s | 0.1174s | 8.5199 Ops/s | 8.2753 Ops/s | |
test_serialize_weights_returnearly | 0.1761s | 0.1592s | 6.2822 Ops/s | 6.2068 Ops/s | |
test_serialize_weights_pickle | 0.4748s | 0.4149s | 2.4103 Ops/s | 2.4107 Ops/s | |
test_serialize_weights_filesystem | 0.1512s | 0.1443s | 6.9284 Ops/s | 6.9315 Ops/s | |
test_serialize_model_filesystem | 0.1587s | 0.1511s | 6.6193 Ops/s | 5.9606 Ops/s | |
test_reshape_pytree | 0.1256ms | 40.8835μs | 24.4597 KOps/s | 25.0678 KOps/s | |
test_reshape_td | 0.1287ms | 48.3593μs | 20.6785 KOps/s | 21.5215 KOps/s | |
test_view_pytree | 88.8060μs | 40.1958μs | 24.8782 KOps/s | 25.3787 KOps/s | |
test_view_td | 0.1273ms | 56.0395μs | 17.8445 KOps/s | 18.3612 KOps/s | |
test_unbind_pytree | 93.8550μs | 37.3476μs | 26.7755 KOps/s | 27.3981 KOps/s | |
test_unbind_td | 0.3891ms | 46.9443μs | 21.3018 KOps/s | 22.0642 KOps/s | |
test_split_pytree | 0.1018ms | 41.5156μs | 24.0873 KOps/s | 26.1249 KOps/s | |
test_split_td | 96.8688ms | 70.6164μs | 14.1610 KOps/s | 16.7641 KOps/s | |
test_add_pytree | 0.1233ms | 47.5035μs | 21.0511 KOps/s | 21.3560 KOps/s | |
test_add_td | 0.2916ms | 84.0921μs | 11.8917 KOps/s | 11.4911 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1305ms | 54.3348μs | 18.4044 KOps/s | 18.5606 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.4051ms | 0.1920ms | 5.2089 KOps/s | 5.2792 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1313ms | 53.9153μs | 18.5476 KOps/s | 18.1840 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2892ms | 0.1474ms | 6.7826 KOps/s | 6.9429 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 62.4570μs | 20.1888μs | 49.5323 KOps/s | 49.3478 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1394ms | 64.6130μs | 15.4768 KOps/s | 15.7144 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1812ms | 78.4430μs | 12.7481 KOps/s | 12.8168 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1506ms | 70.5486μs | 14.1746 KOps/s | 14.3762 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2651ms | 0.1754ms | 5.7014 KOps/s | 5.7318 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.2702ms | 0.1972ms | 5.0707 KOps/s | 5.1257 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1003ms | 38.8485μs | 25.7410 KOps/s | 25.7172 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.5232ms | 70.8336μs | 14.1176 KOps/s | 14.0872 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.3407ms | 0.1716ms | 5.8290 KOps/s | 5.7484 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.5674ms | 0.3027ms | 3.3031 KOps/s | 3.3904 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 6.3838ms | 0.2130ms | 4.6943 KOps/s | 4.8521 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.3644ms | 0.1784ms | 5.6063 KOps/s | 5.6191 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.2084ms | 62.8483μs | 15.9113 KOps/s | 15.7077 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1267ms | 40.3882μs | 24.7597 KOps/s | 25.6440 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.4512ms | 0.2477ms | 4.0363 KOps/s | 4.2010 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.3299ms | 0.1750ms | 5.7138 KOps/s | 5.7750 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.2191ms | 0.1086ms | 9.2106 KOps/s | 9.2989 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1241ms | 56.1217μs | 17.8184 KOps/s | 17.7553 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1747ms | 80.7593μs | 12.3825 KOps/s | 12.5978 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1820ms | 69.9831μs | 14.2892 KOps/s | 14.1623 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.3736ms | 0.1904ms | 5.2514 KOps/s | 5.2074 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.0403ms | 1.7141ms | 583.3964 Ops/s | 590.1051 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.3569ms | 0.1881ms | 5.3150 KOps/s | 5.2458 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.3869ms | 1.1266ms | 887.6533 Ops/s | 917.2488 Ops/s | |
test_compile_assign_and_add_stack[compile] | 0.8776ms | 0.4150ms | 2.4094 KOps/s | 2.3903 KOps/s | |
test_compile_assign_and_add_stack[eager] | 5.8708ms | 3.8980ms | 256.5420 Ops/s | 251.6770 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 99.4360μs | 33.2194μs | 30.1029 KOps/s | 31.2415 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 1.1343ms | 49.5521μs | 20.1808 KOps/s | 20.3900 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 90.4990μs | 29.0471μs | 34.4269 KOps/s | 32.3230 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.7140ms | 30.6675μs | 32.6078 KOps/s | 32.6198 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 93.2950μs | 28.4875μs | 35.1031 KOps/s | 34.5012 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1073ms | 30.7802μs | 32.4885 KOps/s | 32.8464 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1451ms | 72.8632μs | 13.7243 KOps/s | 13.5732 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.6333ms | 28.4422μs | 35.1590 KOps/s | 35.3381 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1472ms | 67.0904μs | 14.9053 KOps/s | 14.2523 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 73.4370μs | 24.3223μs | 41.1145 KOps/s | 41.5409 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1388ms | 66.6636μs | 15.0007 KOps/s | 14.5385 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 71.2940μs | 24.2202μs | 41.2879 KOps/s | 41.7347 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1503ms | 72.3581μs | 13.8202 KOps/s | 13.6199 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.6815ms | 27.9952μs | 35.7204 KOps/s | 35.3895 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1465ms | 66.6252μs | 15.0093 KOps/s | 14.6943 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 82.1640μs | 23.8756μs | 41.8838 KOps/s | 42.2386 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1405ms | 67.1260μs | 14.8974 KOps/s | 14.6162 KOps/s | |
test_compile_indexing[int-pytree-eager] | 67.6060μs | 24.3808μs | 41.0159 KOps/s | 42.6611 KOps/s | |
test_mod_add[eager] | 89.0560μs | 24.7645μs | 40.3803 KOps/s | 40.3881 KOps/s | |
test_mod_add[compile] | 84.7890μs | 37.6297μs | 26.5748 KOps/s | 27.8011 KOps/s | |
test_mod_add[compile-overhead] | 82.6750μs | 36.8976μs | 27.1021 KOps/s | 27.6141 KOps/s | |
test_mod_wrap[eager] | 0.4130ms | 0.2071ms | 4.8297 KOps/s | 4.9370 KOps/s | |
test_mod_wrap[compile] | 1.7839ms | 0.2317ms | 4.3168 KOps/s | 4.3673 KOps/s | |
test_mod_wrap[compile-overhead] | 0.3686ms | 0.2271ms | 4.4030 KOps/s | 4.4364 KOps/s | |
test_mod_wrap_and_backward[eager] | 14.6133ms | 11.8606ms | 84.3129 Ops/s | 82.8495 Ops/s | |
test_mod_wrap_and_backward[compile] | 20.8235ms | 12.5109ms | 79.9303 Ops/s | 80.5995 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 12.2360ms | 11.0317ms | 90.6481 Ops/s | 83.8290 Ops/s | |
test_seq_add[eager] | 0.1442ms | 88.8205μs | 11.2587 KOps/s | 11.3980 KOps/s | |
test_seq_add[compile] | 0.1561ms | 61.3212μs | 16.3076 KOps/s | 16.3142 KOps/s | |
test_seq_add[compile-overhead] | 0.1508ms | 58.5460μs | 17.0806 KOps/s | 17.1085 KOps/s | |
test_seq_wrap[eager] | 0.5941ms | 0.3661ms | 2.7312 KOps/s | 2.6613 KOps/s | |
test_seq_wrap[compile] | 0.6478ms | 0.2654ms | 3.7674 KOps/s | 3.8032 KOps/s | |
test_seq_wrap[compile-overhead] | 0.5013ms | 0.2663ms | 3.7547 KOps/s | 3.8044 KOps/s | |
test_func_call_runtime[False-eager] | 1.0041ms | 0.5475ms | 1.8265 KOps/s | 1.9463 KOps/s | |
test_func_call_runtime[False-compile] | 0.8290ms | 0.4984ms | 2.0064 KOps/s | 2.0215 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.7181ms | 0.4987ms | 2.0052 KOps/s | 2.0335 KOps/s | |
test_func_call_runtime[True-eager] | 1.1129ms | 0.7680ms | 1.3021 KOps/s | 1.3480 KOps/s | |
test_func_call_runtime[True-compile] | 0.9340ms | 0.5180ms | 1.9304 KOps/s | 1.9645 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.7107ms | 0.5136ms | 1.9469 KOps/s | 1.9706 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8946ms | 0.5382ms | 1.8581 KOps/s | 1.9538 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.6810ms | 0.4945ms | 2.0222 KOps/s | 2.0328 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.9222ms | 0.5044ms | 1.9825 KOps/s | 2.0226 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.0974ms | 0.8962ms | 1.1158 KOps/s | 1.1499 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.9551ms | 0.8462ms | 1.1817 KOps/s | 1.2110 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.1926ms | 0.8474ms | 1.1801 KOps/s | 1.2123 KOps/s | |
test_distributed | 0.2703ms | 0.1327ms | 7.5331 KOps/s | 7.4536 KOps/s | |
test_tdmodule | 32.5300μs | 17.3746μs | 57.5552 KOps/s | 54.9233 KOps/s | |
test_tdmodule_dispatch | 65.5130μs | 36.3523μs | 27.5086 KOps/s | 26.1153 KOps/s | |
test_tdseq | 53.5500μs | 19.4651μs | 51.3740 KOps/s | 49.8866 KOps/s | |
test_tdseq_dispatch | 71.5830μs | 40.5816μs | 24.6417 KOps/s | 23.7677 KOps/s | |
test_instantiation_functorch | 2.0274ms | 1.6682ms | 599.4592 Ops/s | 613.3389 Ops/s | |
test_instantiation_td | 2.2676ms | 1.1987ms | 834.2380 Ops/s | 846.2210 Ops/s | |
test_exec_functorch | 0.2849ms | 0.1843ms | 5.4274 KOps/s | 5.7553 KOps/s | |
test_exec_functional_call | 0.3115ms | 0.1760ms | 5.6811 KOps/s | 5.9405 KOps/s | |
test_exec_td | 0.3125ms | 0.1813ms | 5.5164 KOps/s | 5.9209 KOps/s | |
test_exec_td_decorator | 1.0112ms | 0.2311ms | 4.3268 KOps/s | 4.5336 KOps/s | |
test_vmap_mlp_speed[True-True] | 0.8991ms | 0.5777ms | 1.7311 KOps/s | 1.7608 KOps/s | |
test_vmap_mlp_speed[True-False] | 1.0702ms | 0.5709ms | 1.7517 KOps/s | 1.7893 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.6545ms | 0.4718ms | 2.1197 KOps/s | 2.1217 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.8501ms | 0.4781ms | 2.0915 KOps/s | 2.1408 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.0166ms | 0.6286ms | 1.5907 KOps/s | 1.6108 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.9544ms | 0.6238ms | 1.6030 KOps/s | 1.6255 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8601ms | 0.5172ms | 1.9335 KOps/s | 1.9566 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.9105ms | 0.5188ms | 1.9275 KOps/s | 1.9620 KOps/s | |
test_to_module_speed[True] | 1.6293ms | 1.3267ms | 753.7726 Ops/s | 764.6880 Ops/s | |
test_to_module_speed[False] | 1.7291ms | 1.2978ms | 770.5110 Ops/s | 780.4181 Ops/s | |
test_tc_init | 97.6330μs | 46.0590μs | 21.7113 KOps/s | 21.8749 KOps/s | |
test_tc_init_nested | 0.1814ms | 92.5699μs | 10.8026 KOps/s | 11.0323 KOps/s | |
test_tc_first_layer_tensor | 35.2860μs | 1.4577μs | 686.0063 KOps/s | 713.7629 KOps/s | |
test_tc_first_layer_nontensor | 38.2420μs | 4.2698μs | 234.2012 KOps/s | 236.5951 KOps/s | |
test_tc_second_layer_tensor | 28.2030μs | 2.7424μs | 364.6457 KOps/s | 384.7982 KOps/s | |
test_tc_second_layer_nontensor | 33.0610μs | 5.5571μs | 179.9511 KOps/s | 186.0553 KOps/s | |
test_unbind | 0.4735s | 14.4470ms | 69.2183 Ops/s | 64.9068 Ops/s | |
test_full_like | 14.2006ms | 8.5700ms | 116.6857 Ops/s | 120.7441 Ops/s | |
test_zeros_like | 11.6672ms | 7.0305ms | 142.2375 Ops/s | 131.7384 Ops/s | |
test_ones_like | 13.2507ms | 7.5986ms | 131.6033 Ops/s | 124.0464 Ops/s | |
test_clone | 19.8587ms | 9.2960ms | 107.5737 Ops/s | 102.2571 Ops/s | |
test_squeeze | 97.1620μs | 12.7882μs | 78.1970 KOps/s | 76.9457 KOps/s | |
test_unsqueeze | 0.1526ms | 93.5205μs | 10.6928 KOps/s | 10.7456 KOps/s | |
test_split | 0.3560ms | 0.2015ms | 4.9628 KOps/s | 5.0547 KOps/s | |
test_permute | 0.3158ms | 0.2187ms | 4.5716 KOps/s | 4.6183 KOps/s | |
test_stack | 33.3918ms | 25.8647ms | 38.6628 Ops/s | 39.0705 Ops/s | |
test_cat | 29.9482ms | 25.5082ms | 39.2031 Ops/s | 38.7598 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 0.1452ms | 17.1688μs | 58.2453 KOps/s | 60.5768 KOps/s | |
test_plain_set_stack_nested | 35.1900μs | 17.3523μs | 57.6291 KOps/s | 60.5483 KOps/s | |
test_plain_set_nested_inplace | 38.7710μs | 18.2326μs | 54.8468 KOps/s | 56.5368 KOps/s | |
test_plain_set_stack_nested_inplace | 36.9800μs | 18.2983μs | 54.6498 KOps/s | 56.8306 KOps/s | |
test_items | 31.5100μs | 4.6626μs | 214.4743 KOps/s | 215.5523 KOps/s | |
test_items_nested | 0.4119ms | 0.3618ms | 2.7638 KOps/s | 2.7175 KOps/s | |
test_items_nested_locked | 0.4196ms | 0.3652ms | 2.7381 KOps/s | 2.6799 KOps/s | |
test_items_nested_leaf | 0.1096ms | 84.0798μs | 11.8935 KOps/s | 11.9255 KOps/s | |
test_items_stack_nested | 0.4204ms | 0.3677ms | 2.7194 KOps/s | 2.7285 KOps/s | |
test_items_stack_nested_leaf | 0.1087ms | 85.7896μs | 11.6564 KOps/s | 11.8423 KOps/s | |
test_items_stack_nested_locked | 0.4244ms | 0.3694ms | 2.7069 KOps/s | 2.7554 KOps/s | |
test_keys | 18.7900μs | 4.3641μs | 229.1444 KOps/s | 229.4179 KOps/s | |
test_keys_nested | 88.6710μs | 65.7711μs | 15.2042 KOps/s | 14.9951 KOps/s | |
test_keys_nested_locked | 0.6789ms | 72.7758μs | 13.7408 KOps/s | 13.8304 KOps/s | |
test_keys_nested_leaf | 79.2110μs | 55.8647μs | 17.9004 KOps/s | 18.0039 KOps/s | |
test_keys_stack_nested | 85.6910μs | 66.1650μs | 15.1137 KOps/s | 15.1377 KOps/s | |
test_keys_stack_nested_leaf | 76.2710μs | 58.0801μs | 17.2176 KOps/s | 17.3717 KOps/s | |
test_keys_stack_nested_locked | 0.1029ms | 72.1056μs | 13.8686 KOps/s | 14.0025 KOps/s | |
test_values | 9.3637μs | 1.8029μs | 554.6688 KOps/s | 567.9603 KOps/s | |
test_values_nested | 52.3810μs | 33.7206μs | 29.6554 KOps/s | 29.9826 KOps/s | |
test_values_nested_locked | 53.2410μs | 35.9476μs | 27.8182 KOps/s | 28.3084 KOps/s | |
test_values_nested_leaf | 54.1910μs | 30.0964μs | 33.2265 KOps/s | 33.5133 KOps/s | |
test_values_stack_nested | 53.8810μs | 34.3808μs | 29.0860 KOps/s | 29.1728 KOps/s | |
test_values_stack_nested_leaf | 54.4600μs | 30.8498μs | 32.4151 KOps/s | 32.5653 KOps/s | |
test_values_stack_nested_locked | 53.9110μs | 36.3649μs | 27.4991 KOps/s | 27.5257 KOps/s | |
test_membership | 1.3156μs | 0.5560μs | 1.7986 MOps/s | 1.8573 MOps/s | |
test_membership_nested | 9.3405μs | 1.9382μs | 515.9308 KOps/s | 500.8193 KOps/s | |
test_membership_nested_leaf | 9.3205μs | 1.9364μs | 516.4120 KOps/s | 516.4297 KOps/s | |
test_membership_stacked_nested | 25.3700μs | 2.0017μs | 499.5769 KOps/s | 505.3808 KOps/s | |
test_membership_stacked_nested_leaf | 20.7600μs | 2.0029μs | 499.2725 KOps/s | 499.8722 KOps/s | |
test_membership_nested_last | 16.4900μs | 2.9111μs | 343.5124 KOps/s | 351.5697 KOps/s | |
test_membership_nested_leaf_last | 19.4500μs | 2.8770μs | 347.5890 KOps/s | 344.0058 KOps/s | |
test_membership_stacked_nested_last | 21.2900μs | 2.9297μs | 341.3327 KOps/s | 182.8060 KOps/s | |
test_membership_stacked_nested_leaf_last | 17.1600μs | 2.9429μs | 339.8044 KOps/s | 183.2134 KOps/s | |
test_nested_getleaf | 21.4000μs | 8.0410μs | 124.3630 KOps/s | 125.6175 KOps/s | |
test_nested_get | 22.2900μs | 7.4337μs | 134.5232 KOps/s | 134.0402 KOps/s | |
test_stacked_getleaf | 29.7700μs | 7.9738μs | 125.4112 KOps/s | 124.6191 KOps/s | |
test_stacked_get | 23.3910μs | 7.4892μs | 133.5263 KOps/s | 133.9974 KOps/s | |
test_nested_getitemleaf | 22.5200μs | 8.1951μs | 122.0246 KOps/s | 123.3105 KOps/s | |
test_nested_getitem | 22.5500μs | 7.6398μs | 130.8928 KOps/s | 131.1404 KOps/s | |
test_stacked_getitemleaf | 24.6010μs | 8.1110μs | 123.2894 KOps/s | 122.2887 KOps/s | |
test_stacked_getitem | 22.2300μs | 7.6494μs | 130.7296 KOps/s | 131.2759 KOps/s | |
test_lock_nested | 9.9408ms | 0.4823ms | 2.0736 KOps/s | 2.1431 KOps/s | |
test_lock_stack_nested | 0.4812ms | 0.4294ms | 2.3287 KOps/s | 2.3852 KOps/s | |
test_unlock_nested | 0.8714ms | 0.3910ms | 2.5577 KOps/s | 2.5755 KOps/s | |
test_unlock_stack_nested | 0.4137ms | 0.3483ms | 2.8707 KOps/s | 2.9523 KOps/s | |
test_flatten_speed | 0.5281ms | 0.1055ms | 9.4752 KOps/s | 9.5622 KOps/s | |
test_unflatten_speed | 0.3544ms | 0.2877ms | 3.4763 KOps/s | 3.4636 KOps/s | |
test_common_ops | 1.6404ms | 1.3127ms | 761.8032 Ops/s | 743.8186 Ops/s | |
test_creation | 17.6310μs | 1.6418μs | 609.0933 KOps/s | 612.3384 KOps/s | |
test_creation_empty | 36.1610μs | 17.5815μs | 56.8779 KOps/s | 61.6590 KOps/s | |
test_creation_nested_1 | 43.7700μs | 19.6216μs | 50.9642 KOps/s | 54.4367 KOps/s | |
test_creation_nested_2 | 42.9600μs | 22.2310μs | 44.9822 KOps/s | 49.0208 KOps/s | |
test_clone | 58.4410μs | 30.1915μs | 33.1219 KOps/s | 32.0291 KOps/s | |
test_getitem[int] | 1.0026ms | 17.3358μs | 57.6841 KOps/s | 60.6296 KOps/s | |
test_getitem[slice_int] | 0.1686ms | 29.7400μs | 33.6248 KOps/s | 35.6359 KOps/s | |
test_getitem[range] | 0.2937ms | 0.1148ms | 8.7083 KOps/s | 8.9239 KOps/s | |
test_getitem[tuple] | 0.1556ms | 25.9135μs | 38.5899 KOps/s | 40.7794 KOps/s | |
test_getitem[list] | 91.7430ms | 0.1289ms | 7.7563 KOps/s | 9.2480 KOps/s | |
test_setitem_dim[int] | 74.9510μs | 54.0628μs | 18.4970 KOps/s | 18.4409 KOps/s | |
test_setitem_dim[slice_int] | 98.5310μs | 79.3509μs | 12.6023 KOps/s | 13.4630 KOps/s | |
test_setitem_dim[range] | 0.1739ms | 0.1437ms | 6.9588 KOps/s | 7.3433 KOps/s | |
test_setitem_dim[tuple] | 93.8910μs | 71.6000μs | 13.9665 KOps/s | 14.9486 KOps/s | |
test_setitem | 85.2110μs | 43.4212μs | 23.0303 KOps/s | 23.6018 KOps/s | |
test_set | 73.8710μs | 42.8305μs | 23.3478 KOps/s | 22.6652 KOps/s | |
test_set_shared | 0.3711ms | 53.8839μs | 18.5584 KOps/s | 18.8704 KOps/s | |
test_update | 88.8310μs | 52.6576μs | 18.9906 KOps/s | 20.6831 KOps/s | |
test_update_nested | 84.1110μs | 60.1112μs | 16.6358 KOps/s | 16.7196 KOps/s | |
test_update__nested | 94.9720μs | 62.8184μs | 15.9189 KOps/s | 15.4769 KOps/s | |
test_set_nested | 67.3210μs | 44.8438μs | 22.2996 KOps/s | 21.1556 KOps/s | |
test_set_nested_new | 96.5020μs | 49.0605μs | 20.3830 KOps/s | 19.7921 KOps/s | |
test_select | 85.8020μs | 64.9340μs | 15.4003 KOps/s | 15.1343 KOps/s | |
test_select_nested | 71.8010μs | 53.2929μs | 18.7642 KOps/s | 19.2987 KOps/s | |
test_exclude_nested | 0.1090ms | 70.5635μs | 14.1716 KOps/s | 14.2565 KOps/s | |
test_empty[True] | 0.3372ms | 0.2860ms | 3.4967 KOps/s | 3.5757 KOps/s | |
test_empty[False] | 2.3511μs | 0.8547μs | 1.1699 MOps/s | 1.1451 MOps/s | |
test_to | 66.8010μs | 39.8922μs | 25.0675 KOps/s | 25.2802 KOps/s | |
test_to_nonblocking | 47.5610μs | 25.7135μs | 38.8900 KOps/s | 40.8593 KOps/s | |
test_unbind_speed | 1.4269ms | 0.3046ms | 3.2825 KOps/s | 3.3723 KOps/s | |
test_unbind_speed_stack0 | 0.3527ms | 0.3019ms | 3.3129 KOps/s | 3.3891 KOps/s | |
test_unbind_speed_stack1 | 90.7687ms | 0.7689ms | 1.3005 KOps/s | 1.3249 KOps/s | |
test_split | 91.4186ms | 2.3634ms | 423.1141 Ops/s | 444.1235 Ops/s | |
test_chunk | 2.3244ms | 2.1654ms | 461.8020 Ops/s | 441.4737 Ops/s | |
test_creation[device0] | 0.1559ms | 0.1028ms | 9.7245 KOps/s | 9.6952 KOps/s | |
test_creation_from_tensor | 0.1610ms | 0.1012ms | 9.8813 KOps/s | 9.4207 KOps/s | |
test_add_one[memmap_tensor0] | 78.7910μs | 8.6094μs | 116.1524 KOps/s | 113.3005 KOps/s | |
test_contiguous[memmap_tensor0] | 27.4610μs | 2.1585μs | 463.2862 KOps/s | 462.7545 KOps/s | |
test_stack[memmap_tensor0] | 31.0100μs | 6.5435μs | 152.8236 KOps/s | 156.6894 KOps/s | |
test_memmaptd_index | 1.0657ms | 0.4296ms | 2.3276 KOps/s | 2.3722 KOps/s | |
test_memmaptd_index_astensor | 0.8477ms | 0.4968ms | 2.0130 KOps/s | 2.0503 KOps/s | |
test_memmaptd_index_op | 1.4515ms | 1.0597ms | 943.7055 Ops/s | 1.0060 KOps/s | |
test_serialize_model | 93.3712ms | 89.3092ms | 11.1971 Ops/s | 10.8638 Ops/s | |
test_serialize_model_pickle | 1.3540s | 1.2363s | 0.8089 Ops/s | 0.8084 Ops/s | |
test_serialize_weights | 89.8706ms | 85.9446ms | 11.6354 Ops/s | 11.0624 Ops/s | |
test_serialize_weights_returnearly | 0.2098s | 65.1776ms | 15.3427 Ops/s | 19.1596 Ops/s | |
test_serialize_weights_pickle | 1.4079s | 1.2439s | 0.8039 Ops/s | 0.8082 Ops/s | |
test_reshape_pytree | 0.2151ms | 38.9845μs | 25.6512 KOps/s | 26.4096 KOps/s | |
test_reshape_td | 78.8610μs | 44.0584μs | 22.6972 KOps/s | 23.4596 KOps/s | |
test_view_pytree | 56.1000μs | 37.6972μs | 26.5271 KOps/s | 26.9682 KOps/s | |
test_view_td | 78.1220μs | 50.5385μs | 19.7869 KOps/s | 19.9865 KOps/s | |
test_unbind_pytree | 70.7710μs | 36.8133μs | 27.1641 KOps/s | 27.6955 KOps/s | |
test_unbind_td | 0.4215ms | 46.0442μs | 21.7183 KOps/s | 22.8011 KOps/s | |
test_split_pytree | 82.9420μs | 51.9336μs | 19.2554 KOps/s | 19.7798 KOps/s | |
test_split_td | 92.0757ms | 69.9481μs | 14.2963 KOps/s | 17.5668 KOps/s | |
test_add_pytree | 0.1119ms | 60.2328μs | 16.6023 KOps/s | 17.2957 KOps/s | |
test_add_td | 0.1401ms | 97.9020μs | 10.2143 KOps/s | 10.6486 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.4190ms | 0.2179ms | 4.5886 KOps/s | 4.6433 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2619ms | 0.1731ms | 5.7764 KOps/s | 5.7550 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1918ms | 0.1483ms | 6.7411 KOps/s | 6.7852 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2497ms | 0.1978ms | 5.0557 KOps/s | 5.2897 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 49.1510μs | 22.6222μs | 44.2044 KOps/s | 44.0308 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 74.4010μs | 47.8051μs | 20.9183 KOps/s | 20.9825 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1297ms | 75.0283μs | 13.3283 KOps/s | 13.3850 KOps/s | |
test_compile_copy_nested[pytree-eager] | 84.1410μs | 60.1435μs | 16.6269 KOps/s | 16.7399 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.4261ms | 0.3355ms | 2.9808 KOps/s | 3.0255 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.2977ms | 0.2236ms | 4.4718 KOps/s | 4.5507 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.2074ms | 0.1332ms | 7.5090 KOps/s | 7.3362 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1363ms | 62.5719μs | 15.9816 KOps/s | 15.7176 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.4132ms | 0.3318ms | 3.0137 KOps/s | 3.0406 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.7074ms | 0.6445ms | 1.5516 KOps/s | 1.5560 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3208ms | 0.2702ms | 3.7006 KOps/s | 3.7477 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.3764ms | 0.3347ms | 2.9874 KOps/s | 3.0114 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1628ms | 74.8124μs | 13.3668 KOps/s | 13.5527 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1833ms | 0.1321ms | 7.5705 KOps/s | 7.3801 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.6300ms | 0.5494ms | 1.8200 KOps/s | 1.9179 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.3947ms | 0.3318ms | 3.0137 KOps/s | 3.0455 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 45.7710μs | 19.2900μs | 51.8403 KOps/s | 54.6969 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 57.9210μs | 32.3982μs | 30.8659 KOps/s | 30.7354 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1037ms | 76.7806μs | 13.0241 KOps/s | 13.0837 KOps/s | |
test_compile_copy_flat[pytree-eager] | 83.6120μs | 60.7289μs | 16.4666 KOps/s | 16.6099 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 2.4802ms | 0.8560ms | 1.1682 KOps/s | 1.1048 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 3.8207ms | 3.3642ms | 297.2489 Ops/s | 314.9497 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 2.4269ms | 0.8429ms | 1.1863 KOps/s | 1.1109 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 3.7679ms | 3.4242ms | 292.0373 Ops/s | 316.4868 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.2566ms | 0.1151ms | 8.6916 KOps/s | 8.8621 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.2245ms | 61.1816μs | 16.3448 KOps/s | 15.1674 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1569ms | 0.1044ms | 9.5797 KOps/s | 9.6138 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 86.4910μs | 48.3259μs | 20.6928 KOps/s | 22.2454 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1753ms | 0.1091ms | 9.1622 KOps/s | 9.6090 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 76.4610μs | 45.5733μs | 21.9427 KOps/s | 22.2738 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1839ms | 0.1409ms | 7.0961 KOps/s | 7.1595 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1807ms | 25.8464μs | 38.6900 KOps/s | 39.5489 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1699ms | 0.1320ms | 7.5758 KOps/s | 7.5693 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 45.2510μs | 22.7165μs | 44.0209 KOps/s | 45.6780 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1703ms | 0.1322ms | 7.5663 KOps/s | 7.5829 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 48.4900μs | 22.5793μs | 44.2884 KOps/s | 45.1927 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.2229ms | 0.1402ms | 7.1342 KOps/s | 7.2107 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.4835ms | 28.0589μs | 35.6393 KOps/s | 39.6466 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1721ms | 0.1320ms | 7.5753 KOps/s | 7.5761 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 52.3110μs | 22.2452μs | 44.9536 KOps/s | 45.3700 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1673ms | 0.1320ms | 7.5785 KOps/s | 7.6427 KOps/s | |
test_compile_indexing[int-pytree-eager] | 44.7910μs | 22.4287μs | 44.5858 KOps/s | 45.6774 KOps/s | |
test_mod_add[eager] | 74.8610μs | 38.6019μs | 25.9054 KOps/s | 27.3775 KOps/s | |
test_mod_add[compile] | 0.1125ms | 73.7329μs | 13.5625 KOps/s | 14.4370 KOps/s | |
test_mod_add[compile-overhead] | 0.2817ms | 0.1382ms | 7.2375 KOps/s | 6.5637 KOps/s | |
test_mod_wrap[eager] | 0.3290ms | 0.2509ms | 3.9852 KOps/s | 3.9837 KOps/s | |
test_mod_wrap[compile] | 1.0594ms | 0.2931ms | 3.4115 KOps/s | 3.3005 KOps/s | |
test_mod_wrap[compile-overhead] | 8.2427ms | 4.3623ms | 229.2383 Ops/s | 232.0157 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.4371ms | 1.3303ms | 751.7312 Ops/s | 703.4139 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.4543ms | 1.3319ms | 750.8101 Ops/s | 693.6235 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3245ms | 0.8977ms | 1.1140 KOps/s | 994.6725 Ops/s | |
test_seq_add[eager] | 0.1650ms | 0.1116ms | 8.9587 KOps/s | 9.1435 KOps/s | |
test_seq_add[compile] | 0.1528ms | 88.8043μs | 11.2607 KOps/s | 11.6042 KOps/s | |
test_seq_add[compile-overhead] | 0.1626ms | 0.1236ms | 8.0938 KOps/s | 8.0839 KOps/s | |
test_seq_wrap[eager] | 0.5089ms | 0.4365ms | 2.2911 KOps/s | 2.4010 KOps/s | |
test_seq_wrap[compile] | 0.3933ms | 0.3304ms | 3.0267 KOps/s | 3.0466 KOps/s | |
test_seq_wrap[compile-overhead] | 0.1926s | 89.5843ms | 11.1627 Ops/s | 7.9022 Ops/s | |
test_func_call_runtime[False-eager] | 0.8144ms | 0.7446ms | 1.3430 KOps/s | 1.3081 KOps/s | |
test_func_call_runtime[False-compile] | 0.9099ms | 0.8126ms | 1.2307 KOps/s | 1.2464 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4215ms | 0.3687ms | 2.7120 KOps/s | 2.7149 KOps/s | |
test_func_call_runtime[True-eager] | 1.0939ms | 0.9495ms | 1.0532 KOps/s | 1.0845 KOps/s | |
test_func_call_runtime[True-compile] | 0.9740ms | 0.8561ms | 1.1681 KOps/s | 1.1792 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.4605ms | 0.4166ms | 2.4001 KOps/s | 2.4202 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8703ms | 0.7974ms | 1.2541 KOps/s | 1.2613 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.9059ms | 0.8401ms | 1.1904 KOps/s | 1.2320 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5535ms | 0.3708ms | 2.6968 KOps/s | 2.7328 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1769ms | 1.0550ms | 947.8695 Ops/s | 963.8055 Ops/s | |
test_func_call_cm_runtime[True-compile] | 1.1360ms | 1.0408ms | 960.8086 Ops/s | 987.6768 Ops/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.1253ms | 1.0187ms | 981.6905 Ops/s | 989.1326 Ops/s | |
test_distributed | 1.1791ms | 72.9342μs | 13.7110 KOps/s | 14.2549 KOps/s | |
test_tdmodule | 32.2200μs | 16.6801μs | 59.9515 KOps/s | 64.7004 KOps/s | |
test_tdmodule_dispatch | 49.9810μs | 33.8524μs | 29.5400 KOps/s | 31.8952 KOps/s | |
test_tdseq | 33.5210μs | 16.9521μs | 58.9898 KOps/s | 62.5089 KOps/s | |
test_tdseq_dispatch | 52.9400μs | 35.4881μs | 28.1785 KOps/s | 29.9008 KOps/s | |
test_instantiation_functorch | 2.1209ms | 2.0238ms | 494.1125 Ops/s | 496.1680 Ops/s | |
test_instantiation_td | 2.0000ms | 1.3180ms | 758.7345 Ops/s | 768.0839 Ops/s | |
test_exec_functorch | 0.2801ms | 0.2200ms | 4.5452 KOps/s | 4.6882 KOps/s | |
test_exec_functional_call | 0.2976ms | 0.2247ms | 4.4512 KOps/s | 4.7883 KOps/s | |
test_exec_td | 0.2864ms | 0.2257ms | 4.4314 KOps/s | 4.6417 KOps/s | |
test_exec_td_decorator | 0.4238ms | 0.2700ms | 3.7036 KOps/s | 3.7312 KOps/s | |
test_vmap_mlp_speed[True-True] | 1.1925ms | 0.6431ms | 1.5549 KOps/s | 1.5603 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.7164ms | 0.6383ms | 1.5666 KOps/s | 1.5901 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.6816ms | 0.5727ms | 1.7462 KOps/s | 1.7924 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.6498ms | 0.5757ms | 1.7370 KOps/s | 1.7493 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.2953ms | 0.6850ms | 1.4599 KOps/s | 1.4680 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.7845ms | 0.6856ms | 1.4586 KOps/s | 1.4627 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7534ms | 0.6025ms | 1.6596 KOps/s | 1.6794 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7386ms | 0.6032ms | 1.6578 KOps/s | 1.6720 KOps/s | |
test_vmap_transformer_speed[True-True] | 8.8291ms | 8.5372ms | 117.1349 Ops/s | 117.8211 Ops/s | |
test_vmap_transformer_speed[True-False] | 9.9652ms | 8.5758ms | 116.6072 Ops/s | 117.7688 Ops/s | |
test_vmap_transformer_speed[False-True] | 8.8983ms | 8.5014ms | 117.6283 Ops/s | 119.4049 Ops/s | |
test_vmap_transformer_speed[False-False] | 8.9407ms | 8.4480ms | 118.3708 Ops/s | 119.1965 Ops/s | |
test_vmap_transformer_speed_decorator[True-True] | 21.0236ms | 20.1986ms | 49.5083 Ops/s | 49.9248 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 21.0009ms | 20.1243ms | 49.6912 Ops/s | 50.0024 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 21.4750ms | 20.0143ms | 49.9642 Ops/s | 50.4852 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 20.8289ms | 19.9782ms | 50.0546 Ops/s | 50.4436 Ops/s | |
test_to_module_speed[True] | 1.2859ms | 1.1592ms | 862.6663 Ops/s | 872.9088 Ops/s | |
test_to_module_speed[False] | 1.2239ms | 1.1246ms | 889.1840 Ops/s | 877.9578 Ops/s | |
test_tc_init | 59.7210μs | 39.7704μs | 25.1443 KOps/s | 27.0998 KOps/s | |
test_tc_init_nested | 0.1442ms | 81.1498μs | 12.3229 KOps/s | 12.9413 KOps/s | |
test_tc_first_layer_tensor | 3.4167μs | 0.8096μs | 1.2352 MOps/s | 1.2645 MOps/s | |
test_tc_first_layer_nontensor | 18.2210μs | 2.5377μs | 394.0649 KOps/s | 391.9895 KOps/s | |
test_tc_second_layer_tensor | 6.8467μs | 1.5967μs | 626.2874 KOps/s | 620.2630 KOps/s | |
test_tc_second_layer_nontensor | 15.5910μs | 3.3882μs | 295.1440 KOps/s | 298.8527 KOps/s | |
test_unbind | 0.1814s | 12.0960ms | 82.6717 Ops/s | 83.5464 Ops/s | |
test_full_like | 0.6566ms | 0.5768ms | 1.7336 KOps/s | 1.7359 KOps/s | |
test_zeros_like | 0.2727ms | 0.1978ms | 5.0559 KOps/s | 5.0568 KOps/s | |
test_ones_like | 0.2312ms | 0.1976ms | 5.0601 KOps/s | 5.0627 KOps/s | |
test_clone | 0.4455ms | 0.4154ms | 2.4076 KOps/s | 2.4160 KOps/s | |
test_squeeze | 27.3500μs | 10.5843μs | 94.4799 KOps/s | 90.0719 KOps/s | |
test_unsqueeze | 0.2478ms | 80.5709μs | 12.4114 KOps/s | 12.4533 KOps/s | |
test_split | 0.4673ms | 0.1770ms | 5.6485 KOps/s | 5.7955 KOps/s | |
test_permute | 0.2439ms | 0.1904ms | 5.2522 KOps/s | 5.3502 KOps/s | |
test_stack | 1.2557ms | 0.8936ms | 1.1191 KOps/s | 1.1069 KOps/s | |
test_cat | 1.2622ms | 1.2312ms | 812.1852 Ops/s | 812.2604 Ops/s |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Closes #937