-
Notifications
You must be signed in to change notification settings - Fork 79
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[CI] Upgrade 3.8 workflows #967
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vmoens
commented
Aug 14, 2024
vmoens
commented
Aug 14, 2024
vmoens
commented
Aug 14, 2024
vmoens
commented
Aug 14, 2024
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 45.0240μs | 20.8286μs | 48.0109 KOps/s | 47.5752 KOps/s | |
test_plain_set_stack_nested | 0.2192ms | 21.1602μs | 47.2585 KOps/s | 47.5871 KOps/s | |
test_plain_set_nested_inplace | 0.2240ms | 23.9364μs | 41.7774 KOps/s | 44.4752 KOps/s | |
test_plain_set_stack_nested_inplace | 58.8500μs | 22.4350μs | 44.5733 KOps/s | 44.6596 KOps/s | |
test_items | 34.2840μs | 4.1140μs | 243.0752 KOps/s | 247.4519 KOps/s | |
test_items_nested | 0.5291ms | 0.3396ms | 2.9444 KOps/s | 3.0647 KOps/s | |
test_items_nested_locked | 0.6994ms | 0.3347ms | 2.9874 KOps/s | 3.0723 KOps/s | |
test_items_nested_leaf | 0.6381ms | 84.5754μs | 11.8238 KOps/s | 11.8112 KOps/s | |
test_items_stack_nested | 0.5263ms | 0.3381ms | 2.9573 KOps/s | 3.0475 KOps/s | |
test_items_stack_nested_leaf | 0.1560ms | 85.1096μs | 11.7496 KOps/s | 12.1864 KOps/s | |
test_items_stack_nested_locked | 0.8984ms | 0.3374ms | 2.9637 KOps/s | 3.0283 KOps/s | |
test_keys | 29.3750μs | 3.5219μs | 283.9364 KOps/s | 286.9335 KOps/s | |
test_keys_nested | 0.1844ms | 95.2101μs | 10.5031 KOps/s | 10.6934 KOps/s | |
test_keys_nested_locked | 1.8373ms | 99.9169μs | 10.0083 KOps/s | 10.1085 KOps/s | |
test_keys_nested_leaf | 0.1794ms | 79.0450μs | 12.6510 KOps/s | 12.6957 KOps/s | |
test_keys_stack_nested | 0.1831ms | 96.3952μs | 10.3740 KOps/s | 10.6162 KOps/s | |
test_keys_stack_nested_leaf | 0.1465ms | 82.0355μs | 12.1898 KOps/s | 12.8814 KOps/s | |
test_keys_stack_nested_locked | 0.1937ms | 99.6604μs | 10.0341 KOps/s | 10.2441 KOps/s | |
test_values | 6.5102μs | 1.0984μs | 910.4481 KOps/s | 914.3132 KOps/s | |
test_values_nested | 0.1070ms | 47.3352μs | 21.1259 KOps/s | 20.6304 KOps/s | |
test_values_nested_locked | 92.4930μs | 47.2916μs | 21.1454 KOps/s | 21.2119 KOps/s | |
test_values_nested_leaf | 99.6070μs | 42.2209μs | 23.6850 KOps/s | 23.1309 KOps/s | |
test_values_stack_nested | 0.1284ms | 47.9068μs | 20.8738 KOps/s | 20.9615 KOps/s | |
test_values_stack_nested_leaf | 75.6620μs | 42.5015μs | 23.5286 KOps/s | 24.8209 KOps/s | |
test_values_stack_nested_locked | 0.1014ms | 47.1571μs | 21.2057 KOps/s | 20.8773 KOps/s | |
test_membership | 7.0274μs | 0.6980μs | 1.4328 MOps/s | 1.4139 MOps/s | |
test_membership_nested | 41.9790μs | 2.5825μs | 387.2292 KOps/s | 381.9263 KOps/s | |
test_membership_nested_leaf | 25.3180μs | 2.5884μs | 386.3423 KOps/s | 378.7590 KOps/s | |
test_membership_stacked_nested | 16.9220μs | 2.5853μs | 386.8036 KOps/s | 392.0336 KOps/s | |
test_membership_stacked_nested_leaf | 38.8530μs | 2.6126μs | 382.7604 KOps/s | 389.4675 KOps/s | |
test_membership_nested_last | 26.7390μs | 3.7837μs | 264.2935 KOps/s | 262.5016 KOps/s | |
test_membership_nested_leaf_last | 26.2390μs | 3.8046μs | 262.8394 KOps/s | 267.3219 KOps/s | |
test_membership_stacked_nested_last | 22.5820μs | 3.7830μs | 264.3420 KOps/s | 79.2814 KOps/s | |
test_membership_stacked_nested_leaf_last | 20.8900μs | 3.8377μs | 260.5744 KOps/s | 79.3066 KOps/s | |
test_nested_getleaf | 55.4040μs | 10.6115μs | 94.2371 KOps/s | 96.9997 KOps/s | |
test_nested_get | 50.2140μs | 10.1057μs | 98.9541 KOps/s | 100.1833 KOps/s | |
test_stacked_getleaf | 37.7510μs | 10.5489μs | 94.7963 KOps/s | 95.0697 KOps/s | |
test_stacked_get | 35.7970μs | 9.9475μs | 100.5280 KOps/s | 101.1910 KOps/s | |
test_nested_getitemleaf | 55.1330μs | 11.1078μs | 90.0268 KOps/s | 85.1776 KOps/s | |
test_nested_getitem | 36.2680μs | 10.5045μs | 95.1975 KOps/s | 99.6539 KOps/s | |
test_stacked_getitemleaf | 50.3640μs | 10.9022μs | 91.7248 KOps/s | 92.2595 KOps/s | |
test_stacked_getitem | 55.2530μs | 10.1780μs | 98.2513 KOps/s | 100.0537 KOps/s | |
test_lock_nested | 83.4372ms | 0.5732ms | 1.7444 KOps/s | 2.1454 KOps/s | |
test_lock_stack_nested | 0.8165ms | 0.4471ms | 2.2365 KOps/s | 2.3764 KOps/s | |
test_unlock_nested | 84.6834ms | 0.4840ms | 2.0663 KOps/s | 2.5358 KOps/s | |
test_unlock_stack_nested | 0.5939ms | 0.3645ms | 2.7434 KOps/s | 2.9252 KOps/s | |
test_flatten_speed | 0.1980ms | 0.1059ms | 9.4414 KOps/s | 9.6200 KOps/s | |
test_unflatten_speed | 0.9657ms | 0.4551ms | 2.1972 KOps/s | 2.1964 KOps/s | |
test_common_ops | 3.8951ms | 1.1358ms | 880.4697 Ops/s | 905.7957 Ops/s | |
test_creation | 58.4110μs | 1.9938μs | 501.5672 KOps/s | 506.8917 KOps/s | |
test_creation_empty | 53.0400μs | 18.4342μs | 54.2469 KOps/s | 52.0952 KOps/s | |
test_creation_nested_1 | 54.4120μs | 21.8160μs | 45.8380 KOps/s | 45.1053 KOps/s | |
test_creation_nested_2 | 68.6490μs | 25.6778μs | 38.9442 KOps/s | 38.2512 KOps/s | |
test_clone | 64.3010μs | 16.8857μs | 59.2218 KOps/s | 61.7598 KOps/s | |
test_getitem[int] | 0.9066ms | 16.1607μs | 61.8783 KOps/s | 63.3963 KOps/s | |
test_getitem[slice_int] | 0.1405ms | 29.9678μs | 33.3692 KOps/s | 34.4393 KOps/s | |
test_getitem[range] | 0.1722ms | 57.3686μs | 17.4311 KOps/s | 18.0528 KOps/s | |
test_getitem[tuple] | 0.1284ms | 24.0288μs | 41.6168 KOps/s | 41.8146 KOps/s | |
test_getitem[list] | 0.1713ms | 52.6654μs | 18.9878 KOps/s | 19.6746 KOps/s | |
test_setitem_dim[int] | 79.4390μs | 40.7921μs | 24.5146 KOps/s | 24.2646 KOps/s | |
test_setitem_dim[slice_int] | 0.1192ms | 69.0664μs | 14.4788 KOps/s | 14.0726 KOps/s | |
test_setitem_dim[range] | 0.1894ms | 93.6443μs | 10.6787 KOps/s | 10.7347 KOps/s | |
test_setitem_dim[tuple] | 93.9660μs | 57.3813μs | 17.4273 KOps/s | 16.9642 KOps/s | |
test_setitem | 93.6560μs | 29.5602μs | 33.8293 KOps/s | 34.3955 KOps/s | |
test_set | 98.3950μs | 28.7940μs | 34.7294 KOps/s | 34.6598 KOps/s | |
test_set_shared | 1.1760ms | 0.2139ms | 4.6753 KOps/s | 4.7739 KOps/s | |
test_update | 0.1508ms | 35.5650μs | 28.1176 KOps/s | 27.4073 KOps/s | |
test_update_nested | 0.1305ms | 44.7472μs | 22.3478 KOps/s | 21.2921 KOps/s | |
test_update__nested | 87.6040μs | 33.8960μs | 29.5020 KOps/s | 30.0153 KOps/s | |
test_set_nested | 82.7750μs | 30.8485μs | 32.4165 KOps/s | 32.3066 KOps/s | |
test_set_nested_new | 80.9820μs | 35.6462μs | 28.0535 KOps/s | 27.3042 KOps/s | |
test_select | 0.1182ms | 52.8186μs | 18.9327 KOps/s | 18.8631 KOps/s | |
test_select_nested | 0.1038ms | 59.2129μs | 16.8882 KOps/s | 17.1489 KOps/s | |
test_exclude_nested | 0.1525ms | 74.0032μs | 13.5129 KOps/s | 13.5541 KOps/s | |
test_empty[True] | 0.4855ms | 0.3097ms | 3.2290 KOps/s | 3.1690 KOps/s | |
test_empty[False] | 6.5482μs | 1.1700μs | 854.6847 KOps/s | 854.1090 KOps/s | |
test_unbind_speed | 0.5038ms | 0.2931ms | 3.4119 KOps/s | 3.4898 KOps/s | |
test_unbind_speed_stack0 | 0.4303ms | 0.2942ms | 3.3993 KOps/s | 3.6181 KOps/s | |
test_unbind_speed_stack1 | 87.2939ms | 0.7824ms | 1.2781 KOps/s | 1.4640 KOps/s | |
test_split | 81.1937ms | 2.1440ms | 466.4250 Ops/s | 480.8048 Ops/s | |
test_chunk | 2.2334ms | 2.0082ms | 497.9475 Ops/s | 478.9987 Ops/s | |
test_creation[device0] | 0.2110ms | 0.1155ms | 8.6610 KOps/s | 8.6535 KOps/s | |
test_creation_from_tensor | 3.1036ms | 0.1172ms | 8.5323 KOps/s | 8.6536 KOps/s | |
test_add_one[memmap_tensor0] | 0.2403ms | 7.4117μs | 134.9214 KOps/s | 140.5479 KOps/s | |
test_contiguous[memmap_tensor0] | 22.2120μs | 1.9018μs | 525.8099 KOps/s | 525.9348 KOps/s | |
test_stack[memmap_tensor0] | 33.2420μs | 5.5274μs | 180.9177 KOps/s | 184.2769 KOps/s | |
test_memmaptd_index | 1.0336ms | 0.3926ms | 2.5471 KOps/s | 2.5692 KOps/s | |
test_memmaptd_index_astensor | 1.1173ms | 0.4731ms | 2.1135 KOps/s | 2.1400 KOps/s | |
test_memmaptd_index_op | 1.6158ms | 1.0068ms | 993.2760 Ops/s | 983.3592 Ops/s | |
test_serialize_model | 0.1191s | 0.1158s | 8.6325 Ops/s | 8.4923 Ops/s | |
test_serialize_model_pickle | 0.4368s | 0.3970s | 2.5190 Ops/s | 2.4966 Ops/s | |
test_serialize_weights | 0.1251s | 0.1156s | 8.6530 Ops/s | 7.7801 Ops/s | |
test_serialize_weights_returnearly | 0.1698s | 0.1602s | 6.2433 Ops/s | 6.3717 Ops/s | |
test_serialize_weights_pickle | 0.4610s | 0.4117s | 2.4290 Ops/s | 2.5215 Ops/s | |
test_serialize_weights_filesystem | 0.1455s | 0.1433s | 6.9795 Ops/s | 7.1427 Ops/s | |
test_serialize_model_filesystem | 0.1526s | 0.1458s | 6.8601 Ops/s | 6.6938 Ops/s | |
test_reshape_pytree | 80.5410μs | 38.5313μs | 25.9529 KOps/s | 25.2671 KOps/s | |
test_reshape_td | 0.1257ms | 45.2376μs | 22.1055 KOps/s | 21.9392 KOps/s | |
test_view_pytree | 79.6190μs | 38.4985μs | 25.9750 KOps/s | 25.4783 KOps/s | |
test_view_td | 0.1045ms | 49.9949μs | 20.0020 KOps/s | 19.3429 KOps/s | |
test_unbind_pytree | 93.0450μs | 35.1287μs | 28.4668 KOps/s | 28.3895 KOps/s | |
test_unbind_td | 0.2906ms | 43.7322μs | 22.8665 KOps/s | 23.6589 KOps/s | |
test_split_pytree | 90.4700μs | 37.3888μs | 26.7460 KOps/s | 26.6617 KOps/s | |
test_split_td | 0.2181ms | 56.5194μs | 17.6930 KOps/s | 18.2140 KOps/s | |
test_add_pytree | 0.1133ms | 43.7612μs | 22.8513 KOps/s | 23.2468 KOps/s | |
test_add_td | 0.1644ms | 80.7427μs | 12.3850 KOps/s | 12.5283 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1374ms | 56.0073μs | 17.8548 KOps/s | 17.7204 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.9061ms | 0.1867ms | 5.3561 KOps/s | 5.4749 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1553ms | 55.2004μs | 18.1158 KOps/s | 17.9423 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2395ms | 0.1398ms | 7.1512 KOps/s | 7.3746 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 62.3670μs | 21.1752μs | 47.2251 KOps/s | 48.8970 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1077ms | 65.7221μs | 15.2156 KOps/s | 15.7746 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1718ms | 75.7423μs | 13.2027 KOps/s | 13.3876 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1081ms | 67.9940μs | 14.7072 KOps/s | 14.7979 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.4144ms | 0.1722ms | 5.8072 KOps/s | 5.7889 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.4126ms | 0.1910ms | 5.2343 KOps/s | 5.2202 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 93.9060μs | 40.6978μs | 24.5713 KOps/s | 23.9347 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.4248ms | 67.5979μs | 14.7934 KOps/s | 14.2535 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2951ms | 0.1752ms | 5.7068 KOps/s | 5.7491 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.6350ms | 0.2949ms | 3.3908 KOps/s | 3.5201 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4691ms | 0.2017ms | 4.9588 KOps/s | 5.0064 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2773ms | 0.1725ms | 5.7977 KOps/s | 5.7579 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1358ms | 61.8806μs | 16.1601 KOps/s | 16.2491 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1078ms | 41.4191μs | 24.1434 KOps/s | 24.0598 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.5169ms | 0.2411ms | 4.1473 KOps/s | 4.2950 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.4032ms | 0.1729ms | 5.7840 KOps/s | 5.7188 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.2372ms | 0.1015ms | 9.8533 KOps/s | 9.6032 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1439ms | 65.2584μs | 15.3237 KOps/s | 17.5895 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1705ms | 76.0642μs | 13.1468 KOps/s | 12.8118 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1408ms | 68.9153μs | 14.5106 KOps/s | 14.0611 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.3764ms | 0.1947ms | 5.1360 KOps/s | 5.0596 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.3360ms | 1.6247ms | 615.4887 Ops/s | 618.0939 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.3186ms | 0.1916ms | 5.2193 KOps/s | 5.2369 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.7524ms | 1.1236ms | 889.9824 Ops/s | 931.3883 Ops/s | |
test_compile_assign_and_add_stack[compile] | 0.7262ms | 0.4159ms | 2.4044 KOps/s | 2.4072 KOps/s | |
test_compile_assign_and_add_stack[eager] | 4.0359ms | 3.7293ms | 268.1476 Ops/s | 266.7290 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 98.9450μs | 34.2959μs | 29.1580 KOps/s | 28.7952 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 1.0588ms | 47.1907μs | 21.1906 KOps/s | 21.3987 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 87.5140μs | 28.5896μs | 34.9778 KOps/s | 33.7380 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 96.3500μs | 28.8904μs | 34.6136 KOps/s | 34.8714 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1087ms | 28.6833μs | 34.8635 KOps/s | 33.5466 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 83.1650μs | 28.9435μs | 34.5501 KOps/s | 34.9980 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1782ms | 73.7378μs | 13.5616 KOps/s | 13.6065 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.3624ms | 27.1790μs | 36.7931 KOps/s | 37.3740 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1607ms | 68.0682μs | 14.6912 KOps/s | 14.7371 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 70.9730μs | 22.7820μs | 43.8943 KOps/s | 43.7318 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1637ms | 67.8313μs | 14.7425 KOps/s | 14.7721 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 73.6070μs | 22.8934μs | 43.6807 KOps/s | 43.5041 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1590ms | 73.7442μs | 13.5604 KOps/s | 13.7664 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.8583ms | 26.9823μs | 37.0613 KOps/s | 37.8694 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1351ms | 67.5911μs | 14.7948 KOps/s | 14.8135 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 64.9820μs | 22.8951μs | 43.6774 KOps/s | 44.0567 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1494ms | 67.8625μs | 14.7357 KOps/s | 14.9905 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.4151ms | 22.8507μs | 43.7623 KOps/s | 44.5870 KOps/s | |
test_mod_add[eager] | 83.8780μs | 23.0498μs | 43.3843 KOps/s | 40.2659 KOps/s | |
test_mod_add[compile] | 99.7060μs | 39.5707μs | 25.2712 KOps/s | 26.1402 KOps/s | |
test_mod_add[compile-overhead] | 92.8340μs | 39.4371μs | 25.3568 KOps/s | 25.5147 KOps/s | |
test_mod_wrap[eager] | 0.3650ms | 0.2059ms | 4.8559 KOps/s | 4.8779 KOps/s | |
test_mod_wrap[compile] | 0.3596ms | 0.2283ms | 4.3795 KOps/s | 4.4137 KOps/s | |
test_mod_wrap[compile-overhead] | 0.4580ms | 0.2281ms | 4.3840 KOps/s | 4.4051 KOps/s | |
test_mod_wrap_and_backward[eager] | 11.6613ms | 10.6209ms | 94.1543 Ops/s | 93.6631 Ops/s | |
test_mod_wrap_and_backward[compile] | 15.5988ms | 11.2414ms | 88.9573 Ops/s | 89.1148 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 14.8293ms | 10.7656ms | 92.8882 Ops/s | 89.0869 Ops/s | |
test_seq_add[eager] | 0.1698ms | 86.9752μs | 11.4975 KOps/s | 11.5238 KOps/s | |
test_seq_add[compile] | 0.1539ms | 62.4735μs | 16.0068 KOps/s | 15.6189 KOps/s | |
test_seq_add[compile-overhead] | 0.1317ms | 62.5140μs | 15.9964 KOps/s | 16.0669 KOps/s | |
test_seq_wrap[eager] | 0.5754ms | 0.3751ms | 2.6660 KOps/s | 2.6555 KOps/s | |
test_seq_wrap[compile] | 2.7406ms | 0.2656ms | 3.7645 KOps/s | 3.7522 KOps/s | |
test_seq_wrap[compile-overhead] | 0.4478ms | 0.2666ms | 3.7509 KOps/s | 3.7905 KOps/s | |
test_func_call_runtime[False-eager] | 0.6281ms | 0.5235ms | 1.9101 KOps/s | 1.9274 KOps/s | |
test_func_call_runtime[False-compile] | 1.0518ms | 0.5013ms | 1.9948 KOps/s | 2.0214 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.8612ms | 0.4995ms | 2.0022 KOps/s | 2.0156 KOps/s | |
test_func_call_runtime[True-eager] | 0.8832ms | 0.7328ms | 1.3646 KOps/s | 1.3546 KOps/s | |
test_func_call_runtime[True-compile] | 0.6113ms | 0.5087ms | 1.9657 KOps/s | 1.9718 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.6233ms | 0.5135ms | 1.9476 KOps/s | 1.9870 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.6597ms | 0.5191ms | 1.9265 KOps/s | 1.9183 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.6022ms | 0.4999ms | 2.0006 KOps/s | 2.0205 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5811ms | 0.4984ms | 2.0063 KOps/s | 2.0127 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.7708ms | 0.8738ms | 1.1444 KOps/s | 1.1541 KOps/s | |
test_func_call_cm_runtime[True-compile] | 1.0138ms | 0.8331ms | 1.2003 KOps/s | 1.2302 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.9921ms | 0.8323ms | 1.2015 KOps/s | 1.2248 KOps/s | |
test_distributed | 0.2974ms | 0.1228ms | 8.1462 KOps/s | 8.0015 KOps/s | |
test_tdmodule | 34.6040μs | 17.3135μs | 57.7585 KOps/s | 54.8413 KOps/s | |
test_tdmodule_dispatch | 57.3770μs | 35.4057μs | 28.2440 KOps/s | 26.9752 KOps/s | |
test_tdseq | 45.6850μs | 20.0785μs | 49.8044 KOps/s | 49.5287 KOps/s | |
test_tdseq_dispatch | 68.0070μs | 39.9799μs | 25.0125 KOps/s | 23.8253 KOps/s | |
test_instantiation_functorch | 1.7236ms | 1.5528ms | 643.9941 Ops/s | 638.0167 Ops/s | |
test_instantiation_td | 1.7680ms | 1.1334ms | 882.3300 Ops/s | 867.6383 Ops/s | |
test_exec_functorch | 0.4337ms | 0.1880ms | 5.3182 KOps/s | 5.4059 KOps/s | |
test_exec_functional_call | 0.4235ms | 0.1747ms | 5.7246 KOps/s | 5.6565 KOps/s | |
test_exec_td | 0.3736ms | 0.1663ms | 6.0135 KOps/s | 5.8589 KOps/s | |
test_exec_td_decorator | 0.6445ms | 0.2257ms | 4.4306 KOps/s | 4.4550 KOps/s | |
test_vmap_mlp_speed[True-True] | 0.9123ms | 0.6455ms | 1.5492 KOps/s | 1.5684 KOps/s | |
test_vmap_mlp_speed[True-False] | 1.1210ms | 0.6441ms | 1.5526 KOps/s | 1.5781 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.7880ms | 0.4950ms | 2.0204 KOps/s | 2.0549 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.6370ms | 0.4944ms | 2.0226 KOps/s | 2.0400 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.4872ms | 0.6228ms | 1.6057 KOps/s | 1.6010 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8721ms | 0.6239ms | 1.6029 KOps/s | 1.6174 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.6620ms | 0.5100ms | 1.9609 KOps/s | 1.9638 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.8716ms | 0.5131ms | 1.9489 KOps/s | 1.9694 KOps/s | |
test_to_module_speed[True] | 1.5832ms | 1.2996ms | 769.4405 Ops/s | 765.8495 Ops/s | |
test_to_module_speed[False] | 1.4478ms | 1.2693ms | 787.8581 Ops/s | 763.8009 Ops/s | |
test_tc_init | 83.7170μs | 44.0159μs | 22.7190 KOps/s | 21.6217 KOps/s | |
test_tc_init_nested | 0.1875ms | 88.8350μs | 11.2568 KOps/s | 10.8313 KOps/s | |
test_tc_first_layer_tensor | 44.5640μs | 1.5475μs | 646.2226 KOps/s | 644.9108 KOps/s | |
test_tc_first_layer_nontensor | 28.5840μs | 4.8416μs | 206.5414 KOps/s | 212.0964 KOps/s | |
test_tc_second_layer_tensor | 29.1140μs | 2.8945μs | 345.4850 KOps/s | 348.5042 KOps/s | |
test_tc_second_layer_nontensor | 46.0270μs | 6.2338μs | 160.4155 KOps/s | 165.2339 KOps/s | |
test_unbind | 0.4447s | 13.6934ms | 73.0278 Ops/s | 76.9758 Ops/s | |
test_full_like | 7.7645ms | 6.8530ms | 145.9225 Ops/s | 89.7463 Ops/s | |
test_zeros_like | 12.7001ms | 6.5365ms | 152.9871 Ops/s | 144.1904 Ops/s | |
test_ones_like | 16.5187ms | 7.4677ms | 133.9102 Ops/s | 136.1463 Ops/s | |
test_clone | 15.3266ms | 9.0290ms | 110.7540 Ops/s | 110.7059 Ops/s | |
test_squeeze | 0.1044ms | 12.3128μs | 81.2166 KOps/s | 81.2278 KOps/s | |
test_unsqueeze | 0.1725ms | 92.3541μs | 10.8279 KOps/s | 11.5813 KOps/s | |
test_split | 0.3913ms | 0.1969ms | 5.0780 KOps/s | 5.2747 KOps/s | |
test_permute | 0.3387ms | 0.2151ms | 4.6497 KOps/s | 4.7178 KOps/s | |
test_stack | 34.1006ms | 24.3381ms | 41.0878 Ops/s | 40.1934 Ops/s | |
test_cat | 30.0687ms | 24.0422ms | 41.5936 Ops/s | 40.2355 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 0.5905ms | 13.0822μs | 76.4395 KOps/s | 66.3868 KOps/s | |
test_plain_set_stack_nested | 39.0410μs | 13.1315μs | 76.1525 KOps/s | 66.4459 KOps/s | |
test_plain_set_nested_inplace | 53.3310μs | 13.9883μs | 71.4881 KOps/s | 62.4137 KOps/s | |
test_plain_set_stack_nested_inplace | 0.1937ms | 14.0721μs | 71.0627 KOps/s | 62.6144 KOps/s | |
test_items | 0.2021ms | 2.9099μs | 343.6598 KOps/s | 348.0250 KOps/s | |
test_items_nested | 0.3771ms | 0.3125ms | 3.2000 KOps/s | 3.1692 KOps/s | |
test_items_nested_locked | 0.4812ms | 0.3186ms | 3.1389 KOps/s | 3.1384 KOps/s | |
test_items_nested_leaf | 0.2580ms | 63.4420μs | 15.7624 KOps/s | 15.7566 KOps/s | |
test_items_stack_nested | 0.5007ms | 0.3187ms | 3.1382 KOps/s | 3.1399 KOps/s | |
test_items_stack_nested_leaf | 0.1013ms | 65.1108μs | 15.3584 KOps/s | 15.6821 KOps/s | |
test_items_stack_nested_locked | 0.4038ms | 0.3175ms | 3.1494 KOps/s | 3.1374 KOps/s | |
test_keys | 43.3110μs | 3.4172μs | 292.6350 KOps/s | 291.3669 KOps/s | |
test_keys_nested | 92.3220μs | 55.6115μs | 17.9819 KOps/s | 17.8342 KOps/s | |
test_keys_nested_locked | 2.1667ms | 59.9636μs | 16.6768 KOps/s | 16.4961 KOps/s | |
test_keys_nested_leaf | 99.1730μs | 45.1236μs | 22.1613 KOps/s | 21.1169 KOps/s | |
test_keys_stack_nested | 81.9420μs | 55.1571μs | 18.1300 KOps/s | 18.0799 KOps/s | |
test_keys_stack_nested_leaf | 80.7720μs | 47.1693μs | 21.2002 KOps/s | 20.8170 KOps/s | |
test_keys_stack_nested_locked | 87.3520μs | 60.7525μs | 16.4602 KOps/s | 16.7534 KOps/s | |
test_values | 6.4385μs | 0.8067μs | 1.2396 MOps/s | 1.2446 MOps/s | |
test_values_nested | 50.0510μs | 27.3910μs | 36.5084 KOps/s | 36.5746 KOps/s | |
test_values_nested_locked | 70.9420μs | 29.1434μs | 34.3131 KOps/s | 34.0696 KOps/s | |
test_values_nested_leaf | 58.5920μs | 24.0046μs | 41.6587 KOps/s | 41.3760 KOps/s | |
test_values_stack_nested | 61.1220μs | 28.4365μs | 35.1661 KOps/s | 35.2287 KOps/s | |
test_values_stack_nested_leaf | 59.8010μs | 24.8755μs | 40.2002 KOps/s | 39.8430 KOps/s | |
test_values_stack_nested_locked | 60.6120μs | 30.0943μs | 33.2289 KOps/s | 32.9391 KOps/s | |
test_membership | 1.9250μs | 0.5210μs | 1.9196 MOps/s | 1.9175 MOps/s | |
test_membership_nested | 36.8305μs | 1.7504μs | 571.3079 KOps/s | 565.4850 KOps/s | |
test_membership_nested_leaf | 16.8937μs | 1.7497μs | 571.5120 KOps/s | 572.8176 KOps/s | |
test_membership_stacked_nested | 31.0100μs | 1.8127μs | 551.6701 KOps/s | 547.7579 KOps/s | |
test_membership_stacked_nested_leaf | 31.4710μs | 1.7801μs | 561.7804 KOps/s | 553.0812 KOps/s | |
test_membership_nested_last | 52.4420μs | 2.5996μs | 384.6720 KOps/s | 381.6916 KOps/s | |
test_membership_nested_leaf_last | 33.0300μs | 2.6428μs | 378.3882 KOps/s | 378.6362 KOps/s | |
test_membership_stacked_nested_last | 39.4510μs | 3.2893μs | 304.0193 KOps/s | 131.3519 KOps/s | |
test_membership_stacked_nested_leaf_last | 29.1010μs | 3.2928μs | 303.6967 KOps/s | 130.3803 KOps/s | |
test_nested_getleaf | 41.0810μs | 6.1211μs | 163.3697 KOps/s | 163.9610 KOps/s | |
test_nested_get | 30.3510μs | 5.7843μs | 172.8814 KOps/s | 173.7044 KOps/s | |
test_stacked_getleaf | 43.9510μs | 6.0802μs | 164.4676 KOps/s | 165.2808 KOps/s | |
test_stacked_get | 24.5700μs | 5.6894μs | 175.7641 KOps/s | 177.3956 KOps/s | |
test_nested_getitemleaf | 38.7710μs | 6.1043μs | 163.8178 KOps/s | 165.9108 KOps/s | |
test_nested_getitem | 26.7810μs | 5.7414μs | 174.1734 KOps/s | 174.9613 KOps/s | |
test_stacked_getitemleaf | 42.6610μs | 6.1476μs | 162.6655 KOps/s | 164.9819 KOps/s | |
test_stacked_getitem | 40.2310μs | 5.6624μs | 176.6047 KOps/s | 176.6609 KOps/s | |
test_lock_nested | 3.1586ms | 0.4114ms | 2.4310 KOps/s | 2.4111 KOps/s | |
test_lock_stack_nested | 0.4881ms | 0.3766ms | 2.6550 KOps/s | 2.7474 KOps/s | |
test_unlock_nested | 0.7649ms | 0.3494ms | 2.8622 KOps/s | 2.8548 KOps/s | |
test_unlock_stack_nested | 0.4197ms | 0.3160ms | 3.1643 KOps/s | 3.3014 KOps/s | |
test_flatten_speed | 0.1788ms | 80.0474μs | 12.4926 KOps/s | 12.6529 KOps/s | |
test_unflatten_speed | 0.3583ms | 0.2827ms | 3.5371 KOps/s | 3.5768 KOps/s | |
test_common_ops | 1.6234ms | 1.2191ms | 820.2813 Ops/s | 767.5275 Ops/s | |
test_creation | 16.2610μs | 1.4570μs | 686.3244 KOps/s | 680.3163 KOps/s | |
test_creation_empty | 64.2410μs | 14.0120μs | 71.3674 KOps/s | 56.8107 KOps/s | |
test_creation_nested_1 | 87.7420μs | 16.0799μs | 62.1893 KOps/s | 52.0037 KOps/s | |
test_creation_nested_2 | 49.4210μs | 18.4838μs | 54.1015 KOps/s | 45.2766 KOps/s | |
test_clone | 0.1774ms | 29.0287μs | 34.4487 KOps/s | 34.2805 KOps/s | |
test_getitem[int] | 1.1198ms | 15.4785μs | 64.6057 KOps/s | 63.4782 KOps/s | |
test_getitem[slice_int] | 0.1300ms | 27.2544μs | 36.6913 KOps/s | 36.0007 KOps/s | |
test_getitem[range] | 0.1537ms | 0.1095ms | 9.1352 KOps/s | 9.1875 KOps/s | |
test_getitem[tuple] | 93.1100ms | 29.8582μs | 33.4916 KOps/s | 42.4557 KOps/s | |
test_getitem[list] | 0.2612ms | 98.8386μs | 10.1175 KOps/s | 9.8381 KOps/s | |
test_setitem_dim[int] | 80.1520μs | 48.7103μs | 20.5295 KOps/s | 19.0859 KOps/s | |
test_setitem_dim[slice_int] | 0.2217ms | 73.4933μs | 13.6067 KOps/s | 13.2420 KOps/s | |
test_setitem_dim[range] | 0.2902ms | 0.1334ms | 7.4954 KOps/s | 7.2231 KOps/s | |
test_setitem_dim[tuple] | 0.2136ms | 66.1258μs | 15.1227 KOps/s | 14.5165 KOps/s | |
test_setitem | 0.1939ms | 41.1621μs | 24.2942 KOps/s | 23.1842 KOps/s | |
test_set | 0.1875ms | 40.1320μs | 24.9178 KOps/s | 23.5690 KOps/s | |
test_set_shared | 0.3428ms | 51.2393μs | 19.5163 KOps/s | 19.0723 KOps/s | |
test_update | 0.1973ms | 47.6492μs | 20.9867 KOps/s | 19.2709 KOps/s | |
test_update_nested | 0.2066ms | 54.4515μs | 18.3650 KOps/s | 17.0188 KOps/s | |
test_update__nested | 0.2108ms | 58.9759μs | 16.9561 KOps/s | 17.2212 KOps/s | |
test_set_nested | 0.1900ms | 42.6384μs | 23.4530 KOps/s | 22.4635 KOps/s | |
test_set_nested_new | 0.1977ms | 45.9749μs | 21.7510 KOps/s | 20.6715 KOps/s | |
test_select | 0.2137ms | 59.5237μs | 16.8000 KOps/s | 16.2749 KOps/s | |
test_select_nested | 0.5119ms | 42.4314μs | 23.5674 KOps/s | 23.4476 KOps/s | |
test_exclude_nested | 0.1079ms | 59.8719μs | 16.7023 KOps/s | 16.6126 KOps/s | |
test_empty[True] | 0.2982ms | 0.2448ms | 4.0851 KOps/s | 4.0524 KOps/s | |
test_empty[False] | 3.7811μs | 0.7185μs | 1.3918 MOps/s | 1.3733 MOps/s | |
test_to | 0.1465ms | 25.4819μs | 39.2436 KOps/s | 39.0808 KOps/s | |
test_to_nonblocking | 0.1462ms | 23.7531μs | 42.0998 KOps/s | 41.3957 KOps/s | |
test_unbind_speed | 0.3338ms | 0.2759ms | 3.6246 KOps/s | 3.6758 KOps/s | |
test_unbind_speed_stack0 | 0.3516ms | 0.2740ms | 3.6498 KOps/s | 3.8100 KOps/s | |
test_unbind_speed_stack1 | 91.8051ms | 0.6856ms | 1.4587 KOps/s | 1.4770 KOps/s | |
test_split | 93.2259ms | 2.1504ms | 465.0384 Ops/s | 465.4738 Ops/s | |
test_chunk | 95.6504ms | 2.1764ms | 459.4769 Ops/s | 466.3240 Ops/s | |
test_creation[device0] | 0.4262ms | 0.1274ms | 7.8470 KOps/s | 7.8456 KOps/s | |
test_creation_from_tensor | 0.3751ms | 0.1295ms | 7.7198 KOps/s | 7.6386 KOps/s | |
test_add_one[memmap_tensor0] | 0.1921ms | 8.7148μs | 114.7476 KOps/s | 119.3826 KOps/s | |
test_contiguous[memmap_tensor0] | 35.3000μs | 2.1697μs | 460.8959 KOps/s | 451.7734 KOps/s | |
test_stack[memmap_tensor0] | 34.8810μs | 6.5507μs | 152.6566 KOps/s | 153.2425 KOps/s | |
test_memmaptd_index | 1.0611ms | 0.4154ms | 2.4072 KOps/s | 2.3192 KOps/s | |
test_memmaptd_index_astensor | 0.7431ms | 0.4742ms | 2.1086 KOps/s | 2.0573 KOps/s | |
test_memmaptd_index_op | 1.3759ms | 0.9913ms | 1.0087 KOps/s | 942.0577 Ops/s | |
test_serialize_model | 0.1310s | 0.1294s | 7.7260 Ops/s | 7.7343 Ops/s | |
test_serialize_model_pickle | 1.4450s | 1.2341s | 0.8103 Ops/s | 0.8250 Ops/s | |
test_serialize_weights | 0.1309s | 0.1293s | 7.7344 Ops/s | 7.0203 Ops/s | |
test_serialize_weights_returnearly | 0.2323s | 56.2324ms | 17.7833 Ops/s | 17.3754 Ops/s | |
test_serialize_weights_pickle | 1.3729s | 1.2159s | 0.8224 Ops/s | 0.8224 Ops/s | |
test_reshape_pytree | 0.2185ms | 34.9123μs | 28.6432 KOps/s | 27.2187 KOps/s | |
test_reshape_td | 0.2429ms | 43.3115μs | 23.0886 KOps/s | 23.7909 KOps/s | |
test_view_pytree | 0.2382ms | 37.3207μs | 26.7948 KOps/s | 27.2828 KOps/s | |
test_view_td | 0.2236ms | 48.7312μs | 20.5207 KOps/s | 20.8188 KOps/s | |
test_unbind_pytree | 0.1563ms | 33.7293μs | 29.6478 KOps/s | 29.2518 KOps/s | |
test_unbind_td | 0.4457ms | 42.1510μs | 23.7242 KOps/s | 24.0710 KOps/s | |
test_split_pytree | 0.1743ms | 46.6069μs | 21.4560 KOps/s | 21.6325 KOps/s | |
test_split_td | 3.4695ms | 55.6964μs | 17.9545 KOps/s | 17.9393 KOps/s | |
test_add_pytree | 0.2516ms | 58.0311μs | 17.2321 KOps/s | 17.4036 KOps/s | |
test_add_td | 0.2528ms | 89.3290μs | 11.1946 KOps/s | 10.6651 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.4086ms | 0.2034ms | 4.9171 KOps/s | 4.8637 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3080ms | 0.1565ms | 6.3879 KOps/s | 6.3625 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.2717ms | 0.1470ms | 6.8027 KOps/s | 6.8874 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.3530ms | 0.1821ms | 5.4914 KOps/s | 5.3923 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 0.1649ms | 17.9210μs | 55.8006 KOps/s | 53.1694 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1076ms | 42.5883μs | 23.4806 KOps/s | 23.2097 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1126ms | 64.0143μs | 15.6215 KOps/s | 15.6622 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1109ms | 49.4606μs | 20.2181 KOps/s | 20.2807 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.4366ms | 0.3178ms | 3.1468 KOps/s | 3.1422 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3475ms | 0.2087ms | 4.7921 KOps/s | 4.7391 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.2742ms | 0.1274ms | 7.8517 KOps/s | 7.8094 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.2103ms | 59.7139μs | 16.7465 KOps/s | 16.6070 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.5585ms | 0.3328ms | 3.0046 KOps/s | 3.1273 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.9355ms | 0.6685ms | 1.4958 KOps/s | 1.5738 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4143ms | 0.2552ms | 3.9181 KOps/s | 3.9713 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.5120ms | 0.3252ms | 3.0753 KOps/s | 3.1200 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.2802ms | 73.8587μs | 13.5394 KOps/s | 14.0888 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.3028ms | 0.1361ms | 7.3491 KOps/s | 7.7066 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.7988ms | 0.5675ms | 1.7621 KOps/s | 1.8600 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.5005ms | 0.3240ms | 3.0866 KOps/s | 3.1232 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.2008ms | 19.6256μs | 50.9537 KOps/s | 53.9161 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.2254ms | 27.1798μs | 36.7921 KOps/s | 35.3561 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1672ms | 69.3508μs | 14.4194 KOps/s | 14.2245 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1571ms | 51.4033μs | 19.4540 KOps/s | 19.4517 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 2.3225ms | 0.8130ms | 1.2301 KOps/s | 1.1242 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 3.8109ms | 3.2284ms | 309.7479 Ops/s | 309.6613 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 2.2555ms | 0.7953ms | 1.2574 KOps/s | 1.1351 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 3.4601ms | 3.1534ms | 317.1147 Ops/s | 309.4029 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.2571ms | 0.1088ms | 9.1942 KOps/s | 9.0882 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.2650ms | 59.4848μs | 16.8110 KOps/s | 16.3264 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.2513ms | 0.1034ms | 9.6694 KOps/s | 9.5497 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1986ms | 42.2631μs | 23.6613 KOps/s | 22.9177 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.2521ms | 0.1029ms | 9.7166 KOps/s | 9.6230 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1885ms | 42.1106μs | 23.7470 KOps/s | 22.8771 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.3206ms | 0.1365ms | 7.3257 KOps/s | 7.3178 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1758ms | 24.9589μs | 40.0659 KOps/s | 39.6769 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.2787ms | 0.1314ms | 7.6126 KOps/s | 7.6178 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 0.1385ms | 20.3700μs | 49.0918 KOps/s | 47.7158 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.3506ms | 0.1307ms | 7.6496 KOps/s | 7.6125 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 0.2053ms | 20.8152μs | 48.0419 KOps/s | 48.1706 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.3057ms | 0.1373ms | 7.2848 KOps/s | 7.1513 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.5326ms | 24.8184μs | 40.2927 KOps/s | 39.4701 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.2922ms | 0.1310ms | 7.6342 KOps/s | 7.6263 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.2092ms | 20.6628μs | 48.3963 KOps/s | 47.9962 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.2755ms | 0.1309ms | 7.6370 KOps/s | 7.6195 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.3691ms | 20.8621μs | 47.9337 KOps/s | 48.4291 KOps/s | |
test_mod_add[eager] | 0.1823ms | 30.4556μs | 32.8347 KOps/s | 30.1591 KOps/s | |
test_mod_add[compile] | 0.2409ms | 70.9650μs | 14.0914 KOps/s | 13.9418 KOps/s | |
test_mod_add[compile-overhead] | 0.2669ms | 0.1363ms | 7.3393 KOps/s | 6.7923 KOps/s | |
test_mod_wrap[eager] | 0.4463ms | 0.2525ms | 3.9609 KOps/s | 4.0911 KOps/s | |
test_mod_wrap[compile] | 1.0416ms | 0.3097ms | 3.2293 KOps/s | 3.3597 KOps/s | |
test_mod_wrap[compile-overhead] | 7.5649ms | 4.0632ms | 246.1091 Ops/s | 245.4454 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.6123ms | 1.3066ms | 765.3695 Ops/s | 712.9189 Ops/s | |
test_mod_wrap_and_backward[compile] | 2.2562ms | 1.3169ms | 759.3558 Ops/s | 686.6591 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.2940ms | 0.8785ms | 1.1383 KOps/s | 1.0170 KOps/s | |
test_seq_add[eager] | 0.2801ms | 98.9902μs | 10.1020 KOps/s | 10.1310 KOps/s | |
test_seq_add[compile] | 0.3685ms | 86.7142μs | 11.5321 KOps/s | 12.0494 KOps/s | |
test_seq_add[compile-overhead] | 0.3014ms | 0.1214ms | 8.2340 KOps/s | 8.6942 KOps/s | |
test_seq_wrap[eager] | 0.5895ms | 0.3907ms | 2.5592 KOps/s | 2.5803 KOps/s | |
test_seq_wrap[compile] | 0.5168ms | 0.3289ms | 3.0404 KOps/s | 3.1619 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3639ms | 0.2155ms | 4.6408 KOps/s | 4.5665 KOps/s | |
test_func_call_runtime[False-eager] | 0.9006ms | 0.7186ms | 1.3915 KOps/s | 1.3704 KOps/s | |
test_func_call_runtime[False-compile] | 0.9619ms | 0.7888ms | 1.2677 KOps/s | 1.2396 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4976ms | 0.3528ms | 2.8345 KOps/s | 2.7862 KOps/s | |
test_func_call_runtime[True-eager] | 1.0311ms | 0.8801ms | 1.1363 KOps/s | 1.1128 KOps/s | |
test_func_call_runtime[True-compile] | 0.9953ms | 0.8290ms | 1.2062 KOps/s | 1.1939 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.5343ms | 0.3891ms | 2.5700 KOps/s | 2.5388 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8625ms | 0.7110ms | 1.4064 KOps/s | 1.3749 KOps/s | |
test_func_call_cm_runtime[False-compile] | 1.0161ms | 0.8118ms | 1.2319 KOps/s | 1.2366 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4928ms | 0.3543ms | 2.8222 KOps/s | 2.7738 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1357ms | 0.9725ms | 1.0283 KOps/s | 1.0024 KOps/s | |
test_func_call_cm_runtime[True-compile] | 1.1457ms | 0.9629ms | 1.0385 KOps/s | 1.0175 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.1223ms | 0.9604ms | 1.0412 KOps/s | 1.0125 KOps/s | |
test_distributed | 2.7370ms | 0.1961ms | 5.1005 KOps/s | 8.2032 KOps/s | |
test_tdmodule | 48.8610μs | 14.1947μs | 70.4487 KOps/s | 64.2355 KOps/s | |
test_tdmodule_dispatch | 48.5310μs | 28.0814μs | 35.6107 KOps/s | 31.1391 KOps/s | |
test_tdseq | 0.1585ms | 14.7112μs | 67.9752 KOps/s | 60.7658 KOps/s | |
test_tdseq_dispatch | 71.1120μs | 30.3779μs | 32.9187 KOps/s | 29.3141 KOps/s | |
test_instantiation_functorch | 2.1128ms | 1.9102ms | 523.5069 Ops/s | 520.6638 Ops/s | |
test_instantiation_td | 0.1717s | 1.4276ms | 700.4783 Ops/s | 816.5792 Ops/s | |
test_exec_functorch | 0.4062ms | 0.2053ms | 4.8700 KOps/s | 4.7647 KOps/s | |
test_exec_functional_call | 0.3552ms | 0.2050ms | 4.8778 KOps/s | 4.8403 KOps/s | |
test_exec_td | 0.3624ms | 0.2077ms | 4.8147 KOps/s | 4.6938 KOps/s | |
test_exec_td_decorator | 0.4746ms | 0.2538ms | 3.9400 KOps/s | 3.8806 KOps/s | |
test_vmap_mlp_speed[True-True] | 0.8592ms | 0.6666ms | 1.5000 KOps/s | 1.4667 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.8574ms | 0.6638ms | 1.5064 KOps/s | 1.4801 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.7434ms | 0.5585ms | 1.7907 KOps/s | 1.7751 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.7301ms | 0.5588ms | 1.7895 KOps/s | 1.7756 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.2411ms | 0.6515ms | 1.5350 KOps/s | 1.5085 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8042ms | 0.6523ms | 1.5329 KOps/s | 1.4991 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7304ms | 0.5734ms | 1.7439 KOps/s | 1.7278 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7462ms | 0.5740ms | 1.7420 KOps/s | 1.7262 KOps/s | |
test_vmap_transformer_speed[True-True] | 8.4135ms | 8.1249ms | 123.0789 Ops/s | 122.7586 Ops/s | |
test_vmap_transformer_speed[True-False] | 8.3761ms | 8.0995ms | 123.4638 Ops/s | 123.0035 Ops/s | |
test_vmap_transformer_speed[False-True] | 8.2238ms | 7.9409ms | 125.9306 Ops/s | 126.2935 Ops/s | |
test_vmap_transformer_speed[False-False] | 8.0747ms | 7.8854ms | 126.8173 Ops/s | 126.5878 Ops/s | |
test_vmap_transformer_speed_decorator[True-True] | 19.2551ms | 18.9583ms | 52.7474 Ops/s | 52.6895 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 19.8476ms | 18.9174ms | 52.8614 Ops/s | 52.6462 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.1637ms | 18.8469ms | 53.0591 Ops/s | 52.9516 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 20.0260ms | 18.8440ms | 53.0674 Ops/s | 53.1964 Ops/s | |
test_to_module_speed[True] | 1.3716ms | 0.9384ms | 1.0657 KOps/s | 1.0687 KOps/s | |
test_to_module_speed[False] | 1.2996ms | 0.9089ms | 1.1002 KOps/s | 1.0865 KOps/s | |
test_tc_init | 63.5310μs | 32.1857μs | 31.0697 KOps/s | 28.2231 KOps/s | |
test_tc_init_nested | 95.7820μs | 64.9236μs | 15.4027 KOps/s | 14.0798 KOps/s | |
test_tc_first_layer_tensor | 5.8959μs | 0.6919μs | 1.4453 MOps/s | 1.4727 MOps/s | |
test_tc_first_layer_nontensor | 25.7310μs | 2.2410μs | 446.2292 KOps/s | 451.6087 KOps/s | |
test_tc_second_layer_tensor | 44.1477μs | 1.3874μs | 720.7621 KOps/s | 729.6250 KOps/s | |
test_tc_second_layer_nontensor | 0.1757ms | 2.9496μs | 339.0270 KOps/s | 344.2895 KOps/s | |
test_unbind | 0.1826s | 11.7867ms | 84.8412 Ops/s | 69.1249 Ops/s | |
test_full_like | 0.7574ms | 0.5760ms | 1.7360 KOps/s | 1.7344 KOps/s | |
test_zeros_like | 0.3866ms | 0.1984ms | 5.0411 KOps/s | 5.0409 KOps/s | |
test_ones_like | 0.3808ms | 0.1982ms | 5.0442 KOps/s | 5.0465 KOps/s | |
test_clone | 0.5990ms | 0.4149ms | 2.4103 KOps/s | 2.4082 KOps/s | |
test_squeeze | 0.1487ms | 9.7961μs | 102.0816 KOps/s | 100.5913 KOps/s | |
test_unsqueeze | 0.2178ms | 71.6736μs | 13.9521 KOps/s | 13.8100 KOps/s | |
test_split | 0.4081ms | 0.1566ms | 6.3838 KOps/s | 6.2313 KOps/s | |
test_permute | 0.3832ms | 0.1787ms | 5.5974 KOps/s | 5.6417 KOps/s | |
test_stack | 1.3713ms | 0.8793ms | 1.1373 KOps/s | 1.1429 KOps/s | |
test_cat | 1.3865ms | 1.2319ms | 811.7541 Ops/s | 811.3640 Ops/s |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CI
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
No more 3.8 in torch 2.5