-
Notifications
You must be signed in to change notification settings - Fork 79
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[CI, BugFix] Fix nightly build #941
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 48.7410μs | 22.2076μs | 45.0297 KOps/s | 45.7287 KOps/s | |
test_plain_set_stack_nested | 70.5830μs | 22.2475μs | 44.9489 KOps/s | 45.8845 KOps/s | |
test_plain_set_nested_inplace | 65.7830μs | 24.2168μs | 41.2937 KOps/s | 41.9751 KOps/s | |
test_plain_set_stack_nested_inplace | 78.3070μs | 24.1786μs | 41.3588 KOps/s | 42.4353 KOps/s | |
test_items | 22.3320μs | 2.6000μs | 384.6204 KOps/s | 373.8731 KOps/s | |
test_items_nested | 2.2619ms | 0.3370ms | 2.9670 KOps/s | 2.8069 KOps/s | |
test_items_nested_locked | 0.5967ms | 0.3357ms | 2.9785 KOps/s | 2.9033 KOps/s | |
test_items_nested_leaf | 0.1547ms | 88.0496μs | 11.3572 KOps/s | 11.6840 KOps/s | |
test_items_stack_nested | 0.7177ms | 0.3357ms | 2.9790 KOps/s | 2.8708 KOps/s | |
test_items_stack_nested_leaf | 0.1579ms | 88.0857μs | 11.3526 KOps/s | 11.6958 KOps/s | |
test_items_stack_nested_locked | 0.6054ms | 0.3371ms | 2.9665 KOps/s | 2.9254 KOps/s | |
test_keys | 40.6870μs | 3.9168μs | 255.3126 KOps/s | 258.4810 KOps/s | |
test_keys_nested | 0.2511ms | 0.1436ms | 6.9630 KOps/s | 6.9851 KOps/s | |
test_keys_nested_locked | 0.7713ms | 0.1510ms | 6.6238 KOps/s | 6.7126 KOps/s | |
test_keys_nested_leaf | 0.2099ms | 0.1243ms | 8.0430 KOps/s | 7.9940 KOps/s | |
test_keys_stack_nested | 0.2119ms | 0.1437ms | 6.9581 KOps/s | 6.9328 KOps/s | |
test_keys_stack_nested_leaf | 0.2182ms | 0.1225ms | 8.1630 KOps/s | 8.1085 KOps/s | |
test_keys_stack_nested_locked | 0.2791ms | 0.1482ms | 6.7457 KOps/s | 6.7439 KOps/s | |
test_values | 9.2614μs | 1.1672μs | 856.7690 KOps/s | 832.1699 KOps/s | |
test_values_nested | 0.1001ms | 50.5029μs | 19.8008 KOps/s | 19.9662 KOps/s | |
test_values_nested_locked | 0.1046ms | 50.4155μs | 19.8352 KOps/s | 19.8582 KOps/s | |
test_values_nested_leaf | 96.1110μs | 45.6743μs | 21.8942 KOps/s | 22.0892 KOps/s | |
test_values_stack_nested | 97.1510μs | 50.8726μs | 19.6570 KOps/s | 18.6483 KOps/s | |
test_values_stack_nested_leaf | 85.4800μs | 45.8056μs | 21.8314 KOps/s | 21.3124 KOps/s | |
test_values_stack_nested_locked | 0.1012ms | 50.3292μs | 19.8692 KOps/s | 19.8908 KOps/s | |
test_membership | 6.1487μs | 0.7668μs | 1.3041 MOps/s | 1.3833 MOps/s | |
test_membership_nested | 45.8260μs | 2.6020μs | 384.3235 KOps/s | 372.0986 KOps/s | |
test_membership_nested_leaf | 24.5460μs | 2.6098μs | 383.1712 KOps/s | 366.4230 KOps/s | |
test_membership_stacked_nested | 42.4090μs | 2.5961μs | 385.1907 KOps/s | 377.0582 KOps/s | |
test_membership_stacked_nested_leaf | 30.5570μs | 2.5982μs | 384.8794 KOps/s | 375.7077 KOps/s | |
test_membership_nested_last | 47.0980μs | 3.8467μs | 259.9626 KOps/s | 255.5220 KOps/s | |
test_membership_nested_leaf_last | 41.6380μs | 3.9005μs | 256.3769 KOps/s | 249.4111 KOps/s | |
test_membership_stacked_nested_last | 43.8220μs | 3.8540μs | 259.4714 KOps/s | 255.3657 KOps/s | |
test_membership_stacked_nested_leaf_last | 18.4540μs | 3.8520μs | 259.6086 KOps/s | 252.1086 KOps/s | |
test_nested_getleaf | 50.6950μs | 10.3730μs | 96.4041 KOps/s | 95.8314 KOps/s | |
test_nested_get | 57.1770μs | 9.8300μs | 101.7298 KOps/s | 101.1306 KOps/s | |
test_stacked_getleaf | 27.9420μs | 10.2996μs | 97.0916 KOps/s | 96.1467 KOps/s | |
test_stacked_get | 47.0580μs | 9.7709μs | 102.3448 KOps/s | 101.0593 KOps/s | |
test_nested_getitemleaf | 47.7000μs | 11.0832μs | 90.2264 KOps/s | 90.7704 KOps/s | |
test_nested_getitem | 36.6990μs | 10.1440μs | 98.5804 KOps/s | 99.3550 KOps/s | |
test_stacked_getitemleaf | 53.5910μs | 10.9077μs | 91.6785 KOps/s | 92.6485 KOps/s | |
test_stacked_getitem | 53.6300μs | 10.0742μs | 99.2634 KOps/s | 99.6595 KOps/s | |
test_lock_nested | 79.2502ms | 0.5753ms | 1.7382 KOps/s | 2.0192 KOps/s | |
test_lock_stack_nested | 0.8142ms | 0.4707ms | 2.1244 KOps/s | 2.1237 KOps/s | |
test_unlock_nested | 81.3517ms | 0.4934ms | 2.0269 KOps/s | 2.4224 KOps/s | |
test_unlock_stack_nested | 0.5931ms | 0.3844ms | 2.6018 KOps/s | 2.5779 KOps/s | |
test_flatten_speed | 0.2205ms | 0.1092ms | 9.1582 KOps/s | 9.6471 KOps/s | |
test_unflatten_speed | 0.7479ms | 0.4319ms | 2.3153 KOps/s | 2.3183 KOps/s | |
test_common_ops | 1.7206ms | 1.0803ms | 925.6308 Ops/s | 922.6507 Ops/s | |
test_creation | 38.0010μs | 2.1401μs | 467.2682 KOps/s | 472.5831 KOps/s | |
test_creation_empty | 43.4610μs | 19.0268μs | 52.5574 KOps/s | 56.7088 KOps/s | |
test_creation_nested_1 | 54.8730μs | 22.4571μs | 44.5293 KOps/s | 47.2810 KOps/s | |
test_creation_nested_2 | 86.3920μs | 25.9905μs | 38.4756 KOps/s | 41.1829 KOps/s | |
test_clone | 59.1510μs | 16.4439μs | 60.8130 KOps/s | 60.7347 KOps/s | |
test_getitem[int] | 1.2887ms | 16.7205μs | 59.8069 KOps/s | 58.5678 KOps/s | |
test_getitem[slice_int] | 0.1440ms | 31.1754μs | 32.0766 KOps/s | 31.7103 KOps/s | |
test_getitem[range] | 0.2071ms | 57.4251μs | 17.4140 KOps/s | 17.6855 KOps/s | |
test_getitem[tuple] | 0.1197ms | 25.2111μs | 39.6651 KOps/s | 39.4648 KOps/s | |
test_getitem[list] | 0.1852ms | 50.4685μs | 19.8143 KOps/s | 19.3043 KOps/s | |
test_setitem_dim[int] | 93.8860μs | 41.8765μs | 23.8798 KOps/s | 25.8692 KOps/s | |
test_setitem_dim[slice_int] | 0.1336ms | 70.6524μs | 14.1538 KOps/s | 14.3467 KOps/s | |
test_setitem_dim[range] | 0.1488ms | 92.5367μs | 10.8065 KOps/s | 11.1403 KOps/s | |
test_setitem_dim[tuple] | 0.1203ms | 58.4637μs | 17.1046 KOps/s | 17.7374 KOps/s | |
test_setitem | 0.1060ms | 29.9550μs | 33.3834 KOps/s | 34.4743 KOps/s | |
test_set | 86.8320μs | 29.5647μs | 33.8241 KOps/s | 35.6680 KOps/s | |
test_set_shared | 4.9505ms | 0.2178ms | 4.5921 KOps/s | 4.6904 KOps/s | |
test_update | 0.1461ms | 37.0315μs | 27.0041 KOps/s | 27.8876 KOps/s | |
test_update_nested | 0.1091ms | 46.4496μs | 21.5287 KOps/s | 21.7670 KOps/s | |
test_update__nested | 0.1078ms | 33.9825μs | 29.4269 KOps/s | 29.0984 KOps/s | |
test_set_nested | 94.8570μs | 31.6663μs | 31.5793 KOps/s | 32.2122 KOps/s | |
test_set_nested_new | 89.3070μs | 36.2904μs | 27.5555 KOps/s | 27.5123 KOps/s | |
test_select | 0.1252ms | 52.6309μs | 19.0002 KOps/s | 18.9433 KOps/s | |
test_select_nested | 0.1186ms | 59.1122μs | 16.9170 KOps/s | 16.7880 KOps/s | |
test_exclude_nested | 0.1624ms | 75.9299μs | 13.1700 KOps/s | 12.9420 KOps/s | |
test_empty[True] | 0.5544ms | 0.3233ms | 3.0931 KOps/s | 3.0550 KOps/s | |
test_empty[False] | 13.5427μs | 1.1818μs | 846.1437 KOps/s | 797.3978 KOps/s | |
test_unbind_speed | 0.5401ms | 0.3042ms | 3.2873 KOps/s | 3.2287 KOps/s | |
test_unbind_speed_stack0 | 0.6490ms | 0.3039ms | 3.2909 KOps/s | 3.2703 KOps/s | |
test_unbind_speed_stack1 | 82.4114ms | 0.8684ms | 1.1515 KOps/s | 1.3648 KOps/s | |
test_split | 86.1528ms | 2.1396ms | 467.3865 Ops/s | 471.9992 Ops/s | |
test_chunk | 80.6426ms | 2.1262ms | 470.3183 Ops/s | 469.1196 Ops/s | |
test_creation[device0] | 0.2503ms | 0.1170ms | 8.5447 KOps/s | 8.6171 KOps/s | |
test_creation_from_tensor | 4.6553ms | 0.1191ms | 8.3944 KOps/s | 8.3960 KOps/s | |
test_add_one[memmap_tensor0] | 0.1970ms | 7.2523μs | 137.8867 KOps/s | 139.9371 KOps/s | |
test_contiguous[memmap_tensor0] | 25.4070μs | 1.9763μs | 505.9877 KOps/s | 496.2598 KOps/s | |
test_stack[memmap_tensor0] | 44.7740μs | 5.5144μs | 181.3444 KOps/s | 176.7963 KOps/s | |
test_memmaptd_index | 1.1161ms | 0.4152ms | 2.4087 KOps/s | 2.4765 KOps/s | |
test_memmaptd_index_astensor | 1.0164ms | 0.4916ms | 2.0341 KOps/s | 2.0339 KOps/s | |
test_memmaptd_index_op | 1.4954ms | 1.0381ms | 963.3410 Ops/s | 997.4427 Ops/s | |
test_serialize_model | 0.1271s | 0.1190s | 8.4022 Ops/s | 7.6308 Ops/s | |
test_serialize_model_pickle | 0.5028s | 0.4067s | 2.4588 Ops/s | 2.5392 Ops/s | |
test_serialize_weights | 0.1198s | 0.1155s | 8.6545 Ops/s | 8.5452 Ops/s | |
test_serialize_weights_returnearly | 0.1693s | 0.1573s | 6.3555 Ops/s | 6.2051 Ops/s | |
test_serialize_weights_pickle | 1.2486s | 0.7384s | 1.3542 Ops/s | 2.4470 Ops/s | |
test_serialize_weights_filesystem | 0.1493s | 0.1389s | 7.1991 Ops/s | 6.3908 Ops/s | |
test_serialize_model_filesystem | 0.1570s | 0.1406s | 7.1140 Ops/s | 6.5466 Ops/s | |
test_reshape_pytree | 84.4380μs | 38.8453μs | 25.7432 KOps/s | 24.6978 KOps/s | |
test_reshape_td | 0.1152ms | 46.0336μs | 21.7233 KOps/s | 20.3408 KOps/s | |
test_view_pytree | 0.1193ms | 39.7681μs | 25.1458 KOps/s | 24.9593 KOps/s | |
test_view_td | 0.1324ms | 53.4729μs | 18.7011 KOps/s | 17.8409 KOps/s | |
test_unbind_pytree | 85.3300μs | 37.1684μs | 26.9046 KOps/s | 26.7874 KOps/s | |
test_unbind_td | 0.3225ms | 45.7484μs | 21.8587 KOps/s | 20.9262 KOps/s | |
test_split_pytree | 0.1061ms | 39.8030μs | 25.1237 KOps/s | 24.4822 KOps/s | |
test_split_td | 0.6651ms | 58.7250μs | 17.0285 KOps/s | 16.5328 KOps/s | |
test_add_pytree | 0.1169ms | 46.2152μs | 21.6379 KOps/s | 22.2655 KOps/s | |
test_add_td | 0.1812ms | 82.3966μs | 12.1364 KOps/s | 12.5262 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1136ms | 53.6985μs | 18.6225 KOps/s | 18.8208 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3466ms | 0.1894ms | 5.2800 KOps/s | 5.1293 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1793ms | 54.3642μs | 18.3944 KOps/s | 18.2195 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2741ms | 0.1420ms | 7.0403 KOps/s | 7.0596 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 55.4340μs | 19.5870μs | 51.0544 KOps/s | 48.0219 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1113ms | 64.0485μs | 15.6132 KOps/s | 15.4286 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1654ms | 80.3127μs | 12.4513 KOps/s | 12.6232 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1183ms | 73.1777μs | 13.6654 KOps/s | 13.9514 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2527ms | 0.1735ms | 5.7633 KOps/s | 5.8086 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.2734ms | 0.1931ms | 5.1782 KOps/s | 5.0981 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 81.0420μs | 37.8999μs | 26.3853 KOps/s | 26.0258 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 1.3994ms | 69.5391μs | 14.3804 KOps/s | 14.2820 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2756ms | 0.1710ms | 5.8491 KOps/s | 5.7659 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.5011ms | 0.2855ms | 3.5024 KOps/s | 3.5018 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3829ms | 0.2068ms | 4.8346 KOps/s | 4.7505 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.3134ms | 0.1760ms | 5.6805 KOps/s | 5.8046 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.6774ms | 63.7368μs | 15.6895 KOps/s | 15.7748 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 94.8170μs | 40.7617μs | 24.5328 KOps/s | 25.8851 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.4704ms | 0.2362ms | 4.2330 KOps/s | 4.2709 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.3427ms | 0.1729ms | 5.7851 KOps/s | 5.8381 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.2733ms | 0.1070ms | 9.3493 KOps/s | 9.3955 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1320ms | 57.5896μs | 17.3643 KOps/s | 17.5197 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1560ms | 79.2611μs | 12.6165 KOps/s | 12.4535 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1544ms | 71.8092μs | 13.9258 KOps/s | 14.0233 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.2748ms | 0.1921ms | 5.2063 KOps/s | 5.1836 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.0621ms | 1.6364ms | 611.1057 Ops/s | 603.6447 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.2982ms | 0.1907ms | 5.2444 KOps/s | 5.3421 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.6485ms | 1.0576ms | 945.5316 Ops/s | 943.9400 Ops/s | |
test_compile_assign_and_add_stack[compile] | 0.6050ms | 0.4153ms | 2.4080 KOps/s | 2.4069 KOps/s | |
test_compile_assign_and_add_stack[eager] | 4.4043ms | 3.8194ms | 261.8182 Ops/s | 268.0960 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 94.1860μs | 32.7914μs | 30.4958 KOps/s | 30.7494 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 1.4231ms | 47.9040μs | 20.8751 KOps/s | 20.2404 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1065ms | 28.7809μs | 34.7452 KOps/s | 35.5427 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 77.8260μs | 30.6230μs | 32.6552 KOps/s | 33.7146 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1053ms | 28.4374μs | 35.1650 KOps/s | 35.6051 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 85.8210μs | 30.5512μs | 32.7319 KOps/s | 34.1236 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1577ms | 71.9305μs | 13.9023 KOps/s | 13.5980 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.4713ms | 28.0860μs | 35.6049 KOps/s | 34.5568 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1315ms | 67.6093μs | 14.7909 KOps/s | 14.7650 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 86.7030μs | 25.5521μs | 39.1358 KOps/s | 39.9936 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1359ms | 67.6848μs | 14.7744 KOps/s | 14.8122 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 85.3090μs | 24.7611μs | 40.3859 KOps/s | 41.0122 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1389ms | 71.7688μs | 13.9336 KOps/s | 13.7631 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.9970ms | 27.8806μs | 35.8673 KOps/s | 35.0342 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1540ms | 67.0496μs | 14.9143 KOps/s | 14.7886 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.1025ms | 24.5593μs | 40.7178 KOps/s | 40.4907 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1605ms | 67.1759μs | 14.8863 KOps/s | 14.8001 KOps/s | |
test_compile_indexing[int-pytree-eager] | 85.4700μs | 24.9413μs | 40.0942 KOps/s | 40.6482 KOps/s | |
test_mod_add[eager] | 75.1100μs | 25.2680μs | 39.5757 KOps/s | 42.4028 KOps/s | |
test_mod_add[compile] | 72.4360μs | 35.7119μs | 28.0018 KOps/s | 27.3013 KOps/s | |
test_mod_add[compile-overhead] | 77.3850μs | 36.4443μs | 27.4391 KOps/s | 28.0408 KOps/s | |
test_mod_wrap[eager] | 0.3172ms | 0.2001ms | 4.9982 KOps/s | 4.9076 KOps/s | |
test_mod_wrap[compile] | 1.3128ms | 0.2217ms | 4.5108 KOps/s | 4.4500 KOps/s | |
test_mod_wrap[compile-overhead] | 0.3692ms | 0.2193ms | 4.5599 KOps/s | 4.5062 KOps/s | |
test_mod_wrap_and_backward[eager] | 14.8968ms | 11.7605ms | 85.0307 Ops/s | 81.1895 Ops/s | |
test_mod_wrap_and_backward[compile] | 21.2121ms | 12.1574ms | 82.2547 Ops/s | 80.7052 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 15.6574ms | 11.4881ms | 87.0463 Ops/s | 80.8864 Ops/s | |
test_seq_add[eager] | 0.1689ms | 87.3312μs | 11.4507 KOps/s | 11.6064 KOps/s | |
test_seq_add[compile] | 0.1547ms | 57.9801μs | 17.2473 KOps/s | 16.5044 KOps/s | |
test_seq_add[compile-overhead] | 0.1436ms | 58.5864μs | 17.0688 KOps/s | 17.0935 KOps/s | |
test_seq_wrap[eager] | 0.5812ms | 0.3683ms | 2.7153 KOps/s | 2.7140 KOps/s | |
test_seq_wrap[compile] | 0.4203ms | 0.2544ms | 3.9312 KOps/s | 3.8474 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3791ms | 0.2526ms | 3.9595 KOps/s | 3.8816 KOps/s | |
test_func_call_runtime[False-eager] | 0.6413ms | 0.4984ms | 2.0064 KOps/s | 1.9585 KOps/s | |
test_func_call_runtime[False-compile] | 0.6093ms | 0.4809ms | 2.0796 KOps/s | 2.0378 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.6280ms | 0.4806ms | 2.0809 KOps/s | 2.0255 KOps/s | |
test_func_call_runtime[True-eager] | 1.0373ms | 0.7239ms | 1.3813 KOps/s | 1.3367 KOps/s | |
test_func_call_runtime[True-compile] | 0.9156ms | 0.4977ms | 2.0093 KOps/s | 1.9642 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.6668ms | 0.4965ms | 2.0142 KOps/s | 1.9519 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.7760ms | 0.4993ms | 2.0027 KOps/s | 1.9411 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.7505ms | 0.4798ms | 2.0843 KOps/s | 2.0209 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5618ms | 0.4820ms | 2.0745 KOps/s | 2.0469 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1988ms | 0.8609ms | 1.1616 KOps/s | 1.1320 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.9772ms | 0.8159ms | 1.2256 KOps/s | 1.2117 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.9945ms | 0.8239ms | 1.2137 KOps/s | 1.2084 KOps/s | |
test_distributed | 0.2436ms | 0.1287ms | 7.7717 KOps/s | 7.5992 KOps/s | |
test_tdmodule | 34.3940μs | 18.0083μs | 55.5300 KOps/s | 58.2156 KOps/s | |
test_tdmodule_dispatch | 82.3540μs | 39.0932μs | 25.5799 KOps/s | 28.3192 KOps/s | |
test_tdseq | 34.7350μs | 19.9648μs | 50.0881 KOps/s | 52.7680 KOps/s | |
test_tdseq_dispatch | 62.1460μs | 40.9146μs | 24.4411 KOps/s | 25.3909 KOps/s | |
test_instantiation_functorch | 1.9547ms | 1.6185ms | 617.8741 Ops/s | 613.9686 Ops/s | |
test_instantiation_td | 2.1579ms | 1.1846ms | 844.1771 Ops/s | 850.8921 Ops/s | |
test_exec_functorch | 0.3366ms | 0.1790ms | 5.5859 KOps/s | 5.4754 KOps/s | |
test_exec_functional_call | 0.3173ms | 0.1656ms | 6.0398 KOps/s | 5.8065 KOps/s | |
test_exec_td | 0.2534ms | 0.1691ms | 5.9123 KOps/s | 5.8786 KOps/s | |
test_exec_td_decorator | 0.3760ms | 0.2246ms | 4.4519 KOps/s | 4.3910 KOps/s | |
test_vmap_mlp_speed[True-True] | 0.9991ms | 0.5662ms | 1.7662 KOps/s | 1.7358 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.8765ms | 0.5600ms | 1.7856 KOps/s | 1.7601 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.7042ms | 0.4602ms | 2.1728 KOps/s | 2.1232 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.7427ms | 0.4663ms | 2.1446 KOps/s | 2.1314 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.2817ms | 0.6177ms | 1.6189 KOps/s | 1.5916 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8947ms | 0.6230ms | 1.6050 KOps/s | 1.5902 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.6971ms | 0.5122ms | 1.9523 KOps/s | 1.9259 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.8457ms | 0.5150ms | 1.9418 KOps/s | 1.9208 KOps/s | |
test_to_module_speed[True] | 1.7330ms | 1.3387ms | 747.0105 Ops/s | 744.7494 Ops/s | |
test_to_module_speed[False] | 1.8002ms | 1.3041ms | 766.7920 Ops/s | 758.8848 Ops/s | |
test_tc_init | 82.7540μs | 43.7076μs | 22.8793 KOps/s | 23.2240 KOps/s | |
test_tc_init_nested | 0.1866ms | 89.1815μs | 11.2131 KOps/s | 11.5048 KOps/s | |
test_tc_first_layer_tensor | 14.3970μs | 1.4765μs | 677.2779 KOps/s | 695.7294 KOps/s | |
test_tc_first_layer_nontensor | 26.3290μs | 4.3396μs | 230.4356 KOps/s | 231.7536 KOps/s | |
test_tc_second_layer_tensor | 20.3180μs | 2.7258μs | 366.8621 KOps/s | 361.2106 KOps/s | |
test_tc_second_layer_nontensor | 42.0380μs | 5.5257μs | 180.9730 KOps/s | 176.7716 KOps/s | |
test_unbind | 0.4508s | 13.4383ms | 74.4142 Ops/s | 75.8915 Ops/s | |
test_full_like | 18.4100ms | 12.2898ms | 81.3685 Ops/s | 137.2619 Ops/s | |
test_zeros_like | 14.0514ms | 7.4003ms | 135.1296 Ops/s | 132.3437 Ops/s | |
test_ones_like | 14.0036ms | 7.4606ms | 134.0373 Ops/s | 132.4858 Ops/s | |
test_clone | 15.3970ms | 9.0125ms | 110.9573 Ops/s | 107.5982 Ops/s | |
test_squeeze | 81.3330μs | 12.8241μs | 77.9781 KOps/s | 75.9953 KOps/s | |
test_unsqueeze | 0.1642ms | 94.0647μs | 10.6310 KOps/s | 10.4970 KOps/s | |
test_split | 0.4707ms | 0.1968ms | 5.0812 KOps/s | 5.0316 KOps/s | |
test_permute | 0.4388ms | 0.2197ms | 4.5517 KOps/s | 4.6811 KOps/s | |
test_stack | 32.7249ms | 24.5103ms | 40.7991 Ops/s | 39.6124 Ops/s | |
test_cat | 29.4476ms | 24.5634ms | 40.7110 Ops/s | 39.3405 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 33.7710μs | 15.1642μs | 65.9446 KOps/s | 65.8002 KOps/s | |
test_plain_set_stack_nested | 33.0810μs | 15.2933μs | 65.3881 KOps/s | 65.7361 KOps/s | |
test_plain_set_nested_inplace | 32.1400μs | 16.1655μs | 61.8600 KOps/s | 61.0175 KOps/s | |
test_plain_set_stack_nested_inplace | 46.7310μs | 16.1811μs | 61.8006 KOps/s | 61.5738 KOps/s | |
test_items | 25.5000μs | 4.7578μs | 210.1793 KOps/s | 215.7483 KOps/s | |
test_items_nested | 0.4639ms | 0.3600ms | 2.7775 KOps/s | 2.7754 KOps/s | |
test_items_nested_locked | 0.3899ms | 0.3626ms | 2.7582 KOps/s | 2.7585 KOps/s | |
test_items_nested_leaf | 0.1160ms | 83.2309μs | 12.0148 KOps/s | 11.9199 KOps/s | |
test_items_stack_nested | 0.4363ms | 0.3636ms | 2.7503 KOps/s | 2.7554 KOps/s | |
test_items_stack_nested_leaf | 0.1222ms | 85.8609μs | 11.6467 KOps/s | 11.6383 KOps/s | |
test_items_stack_nested_locked | 0.4570ms | 0.3629ms | 2.7556 KOps/s | 2.7462 KOps/s | |
test_keys | 23.8910μs | 4.4121μs | 226.6496 KOps/s | 229.1277 KOps/s | |
test_keys_nested | 89.0220μs | 66.9639μs | 14.9334 KOps/s | 14.5480 KOps/s | |
test_keys_nested_locked | 0.6642ms | 72.5709μs | 13.7796 KOps/s | 13.6188 KOps/s | |
test_keys_nested_leaf | 85.0820μs | 56.3472μs | 17.7471 KOps/s | 16.9311 KOps/s | |
test_keys_stack_nested | 0.1055ms | 66.0209μs | 15.1467 KOps/s | 14.5592 KOps/s | |
test_keys_stack_nested_leaf | 90.7220μs | 57.4345μs | 17.4111 KOps/s | 16.7796 KOps/s | |
test_keys_stack_nested_locked | 0.1145ms | 71.6764μs | 13.9516 KOps/s | 13.5990 KOps/s | |
test_values | 7.3737μs | 1.7837μs | 560.6348 KOps/s | 565.1609 KOps/s | |
test_values_nested | 64.2410μs | 33.7969μs | 29.5885 KOps/s | 29.5891 KOps/s | |
test_values_nested_locked | 54.2310μs | 35.8871μs | 27.8651 KOps/s | 27.8623 KOps/s | |
test_values_nested_leaf | 52.0010μs | 30.0714μs | 33.2542 KOps/s | 33.0100 KOps/s | |
test_values_stack_nested | 58.7610μs | 34.4483μs | 29.0290 KOps/s | 28.8443 KOps/s | |
test_values_stack_nested_leaf | 57.9210μs | 30.6533μs | 32.6229 KOps/s | 32.3516 KOps/s | |
test_values_stack_nested_locked | 87.3820μs | 36.5638μs | 27.3495 KOps/s | 27.2190 KOps/s | |
test_membership | 1.4425μs | 0.5392μs | 1.8546 MOps/s | 1.8284 MOps/s | |
test_membership_nested | 9.9955μs | 1.8773μs | 532.6763 KOps/s | 516.1628 KOps/s | |
test_membership_nested_leaf | 9.9550μs | 1.8905μs | 528.9589 KOps/s | 511.4412 KOps/s | |
test_membership_stacked_nested | 30.3100μs | 1.9720μs | 507.1065 KOps/s | 506.9058 KOps/s | |
test_membership_stacked_nested_leaf | 16.2100μs | 1.9405μs | 515.3211 KOps/s | 506.4896 KOps/s | |
test_membership_nested_last | 26.3310μs | 2.8792μs | 347.3166 KOps/s | 349.7469 KOps/s | |
test_membership_nested_leaf_last | 49.9310μs | 2.8880μs | 346.2640 KOps/s | 350.0967 KOps/s | |
test_membership_stacked_nested_last | 23.6010μs | 2.8926μs | 345.7154 KOps/s | 306.6164 KOps/s | |
test_membership_stacked_nested_leaf_last | 32.9600μs | 2.9061μs | 344.0996 KOps/s | 300.2066 KOps/s | |
test_nested_getleaf | 23.5500μs | 7.8981μs | 126.6121 KOps/s | 127.4555 KOps/s | |
test_nested_get | 31.1900μs | 7.4851μs | 133.5990 KOps/s | 134.6472 KOps/s | |
test_stacked_getleaf | 37.6700μs | 7.9496μs | 125.7931 KOps/s | 126.9880 KOps/s | |
test_stacked_get | 28.4210μs | 7.4922μs | 133.4729 KOps/s | 134.1943 KOps/s | |
test_nested_getitemleaf | 41.0410μs | 8.0742μs | 123.8512 KOps/s | 124.2527 KOps/s | |
test_nested_getitem | 22.8910μs | 7.6166μs | 131.2926 KOps/s | 132.0077 KOps/s | |
test_stacked_getitemleaf | 35.5210μs | 8.0581μs | 124.0995 KOps/s | 123.6688 KOps/s | |
test_stacked_getitem | 33.2110μs | 7.6104μs | 131.3992 KOps/s | 131.1118 KOps/s | |
test_lock_nested | 9.7916ms | 0.4772ms | 2.0956 KOps/s | 2.1284 KOps/s | |
test_lock_stack_nested | 0.4994ms | 0.4331ms | 2.3091 KOps/s | 2.3176 KOps/s | |
test_unlock_nested | 0.8683ms | 0.3864ms | 2.5880 KOps/s | 2.5701 KOps/s | |
test_unlock_stack_nested | 0.3864ms | 0.3514ms | 2.8458 KOps/s | 2.8454 KOps/s | |
test_flatten_speed | 0.4700ms | 0.1024ms | 9.7637 KOps/s | 9.5597 KOps/s | |
test_unflatten_speed | 0.3755ms | 0.2854ms | 3.5035 KOps/s | 3.5190 KOps/s | |
test_common_ops | 1.4353ms | 1.1975ms | 835.1060 Ops/s | 811.2792 Ops/s | |
test_creation | 22.3800μs | 1.6410μs | 609.3810 KOps/s | 617.6266 KOps/s | |
test_creation_empty | 33.6410μs | 13.6707μs | 73.1490 KOps/s | 73.2252 KOps/s | |
test_creation_nested_1 | 40.9400μs | 15.2057μs | 65.7649 KOps/s | 64.6091 KOps/s | |
test_creation_nested_2 | 43.2000μs | 17.5991μs | 56.8210 KOps/s | 55.7680 KOps/s | |
test_clone | 71.9610μs | 28.5363μs | 35.0430 KOps/s | 35.0257 KOps/s | |
test_getitem[int] | 1.1593ms | 16.3557μs | 61.1409 KOps/s | 60.3366 KOps/s | |
test_getitem[slice_int] | 0.1525ms | 28.2301μs | 35.4232 KOps/s | 34.6158 KOps/s | |
test_getitem[range] | 0.2482ms | 0.1145ms | 8.7363 KOps/s | 8.8328 KOps/s | |
test_getitem[tuple] | 0.1526ms | 24.1991μs | 41.3239 KOps/s | 40.9742 KOps/s | |
test_getitem[list] | 90.8483ms | 0.1170ms | 8.5460 KOps/s | 9.6735 KOps/s | |
test_setitem_dim[int] | 68.3120μs | 48.9478μs | 20.4299 KOps/s | 20.1102 KOps/s | |
test_setitem_dim[slice_int] | 92.9920μs | 73.5000μs | 13.6054 KOps/s | 13.4262 KOps/s | |
test_setitem_dim[range] | 0.1769ms | 0.1402ms | 7.1303 KOps/s | 7.1815 KOps/s | |
test_setitem_dim[tuple] | 0.1126ms | 66.6689μs | 14.9995 KOps/s | 13.9729 KOps/s | |
test_setitem | 75.5220μs | 39.1764μs | 25.5256 KOps/s | 23.5829 KOps/s | |
test_set | 79.0020μs | 38.2370μs | 26.1527 KOps/s | 25.7231 KOps/s | |
test_set_shared | 0.3744ms | 52.0963μs | 19.1952 KOps/s | 19.0007 KOps/s | |
test_update | 96.2320μs | 45.5249μs | 21.9660 KOps/s | 21.3593 KOps/s | |
test_update_nested | 92.8020μs | 52.7920μs | 18.9423 KOps/s | 17.7085 KOps/s | |
test_update__nested | 0.1177ms | 60.9913μs | 16.3958 KOps/s | 16.4539 KOps/s | |
test_set_nested | 92.2520μs | 40.6614μs | 24.5934 KOps/s | 22.2266 KOps/s | |
test_set_nested_new | 75.4720μs | 43.7539μs | 22.8551 KOps/s | 19.9248 KOps/s | |
test_select | 0.1119ms | 61.3843μs | 16.2908 KOps/s | 16.5123 KOps/s | |
test_select_nested | 0.5267ms | 51.0669μs | 19.5822 KOps/s | 19.5578 KOps/s | |
test_exclude_nested | 99.1020μs | 67.5861μs | 14.7959 KOps/s | 14.2246 KOps/s | |
test_empty[True] | 0.3090ms | 0.2795ms | 3.5773 KOps/s | 3.4982 KOps/s | |
test_empty[False] | 2.5840μs | 0.8747μs | 1.1433 MOps/s | 1.1598 MOps/s | |
test_to | 65.4710μs | 39.3835μs | 25.3914 KOps/s | 25.2775 KOps/s | |
test_to_nonblocking | 52.8110μs | 24.6223μs | 40.6136 KOps/s | 39.7406 KOps/s | |
test_unbind_speed | 1.3305ms | 0.2969ms | 3.3678 KOps/s | 3.3266 KOps/s | |
test_unbind_speed_stack0 | 0.3685ms | 0.2923ms | 3.4208 KOps/s | 3.3362 KOps/s | |
test_unbind_speed_stack1 | 90.1569ms | 0.7640ms | 1.3089 KOps/s | 1.2841 KOps/s | |
test_split | 91.5046ms | 2.2953ms | 435.6802 Ops/s | 426.6472 Ops/s | |
test_chunk | 93.4434ms | 2.2987ms | 435.0284 Ops/s | 468.8501 Ops/s | |
test_creation[device0] | 0.1583ms | 0.1025ms | 9.7562 KOps/s | 9.6399 KOps/s | |
test_creation_from_tensor | 0.1762ms | 0.1015ms | 9.8571 KOps/s | 9.8701 KOps/s | |
test_add_one[memmap_tensor0] | 61.1510μs | 8.5232μs | 117.3266 KOps/s | 118.2933 KOps/s | |
test_contiguous[memmap_tensor0] | 16.1600μs | 2.1284μs | 469.8368 KOps/s | 463.7732 KOps/s | |
test_stack[memmap_tensor0] | 33.4310μs | 6.4536μs | 154.9522 KOps/s | 156.3741 KOps/s | |
test_memmaptd_index | 1.0639ms | 0.4159ms | 2.4047 KOps/s | 2.0640 KOps/s | |
test_memmaptd_index_astensor | 0.7521ms | 0.4814ms | 2.0775 KOps/s | 2.0698 KOps/s | |
test_memmaptd_index_op | 1.3893ms | 0.9700ms | 1.0310 KOps/s | 1.0072 KOps/s | |
test_serialize_model | 92.3385ms | 88.5473ms | 11.2934 Ops/s | 10.9429 Ops/s | |
test_serialize_model_pickle | 1.3497s | 1.2360s | 0.8091 Ops/s | 0.8060 Ops/s | |
test_serialize_weights | 89.2402ms | 85.0294ms | 11.7606 Ops/s | 11.1357 Ops/s | |
test_serialize_weights_returnearly | 68.3841ms | 55.9381ms | 17.8769 Ops/s | 16.3226 Ops/s | |
test_serialize_weights_pickle | 1.3525s | 1.2436s | 0.8041 Ops/s | 0.8088 Ops/s | |
test_reshape_pytree | 61.1020μs | 37.7820μs | 26.4676 KOps/s | 26.5904 KOps/s | |
test_reshape_td | 86.6120μs | 44.5253μs | 22.4591 KOps/s | 22.8184 KOps/s | |
test_view_pytree | 60.2310μs | 37.3790μs | 26.7530 KOps/s | 26.8878 KOps/s | |
test_view_td | 0.1899ms | 50.0664μs | 19.9735 KOps/s | 18.9659 KOps/s | |
test_unbind_pytree | 70.3910μs | 36.3276μs | 27.5273 KOps/s | 27.3880 KOps/s | |
test_unbind_td | 0.4162ms | 44.4902μs | 22.4768 KOps/s | 22.3247 KOps/s | |
test_split_pytree | 80.6220μs | 50.4949μs | 19.8040 KOps/s | 19.3331 KOps/s | |
test_split_td | 90.8013ms | 68.0545μs | 14.6941 KOps/s | 14.3888 KOps/s | |
test_add_pytree | 94.3620μs | 60.2902μs | 16.5864 KOps/s | 17.2579 KOps/s | |
test_add_td | 0.1375ms | 90.0165μs | 11.1091 KOps/s | 11.4760 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.4074ms | 0.2145ms | 4.6628 KOps/s | 4.6187 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2650ms | 0.1733ms | 5.7710 KOps/s | 5.7676 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1908ms | 0.1455ms | 6.8726 KOps/s | 6.8091 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.3891ms | 0.1882ms | 5.3123 KOps/s | 4.9900 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 0.2159ms | 22.3557μs | 44.7312 KOps/s | 43.7541 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 71.9410μs | 47.4851μs | 21.0593 KOps/s | 20.8090 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.2642ms | 73.2573μs | 13.6505 KOps/s | 13.6641 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.2418ms | 59.3420μs | 16.8515 KOps/s | 16.7999 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.5258ms | 0.3253ms | 3.0739 KOps/s | 3.0200 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.4139ms | 0.2227ms | 4.4900 KOps/s | 4.4873 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.3373ms | 0.1297ms | 7.7129 KOps/s | 7.6847 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.2674ms | 62.4366μs | 16.0162 KOps/s | 16.2140 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.5167ms | 0.3244ms | 3.0827 KOps/s | 3.0408 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.8247ms | 0.6184ms | 1.6170 KOps/s | 1.6070 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4638ms | 0.2704ms | 3.6986 KOps/s | 3.7178 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.3874ms | 0.3258ms | 3.0691 KOps/s | 3.0041 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.2908ms | 74.2269μs | 13.4722 KOps/s | 13.4405 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1731ms | 0.1309ms | 7.6412 KOps/s | 7.5873 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.7197ms | 0.5254ms | 1.9032 KOps/s | 1.8726 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.4005ms | 0.3239ms | 3.0871 KOps/s | 3.0420 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.2129ms | 18.8900μs | 52.9382 KOps/s | 52.9673 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 93.7410μs | 32.4024μs | 30.8619 KOps/s | 30.8663 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.2654ms | 77.3413μs | 12.9297 KOps/s | 13.2891 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.2469ms | 60.7254μs | 16.4676 KOps/s | 16.6730 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 2.4131ms | 0.8296ms | 1.2054 KOps/s | 1.0992 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 3.3746ms | 3.2502ms | 307.6722 Ops/s | 296.3224 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 2.3634ms | 0.8184ms | 1.2219 KOps/s | 1.0974 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 3.3678ms | 3.2123ms | 311.3030 Ops/s | 298.7932 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1713ms | 0.1109ms | 9.0196 KOps/s | 9.0417 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.2075ms | 64.6476μs | 15.4685 KOps/s | 15.9725 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1841ms | 0.1035ms | 9.6594 KOps/s | 9.6654 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 93.0020μs | 46.8123μs | 21.3619 KOps/s | 22.0817 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1554ms | 0.1067ms | 9.3723 KOps/s | 9.3614 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 94.0120μs | 46.9904μs | 21.2810 KOps/s | 22.1343 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1904ms | 0.1453ms | 6.8822 KOps/s | 7.1852 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1859ms | 25.5737μs | 39.1026 KOps/s | 38.6252 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1922ms | 0.1339ms | 7.4680 KOps/s | 7.6093 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 56.4410μs | 21.6964μs | 46.0906 KOps/s | 45.4898 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1920ms | 0.1347ms | 7.4224 KOps/s | 7.6482 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 61.4920μs | 22.6882μs | 44.0758 KOps/s | 45.1930 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1971ms | 0.1450ms | 6.8968 KOps/s | 7.2256 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.4882ms | 26.2412μs | 38.1081 KOps/s | 38.1751 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.2198ms | 0.1356ms | 7.3755 KOps/s | 7.3033 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 60.5610μs | 21.9510μs | 45.5561 KOps/s | 45.7568 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.2251ms | 0.1327ms | 7.5356 KOps/s | 7.6164 KOps/s | |
test_compile_indexing[int-pytree-eager] | 56.6700μs | 22.1536μs | 45.1394 KOps/s | 44.2791 KOps/s | |
test_mod_add[eager] | 79.5910μs | 37.2313μs | 26.8591 KOps/s | 28.1555 KOps/s | |
test_mod_add[compile] | 0.1434ms | 72.4547μs | 13.8017 KOps/s | 13.7232 KOps/s | |
test_mod_add[compile-overhead] | 0.2594ms | 0.1352ms | 7.3965 KOps/s | 6.6423 KOps/s | |
test_mod_wrap[eager] | 0.3430ms | 0.2524ms | 3.9616 KOps/s | 3.9746 KOps/s | |
test_mod_wrap[compile] | 1.0530ms | 0.2905ms | 3.4426 KOps/s | 3.3781 KOps/s | |
test_mod_wrap[compile-overhead] | 8.1901ms | 4.3179ms | 231.5943 Ops/s | 225.0962 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.5689ms | 1.4721ms | 679.2886 Ops/s | 690.5134 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.5760ms | 1.4485ms | 690.3739 Ops/s | 691.4017 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.4117ms | 0.9805ms | 1.0198 KOps/s | 996.1003 Ops/s | |
test_seq_add[eager] | 0.1642ms | 0.1032ms | 9.6870 KOps/s | 8.9700 KOps/s | |
test_seq_add[compile] | 0.1332ms | 85.5141μs | 11.6940 KOps/s | 11.0150 KOps/s | |
test_seq_add[compile-overhead] | 0.1578ms | 0.1220ms | 8.1999 KOps/s | 8.1317 KOps/s | |
test_seq_wrap[eager] | 0.4672ms | 0.3995ms | 2.5029 KOps/s | 2.3031 KOps/s | |
test_seq_wrap[compile] | 0.4045ms | 0.3220ms | 3.1057 KOps/s | 2.9623 KOps/s | |
test_seq_wrap[compile-overhead] | 0.1897s | 88.1411ms | 11.3454 Ops/s | 7.9383 Ops/s | |
test_func_call_runtime[False-eager] | 0.8404ms | 0.7336ms | 1.3631 KOps/s | 1.2783 KOps/s | |
test_func_call_runtime[False-compile] | 0.9102ms | 0.8008ms | 1.2488 KOps/s | 1.2404 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4807ms | 0.3644ms | 2.7442 KOps/s | 2.7360 KOps/s | |
test_func_call_runtime[True-eager] | 1.0598ms | 0.9300ms | 1.0752 KOps/s | 1.0688 KOps/s | |
test_func_call_runtime[True-compile] | 0.9025ms | 0.8370ms | 1.1948 KOps/s | 1.1776 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.5313ms | 0.4116ms | 2.4297 KOps/s | 2.4197 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8109ms | 0.7336ms | 1.3632 KOps/s | 1.2643 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8697ms | 0.7960ms | 1.2562 KOps/s | 1.2239 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4867ms | 0.3682ms | 2.7156 KOps/s | 2.7067 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1337ms | 1.0417ms | 959.9859 Ops/s | 947.1744 Ops/s | |
test_func_call_cm_runtime[True-compile] | 1.1305ms | 1.0108ms | 989.3480 Ops/s | 975.1401 Ops/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.1521ms | 1.0138ms | 986.3978 Ops/s | 969.3104 Ops/s | |
test_distributed | 0.1884ms | 68.8366μs | 14.5272 KOps/s | 13.9687 KOps/s | |
test_tdmodule | 0.1275ms | 14.0750μs | 71.0477 KOps/s | 67.6518 KOps/s | |
test_tdmodule_dispatch | 43.6510μs | 27.7043μs | 36.0954 KOps/s | 33.8146 KOps/s | |
test_tdseq | 30.4910μs | 14.2442μs | 70.2040 KOps/s | 65.0054 KOps/s | |
test_tdseq_dispatch | 49.9510μs | 29.8386μs | 33.5136 KOps/s | 33.2853 KOps/s | |
test_instantiation_functorch | 2.0911ms | 1.9913ms | 502.1799 Ops/s | 496.2445 Ops/s | |
test_instantiation_td | 2.0026ms | 1.2966ms | 771.2383 Ops/s | 767.9728 Ops/s | |
test_exec_functorch | 0.3157ms | 0.2098ms | 4.7670 KOps/s | 4.6839 KOps/s | |
test_exec_functional_call | 0.2887ms | 0.2063ms | 4.8463 KOps/s | 4.8348 KOps/s | |
test_exec_td | 0.2474ms | 0.2134ms | 4.6850 KOps/s | 4.5748 KOps/s | |
test_exec_td_decorator | 0.8163ms | 0.2653ms | 3.7693 KOps/s | 3.6638 KOps/s | |
test_vmap_mlp_speed[True-True] | 0.8378ms | 0.6441ms | 1.5525 KOps/s | 1.4797 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.6994ms | 0.6436ms | 1.5537 KOps/s | 1.4916 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.6930ms | 0.5749ms | 1.7393 KOps/s | 1.6970 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.6936ms | 0.5672ms | 1.7630 KOps/s | 1.6904 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8321ms | 0.6884ms | 1.4526 KOps/s | 1.4221 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 1.1500ms | 0.6906ms | 1.4480 KOps/s | 1.4029 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7373ms | 0.6083ms | 1.6440 KOps/s | 1.5765 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7441ms | 0.6092ms | 1.6415 KOps/s | 1.6093 KOps/s | |
test_vmap_transformer_speed[True-True] | 9.0567ms | 8.7127ms | 114.7743 Ops/s | 113.4839 Ops/s | |
test_vmap_transformer_speed[True-False] | 8.9229ms | 8.6584ms | 115.4941 Ops/s | 114.4248 Ops/s | |
test_vmap_transformer_speed[False-True] | 8.8066ms | 8.6065ms | 116.1910 Ops/s | 116.0338 Ops/s | |
test_vmap_transformer_speed[False-False] | 8.9356ms | 8.6103ms | 116.1400 Ops/s | 115.7930 Ops/s | |
test_vmap_transformer_speed_decorator[True-True] | 20.6377ms | 20.5295ms | 48.7103 Ops/s | 48.7575 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 20.9383ms | 20.6645ms | 48.3921 Ops/s | 48.4391 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 20.5459ms | 20.4489ms | 48.9023 Ops/s | 49.1037 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 21.2606ms | 20.4523ms | 48.8943 Ops/s | 49.0013 Ops/s | |
test_to_module_speed[True] | 1.2487ms | 1.1408ms | 876.5931 Ops/s | 874.4783 Ops/s | |
test_to_module_speed[False] | 1.2210ms | 1.1120ms | 899.2816 Ops/s | 893.8686 Ops/s | |
test_tc_init | 57.0610μs | 35.9613μs | 27.8077 KOps/s | 27.7600 KOps/s | |
test_tc_init_nested | 98.6120μs | 73.1218μs | 13.6758 KOps/s | 13.8414 KOps/s | |
test_tc_first_layer_tensor | 3.4468μs | 0.7870μs | 1.2707 MOps/s | 1.2639 MOps/s | |
test_tc_first_layer_nontensor | 0.1110ms | 2.5363μs | 394.2685 KOps/s | 395.7986 KOps/s | |
test_tc_second_layer_tensor | 9.4370μs | 1.6278μs | 614.3206 KOps/s | 617.1205 KOps/s | |
test_tc_second_layer_nontensor | 27.7310μs | 3.4078μs | 293.4439 KOps/s | 292.8752 KOps/s | |
test_unbind | 0.1806s | 10.6358ms | 94.0223 Ops/s | 63.6344 Ops/s | |
test_full_like | 0.6578ms | 0.5775ms | 1.7316 KOps/s | 1.7268 KOps/s | |
test_zeros_like | 0.2705ms | 0.1978ms | 5.0551 KOps/s | 5.0584 KOps/s | |
test_ones_like | 0.2535ms | 0.1976ms | 5.0595 KOps/s | 5.0624 KOps/s | |
test_clone | 0.4408ms | 0.4137ms | 2.4170 KOps/s | 2.4099 KOps/s | |
test_squeeze | 28.6010μs | 10.7029μs | 93.4327 KOps/s | 87.7992 KOps/s | |
test_unsqueeze | 0.2459ms | 78.4766μs | 12.7426 KOps/s | 12.7325 KOps/s | |
test_split | 0.4220ms | 0.1685ms | 5.9332 KOps/s | 5.7889 KOps/s | |
test_permute | 0.2204ms | 0.1836ms | 5.4480 KOps/s | 5.4807 KOps/s | |
test_stack | 1.2531ms | 0.8907ms | 1.1227 KOps/s | 1.1085 KOps/s | |
test_cat | 1.2513ms | 1.2314ms | 812.1022 Ops/s | 811.9221 Ops/s |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
bug
Something isn't working
CI
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
No description provided.