-
Notifications
You must be signed in to change notification settings - Fork 79
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BugFix] Add nullbyte in memmap files to make fbcode happy #943
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 47.5180μs | 20.4252μs | 48.9591 KOps/s | 49.1269 KOps/s | |
test_plain_set_stack_nested | 44.4330μs | 20.2615μs | 49.3547 KOps/s | 48.2675 KOps/s | |
test_plain_set_nested_inplace | 0.2770ms | 22.4885μs | 44.4672 KOps/s | 44.2703 KOps/s | |
test_plain_set_stack_nested_inplace | 55.1820μs | 22.1829μs | 45.0798 KOps/s | 45.0476 KOps/s | |
test_items | 33.1020μs | 2.6106μs | 383.0575 KOps/s | 379.8742 KOps/s | |
test_items_nested | 0.4596ms | 0.3373ms | 2.9651 KOps/s | 2.9498 KOps/s | |
test_items_nested_locked | 2.1734ms | 0.3401ms | 2.9400 KOps/s | 2.9596 KOps/s | |
test_items_nested_leaf | 0.1455ms | 82.9099μs | 12.0613 KOps/s | 11.4838 KOps/s | |
test_items_stack_nested | 0.5554ms | 0.3390ms | 2.9495 KOps/s | 2.9340 KOps/s | |
test_items_stack_nested_leaf | 0.1630ms | 82.9109μs | 12.0611 KOps/s | 11.2692 KOps/s | |
test_items_stack_nested_locked | 0.4829ms | 0.3370ms | 2.9676 KOps/s | 2.9039 KOps/s | |
test_keys | 30.0870μs | 3.9391μs | 253.8623 KOps/s | 253.6656 KOps/s | |
test_keys_nested | 0.2424ms | 0.1438ms | 6.9520 KOps/s | 6.8745 KOps/s | |
test_keys_nested_locked | 0.6685ms | 0.1484ms | 6.7383 KOps/s | 6.6169 KOps/s | |
test_keys_nested_leaf | 0.2054ms | 0.1235ms | 8.0988 KOps/s | 8.0410 KOps/s | |
test_keys_stack_nested | 0.2069ms | 0.1418ms | 7.0507 KOps/s | 6.8822 KOps/s | |
test_keys_stack_nested_leaf | 0.2381ms | 0.1221ms | 8.1900 KOps/s | 8.0496 KOps/s | |
test_keys_stack_nested_locked | 0.2563ms | 0.1481ms | 6.7539 KOps/s | 6.6314 KOps/s | |
test_values | 5.9538μs | 1.3351μs | 749.0208 KOps/s | 753.8268 KOps/s | |
test_values_nested | 0.1289ms | 49.2112μs | 20.3206 KOps/s | 19.6692 KOps/s | |
test_values_nested_locked | 91.2390μs | 49.4623μs | 20.2174 KOps/s | 19.7492 KOps/s | |
test_values_nested_leaf | 88.8460μs | 43.9368μs | 22.7599 KOps/s | 22.0577 KOps/s | |
test_values_stack_nested | 98.0430μs | 49.3352μs | 20.2695 KOps/s | 18.9223 KOps/s | |
test_values_stack_nested_leaf | 90.1880μs | 44.3762μs | 22.5346 KOps/s | 21.9465 KOps/s | |
test_values_stack_nested_locked | 98.3530μs | 49.3106μs | 20.2796 KOps/s | 19.4930 KOps/s | |
test_membership | 11.8220μs | 0.8969μs | 1.1150 MOps/s | 1.3392 MOps/s | |
test_membership_nested | 21.3090μs | 2.5870μs | 386.5429 KOps/s | 380.3550 KOps/s | |
test_membership_nested_leaf | 43.0500μs | 2.5870μs | 386.5545 KOps/s | 383.8465 KOps/s | |
test_membership_stacked_nested | 28.5730μs | 2.6147μs | 382.4535 KOps/s | 385.0209 KOps/s | |
test_membership_stacked_nested_leaf | 20.5790μs | 2.6374μs | 379.1610 KOps/s | 381.3179 KOps/s | |
test_membership_nested_last | 27.4110μs | 3.8532μs | 259.5262 KOps/s | 249.7755 KOps/s | |
test_membership_nested_leaf_last | 28.6430μs | 3.8446μs | 260.1041 KOps/s | 256.9428 KOps/s | |
test_membership_stacked_nested_last | 25.3870μs | 3.8709μs | 258.3370 KOps/s | 260.1293 KOps/s | |
test_membership_stacked_nested_leaf_last | 37.9300μs | 3.8292μs | 261.1543 KOps/s | 257.3620 KOps/s | |
test_nested_getleaf | 34.0230μs | 10.4361μs | 95.8212 KOps/s | 93.9306 KOps/s | |
test_nested_get | 29.2840μs | 9.8981μs | 101.0291 KOps/s | 99.6341 KOps/s | |
test_stacked_getleaf | 39.5640μs | 10.4320μs | 95.8589 KOps/s | 95.3772 KOps/s | |
test_stacked_get | 33.2620μs | 9.8058μs | 101.9802 KOps/s | 100.9706 KOps/s | |
test_nested_getitemleaf | 27.8520μs | 11.0132μs | 90.8002 KOps/s | 91.3758 KOps/s | |
test_nested_getitem | 38.0510μs | 10.0539μs | 99.4639 KOps/s | 99.0487 KOps/s | |
test_stacked_getitemleaf | 34.4340μs | 10.8736μs | 91.9659 KOps/s | 91.9859 KOps/s | |
test_stacked_getitem | 35.9470μs | 9.9730μs | 100.2709 KOps/s | 100.2604 KOps/s | |
test_lock_nested | 79.6474ms | 0.5784ms | 1.7289 KOps/s | 1.9919 KOps/s | |
test_lock_stack_nested | 0.7077ms | 0.4663ms | 2.1443 KOps/s | 2.1380 KOps/s | |
test_unlock_nested | 83.2649ms | 0.4938ms | 2.0250 KOps/s | 2.4111 KOps/s | |
test_unlock_stack_nested | 0.7431ms | 0.3813ms | 2.6228 KOps/s | 2.6048 KOps/s | |
test_flatten_speed | 0.1957ms | 0.1035ms | 9.6634 KOps/s | 9.3750 KOps/s | |
test_unflatten_speed | 0.9304ms | 0.4242ms | 2.3576 KOps/s | 2.3167 KOps/s | |
test_common_ops | 2.6075ms | 1.0384ms | 963.0419 Ops/s | 968.2762 Ops/s | |
test_creation | 24.4250μs | 2.0076μs | 498.1150 KOps/s | 488.9825 KOps/s | |
test_creation_empty | 40.1650μs | 15.6323μs | 63.9700 KOps/s | 62.3655 KOps/s | |
test_creation_nested_1 | 64.2190μs | 18.6676μs | 53.5687 KOps/s | 52.0736 KOps/s | |
test_creation_nested_2 | 69.5590μs | 22.3265μs | 44.7899 KOps/s | 44.0674 KOps/s | |
test_clone | 75.4100μs | 16.3420μs | 61.1920 KOps/s | 60.0139 KOps/s | |
test_getitem[int] | 1.3032ms | 16.2551μs | 61.5191 KOps/s | 59.9661 KOps/s | |
test_getitem[slice_int] | 0.1330ms | 31.5116μs | 31.7343 KOps/s | 32.1255 KOps/s | |
test_getitem[range] | 0.1577ms | 54.6468μs | 18.2993 KOps/s | 17.4924 KOps/s | |
test_getitem[tuple] | 0.1183ms | 24.6651μs | 40.5431 KOps/s | 39.9031 KOps/s | |
test_getitem[list] | 0.2560ms | 50.6009μs | 19.7625 KOps/s | 19.2919 KOps/s | |
test_setitem_dim[int] | 70.0300μs | 35.4312μs | 28.2237 KOps/s | 25.2617 KOps/s | |
test_setitem_dim[slice_int] | 0.1049ms | 64.5284μs | 15.4971 KOps/s | 14.0445 KOps/s | |
test_setitem_dim[range] | 0.1467ms | 85.0245μs | 11.7613 KOps/s | 11.0098 KOps/s | |
test_setitem_dim[tuple] | 96.3790μs | 52.8612μs | 18.9175 KOps/s | 17.1994 KOps/s | |
test_setitem | 0.1306ms | 26.9738μs | 37.0730 KOps/s | 35.9744 KOps/s | |
test_set | 85.2580μs | 25.7554μs | 38.8269 KOps/s | 37.1597 KOps/s | |
test_set_shared | 2.1288ms | 0.2113ms | 4.7325 KOps/s | 4.6515 KOps/s | |
test_update | 0.1882ms | 31.4229μs | 31.8239 KOps/s | 30.2613 KOps/s | |
test_update_nested | 0.1284ms | 40.9487μs | 24.4208 KOps/s | 23.0297 KOps/s | |
test_update__nested | 0.1304ms | 33.0857μs | 30.2245 KOps/s | 29.1443 KOps/s | |
test_set_nested | 0.1605ms | 28.3674μs | 35.2518 KOps/s | 34.0096 KOps/s | |
test_set_nested_new | 98.3940μs | 33.1902μs | 30.1294 KOps/s | 28.9543 KOps/s | |
test_select | 0.2049ms | 49.5622μs | 20.1767 KOps/s | 19.6597 KOps/s | |
test_select_nested | 0.1159ms | 58.8670μs | 16.9875 KOps/s | 16.9068 KOps/s | |
test_exclude_nested | 0.1616ms | 76.9814μs | 12.9902 KOps/s | 12.9944 KOps/s | |
test_empty[True] | 1.1894ms | 0.3249ms | 3.0774 KOps/s | 3.0607 KOps/s | |
test_empty[False] | 6.5195μs | 1.1622μs | 860.4562 KOps/s | 857.6151 KOps/s | |
test_unbind_speed | 0.3951ms | 0.3085ms | 3.2411 KOps/s | 3.2302 KOps/s | |
test_unbind_speed_stack0 | 0.5287ms | 0.3037ms | 3.2931 KOps/s | 3.3160 KOps/s | |
test_unbind_speed_stack1 | 94.1494ms | 0.8062ms | 1.2403 KOps/s | 1.3511 KOps/s | |
test_split | 94.8275ms | 2.1376ms | 467.8198 Ops/s | 470.1770 Ops/s | |
test_chunk | 93.8632ms | 2.1354ms | 468.3061 Ops/s | 467.8018 Ops/s | |
test_creation[device0] | 3.0790ms | 0.1211ms | 8.2602 KOps/s | 8.3815 KOps/s | |
test_creation_from_tensor | 0.2962ms | 0.1191ms | 8.3934 KOps/s | 8.2956 KOps/s | |
test_add_one[memmap_tensor0] | 0.4837ms | 7.1086μs | 140.6755 KOps/s | 126.5449 KOps/s | |
test_contiguous[memmap_tensor0] | 21.8910μs | 1.9938μs | 501.5652 KOps/s | 496.4822 KOps/s | |
test_stack[memmap_tensor0] | 91.8610μs | 5.4343μs | 184.0175 KOps/s | 178.8234 KOps/s | |
test_memmaptd_index | 1.2305ms | 0.4052ms | 2.4679 KOps/s | 2.4915 KOps/s | |
test_memmaptd_index_astensor | 0.9711ms | 0.4835ms | 2.0683 KOps/s | 2.0851 KOps/s | |
test_memmaptd_index_op | 1.3163ms | 0.9608ms | 1.0408 KOps/s | 989.3251 Ops/s | |
test_serialize_model | 0.1270s | 0.1181s | 8.4710 Ops/s | 7.2913 Ops/s | |
test_serialize_model_pickle | 0.4757s | 0.3941s | 2.5371 Ops/s | 2.4681 Ops/s | |
test_serialize_weights | 0.1251s | 0.1159s | 8.6249 Ops/s | 8.6371 Ops/s | |
test_serialize_weights_returnearly | 0.1794s | 0.1609s | 6.2159 Ops/s | 6.2482 Ops/s | |
test_serialize_weights_pickle | 0.4689s | 0.4117s | 2.4288 Ops/s | 2.3751 Ops/s | |
test_serialize_weights_filesystem | 0.1458s | 0.1424s | 7.0220 Ops/s | 6.8833 Ops/s | |
test_serialize_model_filesystem | 0.2351s | 0.1609s | 6.2143 Ops/s | 6.7351 Ops/s | |
test_reshape_pytree | 85.9810μs | 39.9217μs | 25.0490 KOps/s | 25.0491 KOps/s | |
test_reshape_td | 0.1383ms | 46.5276μs | 21.4926 KOps/s | 20.6270 KOps/s | |
test_view_pytree | 86.4610μs | 39.7338μs | 25.1675 KOps/s | 24.6848 KOps/s | |
test_view_td | 0.1054ms | 52.7407μs | 18.9607 KOps/s | 17.8704 KOps/s | |
test_unbind_pytree | 81.7220μs | 37.2253μs | 26.8635 KOps/s | 27.1370 KOps/s | |
test_unbind_td | 0.3616ms | 45.5707μs | 21.9439 KOps/s | 21.8590 KOps/s | |
test_split_pytree | 87.6630μs | 40.5622μs | 24.6535 KOps/s | 24.7759 KOps/s | |
test_split_td | 0.5110ms | 58.1456μs | 17.1982 KOps/s | 16.8647 KOps/s | |
test_add_pytree | 0.2881ms | 46.5439μs | 21.4851 KOps/s | 21.0810 KOps/s | |
test_add_td | 0.1470ms | 77.0844μs | 12.9728 KOps/s | 12.0621 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1114ms | 53.0097μs | 18.8645 KOps/s | 18.2048 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.4070ms | 0.1929ms | 5.1842 KOps/s | 5.1662 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1198ms | 53.0554μs | 18.8482 KOps/s | 18.1283 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.3134ms | 0.1422ms | 7.0342 KOps/s | 6.8189 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 68.7480μs | 19.8579μs | 50.3577 KOps/s | 49.5994 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1419ms | 63.9781μs | 15.6303 KOps/s | 15.1354 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1540ms | 81.1686μs | 12.3200 KOps/s | 12.4757 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.3103ms | 72.8448μs | 13.7278 KOps/s | 13.8594 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2837ms | 0.1745ms | 5.7307 KOps/s | 5.6778 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.2645ms | 0.1942ms | 5.1486 KOps/s | 5.1206 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 96.4100μs | 36.9954μs | 27.0304 KOps/s | 25.2048 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.5355ms | 68.8860μs | 14.5167 KOps/s | 13.8709 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2492ms | 0.1727ms | 5.7892 KOps/s | 5.6034 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.3656ms | 0.2866ms | 3.4887 KOps/s | 3.4509 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3742ms | 0.2089ms | 4.7874 KOps/s | 4.7596 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.3478ms | 0.1832ms | 5.4575 KOps/s | 5.6932 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.2322ms | 63.7478μs | 15.6868 KOps/s | 15.6174 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 83.6860μs | 39.2521μs | 25.4764 KOps/s | 25.0537 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.4920ms | 0.2364ms | 4.2299 KOps/s | 4.2532 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.3250ms | 0.1725ms | 5.7973 KOps/s | 5.6918 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.1847ms | 0.1086ms | 9.2055 KOps/s | 8.9107 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1216ms | 55.6985μs | 17.9538 KOps/s | 17.5668 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1682ms | 81.0364μs | 12.3401 KOps/s | 12.5973 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1579ms | 71.8199μs | 13.9237 KOps/s | 13.9080 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.3798ms | 0.1887ms | 5.2993 KOps/s | 5.1806 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.7983ms | 1.5994ms | 625.2153 Ops/s | 608.9088 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.2880ms | 0.1843ms | 5.4261 KOps/s | 5.2612 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.3085ms | 1.0547ms | 948.1061 Ops/s | 939.2934 Ops/s | |
test_compile_assign_and_add_stack[compile] | 0.5107ms | 0.4079ms | 2.4517 KOps/s | 2.3933 KOps/s | |
test_compile_assign_and_add_stack[eager] | 3.6952ms | 3.5545ms | 281.3341 Ops/s | 269.1580 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 89.8780μs | 31.4701μs | 31.7762 KOps/s | 30.6450 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 1.4135ms | 46.9430μs | 21.3024 KOps/s | 20.8278 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 82.9450μs | 27.4972μs | 36.3673 KOps/s | 34.0077 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 95.4480μs | 29.8581μs | 33.4917 KOps/s | 32.5561 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1657ms | 27.6216μs | 36.2036 KOps/s | 34.4247 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1032ms | 30.0150μs | 33.3167 KOps/s | 32.7187 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1681ms | 71.7498μs | 13.9373 KOps/s | 13.5968 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.5391ms | 26.8254μs | 37.2781 KOps/s | 35.1307 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1648ms | 66.8711μs | 14.9541 KOps/s | 14.8050 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 82.4040μs | 24.3304μs | 41.1009 KOps/s | 42.2426 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1475ms | 66.1017μs | 15.1282 KOps/s | 14.8498 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 74.7690μs | 23.9356μs | 41.7787 KOps/s | 42.4166 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1458ms | 69.5337μs | 14.3815 KOps/s | 13.8463 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.9895ms | 26.3312μs | 37.9778 KOps/s | 35.5851 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1589ms | 65.6857μs | 15.2240 KOps/s | 14.6972 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 73.5170μs | 23.8598μs | 41.9115 KOps/s | 41.5889 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1377ms | 66.0738μs | 15.1346 KOps/s | 14.8952 KOps/s | |
test_compile_indexing[int-pytree-eager] | 96.6000μs | 24.3466μs | 41.0735 KOps/s | 41.8134 KOps/s | |
test_mod_add[eager] | 85.6690μs | 21.5758μs | 46.3482 KOps/s | 41.4414 KOps/s | |
test_mod_add[compile] | 85.4600μs | 36.1508μs | 27.6619 KOps/s | 27.0020 KOps/s | |
test_mod_add[compile-overhead] | 98.7140μs | 36.7148μs | 27.2370 KOps/s | 26.3961 KOps/s | |
test_mod_wrap[eager] | 0.3270ms | 0.1957ms | 5.1093 KOps/s | 4.8519 KOps/s | |
test_mod_wrap[compile] | 1.5143ms | 0.2227ms | 4.4905 KOps/s | 4.3101 KOps/s | |
test_mod_wrap[compile-overhead] | 0.3584ms | 0.2210ms | 4.5257 KOps/s | 4.3530 KOps/s | |
test_mod_wrap_and_backward[eager] | 13.3398ms | 11.2727ms | 88.7099 Ops/s | 94.1428 Ops/s | |
test_mod_wrap_and_backward[compile] | 15.7336ms | 11.5718ms | 86.4167 Ops/s | 91.4778 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 18.1440ms | 11.6138ms | 86.1045 Ops/s | 89.5947 Ops/s | |
test_seq_add[eager] | 0.1875ms | 81.8656μs | 12.2151 KOps/s | 11.8254 KOps/s | |
test_seq_add[compile] | 0.1529ms | 59.1138μs | 16.9165 KOps/s | 16.1241 KOps/s | |
test_seq_add[compile-overhead] | 0.1438ms | 59.3491μs | 16.8495 KOps/s | 16.3359 KOps/s | |
test_seq_wrap[eager] | 0.4834ms | 0.3486ms | 2.8684 KOps/s | 2.7494 KOps/s | |
test_seq_wrap[compile] | 0.5025ms | 0.2582ms | 3.8723 KOps/s | 3.8219 KOps/s | |
test_seq_wrap[compile-overhead] | 0.4099ms | 0.2538ms | 3.9398 KOps/s | 3.8379 KOps/s | |
test_func_call_runtime[False-eager] | 0.7903ms | 0.5102ms | 1.9601 KOps/s | 1.8912 KOps/s | |
test_func_call_runtime[False-compile] | 0.9608ms | 0.4847ms | 2.0630 KOps/s | 2.0068 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.5843ms | 0.4809ms | 2.0793 KOps/s | 1.9892 KOps/s | |
test_func_call_runtime[True-eager] | 0.8322ms | 0.7254ms | 1.3786 KOps/s | 1.3143 KOps/s | |
test_func_call_runtime[True-compile] | 0.6060ms | 0.4910ms | 2.0365 KOps/s | 1.9481 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.8569ms | 0.4994ms | 2.0024 KOps/s | 1.9514 KOps/s | |
test_func_call_cm_runtime[False-eager] | 1.0807ms | 0.5066ms | 1.9738 KOps/s | 1.8667 KOps/s | |
test_func_call_cm_runtime[False-compile] | 1.0413ms | 0.4824ms | 2.0729 KOps/s | 1.9977 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5772ms | 0.4809ms | 2.0795 KOps/s | 1.9820 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.5959ms | 0.8576ms | 1.1661 KOps/s | 1.1178 KOps/s | |
test_func_call_cm_runtime[True-compile] | 1.3041ms | 0.8162ms | 1.2251 KOps/s | 1.1910 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.3044ms | 0.8141ms | 1.2284 KOps/s | 1.1794 KOps/s | |
test_distributed | 0.2549ms | 0.1313ms | 7.6168 KOps/s | 7.5178 KOps/s | |
test_tdmodule | 90.9190μs | 15.7949μs | 63.3115 KOps/s | 63.3770 KOps/s | |
test_tdmodule_dispatch | 64.8300μs | 32.6713μs | 30.6079 KOps/s | 30.0948 KOps/s | |
test_tdseq | 34.2750μs | 17.3109μs | 57.7669 KOps/s | 56.8100 KOps/s | |
test_tdseq_dispatch | 54.2820μs | 35.7636μs | 27.9614 KOps/s | 27.6473 KOps/s | |
test_instantiation_functorch | 1.7683ms | 1.6069ms | 622.3324 Ops/s | 619.9336 Ops/s | |
test_instantiation_td | 1.8616ms | 1.1643ms | 858.8928 Ops/s | 837.9331 Ops/s | |
test_exec_functorch | 0.4100ms | 0.1762ms | 5.6740 KOps/s | 5.5899 KOps/s | |
test_exec_functional_call | 0.4234ms | 0.1641ms | 6.0956 KOps/s | 5.8454 KOps/s | |
test_exec_td | 0.3127ms | 0.1670ms | 5.9892 KOps/s | 5.7687 KOps/s | |
test_exec_td_decorator | 1.2372ms | 0.2205ms | 4.5344 KOps/s | 4.4646 KOps/s | |
test_vmap_mlp_speed[True-True] | 0.7809ms | 0.5555ms | 1.8000 KOps/s | 1.7680 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.7621ms | 0.5488ms | 1.8221 KOps/s | 1.7051 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.6990ms | 0.4644ms | 2.1532 KOps/s | 2.1218 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.8259ms | 0.4645ms | 2.1530 KOps/s | 2.1346 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.3038ms | 0.6149ms | 1.6263 KOps/s | 1.6099 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.9458ms | 0.6137ms | 1.6296 KOps/s | 1.5964 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7567ms | 0.5148ms | 1.9425 KOps/s | 1.9232 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7280ms | 0.5043ms | 1.9828 KOps/s | 1.9336 KOps/s | |
test_to_module_speed[True] | 2.1446ms | 1.3423ms | 744.9705 Ops/s | 757.0304 Ops/s | |
test_to_module_speed[False] | 1.8951ms | 1.3100ms | 763.3824 Ops/s | 771.2599 Ops/s | |
test_tc_init | 88.2840μs | 42.0844μs | 23.7617 KOps/s | 24.0986 KOps/s | |
test_tc_init_nested | 0.1588ms | 85.8257μs | 11.6515 KOps/s | 12.2669 KOps/s | |
test_tc_first_layer_tensor | 23.1130μs | 1.4670μs | 681.6585 KOps/s | 675.4683 KOps/s | |
test_tc_first_layer_nontensor | 35.6360μs | 4.2674μs | 234.3370 KOps/s | 232.1522 KOps/s | |
test_tc_second_layer_tensor | 45.1740μs | 2.7109μs | 368.8792 KOps/s | 369.1176 KOps/s | |
test_tc_second_layer_nontensor | 34.8950μs | 5.6277μs | 177.6937 KOps/s | 180.0833 KOps/s | |
test_unbind | 0.4535s | 13.5792ms | 73.6422 Ops/s | 72.7396 Ops/s | |
test_full_like | 8.6073ms | 7.7454ms | 129.1084 Ops/s | 80.1716 Ops/s | |
test_zeros_like | 11.1150ms | 6.7949ms | 147.1694 Ops/s | 132.6194 Ops/s | |
test_ones_like | 11.1298ms | 7.6936ms | 129.9788 Ops/s | 125.3410 Ops/s | |
test_clone | 16.3592ms | 9.3495ms | 106.9573 Ops/s | 105.5093 Ops/s | |
test_squeeze | 73.7870μs | 12.7735μs | 78.2871 KOps/s | 75.3323 KOps/s | |
test_unsqueeze | 0.3532ms | 96.0177μs | 10.4147 KOps/s | 10.1351 KOps/s | |
test_split | 0.3283ms | 0.1969ms | 5.0793 KOps/s | 5.0023 KOps/s | |
test_permute | 0.3226ms | 0.2189ms | 4.5679 KOps/s | 4.5075 KOps/s | |
test_stack | 30.5686ms | 25.9463ms | 38.5411 Ops/s | 39.4029 Ops/s | |
test_cat | 31.9834ms | 25.7512ms | 38.8331 Ops/s | 40.4656 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 36.6010μs | 16.4971μs | 60.6168 KOps/s | 59.7101 KOps/s | |
test_plain_set_stack_nested | 38.1110μs | 16.7352μs | 59.7544 KOps/s | 59.7838 KOps/s | |
test_plain_set_nested_inplace | 0.1741ms | 17.7313μs | 56.3976 KOps/s | 56.2001 KOps/s | |
test_plain_set_stack_nested_inplace | 41.9500μs | 17.7437μs | 56.3579 KOps/s | 56.3573 KOps/s | |
test_items | 26.7300μs | 4.7070μs | 212.4501 KOps/s | 211.8214 KOps/s | |
test_items_nested | 0.3949ms | 0.3623ms | 2.7602 KOps/s | 2.7917 KOps/s | |
test_items_nested_locked | 0.5331ms | 0.3656ms | 2.7354 KOps/s | 2.7594 KOps/s | |
test_items_nested_leaf | 0.2632ms | 83.8436μs | 11.9270 KOps/s | 11.9399 KOps/s | |
test_items_stack_nested | 0.5264ms | 0.3669ms | 2.7257 KOps/s | 2.7364 KOps/s | |
test_items_stack_nested_leaf | 0.1002ms | 83.5765μs | 11.9651 KOps/s | 11.8734 KOps/s | |
test_items_stack_nested_locked | 0.3997ms | 0.3714ms | 2.6923 KOps/s | 2.7548 KOps/s | |
test_keys | 18.4000μs | 4.3646μs | 229.1148 KOps/s | 228.5419 KOps/s | |
test_keys_nested | 82.2120μs | 65.6090μs | 15.2418 KOps/s | 14.9448 KOps/s | |
test_keys_nested_locked | 0.7033ms | 72.3577μs | 13.8202 KOps/s | 13.7625 KOps/s | |
test_keys_nested_leaf | 74.3620μs | 55.7043μs | 17.9519 KOps/s | 17.6620 KOps/s | |
test_keys_stack_nested | 91.5020μs | 66.2325μs | 15.0983 KOps/s | 14.9112 KOps/s | |
test_keys_stack_nested_leaf | 84.2220μs | 57.1286μs | 17.5044 KOps/s | 17.7723 KOps/s | |
test_keys_stack_nested_locked | 95.8530μs | 71.0071μs | 14.0831 KOps/s | 13.9527 KOps/s | |
test_values | 8.2173μs | 1.7668μs | 566.0088 KOps/s | 566.9316 KOps/s | |
test_values_nested | 57.6720μs | 33.7654μs | 29.6161 KOps/s | 29.7275 KOps/s | |
test_values_nested_locked | 0.1532ms | 35.5547μs | 28.1256 KOps/s | 28.0318 KOps/s | |
test_values_nested_leaf | 44.4710μs | 29.9245μs | 33.4174 KOps/s | 33.3347 KOps/s | |
test_values_stack_nested | 65.7710μs | 34.5094μs | 28.9776 KOps/s | 29.5530 KOps/s | |
test_values_stack_nested_leaf | 0.1565ms | 30.6306μs | 32.6471 KOps/s | 33.3919 KOps/s | |
test_values_stack_nested_locked | 0.1759ms | 36.3237μs | 27.5302 KOps/s | 28.0034 KOps/s | |
test_membership | 1.2721μs | 0.5491μs | 1.8210 MOps/s | 1.8320 MOps/s | |
test_membership_nested | 15.1305μs | 1.9141μs | 522.4263 KOps/s | 514.0748 KOps/s | |
test_membership_nested_leaf | 10.6300μs | 1.9201μs | 520.8052 KOps/s | 515.2729 KOps/s | |
test_membership_stacked_nested | 22.0300μs | 1.9575μs | 510.8587 KOps/s | 503.0912 KOps/s | |
test_membership_stacked_nested_leaf | 22.0900μs | 1.9557μs | 511.3326 KOps/s | 500.2502 KOps/s | |
test_membership_nested_last | 16.1510μs | 2.9068μs | 344.0211 KOps/s | 346.2356 KOps/s | |
test_membership_nested_leaf_last | 36.2610μs | 2.9623μs | 337.5741 KOps/s | 338.4817 KOps/s | |
test_membership_stacked_nested_last | 26.7810μs | 9.1394μs | 109.4160 KOps/s | 344.2190 KOps/s | |
test_membership_stacked_nested_leaf_last | 28.0610μs | 9.1049μs | 109.8304 KOps/s | 341.1993 KOps/s | |
test_nested_getleaf | 23.0000μs | 7.9286μs | 126.1259 KOps/s | 125.1038 KOps/s | |
test_nested_get | 28.6800μs | 7.4245μs | 134.6900 KOps/s | 132.8815 KOps/s | |
test_stacked_getleaf | 26.3400μs | 7.9006μs | 126.5720 KOps/s | 124.5675 KOps/s | |
test_stacked_get | 23.3610μs | 7.4808μs | 133.6755 KOps/s | 133.0894 KOps/s | |
test_nested_getitemleaf | 21.3700μs | 8.0301μs | 124.5314 KOps/s | 123.0597 KOps/s | |
test_nested_getitem | 23.1810μs | 7.5949μs | 131.6667 KOps/s | 130.3500 KOps/s | |
test_stacked_getitemleaf | 27.0900μs | 8.1135μs | 123.2509 KOps/s | 122.1309 KOps/s | |
test_stacked_getitem | 28.8010μs | 7.6199μs | 131.2361 KOps/s | 129.8339 KOps/s | |
test_lock_nested | 9.7260ms | 0.4760ms | 2.1009 KOps/s | 2.1455 KOps/s | |
test_lock_stack_nested | 0.5950ms | 0.4176ms | 2.3948 KOps/s | 2.3488 KOps/s | |
test_unlock_nested | 0.8560ms | 0.3809ms | 2.6254 KOps/s | 2.6376 KOps/s | |
test_unlock_stack_nested | 0.3645ms | 0.3361ms | 2.9749 KOps/s | 2.8982 KOps/s | |
test_flatten_speed | 0.4767ms | 0.1050ms | 9.5273 KOps/s | 9.5579 KOps/s | |
test_unflatten_speed | 0.3119ms | 0.2855ms | 3.5032 KOps/s | 3.5351 KOps/s | |
test_common_ops | 1.5018ms | 1.2494ms | 800.3650 Ops/s | 768.2089 Ops/s | |
test_creation | 14.4810μs | 1.6225μs | 616.3379 KOps/s | 608.3310 KOps/s | |
test_creation_empty | 59.1710μs | 16.9009μs | 59.1685 KOps/s | 60.0262 KOps/s | |
test_creation_nested_1 | 42.5310μs | 18.5966μs | 53.7732 KOps/s | 53.2637 KOps/s | |
test_creation_nested_2 | 41.3600μs | 21.0865μs | 47.4237 KOps/s | 47.5061 KOps/s | |
test_clone | 0.1799ms | 28.9382μs | 34.5564 KOps/s | 32.8585 KOps/s | |
test_getitem[int] | 1.3883ms | 16.4911μs | 60.6386 KOps/s | 60.3071 KOps/s | |
test_getitem[slice_int] | 0.1525ms | 27.6337μs | 36.1877 KOps/s | 35.2303 KOps/s | |
test_getitem[range] | 0.2851ms | 0.1099ms | 9.0976 KOps/s | 9.0641 KOps/s | |
test_getitem[tuple] | 0.1532ms | 24.0444μs | 41.5897 KOps/s | 40.8207 KOps/s | |
test_getitem[list] | 92.4063ms | 0.1129ms | 8.8606 KOps/s | 9.5352 KOps/s | |
test_setitem_dim[int] | 71.7120μs | 52.9654μs | 18.8802 KOps/s | 18.5476 KOps/s | |
test_setitem_dim[slice_int] | 0.1009ms | 73.9262μs | 13.5270 KOps/s | 13.3355 KOps/s | |
test_setitem_dim[range] | 0.2589ms | 0.1344ms | 7.4404 KOps/s | 7.0434 KOps/s | |
test_setitem_dim[tuple] | 0.1782ms | 66.9995μs | 14.9255 KOps/s | 14.0949 KOps/s | |
test_setitem | 0.1947ms | 41.6912μs | 23.9859 KOps/s | 24.2881 KOps/s | |
test_set | 0.2007ms | 40.4755μs | 24.7063 KOps/s | 24.9811 KOps/s | |
test_set_shared | 0.3639ms | 52.2383μs | 19.1430 KOps/s | 19.4885 KOps/s | |
test_update | 0.2045ms | 49.5820μs | 20.1686 KOps/s | 19.7474 KOps/s | |
test_update_nested | 0.2095ms | 57.0298μs | 17.5347 KOps/s | 17.1757 KOps/s | |
test_update__nested | 0.2109ms | 58.6161μs | 17.0602 KOps/s | 16.8366 KOps/s | |
test_set_nested | 0.1963ms | 44.1578μs | 22.6460 KOps/s | 23.0112 KOps/s | |
test_set_nested_new | 0.1959ms | 47.3473μs | 21.1205 KOps/s | 20.3237 KOps/s | |
test_select | 0.2135ms | 62.3578μs | 16.0365 KOps/s | 15.8064 KOps/s | |
test_select_nested | 0.4896ms | 52.9069μs | 18.9011 KOps/s | 19.7000 KOps/s | |
test_exclude_nested | 93.5220μs | 69.4747μs | 14.3937 KOps/s | 14.4857 KOps/s | |
test_empty[True] | 0.2961ms | 0.2772ms | 3.6077 KOps/s | 3.5563 KOps/s | |
test_empty[False] | 2.2840μs | 0.8731μs | 1.1453 MOps/s | 1.1260 MOps/s | |
test_to | 69.0920μs | 39.0089μs | 25.6352 KOps/s | 25.7848 KOps/s | |
test_to_nonblocking | 54.5520μs | 24.8193μs | 40.2912 KOps/s | 41.2840 KOps/s | |
test_unbind_speed | 1.4801ms | 0.2960ms | 3.3785 KOps/s | 3.3608 KOps/s | |
test_unbind_speed_stack0 | 0.3074ms | 0.2861ms | 3.4958 KOps/s | 3.4318 KOps/s | |
test_unbind_speed_stack1 | 91.9804ms | 0.7414ms | 1.3488 KOps/s | 1.2922 KOps/s | |
test_split | 96.0544ms | 2.2437ms | 445.6935 Ops/s | 438.2914 Ops/s | |
test_chunk | 95.2673ms | 2.2253ms | 449.3717 Ops/s | 438.4833 Ops/s | |
test_creation[device0] | 0.2275ms | 0.1007ms | 9.9260 KOps/s | 9.7982 KOps/s | |
test_creation_from_tensor | 0.2656ms | 0.1026ms | 9.7477 KOps/s | 9.6553 KOps/s | |
test_add_one[memmap_tensor0] | 0.1462ms | 8.3393μs | 119.9142 KOps/s | 117.5063 KOps/s | |
test_contiguous[memmap_tensor0] | 18.6410μs | 2.0593μs | 485.6012 KOps/s | 485.4960 KOps/s | |
test_stack[memmap_tensor0] | 33.0500μs | 6.3678μs | 157.0396 KOps/s | 152.4853 KOps/s | |
test_memmaptd_index | 1.2515ms | 0.4124ms | 2.4249 KOps/s | 2.4156 KOps/s | |
test_memmaptd_index_astensor | 0.7771ms | 0.4745ms | 2.1076 KOps/s | 2.0946 KOps/s | |
test_memmaptd_index_op | 1.4192ms | 1.0043ms | 995.7381 Ops/s | 999.4401 Ops/s | |
test_serialize_model | 93.5574ms | 88.9010ms | 11.2485 Ops/s | 10.9255 Ops/s | |
test_serialize_model_pickle | 1.3520s | 1.2367s | 0.8086 Ops/s | 0.8083 Ops/s | |
test_serialize_weights | 0.1817s | 96.4812ms | 10.3647 Ops/s | 9.7251 Ops/s | |
test_serialize_weights_returnearly | 0.2725s | 66.6711ms | 14.9990 Ops/s | 15.4246 Ops/s | |
test_serialize_weights_pickle | 1.3490s | 1.2363s | 0.8089 Ops/s | 0.8037 Ops/s | |
test_reshape_pytree | 0.1986ms | 40.4146μs | 24.7435 KOps/s | 26.8904 KOps/s | |
test_reshape_td | 0.1723ms | 43.6916μs | 22.8877 KOps/s | 24.4182 KOps/s | |
test_view_pytree | 0.1825ms | 36.0635μs | 27.7289 KOps/s | 27.7337 KOps/s | |
test_view_td | 74.3820μs | 47.6721μs | 20.9766 KOps/s | 20.2618 KOps/s | |
test_unbind_pytree | 0.1172ms | 34.9553μs | 28.6080 KOps/s | 28.2139 KOps/s | |
test_unbind_td | 0.4168ms | 43.6529μs | 22.9080 KOps/s | 22.9332 KOps/s | |
test_split_pytree | 0.1453ms | 48.0957μs | 20.7919 KOps/s | 19.8341 KOps/s | |
test_split_td | 0.5065ms | 58.3369μs | 17.1418 KOps/s | 17.6174 KOps/s | |
test_add_pytree | 0.2142ms | 63.2571μs | 15.8085 KOps/s | 16.0847 KOps/s | |
test_add_td | 0.2506ms | 96.6940μs | 10.3419 KOps/s | 10.8263 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.4028ms | 0.2085ms | 4.7970 KOps/s | 4.8113 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3203ms | 0.1728ms | 5.7865 KOps/s | 5.8360 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.3285ms | 0.1434ms | 6.9731 KOps/s | 6.8927 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.3480ms | 0.1913ms | 5.2275 KOps/s | 5.1282 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 0.1556ms | 21.4879μs | 46.5378 KOps/s | 46.5656 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1843ms | 46.8819μs | 21.3302 KOps/s | 21.4920 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1310ms | 72.9581μs | 13.7065 KOps/s | 13.7538 KOps/s | |
test_compile_copy_nested[pytree-eager] | 85.4120μs | 59.0532μs | 16.9339 KOps/s | 16.8318 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.4516ms | 0.3246ms | 3.0810 KOps/s | 3.1098 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3518ms | 0.2212ms | 4.5216 KOps/s | 4.5139 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.3148ms | 0.1296ms | 7.7180 KOps/s | 7.5151 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.2385ms | 62.4771μs | 16.0059 KOps/s | 15.4867 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.4972ms | 0.3210ms | 3.1153 KOps/s | 3.1048 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.7754ms | 0.6168ms | 1.6214 KOps/s | 1.5601 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4131ms | 0.2680ms | 3.7307 KOps/s | 3.7282 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.4834ms | 0.3256ms | 3.0715 KOps/s | 3.0693 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.2155ms | 74.9756μs | 13.3377 KOps/s | 13.1056 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.2762ms | 0.1338ms | 7.4755 KOps/s | 7.6085 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.6774ms | 0.5234ms | 1.9104 KOps/s | 1.8135 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.4658ms | 0.3240ms | 3.0863 KOps/s | 3.0989 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.2062ms | 19.9446μs | 50.1388 KOps/s | 53.9427 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.2058ms | 32.8435μs | 30.4474 KOps/s | 31.5314 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1942ms | 76.4800μs | 13.0753 KOps/s | 13.1057 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.2526ms | 60.0940μs | 16.6406 KOps/s | 16.5955 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 2.4295ms | 0.8427ms | 1.1866 KOps/s | 1.1177 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 3.7833ms | 3.2966ms | 303.3417 Ops/s | 316.8400 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 2.3677ms | 0.8090ms | 1.2361 KOps/s | 1.1429 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 3.3589ms | 3.1372ms | 318.7549 Ops/s | 315.6469 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.2636ms | 0.1102ms | 9.0726 KOps/s | 9.1125 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.2438ms | 61.8987μs | 16.1554 KOps/s | 16.4150 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.2514ms | 0.1026ms | 9.7512 KOps/s | 9.6530 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.2121ms | 43.6895μs | 22.8888 KOps/s | 22.7502 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.2533ms | 0.1019ms | 9.8136 KOps/s | 9.6918 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.2264ms | 43.8104μs | 22.8256 KOps/s | 22.6793 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.2835ms | 0.1364ms | 7.3290 KOps/s | 7.2840 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.2288ms | 24.3366μs | 41.0904 KOps/s | 39.5458 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.2769ms | 0.1282ms | 7.8033 KOps/s | 7.7265 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 52.5700μs | 21.3903μs | 46.7501 KOps/s | 45.2785 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.2900ms | 0.1279ms | 7.8168 KOps/s | 7.7252 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 56.6720μs | 21.2483μs | 47.0625 KOps/s | 46.3219 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.3011ms | 0.1361ms | 7.3474 KOps/s | 7.3243 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.5057ms | 23.8863μs | 41.8649 KOps/s | 39.5577 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.2884ms | 0.1277ms | 7.8282 KOps/s | 7.7660 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.1597ms | 21.6749μs | 46.1362 KOps/s | 46.2651 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.3289ms | 0.1278ms | 7.8237 KOps/s | 7.7329 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.1934ms | 21.6925μs | 46.0989 KOps/s | 46.1516 KOps/s | |
test_mod_add[eager] | 0.2062ms | 36.6531μs | 27.2828 KOps/s | 27.1037 KOps/s | |
test_mod_add[compile] | 0.2563ms | 68.7253μs | 14.5507 KOps/s | 14.7887 KOps/s | |
test_mod_add[compile-overhead] | 0.2632ms | 0.1342ms | 7.4527 KOps/s | 6.7242 KOps/s | |
test_mod_wrap[eager] | 0.3867ms | 0.2407ms | 4.1552 KOps/s | 3.8976 KOps/s | |
test_mod_wrap[compile] | 1.1374ms | 0.2886ms | 3.4652 KOps/s | 3.3509 KOps/s | |
test_mod_wrap[compile-overhead] | 8.5790ms | 4.4740ms | 223.5159 Ops/s | 223.7609 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.6112ms | 1.4287ms | 699.9598 Ops/s | 761.3435 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.5017ms | 1.3152ms | 760.3418 Ops/s | 697.0903 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3085ms | 0.8871ms | 1.1273 KOps/s | 1.0094 KOps/s | |
test_seq_add[eager] | 0.2601ms | 0.1052ms | 9.5056 KOps/s | 9.4233 KOps/s | |
test_seq_add[compile] | 0.2340ms | 87.0952μs | 11.4817 KOps/s | 11.8015 KOps/s | |
test_seq_add[compile-overhead] | 0.2585ms | 0.1215ms | 8.2336 KOps/s | 8.2193 KOps/s | |
test_seq_wrap[eager] | 0.5488ms | 0.4014ms | 2.4910 KOps/s | 2.4113 KOps/s | |
test_seq_wrap[compile] | 0.4908ms | 0.3309ms | 3.0225 KOps/s | 3.0999 KOps/s | |
test_seq_wrap[compile-overhead] | 0.1966s | 91.5288ms | 10.9255 Ops/s | 7.7447 Ops/s | |
test_func_call_runtime[False-eager] | 0.8737ms | 0.7103ms | 1.4078 KOps/s | 1.3730 KOps/s | |
test_func_call_runtime[False-compile] | 0.9440ms | 0.7874ms | 1.2700 KOps/s | 1.2397 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.5094ms | 0.3593ms | 2.7829 KOps/s | 2.7662 KOps/s | |
test_func_call_runtime[True-eager] | 1.0741ms | 0.8977ms | 1.1140 KOps/s | 1.0956 KOps/s | |
test_func_call_runtime[True-compile] | 1.0162ms | 0.8296ms | 1.2054 KOps/s | 1.2024 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.5448ms | 0.4050ms | 2.4689 KOps/s | 2.4695 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8654ms | 0.7043ms | 1.4199 KOps/s | 1.3243 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.9504ms | 0.7910ms | 1.2642 KOps/s | 1.2576 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5055ms | 0.3614ms | 2.7671 KOps/s | 2.7630 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1781ms | 1.0105ms | 989.5698 Ops/s | 975.0893 Ops/s | |
test_func_call_cm_runtime[True-compile] | 1.1317ms | 0.9696ms | 1.0313 KOps/s | 1.0043 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.1570ms | 0.9848ms | 1.0154 KOps/s | 1.0079 KOps/s | |
test_distributed | 0.9392ms | 73.0238μs | 13.6942 KOps/s | 13.7449 KOps/s | |
test_tdmodule | 38.5220μs | 15.5769μs | 64.1976 KOps/s | 63.0602 KOps/s | |
test_tdmodule_dispatch | 53.6520μs | 32.6956μs | 30.5852 KOps/s | 31.2732 KOps/s | |
test_tdseq | 35.9620μs | 16.3403μs | 61.1983 KOps/s | 60.3314 KOps/s | |
test_tdseq_dispatch | 50.9810μs | 33.8300μs | 29.5595 KOps/s | 29.3084 KOps/s | |
test_instantiation_functorch | 2.1055ms | 1.9314ms | 517.7639 Ops/s | 503.3573 Ops/s | |
test_instantiation_td | 2.0223ms | 1.2740ms | 784.9192 Ops/s | 777.0998 Ops/s | |
test_exec_functorch | 0.3561ms | 0.2058ms | 4.8584 KOps/s | 4.4321 KOps/s | |
test_exec_functional_call | 0.3538ms | 0.2028ms | 4.9310 KOps/s | 4.6397 KOps/s | |
test_exec_td | 0.3562ms | 0.2066ms | 4.8414 KOps/s | 4.4938 KOps/s | |
test_exec_td_decorator | 0.4072ms | 0.2592ms | 3.8584 KOps/s | 3.5882 KOps/s | |
test_vmap_mlp_speed[True-True] | 1.0034ms | 0.6419ms | 1.5578 KOps/s | 1.5259 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.8075ms | 0.6310ms | 1.5848 KOps/s | 1.5316 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.7162ms | 0.5406ms | 1.8498 KOps/s | 1.7439 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.7311ms | 0.5620ms | 1.7794 KOps/s | 1.7175 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.2591ms | 0.6716ms | 1.4889 KOps/s | 1.4078 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8340ms | 0.6695ms | 1.4937 KOps/s | 1.4156 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7906ms | 0.6063ms | 1.6494 KOps/s | 1.6221 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.8148ms | 0.5850ms | 1.7093 KOps/s | 1.6202 KOps/s | |
test_vmap_transformer_speed[True-True] | 8.8062ms | 8.3946ms | 119.1246 Ops/s | 114.4845 Ops/s | |
test_vmap_transformer_speed[True-False] | 8.6109ms | 8.3505ms | 119.7534 Ops/s | 116.1456 Ops/s | |
test_vmap_transformer_speed[False-True] | 8.7354ms | 8.3072ms | 120.3772 Ops/s | 117.7926 Ops/s | |
test_vmap_transformer_speed[False-False] | 8.6474ms | 8.2820ms | 120.7438 Ops/s | 118.4181 Ops/s | |
test_vmap_transformer_speed_decorator[True-True] | 20.0985ms | 19.8570ms | 50.3601 Ops/s | 49.9682 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 20.6228ms | 19.8421ms | 50.3979 Ops/s | 49.3809 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.9706ms | 19.7098ms | 50.7362 Ops/s | 50.1365 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 20.6208ms | 19.7353ms | 50.6706 Ops/s | 50.6893 Ops/s | |
test_to_module_speed[True] | 1.3421ms | 1.1380ms | 878.7506 Ops/s | 878.0708 Ops/s | |
test_to_module_speed[False] | 1.2410ms | 1.1223ms | 891.0592 Ops/s | 904.8376 Ops/s | |
test_tc_init | 68.1210μs | 38.5526μs | 25.9386 KOps/s | 27.3580 KOps/s | |
test_tc_init_nested | 0.1303ms | 80.5596μs | 12.4132 KOps/s | 13.3106 KOps/s | |
test_tc_first_layer_tensor | 16.9655μs | 0.7862μs | 1.2719 MOps/s | 1.2569 MOps/s | |
test_tc_first_layer_nontensor | 15.6110μs | 2.5443μs | 393.0284 KOps/s | 390.7425 KOps/s | |
test_tc_second_layer_tensor | 9.1933μs | 1.6399μs | 609.7857 KOps/s | 611.1844 KOps/s | |
test_tc_second_layer_nontensor | 23.7810μs | 3.3913μs | 294.8691 KOps/s | 293.0295 KOps/s | |
test_unbind | 0.1854s | 10.7100ms | 93.3710 Ops/s | 63.7103 Ops/s | |
test_full_like | 0.7711ms | 0.5774ms | 1.7320 KOps/s | 1.7525 KOps/s | |
test_zeros_like | 0.3450ms | 0.1979ms | 5.0520 KOps/s | 5.0499 KOps/s | |
test_ones_like | 0.3414ms | 0.1979ms | 5.0522 KOps/s | 5.0560 KOps/s | |
test_clone | 0.5699ms | 0.4139ms | 2.4162 KOps/s | 2.4130 KOps/s | |
test_squeeze | 32.4600μs | 10.6518μs | 93.8811 KOps/s | 93.1318 KOps/s | |
test_unsqueeze | 0.2419ms | 78.0804μs | 12.8073 KOps/s | 12.5945 KOps/s | |
test_split | 0.4189ms | 0.1656ms | 6.0383 KOps/s | 5.6752 KOps/s | |
test_permute | 0.3045ms | 0.1836ms | 5.4477 KOps/s | 5.1406 KOps/s | |
test_stack | 1.2963ms | 0.9038ms | 1.1064 KOps/s | 1.1263 KOps/s | |
test_cat | 1.3609ms | 1.2322ms | 811.5489 Ops/s | 811.7279 Ops/s |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
bug
Something isn't working
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
No description provided.