-
Notifications
You must be signed in to change notification settings - Fork 79
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BugFix] Allow any tensorclass to have a data field #906
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 75.1210μs | 24.1447μs | 41.4169 KOps/s | 46.3057 KOps/s | |
test_plain_set_stack_nested | 54.7530μs | 23.2648μs | 42.9835 KOps/s | 45.6048 KOps/s | |
test_plain_set_nested_inplace | 75.8220μs | 25.4850μs | 39.2388 KOps/s | 41.9465 KOps/s | |
test_plain_set_stack_nested_inplace | 58.8500μs | 25.6942μs | 38.9193 KOps/s | 42.3824 KOps/s | |
test_items | 16.4310μs | 2.6399μs | 378.8059 KOps/s | 379.0794 KOps/s | |
test_items_nested | 0.6079ms | 0.3660ms | 2.7324 KOps/s | 2.7174 KOps/s | |
test_items_nested_locked | 1.6325ms | 0.3678ms | 2.7186 KOps/s | 2.6815 KOps/s | |
test_items_nested_leaf | 0.1576ms | 88.2295μs | 11.3341 KOps/s | 11.5881 KOps/s | |
test_items_stack_nested | 0.6051ms | 0.3703ms | 2.7003 KOps/s | 2.7180 KOps/s | |
test_items_stack_nested_leaf | 0.2429ms | 92.6227μs | 10.7965 KOps/s | 11.5589 KOps/s | |
test_items_stack_nested_locked | 2.3419ms | 0.3772ms | 2.6514 KOps/s | 2.7093 KOps/s | |
test_keys | 19.1660μs | 3.9427μs | 253.6345 KOps/s | 259.7874 KOps/s | |
test_keys_nested | 0.2193ms | 0.1460ms | 6.8488 KOps/s | 6.9243 KOps/s | |
test_keys_nested_locked | 0.7524ms | 0.1515ms | 6.6019 KOps/s | 6.6416 KOps/s | |
test_keys_nested_leaf | 0.2120ms | 0.1247ms | 8.0191 KOps/s | 8.0248 KOps/s | |
test_keys_stack_nested | 0.2458ms | 0.1462ms | 6.8377 KOps/s | 6.9208 KOps/s | |
test_keys_stack_nested_leaf | 0.2821ms | 0.1255ms | 7.9707 KOps/s | 8.1381 KOps/s | |
test_keys_stack_nested_locked | 0.2960ms | 0.1526ms | 6.5526 KOps/s | 6.6890 KOps/s | |
test_values | 8.0728μs | 1.1766μs | 849.8810 KOps/s | 893.4795 KOps/s | |
test_values_nested | 99.3950μs | 51.3007μs | 19.4929 KOps/s | 19.8551 KOps/s | |
test_values_nested_locked | 89.1760μs | 50.8439μs | 19.6680 KOps/s | 19.7445 KOps/s | |
test_values_nested_leaf | 0.1413ms | 45.7151μs | 21.8746 KOps/s | 22.3410 KOps/s | |
test_values_stack_nested | 84.0480μs | 52.0252μs | 19.2214 KOps/s | 19.9332 KOps/s | |
test_values_stack_nested_leaf | 83.4370μs | 46.0154μs | 21.7319 KOps/s | 22.6590 KOps/s | |
test_values_stack_nested_locked | 97.4330μs | 51.3049μs | 19.4913 KOps/s | 19.9427 KOps/s | |
test_membership | 3.9387μs | 0.7491μs | 1.3349 MOps/s | 1.0903 MOps/s | |
test_membership_nested | 22.2120μs | 2.7746μs | 360.4091 KOps/s | 369.1447 KOps/s | |
test_membership_nested_leaf | 28.0220μs | 2.8173μs | 354.9490 KOps/s | 365.7796 KOps/s | |
test_membership_stacked_nested | 21.3900μs | 2.7659μs | 361.5404 KOps/s | 373.1302 KOps/s | |
test_membership_stacked_nested_leaf | 26.9400μs | 2.8409μs | 352.0006 KOps/s | 367.6752 KOps/s | |
test_membership_nested_last | 48.6140μs | 4.1599μs | 240.3918 KOps/s | 246.3561 KOps/s | |
test_membership_nested_leaf_last | 30.4170μs | 4.2074μs | 237.6792 KOps/s | 243.7546 KOps/s | |
test_membership_stacked_nested_last | 40.2660μs | 5.3338μs | 187.4843 KOps/s | 247.2349 KOps/s | |
test_membership_stacked_nested_leaf_last | 28.1620μs | 5.3935μs | 185.4096 KOps/s | 244.3903 KOps/s | |
test_nested_getleaf | 38.7930μs | 10.9383μs | 91.4216 KOps/s | 90.1078 KOps/s | |
test_nested_get | 33.1520μs | 10.4247μs | 95.9260 KOps/s | 97.4124 KOps/s | |
test_stacked_getleaf | 55.8820μs | 10.9857μs | 91.0274 KOps/s | 91.3636 KOps/s | |
test_stacked_get | 33.2430μs | 10.3711μs | 96.4214 KOps/s | 96.7201 KOps/s | |
test_nested_getitemleaf | 50.9260μs | 11.3535μs | 88.0785 KOps/s | 88.0886 KOps/s | |
test_nested_getitem | 42.6110μs | 10.5825μs | 94.4953 KOps/s | 95.9017 KOps/s | |
test_stacked_getitemleaf | 50.8490μs | 11.4646μs | 87.2250 KOps/s | 89.7708 KOps/s | |
test_stacked_getitem | 40.9370μs | 10.3848μs | 96.2942 KOps/s | 96.6552 KOps/s | |
test_lock_nested | 0.9993ms | 0.5298ms | 1.8875 KOps/s | 1.6580 KOps/s | |
test_lock_stack_nested | 0.7641ms | 0.4976ms | 2.0098 KOps/s | 2.0220 KOps/s | |
test_unlock_nested | 0.9754ms | 0.4477ms | 2.2337 KOps/s | 2.2924 KOps/s | |
test_unlock_stack_nested | 0.5058ms | 0.4117ms | 2.4292 KOps/s | 2.4565 KOps/s | |
test_flatten_speed | 0.6104ms | 0.1077ms | 9.2847 KOps/s | 9.5295 KOps/s | |
test_unflatten_speed | 0.9707ms | 0.4510ms | 2.2171 KOps/s | 2.2130 KOps/s | |
test_common_ops | 6.3796ms | 1.2049ms | 829.9519 Ops/s | 859.0020 Ops/s | |
test_creation | 19.6170μs | 2.4630μs | 406.0016 KOps/s | 398.4733 KOps/s | |
test_creation_empty | 66.5740μs | 21.2279μs | 47.1078 KOps/s | 53.4715 KOps/s | |
test_creation_nested_1 | 87.2040μs | 24.9258μs | 40.1191 KOps/s | 44.4607 KOps/s | |
test_creation_nested_2 | 71.7440μs | 28.9752μs | 34.5123 KOps/s | 38.0885 KOps/s | |
test_clone | 89.6880μs | 18.1327μs | 55.1490 KOps/s | 54.9509 KOps/s | |
test_getitem[int] | 0.9285ms | 13.3032μs | 75.1701 KOps/s | 75.1025 KOps/s | |
test_getitem[slice_int] | 0.1679ms | 33.1909μs | 30.1288 KOps/s | 28.7917 KOps/s | |
test_getitem[range] | 0.2150ms | 59.1132μs | 16.9167 KOps/s | 17.0207 KOps/s | |
test_getitem[tuple] | 0.1413ms | 27.0761μs | 36.9329 KOps/s | 35.7911 KOps/s | |
test_getitem[list] | 0.2890ms | 53.9488μs | 18.5361 KOps/s | 18.7747 KOps/s | |
test_setitem_dim[int] | 85.3400μs | 37.2496μs | 26.8459 KOps/s | 29.4039 KOps/s | |
test_setitem_dim[slice_int] | 0.1118ms | 75.7156μs | 13.2073 KOps/s | 13.5263 KOps/s | |
test_setitem_dim[range] | 0.1781ms | 97.9818μs | 10.2060 KOps/s | 10.5785 KOps/s | |
test_setitem_dim[tuple] | 98.5450μs | 62.0693μs | 16.1110 KOps/s | 16.2254 KOps/s | |
test_setitem | 0.1276ms | 31.9563μs | 31.2928 KOps/s | 32.5069 KOps/s | |
test_set | 0.1067ms | 31.2686μs | 31.9810 KOps/s | 33.4120 KOps/s | |
test_set_shared | 2.9041ms | 0.2203ms | 4.5395 KOps/s | 4.5501 KOps/s | |
test_update | 0.1787ms | 39.7797μs | 25.1385 KOps/s | 26.4736 KOps/s | |
test_update_nested | 0.1354ms | 49.8414μs | 20.0636 KOps/s | 20.6678 KOps/s | |
test_update__nested | 0.1192ms | 36.3824μs | 27.4858 KOps/s | 27.3257 KOps/s | |
test_set_nested | 98.4350μs | 33.9878μs | 29.4223 KOps/s | 30.5709 KOps/s | |
test_set_nested_new | 0.1540ms | 39.5077μs | 25.3115 KOps/s | 26.0610 KOps/s | |
test_select | 0.1366ms | 56.4041μs | 17.7292 KOps/s | 17.9889 KOps/s | |
test_select_nested | 0.1285ms | 60.7845μs | 16.4516 KOps/s | 16.4611 KOps/s | |
test_exclude_nested | 0.1645ms | 80.9896μs | 12.3473 KOps/s | 12.4711 KOps/s | |
test_empty[True] | 0.5351ms | 0.3406ms | 2.9359 KOps/s | 2.9248 KOps/s | |
test_empty[False] | 6.8202μs | 1.2264μs | 815.4039 KOps/s | 778.5207 KOps/s | |
test_unbind_speed | 0.4759ms | 0.3360ms | 2.9764 KOps/s | 3.0045 KOps/s | |
test_unbind_speed_stack0 | 0.4589ms | 0.3317ms | 3.0144 KOps/s | 3.0458 KOps/s | |
test_unbind_speed_stack1 | 76.2018ms | 0.8364ms | 1.1955 KOps/s | 1.1767 KOps/s | |
test_split | 81.3536ms | 2.3494ms | 425.6480 Ops/s | 463.9704 Ops/s | |
test_chunk | 78.2386ms | 2.3437ms | 426.6777 Ops/s | 398.2070 Ops/s | |
test_creation[device0] | 3.7940ms | 0.1264ms | 7.9136 KOps/s | 8.1993 KOps/s | |
test_creation_from_tensor | 0.2911ms | 0.1220ms | 8.1959 KOps/s | 8.0943 KOps/s | |
test_add_one[memmap_tensor0] | 0.2217ms | 8.1530μs | 122.6545 KOps/s | 121.6949 KOps/s | |
test_contiguous[memmap_tensor0] | 22.1520μs | 2.2755μs | 439.4612 KOps/s | 460.8909 KOps/s | |
test_stack[memmap_tensor0] | 59.9930μs | 6.1353μs | 162.9918 KOps/s | 162.5652 KOps/s | |
test_memmaptd_index | 1.2245ms | 0.4545ms | 2.2002 KOps/s | 2.2415 KOps/s | |
test_memmaptd_index_astensor | 1.1235ms | 0.5417ms | 1.8459 KOps/s | 1.9079 KOps/s | |
test_memmaptd_index_op | 1.5185ms | 1.1349ms | 881.1704 Ops/s | 923.7391 Ops/s | |
test_serialize_model | 0.1992s | 0.1393s | 7.1805 Ops/s | 7.4404 Ops/s | |
test_serialize_model_pickle | 0.4404s | 0.3922s | 2.5498 Ops/s | 2.4975 Ops/s | |
test_serialize_weights | 0.1303s | 0.1249s | 8.0050 Ops/s | 7.0313 Ops/s | |
test_serialize_weights_returnearly | 0.1848s | 0.1714s | 5.8328 Ops/s | 6.2086 Ops/s | |
test_serialize_weights_pickle | 0.4779s | 0.4147s | 2.4117 Ops/s | 2.5462 Ops/s | |
test_serialize_weights_filesystem | 0.2264s | 0.1565s | 6.3909 Ops/s | 6.8290 Ops/s | |
test_serialize_model_filesystem | 0.1542s | 0.1490s | 6.7100 Ops/s | 5.9527 Ops/s | |
test_reshape_pytree | 96.1900μs | 41.5289μs | 24.0796 KOps/s | 25.1604 KOps/s | |
test_reshape_td | 0.1103ms | 50.2262μs | 19.9099 KOps/s | 20.1817 KOps/s | |
test_view_pytree | 96.5010μs | 40.4680μs | 24.7109 KOps/s | 25.4993 KOps/s | |
test_view_td | 0.1168ms | 57.2649μs | 17.4627 KOps/s | 17.7476 KOps/s | |
test_unbind_pytree | 83.5170μs | 37.3991μs | 26.7386 KOps/s | 27.6140 KOps/s | |
test_unbind_td | 0.3673ms | 50.2633μs | 19.8952 KOps/s | 20.7545 KOps/s | |
test_split_pytree | 0.1011ms | 40.2486μs | 24.8456 KOps/s | 25.4193 KOps/s | |
test_split_td | 0.5391ms | 64.4650μs | 15.5123 KOps/s | 15.6725 KOps/s | |
test_add_pytree | 96.1710μs | 44.8067μs | 22.3181 KOps/s | 22.0231 KOps/s | |
test_add_td | 0.1734ms | 88.2813μs | 11.3274 KOps/s | 11.6069 KOps/s | |
test_distributed | 0.2292ms | 0.1312ms | 7.6245 KOps/s | 7.4982 KOps/s | |
test_tdmodule | 44.1830μs | 18.8988μs | 52.9135 KOps/s | 61.0421 KOps/s | |
test_tdmodule_dispatch | 55.8850μs | 38.5754μs | 25.9232 KOps/s | 28.8591 KOps/s | |
test_tdseq | 50.3340μs | 20.2496μs | 49.3838 KOps/s | 55.1012 KOps/s | |
test_tdseq_dispatch | 63.8500μs | 42.7170μs | 23.4099 KOps/s | 25.7866 KOps/s | |
test_instantiation_functorch | 2.3998ms | 1.6582ms | 603.0483 Ops/s | 610.5732 Ops/s | |
test_instantiation_td | 2.2135ms | 1.1534ms | 866.9947 Ops/s | 850.9578 Ops/s | |
test_exec_functorch | 0.3916ms | 0.1867ms | 5.3554 KOps/s | 5.2513 KOps/s | |
test_exec_functional_call | 0.2993ms | 0.1787ms | 5.5975 KOps/s | 5.6177 KOps/s | |
test_exec_td | 0.2815ms | 0.1768ms | 5.6570 KOps/s | 5.5807 KOps/s | |
test_exec_td_decorator | 0.5711ms | 0.2630ms | 3.8023 KOps/s | 3.7864 KOps/s | |
test_vmap_mlp_speed[True-True] | 0.8998ms | 0.6260ms | 1.5973 KOps/s | 1.6007 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.9255ms | 0.6222ms | 1.6073 KOps/s | 1.6034 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.7069ms | 0.5130ms | 1.9495 KOps/s | 1.9343 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.8019ms | 0.5162ms | 1.9374 KOps/s | 1.9509 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.3257ms | 0.7207ms | 1.3876 KOps/s | 1.3920 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 1.5553ms | 0.7194ms | 1.3901 KOps/s | 1.3994 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8875ms | 0.5911ms | 1.6918 KOps/s | 1.6835 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.9378ms | 0.5919ms | 1.6894 KOps/s | 1.6760 KOps/s | |
test_to_module_speed[True] | 2.2201ms | 1.8277ms | 547.1224 Ops/s | 537.5509 Ops/s | |
test_to_module_speed[False] | 1.9580ms | 1.7944ms | 557.2767 Ops/s | 546.5006 Ops/s | |
test_tc_init | 0.1061ms | 47.8902μs | 20.8811 KOps/s | 22.1792 KOps/s | |
test_tc_init_nested | 0.1685ms | 95.4204μs | 10.4799 KOps/s | 10.7490 KOps/s | |
test_tc_first_layer_tensor | 39.2030μs | 9.3121μs | 107.3876 KOps/s | 108.8043 KOps/s | |
test_tc_first_layer_nontensor | 53.2800μs | 9.2950μs | 107.5852 KOps/s | 109.1078 KOps/s | |
test_tc_second_layer_tensor | 25.4270μs | 2.7810μs | 359.5831 KOps/s | 354.9288 KOps/s | |
test_tc_second_layer_nontensor | 39.0430μs | 10.5601μs | 94.6960 KOps/s | 97.3652 KOps/s | |
test_unbind | 8.8957ms | 8.6921ms | 115.0473 Ops/s | 73.6357 Ops/s | |
test_full_like | 8.0755ms | 7.1390ms | 140.0765 Ops/s | 126.4530 Ops/s | |
test_zeros_like | 11.2823ms | 6.4729ms | 154.4907 Ops/s | 132.4493 Ops/s | |
test_ones_like | 14.4500ms | 7.5467ms | 132.5082 Ops/s | 130.1348 Ops/s | |
test_clone | 18.6204ms | 9.1407ms | 109.4007 Ops/s | 106.4577 Ops/s | |
test_squeeze | 64.2710μs | 15.3785μs | 65.0257 KOps/s | 65.6813 KOps/s | |
test_unsqueeze | 0.2662ms | 97.3906μs | 10.2679 KOps/s | 10.3030 KOps/s | |
test_split | 0.4151ms | 0.2087ms | 4.7926 KOps/s | 4.7657 KOps/s | |
test_permute | 0.3503ms | 0.2269ms | 4.4069 KOps/s | 4.4287 KOps/s | |
test_stack | 34.4359ms | 24.7380ms | 40.4237 Ops/s | 39.8593 Ops/s | |
test_cat | 28.7904ms | 24.3533ms | 41.0622 Ops/s | 39.9918 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 0.4894ms | 17.9837μs | 55.6058 KOps/s | 58.4476 KOps/s | |
test_plain_set_stack_nested | 1.7104ms | 18.3041μs | 54.6327 KOps/s | 58.2828 KOps/s | |
test_plain_set_nested_inplace | 38.3010μs | 18.9222μs | 52.8480 KOps/s | 54.4586 KOps/s | |
test_plain_set_stack_nested_inplace | 36.6310μs | 18.8991μs | 52.9125 KOps/s | 54.8865 KOps/s | |
test_items | 26.3100μs | 4.7548μs | 210.3157 KOps/s | 211.5822 KOps/s | |
test_items_nested | 0.4177ms | 0.3955ms | 2.5283 KOps/s | 2.5269 KOps/s | |
test_items_nested_locked | 0.4258ms | 0.3995ms | 2.5033 KOps/s | 2.5464 KOps/s | |
test_items_nested_leaf | 0.1151ms | 87.3588μs | 11.4470 KOps/s | 11.4210 KOps/s | |
test_items_stack_nested | 0.4503ms | 0.4000ms | 2.4998 KOps/s | 2.5338 KOps/s | |
test_items_stack_nested_leaf | 0.1148ms | 87.7003μs | 11.4025 KOps/s | 11.6322 KOps/s | |
test_items_stack_nested_locked | 0.4405ms | 0.4003ms | 2.4978 KOps/s | 2.5573 KOps/s | |
test_keys | 19.5910μs | 4.4008μs | 227.2327 KOps/s | 228.3211 KOps/s | |
test_keys_nested | 97.6910μs | 68.8970μs | 14.5144 KOps/s | 15.1835 KOps/s | |
test_keys_nested_locked | 0.9564ms | 74.1480μs | 13.4865 KOps/s | 13.5830 KOps/s | |
test_keys_nested_leaf | 80.9020μs | 56.7034μs | 17.6356 KOps/s | 17.8677 KOps/s | |
test_keys_stack_nested | 0.1013ms | 68.8646μs | 14.5213 KOps/s | 15.3932 KOps/s | |
test_keys_stack_nested_leaf | 77.0010μs | 58.5157μs | 17.0894 KOps/s | 17.3584 KOps/s | |
test_keys_stack_nested_locked | 93.5310μs | 73.9529μs | 13.5221 KOps/s | 13.9643 KOps/s | |
test_values | 9.1603μs | 1.7604μs | 568.0576 KOps/s | 564.3839 KOps/s | |
test_values_nested | 52.8810μs | 34.1994μs | 29.2403 KOps/s | 29.3087 KOps/s | |
test_values_nested_locked | 55.9910μs | 36.1005μs | 27.7004 KOps/s | 27.5063 KOps/s | |
test_values_nested_leaf | 52.2510μs | 30.3854μs | 32.9106 KOps/s | 33.1151 KOps/s | |
test_values_stack_nested | 68.5510μs | 34.7177μs | 28.8038 KOps/s | 28.6930 KOps/s | |
test_values_stack_nested_leaf | 54.0010μs | 30.8477μs | 32.4173 KOps/s | 32.1246 KOps/s | |
test_values_stack_nested_locked | 57.7510μs | 36.5486μs | 27.3608 KOps/s | 27.0902 KOps/s | |
test_membership | 1.2810μs | 0.5399μs | 1.8522 MOps/s | 1.8510 MOps/s | |
test_membership_nested | 15.7300μs | 2.0917μs | 478.0689 KOps/s | 473.8803 KOps/s | |
test_membership_nested_leaf | 10.2600μs | 2.0553μs | 486.5387 KOps/s | 483.5920 KOps/s | |
test_membership_stacked_nested | 23.6200μs | 2.1061μs | 474.8214 KOps/s | 485.5792 KOps/s | |
test_membership_stacked_nested_leaf | 16.1300μs | 2.0659μs | 484.0405 KOps/s | 481.9617 KOps/s | |
test_membership_nested_last | 20.8110μs | 3.0544μs | 327.4009 KOps/s | 331.2761 KOps/s | |
test_membership_nested_leaf_last | 18.6900μs | 3.0542μs | 327.4224 KOps/s | 330.0220 KOps/s | |
test_membership_stacked_nested_last | 36.1810μs | 3.4774μs | 287.5738 KOps/s | 107.8330 KOps/s | |
test_membership_stacked_nested_leaf_last | 20.1210μs | 3.4779μs | 287.5283 KOps/s | 108.1986 KOps/s | |
test_nested_getleaf | 25.9810μs | 8.0492μs | 124.2353 KOps/s | 123.0623 KOps/s | |
test_nested_get | 23.3700μs | 7.6059μs | 131.4761 KOps/s | 131.2801 KOps/s | |
test_stacked_getleaf | 24.8100μs | 8.0530μs | 124.1780 KOps/s | 124.2933 KOps/s | |
test_stacked_get | 22.4800μs | 7.5775μs | 131.9696 KOps/s | 132.4825 KOps/s | |
test_nested_getitemleaf | 21.5310μs | 8.2009μs | 121.9375 KOps/s | 122.0549 KOps/s | |
test_nested_getitem | 25.0700μs | 7.7418μs | 129.1694 KOps/s | 129.1027 KOps/s | |
test_stacked_getitemleaf | 27.3810μs | 8.2530μs | 121.1688 KOps/s | 121.4940 KOps/s | |
test_stacked_getitem | 23.5400μs | 7.7710μs | 128.6832 KOps/s | 128.7931 KOps/s | |
test_lock_nested | 1.3000ms | 0.4818ms | 2.0756 KOps/s | 2.0723 KOps/s | |
test_lock_stack_nested | 0.4955ms | 0.4423ms | 2.2608 KOps/s | 2.2985 KOps/s | |
test_unlock_nested | 0.8256ms | 0.3957ms | 2.5269 KOps/s | 2.4633 KOps/s | |
test_unlock_stack_nested | 0.5009ms | 0.3584ms | 2.7902 KOps/s | 2.8156 KOps/s | |
test_flatten_speed | 0.2057ms | 0.1067ms | 9.3733 KOps/s | 9.3557 KOps/s | |
test_unflatten_speed | 0.3327ms | 0.2955ms | 3.3843 KOps/s | 3.3822 KOps/s | |
test_common_ops | 1.7973ms | 1.4351ms | 696.8128 Ops/s | 732.8676 Ops/s | |
test_creation | 17.6100μs | 2.0410μs | 489.9464 KOps/s | 487.3441 KOps/s | |
test_creation_empty | 41.8400μs | 19.2156μs | 52.0410 KOps/s | 55.9380 KOps/s | |
test_creation_nested_1 | 38.7810μs | 21.2844μs | 46.9828 KOps/s | 50.6431 KOps/s | |
test_creation_nested_2 | 0.1928ms | 24.0049μs | 41.6581 KOps/s | 44.5547 KOps/s | |
test_clone | 63.4110μs | 34.0724μs | 29.3493 KOps/s | 30.3786 KOps/s | |
test_getitem[int] | 1.1611ms | 17.6809μs | 56.5584 KOps/s | 56.6221 KOps/s | |
test_getitem[slice_int] | 0.1570ms | 30.4462μs | 32.8448 KOps/s | 32.9723 KOps/s | |
test_getitem[range] | 0.2945ms | 0.1209ms | 8.2741 KOps/s | 8.3908 KOps/s | |
test_getitem[tuple] | 0.1528ms | 26.3236μs | 37.9887 KOps/s | 37.9445 KOps/s | |
test_getitem[list] | 0.2372ms | 0.1097ms | 9.1140 KOps/s | 8.9997 KOps/s | |
test_setitem_dim[int] | 77.9120μs | 57.2353μs | 17.4717 KOps/s | 17.0231 KOps/s | |
test_setitem_dim[slice_int] | 0.1127ms | 82.6383μs | 12.1009 KOps/s | 12.3045 KOps/s | |
test_setitem_dim[range] | 0.1752ms | 0.1467ms | 6.8183 KOps/s | 6.5660 KOps/s | |
test_setitem_dim[tuple] | 0.1013ms | 75.4410μs | 13.2554 KOps/s | 13.4929 KOps/s | |
test_setitem | 94.2320μs | 46.6644μs | 21.4296 KOps/s | 21.0788 KOps/s | |
test_set | 70.1410μs | 45.1199μs | 22.1632 KOps/s | 21.8592 KOps/s | |
test_set_shared | 0.3868ms | 55.8149μs | 17.9164 KOps/s | 17.6462 KOps/s | |
test_update | 98.9230μs | 54.9657μs | 18.1932 KOps/s | 18.4083 KOps/s | |
test_update_nested | 95.7810μs | 66.8269μs | 14.9640 KOps/s | 16.0038 KOps/s | |
test_update__nested | 0.1075ms | 69.9336μs | 14.2993 KOps/s | 15.2341 KOps/s | |
test_set_nested | 0.4695ms | 51.1338μs | 19.5565 KOps/s | 20.7837 KOps/s | |
test_set_nested_new | 93.0520μs | 55.1227μs | 18.1414 KOps/s | 18.7293 KOps/s | |
test_select | 0.1047ms | 70.4632μs | 14.1918 KOps/s | 14.1451 KOps/s | |
test_select_nested | 77.8010μs | 54.3300μs | 18.4060 KOps/s | 18.9318 KOps/s | |
test_exclude_nested | 92.5710μs | 72.3394μs | 13.8237 KOps/s | 13.7927 KOps/s | |
test_empty[True] | 0.3524ms | 0.3015ms | 3.3166 KOps/s | 3.3462 KOps/s | |
test_empty[False] | 2.4291μs | 0.9309μs | 1.0742 MOps/s | 1.0747 MOps/s | |
test_to | 65.5320μs | 38.1935μs | 26.1825 KOps/s | 26.4149 KOps/s | |
test_to_nonblocking | 51.8310μs | 24.2673μs | 41.2076 KOps/s | 42.0147 KOps/s | |
test_unbind_speed | 0.3590ms | 0.3112ms | 3.2134 KOps/s | 3.2261 KOps/s | |
test_unbind_speed_stack0 | 0.3618ms | 0.3080ms | 3.2463 KOps/s | 3.2611 KOps/s | |
test_unbind_speed_stack1 | 86.9174ms | 0.7888ms | 1.2678 KOps/s | 1.2790 KOps/s | |
test_split | 89.0993ms | 2.3581ms | 424.0649 Ops/s | 416.3638 Ops/s | |
test_chunk | 2.3252ms | 2.1887ms | 456.8836 Ops/s | 416.3445 Ops/s | |
test_creation[device0] | 0.1569ms | 0.1062ms | 9.4174 KOps/s | 8.9936 KOps/s | |
test_creation_from_tensor | 0.1604ms | 0.1032ms | 9.6936 KOps/s | 9.2338 KOps/s | |
test_add_one[memmap_tensor0] | 22.2210μs | 9.3923μs | 106.4705 KOps/s | 96.6671 KOps/s | |
test_contiguous[memmap_tensor0] | 20.3110μs | 2.2575μs | 442.9765 KOps/s | 437.2904 KOps/s | |
test_stack[memmap_tensor0] | 31.9110μs | 6.7117μs | 148.9945 KOps/s | 141.8527 KOps/s | |
test_memmaptd_index | 90.2815ms | 0.4998ms | 2.0007 KOps/s | 2.2001 KOps/s | |
test_memmaptd_index_astensor | 0.7773ms | 0.5062ms | 1.9755 KOps/s | 1.9230 KOps/s | |
test_memmaptd_index_op | 1.5167ms | 1.1004ms | 908.7743 Ops/s | 894.3267 Ops/s | |
test_serialize_model | 0.1007s | 96.7170ms | 10.3394 Ops/s | 10.1723 Ops/s | |
test_serialize_model_pickle | 1.3684s | 1.2364s | 0.8088 Ops/s | 0.7464 Ops/s | |
test_serialize_weights | 0.1878s | 0.1024s | 9.7669 Ops/s | 9.2226 Ops/s | |
test_serialize_weights_returnearly | 82.7812ms | 72.3278ms | 13.8259 Ops/s | 11.2790 Ops/s | |
test_serialize_weights_pickle | 1.3471s | 1.2360s | 0.8091 Ops/s | 0.8035 Ops/s | |
test_reshape_pytree | 73.3210μs | 39.1696μs | 25.5300 KOps/s | 25.3131 KOps/s | |
test_reshape_td | 88.8620μs | 44.9046μs | 22.2694 KOps/s | 22.3750 KOps/s | |
test_view_pytree | 64.0620μs | 38.8112μs | 25.7658 KOps/s | 25.1216 KOps/s | |
test_view_td | 82.0920μs | 51.0189μs | 19.6006 KOps/s | 19.2695 KOps/s | |
test_unbind_pytree | 0.1904ms | 38.4662μs | 25.9968 KOps/s | 26.2086 KOps/s | |
test_unbind_td | 0.4763ms | 47.7490μs | 20.9429 KOps/s | 20.6963 KOps/s | |
test_split_pytree | 81.8120μs | 51.6126μs | 19.3751 KOps/s | 19.1219 KOps/s | |
test_split_td | 88.5572ms | 69.5585μs | 14.3764 KOps/s | 15.8405 KOps/s | |
test_add_pytree | 0.1018ms | 60.9340μs | 16.4112 KOps/s | 16.0067 KOps/s | |
test_add_td | 0.1598ms | 0.1020ms | 9.8075 KOps/s | 10.1099 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.4160ms | 0.2122ms | 4.7118 KOps/s | 4.6636 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2620ms | 0.1778ms | 5.6249 KOps/s | 5.5824 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1809ms | 0.1477ms | 6.7722 KOps/s | 6.7829 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2655ms | 0.1964ms | 5.0926 KOps/s | 4.9974 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 92.9120μs | 22.0486μs | 45.3545 KOps/s | 45.0575 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 67.9610μs | 50.1598μs | 19.9363 KOps/s | 20.3097 KOps/s | |
test_compile_copy_nested[pytree-compile] | 99.9720μs | 72.8188μs | 13.7327 KOps/s | 14.0106 KOps/s | |
test_compile_copy_nested[pytree-eager] | 80.0210μs | 60.1926μs | 16.6133 KOps/s | 16.7698 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.3746ms | 0.3345ms | 2.9899 KOps/s | 3.0085 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3238ms | 0.2241ms | 4.4616 KOps/s | 4.5343 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1832ms | 0.1333ms | 7.5014 KOps/s | 7.3633 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1309ms | 64.7912μs | 15.4342 KOps/s | 14.6652 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.3872ms | 0.3333ms | 3.0001 KOps/s | 2.9980 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.7064ms | 0.6444ms | 1.5518 KOps/s | 1.5329 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3456ms | 0.2728ms | 3.6653 KOps/s | 3.7000 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.3738ms | 0.3378ms | 2.9599 KOps/s | 2.9750 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1611ms | 77.3605μs | 12.9265 KOps/s | 12.8175 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.2811ms | 0.1333ms | 7.5000 KOps/s | 7.4314 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.6103ms | 0.5503ms | 1.8173 KOps/s | 1.7576 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.3683ms | 0.3323ms | 3.0091 KOps/s | 3.0088 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 36.4710μs | 19.3224μs | 51.7535 KOps/s | 52.3361 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 51.8210μs | 31.7711μs | 31.4751 KOps/s | 31.0560 KOps/s | |
test_compile_copy_flat[pytree-compile] | 98.5510μs | 75.6689μs | 13.2155 KOps/s | 13.1901 KOps/s | |
test_compile_copy_flat[pytree-eager] | 78.1020μs | 60.8864μs | 16.4240 KOps/s | 16.4987 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 2.5584ms | 0.9482ms | 1.0546 KOps/s | 1.0566 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 3.8813ms | 3.4869ms | 286.7854 Ops/s | 287.4858 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 2.5300ms | 0.9370ms | 1.0672 KOps/s | 1.0742 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 3.5384ms | 3.4700ms | 288.1825 Ops/s | 287.0704 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1474ms | 0.1131ms | 8.8401 KOps/s | 8.8150 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.2385ms | 66.7049μs | 14.9914 KOps/s | 14.7177 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1506ms | 0.1059ms | 9.4391 KOps/s | 9.4435 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 73.0820μs | 47.2233μs | 21.1760 KOps/s | 20.8144 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1500ms | 0.1047ms | 9.5511 KOps/s | 9.0179 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1171ms | 47.1000μs | 21.2314 KOps/s | 19.4722 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1837ms | 0.1425ms | 7.0182 KOps/s | 6.9865 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1959ms | 27.0429μs | 36.9783 KOps/s | 35.5292 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.2136ms | 0.1341ms | 7.4548 KOps/s | 7.1832 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 53.1510μs | 23.0247μs | 43.4316 KOps/s | 42.1337 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1852ms | 0.1374ms | 7.2792 KOps/s | 7.2638 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 47.5600μs | 22.6176μs | 44.2134 KOps/s | 42.5948 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.2041ms | 0.1417ms | 7.0588 KOps/s | 7.0131 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.5501ms | 26.6618μs | 37.5068 KOps/s | 35.5832 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1594ms | 0.1331ms | 7.5107 KOps/s | 7.4530 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 48.9910μs | 22.7280μs | 43.9986 KOps/s | 42.5840 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1835ms | 0.1334ms | 7.4961 KOps/s | 7.3896 KOps/s | |
test_compile_indexing[int-pytree-eager] | 46.0610μs | 22.7864μs | 43.8858 KOps/s | 42.5553 KOps/s | |
test_mod_add[eager] | 71.1410μs | 39.7918μs | 25.1308 KOps/s | 25.1097 KOps/s | |
test_mod_add[compile] | 0.1154ms | 69.4774μs | 14.3932 KOps/s | 14.3967 KOps/s | |
test_mod_add[compile-overhead] | 0.2630ms | 0.1491ms | 6.7071 KOps/s | 6.7164 KOps/s | |
test_mod_wrap[eager] | 0.3498ms | 0.2641ms | 3.7859 KOps/s | 3.6592 KOps/s | |
test_mod_wrap[compile] | 1.2029ms | 0.2993ms | 3.3412 KOps/s | 3.3055 KOps/s | |
test_mod_wrap[compile-overhead] | 7.7715ms | 4.1275ms | 242.2792 Ops/s | 236.0045 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.5865ms | 1.4740ms | 678.4348 Ops/s | 725.7881 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.5682ms | 1.4797ms | 675.8340 Ops/s | 670.6832 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.4544ms | 1.0019ms | 998.0865 Ops/s | 999.8928 Ops/s | |
test_seq_add[eager] | 0.1834ms | 0.1168ms | 8.5625 KOps/s | 8.4198 KOps/s | |
test_seq_add[compile] | 0.2130ms | 86.9830μs | 11.4965 KOps/s | 11.2143 KOps/s | |
test_seq_add[compile-overhead] | 0.1605ms | 0.1254ms | 7.9726 KOps/s | 8.0322 KOps/s | |
test_seq_wrap[eager] | 0.5105ms | 0.4439ms | 2.2528 KOps/s | 2.2826 KOps/s | |
test_seq_wrap[compile] | 1.4858ms | 0.3319ms | 3.0132 KOps/s | 2.9896 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3072s | 0.1439s | 6.9493 Ops/s | 6.9115 Ops/s | |
test_func_call_runtime[False-eager] | 0.8706ms | 0.7722ms | 1.2951 KOps/s | 1.3070 KOps/s | |
test_func_call_runtime[False-compile] | 0.9434ms | 0.8435ms | 1.1856 KOps/s | 1.2046 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4557ms | 0.3673ms | 2.7225 KOps/s | 2.7019 KOps/s | |
test_func_call_runtime[True-eager] | 1.1821ms | 1.0239ms | 976.6486 Ops/s | 973.9417 Ops/s | |
test_func_call_runtime[True-compile] | 0.9562ms | 0.8716ms | 1.1473 KOps/s | 1.1471 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.4877ms | 0.4080ms | 2.4509 KOps/s | 2.4432 KOps/s | |
test_distributed | 0.8512ms | 70.8252μs | 14.1193 KOps/s | 14.6739 KOps/s | |
test_tdmodule | 33.5110μs | 17.0287μs | 58.7242 KOps/s | 60.7354 KOps/s | |
test_tdmodule_dispatch | 52.8400μs | 34.6784μs | 28.8364 KOps/s | 30.1436 KOps/s | |
test_tdseq | 33.3400μs | 17.7097μs | 56.4661 KOps/s | 58.6927 KOps/s | |
test_tdseq_dispatch | 53.3410μs | 36.7850μs | 27.1850 KOps/s | 28.3537 KOps/s | |
test_instantiation_functorch | 2.1093ms | 2.0245ms | 493.9424 Ops/s | 491.7527 Ops/s | |
test_instantiation_td | 2.0532ms | 1.3169ms | 759.3381 Ops/s | 754.6258 Ops/s | |
test_exec_functorch | 0.3382ms | 0.2288ms | 4.3716 KOps/s | 4.3864 KOps/s | |
test_exec_functional_call | 0.2678ms | 0.2288ms | 4.3698 KOps/s | 4.4382 KOps/s | |
test_exec_td | 0.2721ms | 0.2264ms | 4.4165 KOps/s | 4.4334 KOps/s | |
test_exec_td_decorator | 0.5252ms | 0.3046ms | 3.2834 KOps/s | 3.3261 KOps/s | |
test_vmap_mlp_speed[True-True] | 0.7418ms | 0.6882ms | 1.4531 KOps/s | 1.4575 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.7644ms | 0.6846ms | 1.4607 KOps/s | 1.4622 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.6524ms | 0.5997ms | 1.6676 KOps/s | 1.6775 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.6385ms | 0.5998ms | 1.6672 KOps/s | 1.6730 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.3071ms | 0.7672ms | 1.3034 KOps/s | 1.3102 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.9152ms | 0.7656ms | 1.3062 KOps/s | 1.3177 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8842ms | 0.6671ms | 1.4991 KOps/s | 1.5113 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.8055ms | 0.6654ms | 1.5028 KOps/s | 1.5155 KOps/s | |
test_vmap_transformer_speed[True-True] | 9.1757ms | 8.9715ms | 111.4642 Ops/s | 111.5347 Ops/s | |
test_vmap_transformer_speed[True-False] | 9.3763ms | 9.0226ms | 110.8328 Ops/s | 110.9224 Ops/s | |
test_vmap_transformer_speed[False-True] | 9.0375ms | 8.8897ms | 112.4896 Ops/s | 112.7141 Ops/s | |
test_vmap_transformer_speed[False-False] | 8.9739ms | 8.8903ms | 112.4815 Ops/s | 112.9106 Ops/s | |
test_vmap_transformer_speed_decorator[True-True] | 22.4365ms | 21.4286ms | 46.6667 Ops/s | 46.9578 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 21.4672ms | 21.3936ms | 46.7430 Ops/s | 46.9427 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 21.2463ms | 21.1746ms | 47.2264 Ops/s | 47.4754 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 21.3033ms | 21.2345ms | 47.0932 Ops/s | 47.3390 Ops/s | |
test_to_module_speed[True] | 1.6134ms | 1.4939ms | 669.4010 Ops/s | 675.5912 Ops/s | |
test_to_module_speed[False] | 1.6195ms | 1.4799ms | 675.7418 Ops/s | 681.5817 Ops/s | |
test_tc_init | 62.6410μs | 40.4992μs | 24.6918 KOps/s | 26.8175 KOps/s | |
test_tc_init_nested | 0.1642ms | 85.1739μs | 11.7407 KOps/s | 13.3440 KOps/s | |
test_tc_first_layer_tensor | 19.4900μs | 3.9863μs | 250.8596 KOps/s | 253.6091 KOps/s | |
test_tc_first_layer_nontensor | 19.2500μs | 4.0667μs | 245.9016 KOps/s | 251.2515 KOps/s | |
test_tc_second_layer_tensor | 4.8952μs | 1.2866μs | 777.2181 KOps/s | 768.9088 KOps/s | |
test_tc_second_layer_nontensor | 22.4010μs | 4.6432μs | 215.3710 KOps/s | 219.2738 KOps/s | |
test_unbind | 0.3159s | 13.1512ms | 76.0386 Ops/s | 82.7279 Ops/s | |
test_full_like | 0.6570ms | 0.5780ms | 1.7300 KOps/s | 1.7303 KOps/s | |
test_zeros_like | 0.2592ms | 0.1979ms | 5.0535 KOps/s | 5.0582 KOps/s | |
test_ones_like | 0.3464ms | 0.1978ms | 5.0559 KOps/s | 5.0618 KOps/s | |
test_clone | 0.4369ms | 0.4138ms | 2.4168 KOps/s | 2.4153 KOps/s | |
test_squeeze | 28.6510μs | 11.8294μs | 84.5354 KOps/s | 84.3698 KOps/s | |
test_unsqueeze | 0.2980ms | 84.6329μs | 11.8157 KOps/s | 11.2204 KOps/s | |
test_split | 0.4748ms | 0.1858ms | 5.3813 KOps/s | 5.5506 KOps/s | |
test_permute | 0.2332ms | 0.1958ms | 5.1061 KOps/s | 5.2228 KOps/s | |
test_stack | 1.2505ms | 0.9192ms | 1.0879 KOps/s | 1.1145 KOps/s | |
test_cat | 1.2489ms | 1.2317ms | 811.9126 Ops/s | 811.8387 Ops/s |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
bug
Something isn't working
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
No description provided.