Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add MANIFEST.in #581

Merged
merged 1 commit into from
Nov 27, 2023
Merged

Add MANIFEST.in #581

merged 1 commit into from
Nov 27, 2023

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Nov 27, 2023

No description provided.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 27, 2023
@vmoens vmoens added the CI label Nov 27, 2023
@vmoens vmoens merged commit e71c70c into main Nov 27, 2023
@vmoens vmoens deleted the add-manifest branch November 27, 2023 17:40
Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 113. Improved: $\large\color{#35bf28}3$. Worsened: $\large\color{#d91a1a}3$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 35.8770μs 15.7386μs 63.5380 KOps/s 63.7009 KOps/s $\color{#d91a1a}-0.26\%$
test_plain_set_stack_nested 0.1731ms 0.1420ms 7.0431 KOps/s 7.0004 KOps/s $\color{#35bf28}+0.61\%$
test_plain_set_nested_inplace 55.3630μs 19.2939μs 51.8299 KOps/s 52.3124 KOps/s $\color{#d91a1a}-0.92\%$
test_plain_set_stack_nested_inplace 0.2325ms 0.1727ms 5.7918 KOps/s 5.7124 KOps/s $\color{#35bf28}+1.39\%$
test_items 26.1890μs 2.4315μs 411.2681 KOps/s 409.4023 KOps/s $\color{#35bf28}+0.46\%$
test_items_nested 0.9992ms 0.2772ms 3.6073 KOps/s 3.6889 KOps/s $\color{#d91a1a}-2.21\%$
test_items_nested_locked 0.3647ms 0.2757ms 3.6276 KOps/s 3.6902 KOps/s $\color{#d91a1a}-1.70\%$
test_items_nested_leaf 0.5568ms 0.1702ms 5.8764 KOps/s 5.9866 KOps/s $\color{#d91a1a}-1.84\%$
test_items_stack_nested 3.0859ms 1.5495ms 645.3719 Ops/s 661.4887 Ops/s $\color{#d91a1a}-2.44\%$
test_items_stack_nested_leaf 1.4532ms 1.3641ms 733.0749 Ops/s 740.8767 Ops/s $\color{#d91a1a}-1.05\%$
test_items_stack_nested_locked 0.8672ms 0.7820ms 1.2788 KOps/s 1.3217 KOps/s $\color{#d91a1a}-3.24\%$
test_keys 16.4300μs 3.8289μs 261.1689 KOps/s 247.4038 KOps/s $\textbf{\color{#35bf28}+5.56\%}$
test_keys_nested 3.3153ms 0.1412ms 7.0811 KOps/s 6.7386 KOps/s $\textbf{\color{#35bf28}+5.08\%}$
test_keys_nested_locked 0.1950ms 0.1406ms 7.1126 KOps/s 7.1163 KOps/s $\color{#d91a1a}-0.05\%$
test_keys_nested_leaf 0.3925ms 0.1400ms 7.1403 KOps/s 7.1588 KOps/s $\color{#d91a1a}-0.26\%$
test_keys_stack_nested 1.6432ms 1.4124ms 707.9989 Ops/s 712.5023 Ops/s $\color{#d91a1a}-0.63\%$
test_keys_stack_nested_leaf 1.7874ms 1.4128ms 707.8219 Ops/s 719.6576 Ops/s $\color{#d91a1a}-1.64\%$
test_keys_stack_nested_locked 1.2617ms 0.6857ms 1.4584 KOps/s 1.5015 KOps/s $\color{#d91a1a}-2.87\%$
test_values 5.9910μs 1.1287μs 885.9715 KOps/s 865.8198 KOps/s $\color{#35bf28}+2.33\%$
test_values_nested 98.8630μs 49.4565μs 20.2198 KOps/s 20.0849 KOps/s $\color{#35bf28}+0.67\%$
test_values_nested_locked 81.9220μs 48.5822μs 20.5837 KOps/s 20.0429 KOps/s $\color{#35bf28}+2.70\%$
test_values_nested_leaf 69.1780μs 44.2146μs 22.6170 KOps/s 22.4174 KOps/s $\color{#35bf28}+0.89\%$
test_values_stack_nested 1.3794ms 1.2154ms 822.7882 Ops/s 835.3024 Ops/s $\color{#d91a1a}-1.50\%$
test_values_stack_nested_leaf 2.0021ms 1.2042ms 830.4538 Ops/s 847.9740 Ops/s $\color{#d91a1a}-2.07\%$
test_values_stack_nested_locked 1.1013ms 0.5143ms 1.9445 KOps/s 1.9906 KOps/s $\color{#d91a1a}-2.32\%$
test_membership 28.7440μs 1.3971μs 715.7927 KOps/s 723.8941 KOps/s $\color{#d91a1a}-1.12\%$
test_membership_nested 14.1470μs 2.8067μs 356.2872 KOps/s 358.2731 KOps/s $\color{#d91a1a}-0.55\%$
test_membership_nested_leaf 32.3900μs 2.8173μs 354.9496 KOps/s 356.0319 KOps/s $\color{#d91a1a}-0.30\%$
test_membership_stacked_nested 43.1110μs 11.7230μs 85.3021 KOps/s 85.2604 KOps/s $\color{#35bf28}+0.05\%$
test_membership_stacked_nested_leaf 54.2510μs 11.8497μs 84.3905 KOps/s 85.0740 KOps/s $\color{#d91a1a}-0.80\%$
test_membership_nested_last 34.5540μs 6.0899μs 164.2069 KOps/s 161.8374 KOps/s $\color{#35bf28}+1.46\%$
test_membership_nested_leaf_last 27.0500μs 5.9627μs 167.7084 KOps/s 168.2228 KOps/s $\color{#d91a1a}-0.31\%$
test_membership_stacked_nested_last 0.3166ms 0.1709ms 5.8525 KOps/s 5.9502 KOps/s $\color{#d91a1a}-1.64\%$
test_membership_stacked_nested_leaf_last 44.4320μs 13.8321μs 72.2954 KOps/s 72.3379 KOps/s $\color{#d91a1a}-0.06\%$
test_nested_getleaf 39.7540μs 11.0299μs 90.6628 KOps/s 92.1093 KOps/s $\color{#d91a1a}-1.57\%$
test_nested_get 31.5590μs 10.5192μs 95.0646 KOps/s 97.4312 KOps/s $\color{#d91a1a}-2.43\%$
test_stacked_getleaf 0.7560ms 0.6465ms 1.5468 KOps/s 1.5743 KOps/s $\color{#d91a1a}-1.75\%$
test_stacked_get 0.7613ms 0.6139ms 1.6288 KOps/s 1.6469 KOps/s $\color{#d91a1a}-1.10\%$
test_nested_getitemleaf 41.7270μs 10.8481μs 92.1823 KOps/s 91.5349 KOps/s $\color{#35bf28}+0.71\%$
test_nested_getitem 36.7890μs 10.2838μs 97.2401 KOps/s 97.0495 KOps/s $\color{#35bf28}+0.20\%$
test_stacked_getitemleaf 1.2634ms 0.6496ms 1.5394 KOps/s 1.5716 KOps/s $\color{#d91a1a}-2.05\%$
test_stacked_getitem 0.7127ms 0.6215ms 1.6090 KOps/s 1.6433 KOps/s $\color{#d91a1a}-2.09\%$
test_lock_nested 7.4052ms 0.5683ms 1.7596 KOps/s 1.7961 KOps/s $\color{#d91a1a}-2.03\%$
test_lock_stack_nested 9.0480ms 5.0702ms 197.2292 Ops/s 199.8912 Ops/s $\color{#d91a1a}-1.33\%$
test_unlock_nested 61.2789ms 0.5016ms 1.9934 KOps/s 2.2802 KOps/s $\textbf{\color{#d91a1a}-12.58\%}$
test_unlock_stack_nested 59.5659ms 6.4375ms 155.3406 Ops/s 156.7571 Ops/s $\color{#d91a1a}-0.90\%$
test_flatten_speed 0.5663ms 0.2698ms 3.7064 KOps/s 3.7386 KOps/s $\color{#d91a1a}-0.86\%$
test_unflatten_speed 0.5637ms 0.4611ms 2.1687 KOps/s 2.1932 KOps/s $\color{#d91a1a}-1.12\%$
test_common_ops 4.1444ms 0.6882ms 1.4531 KOps/s 1.5044 KOps/s $\color{#d91a1a}-3.41\%$
test_creation 22.5120μs 2.5038μs 399.3873 KOps/s 404.3706 KOps/s $\color{#d91a1a}-1.23\%$
test_creation_empty 41.7170μs 8.7075μs 114.8430 KOps/s 116.6533 KOps/s $\color{#d91a1a}-1.55\%$
test_creation_nested_1 42.5390μs 11.8185μs 84.6134 KOps/s 84.7489 KOps/s $\color{#d91a1a}-0.16\%$
test_creation_nested_2 67.4350μs 15.4115μs 64.8864 KOps/s 65.6014 KOps/s $\color{#d91a1a}-1.09\%$
test_clone 62.4060μs 13.4144μs 74.5466 KOps/s 74.6971 KOps/s $\color{#d91a1a}-0.20\%$
test_getitem[int] 47.3780μs 13.0415μs 76.6784 KOps/s 75.1494 KOps/s $\color{#35bf28}+2.03\%$
test_getitem[slice_int] 58.0980μs 25.2349μs 39.6276 KOps/s 41.0498 KOps/s $\color{#d91a1a}-3.46\%$
test_getitem[range] 83.2550μs 44.1666μs 22.6416 KOps/s 22.7287 KOps/s $\color{#d91a1a}-0.38\%$
test_getitem[tuple] 56.3650μs 20.4057μs 49.0060 KOps/s 50.1890 KOps/s $\color{#d91a1a}-2.36\%$
test_getitem[list] 0.2135ms 39.0726μs 25.5934 KOps/s 25.9581 KOps/s $\color{#d91a1a}-1.40\%$
test_setitem_dim[int] 49.0910μs 28.9716μs 34.5165 KOps/s 35.2080 KOps/s $\color{#d91a1a}-1.96\%$
test_setitem_dim[slice_int] 0.1228ms 54.3436μs 18.4014 KOps/s 19.2216 KOps/s $\color{#d91a1a}-4.27\%$
test_setitem_dim[range] 0.1602ms 73.1754μs 13.6658 KOps/s 13.9912 KOps/s $\color{#d91a1a}-2.33\%$
test_setitem_dim[tuple] 0.1125ms 42.9834μs 23.2648 KOps/s 23.5894 KOps/s $\color{#d91a1a}-1.38\%$
test_setitem 74.4980μs 18.9801μs 52.6868 KOps/s 53.8139 KOps/s $\color{#d91a1a}-2.09\%$
test_set 62.2350μs 18.3436μs 54.5149 KOps/s 55.6817 KOps/s $\color{#d91a1a}-2.10\%$
test_set_shared 0.9212ms 0.1408ms 7.1020 KOps/s 7.2416 KOps/s $\color{#d91a1a}-1.93\%$
test_update 0.1010ms 20.2684μs 49.3379 KOps/s 51.1685 KOps/s $\color{#d91a1a}-3.58\%$
test_update_nested 72.6450μs 27.8602μs 35.8935 KOps/s 36.9105 KOps/s $\color{#d91a1a}-2.76\%$
test_set_nested 57.1550μs 20.4176μs 48.9774 KOps/s 50.4847 KOps/s $\color{#d91a1a}-2.99\%$
test_set_nested_new 87.0520μs 25.4039μs 39.3640 KOps/s 39.7398 KOps/s $\color{#d91a1a}-0.95\%$
test_select 0.1253ms 50.3522μs 19.8601 KOps/s 20.2019 KOps/s $\color{#d91a1a}-1.69\%$
test_unbind_speed 0.4288ms 0.3708ms 2.6967 KOps/s 2.6696 KOps/s $\color{#35bf28}+1.01\%$
test_unbind_speed_stack0 64.9872ms 4.5964ms 217.5623 Ops/s 225.7987 Ops/s $\color{#d91a1a}-3.65\%$
test_unbind_speed_stack1 2.4010μs 0.6261μs 1.5973 MOps/s 1.5595 MOps/s $\color{#35bf28}+2.42\%$
test_split 55.7101ms 1.7637ms 566.9790 Ops/s 612.8878 Ops/s $\textbf{\color{#d91a1a}-7.49\%}$
test_chunk 53.8269ms 1.7329ms 577.0556 Ops/s 579.8264 Ops/s $\color{#d91a1a}-0.48\%$
test_creation[device0] 3.2595ms 0.2972ms 3.3643 KOps/s 3.3657 KOps/s $\color{#d91a1a}-0.04\%$
test_creation_from_tensor 0.6862ms 0.3267ms 3.0612 KOps/s 2.8311 KOps/s $\textbf{\color{#35bf28}+8.13\%}$
test_add_one[memmap_tensor0] 75.0590μs 25.7310μs 38.8636 KOps/s 40.4548 KOps/s $\color{#d91a1a}-3.93\%$
test_contiguous[memmap_tensor0] 39.6640μs 5.6940μs 175.6228 KOps/s 177.2072 KOps/s $\color{#d91a1a}-0.89\%$
test_stack[memmap_tensor0] 78.1050μs 18.8849μs 52.9522 KOps/s 51.9138 KOps/s $\color{#35bf28}+2.00\%$
test_memmaptd_index 0.7649ms 0.4021ms 2.4868 KOps/s 2.5521 KOps/s $\color{#d91a1a}-2.56\%$
test_memmaptd_index_astensor 0.6736ms 0.4588ms 2.1794 KOps/s 2.1980 KOps/s $\color{#d91a1a}-0.84\%$
test_memmaptd_index_op 1.2242ms 0.7239ms 1.3815 KOps/s 1.4020 KOps/s $\color{#d91a1a}-1.46\%$
test_reshape_pytree 59.6610μs 23.5482μs 42.4661 KOps/s 43.4880 KOps/s $\color{#d91a1a}-2.35\%$
test_reshape_td 82.5840μs 31.2926μs 31.9565 KOps/s 31.2947 KOps/s $\color{#35bf28}+2.11\%$
test_view_pytree 57.9280μs 23.2588μs 42.9944 KOps/s 43.0014 KOps/s $\color{#d91a1a}-0.02\%$
test_view_td 19.0660μs 4.9528μs 201.9067 KOps/s 203.5120 KOps/s $\color{#d91a1a}-0.79\%$
test_unbind_pytree 0.5803ms 26.3563μs 37.9416 KOps/s 38.2739 KOps/s $\color{#d91a1a}-0.87\%$
test_unbind_td 0.1418ms 59.0658μs 16.9303 KOps/s 17.2783 KOps/s $\color{#d91a1a}-2.01\%$
test_split_pytree 97.3500μs 26.6343μs 37.5455 KOps/s 38.3588 KOps/s $\color{#d91a1a}-2.12\%$
test_split_td 0.1272ms 45.8667μs 21.8023 KOps/s 21.3837 KOps/s $\color{#35bf28}+1.96\%$
test_add_pytree 85.6880μs 31.9994μs 31.2506 KOps/s 31.6528 KOps/s $\color{#d91a1a}-1.27\%$
test_add_td 0.1530ms 44.5013μs 22.4713 KOps/s 22.6994 KOps/s $\color{#d91a1a}-1.00\%$
test_distributed 27.0400μs 6.1663μs 162.1706 KOps/s 163.9924 KOps/s $\color{#d91a1a}-1.11\%$
test_tdmodule 1.6499ms 22.2181μs 45.0084 KOps/s 43.3685 KOps/s $\color{#35bf28}+3.78\%$
test_tdmodule_dispatch 0.1815ms 38.4627μs 25.9992 KOps/s 25.6826 KOps/s $\color{#35bf28}+1.23\%$
test_tdseq 0.1238ms 24.9528μs 40.0757 KOps/s 39.8983 KOps/s $\color{#35bf28}+0.44\%$
test_tdseq_dispatch 0.1453ms 44.5149μs 22.4644 KOps/s 22.9573 KOps/s $\color{#d91a1a}-2.15\%$
test_instantiation_functorch 1.4259ms 1.3232ms 755.7609 Ops/s 771.3441 Ops/s $\color{#d91a1a}-2.02\%$
test_instantiation_td 62.8780ms 1.1007ms 908.5212 Ops/s 992.7101 Ops/s $\textbf{\color{#d91a1a}-8.48\%}$
test_exec_functorch 0.3530ms 0.1605ms 6.2296 KOps/s 6.2917 KOps/s $\color{#d91a1a}-0.99\%$
test_exec_functional_call 0.2303ms 0.1500ms 6.6682 KOps/s 6.8911 KOps/s $\color{#d91a1a}-3.23\%$
test_exec_td 0.2220ms 0.1437ms 6.9574 KOps/s 6.9663 KOps/s $\color{#d91a1a}-0.13\%$
test_exec_td_decorator 2.7541ms 0.1802ms 5.5502 KOps/s 5.6700 KOps/s $\color{#d91a1a}-2.11\%$
test_vmap_mlp_speed[True-True] 0.9608ms 0.8851ms 1.1298 KOps/s 1.1354 KOps/s $\color{#d91a1a}-0.49\%$
test_vmap_mlp_speed[True-False] 0.8202ms 0.4665ms 2.1437 KOps/s 2.1925 KOps/s $\color{#d91a1a}-2.23\%$
test_vmap_mlp_speed[False-True] 0.9262ms 0.7614ms 1.3134 KOps/s 1.3149 KOps/s $\color{#d91a1a}-0.12\%$
test_vmap_mlp_speed[False-False] 0.7864ms 0.3892ms 2.5693 KOps/s 2.6536 KOps/s $\color{#d91a1a}-3.18\%$
test_vmap_mlp_speed_decorator[True-True] 2.6242ms 1.7555ms 569.6464 Ops/s 563.3469 Ops/s $\color{#35bf28}+1.12\%$
test_vmap_mlp_speed_decorator[True-False] 0.9946ms 0.5169ms 1.9348 KOps/s 1.9816 KOps/s $\color{#d91a1a}-2.36\%$
test_vmap_mlp_speed_decorator[False-True] 2.0037ms 1.4596ms 685.1333 Ops/s 670.2108 Ops/s $\color{#35bf28}+2.23\%$
test_vmap_mlp_speed_decorator[False-False] 0.9243ms 0.3936ms 2.5406 KOps/s 2.5788 KOps/s $\color{#d91a1a}-1.48\%$

Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 127. Improved: $\large\color{#35bf28}2$. Worsened: $\large\color{#d91a1a}4$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 0.5319ms 12.8497μs 77.8228 KOps/s 80.4023 KOps/s $\color{#d91a1a}-3.21\%$
test_plain_set_stack_nested 0.1788ms 0.1153ms 8.6746 KOps/s 8.7655 KOps/s $\color{#d91a1a}-1.04\%$
test_plain_set_nested_inplace 32.3010μs 14.9934μs 66.6959 KOps/s 67.3808 KOps/s $\color{#d91a1a}-1.02\%$
test_plain_set_stack_nested_inplace 0.1741ms 0.1393ms 7.1801 KOps/s 7.1217 KOps/s $\color{#35bf28}+0.82\%$
test_items 25.2620μs 4.6729μs 214.0018 KOps/s 212.6173 KOps/s $\color{#35bf28}+0.65\%$
test_items_nested 0.3906ms 0.3370ms 2.9672 KOps/s 2.9519 KOps/s $\color{#35bf28}+0.52\%$
test_items_nested_locked 0.3942ms 0.3402ms 2.9392 KOps/s 2.9358 KOps/s $\color{#35bf28}+0.12\%$
test_items_nested_leaf 0.2468ms 0.1987ms 5.0332 KOps/s 4.9980 KOps/s $\color{#35bf28}+0.71\%$
test_items_stack_nested 1.5395ms 1.4770ms 677.0696 Ops/s 677.5519 Ops/s $\color{#d91a1a}-0.07\%$
test_items_stack_nested_leaf 1.3939ms 1.2950ms 772.1728 Ops/s 765.9871 Ops/s $\color{#35bf28}+0.81\%$
test_items_stack_nested_locked 0.8810ms 0.8157ms 1.2260 KOps/s 1.2256 KOps/s $\color{#35bf28}+0.03\%$
test_keys 21.6010μs 4.5863μs 218.0415 KOps/s 217.3706 KOps/s $\color{#35bf28}+0.31\%$
test_keys_nested 3.2661ms 90.7603μs 11.0180 KOps/s 10.9825 KOps/s $\color{#35bf28}+0.32\%$
test_keys_nested_locked 0.1152ms 90.4675μs 11.0537 KOps/s 10.9216 KOps/s $\color{#35bf28}+1.21\%$
test_keys_nested_leaf 40.6923ms 87.1717μs 11.4716 KOps/s 12.0539 KOps/s $\color{#d91a1a}-4.83\%$
test_keys_stack_nested 1.3467ms 1.2860ms 777.5781 Ops/s 780.7488 Ops/s $\color{#d91a1a}-0.41\%$
test_keys_stack_nested_leaf 1.3361ms 1.2713ms 786.5730 Ops/s 783.2756 Ops/s $\color{#35bf28}+0.42\%$
test_keys_stack_nested_locked 0.7006ms 0.6138ms 1.6293 KOps/s 1.6174 KOps/s $\color{#35bf28}+0.73\%$
test_values 6.7203μs 1.8848μs 530.5518 KOps/s 528.3976 KOps/s $\color{#35bf28}+0.41\%$
test_values_nested 66.5530μs 43.0135μs 23.2485 KOps/s 23.5067 KOps/s $\color{#d91a1a}-1.10\%$
test_values_nested_locked 76.5930μs 45.3226μs 22.0641 KOps/s 22.3072 KOps/s $\color{#d91a1a}-1.09\%$
test_values_nested_leaf 55.4820μs 37.3800μs 26.7523 KOps/s 26.8626 KOps/s $\color{#d91a1a}-0.41\%$
test_values_stack_nested 1.2257ms 1.1246ms 889.2390 Ops/s 882.6323 Ops/s $\color{#35bf28}+0.75\%$
test_values_stack_nested_leaf 1.2304ms 1.1208ms 892.2480 Ops/s 899.6676 Ops/s $\color{#d91a1a}-0.82\%$
test_values_stack_nested_locked 0.5587ms 0.4945ms 2.0222 KOps/s 2.0053 KOps/s $\color{#35bf28}+0.84\%$
test_membership 3.8120μs 0.9478μs 1.0551 MOps/s 1.0594 MOps/s $\color{#d91a1a}-0.41\%$
test_membership_nested 12.3755μs 2.1139μs 473.0604 KOps/s 452.2113 KOps/s $\color{#35bf28}+4.61\%$
test_membership_nested_leaf 7.2903μs 2.0774μs 481.3693 KOps/s 463.8621 KOps/s $\color{#35bf28}+3.77\%$
test_membership_stacked_nested 39.6820μs 11.1313μs 89.8367 KOps/s 91.6158 KOps/s $\color{#d91a1a}-1.94\%$
test_membership_stacked_nested_leaf 34.0410μs 10.9596μs 91.2439 KOps/s 91.8516 KOps/s $\color{#d91a1a}-0.66\%$
test_membership_nested_last 19.2910μs 4.6191μs 216.4936 KOps/s 217.6529 KOps/s $\color{#d91a1a}-0.53\%$
test_membership_nested_leaf_last 19.8910μs 4.5862μs 218.0440 KOps/s 218.1135 KOps/s $\color{#d91a1a}-0.03\%$
test_membership_stacked_nested_last 0.1648ms 0.1330ms 7.5208 KOps/s 7.3571 KOps/s $\color{#35bf28}+2.22\%$
test_membership_stacked_nested_leaf_last 31.0610μs 12.7900μs 78.1864 KOps/s 77.0567 KOps/s $\color{#35bf28}+1.47\%$
test_nested_getleaf 23.6010μs 8.4415μs 118.4625 KOps/s 118.4879 KOps/s $\color{#d91a1a}-0.02\%$
test_nested_get 22.9410μs 7.9575μs 125.6671 KOps/s 125.5276 KOps/s $\color{#35bf28}+0.11\%$
test_stacked_getleaf 0.6351ms 0.5617ms 1.7802 KOps/s 1.7941 KOps/s $\color{#d91a1a}-0.77\%$
test_stacked_get 0.6041ms 0.5326ms 1.8776 KOps/s 1.8780 KOps/s $\color{#d91a1a}-0.02\%$
test_nested_getitemleaf 23.1810μs 8.5374μs 117.1311 KOps/s 118.2361 KOps/s $\color{#d91a1a}-0.93\%$
test_nested_getitem 28.8520μs 8.1111μs 123.2877 KOps/s 125.4005 KOps/s $\color{#d91a1a}-1.68\%$
test_stacked_getitemleaf 0.6663ms 0.5619ms 1.7797 KOps/s 1.7809 KOps/s $\color{#d91a1a}-0.06\%$
test_stacked_getitem 0.5866ms 0.5337ms 1.8736 KOps/s 1.8922 KOps/s $\color{#d91a1a}-0.99\%$
test_lock_nested 3.1154ms 0.5478ms 1.8256 KOps/s 1.8268 KOps/s $\color{#d91a1a}-0.06\%$
test_lock_stack_nested 80.0883ms 7.1097ms 140.6535 Ops/s 139.0486 Ops/s $\color{#35bf28}+1.15\%$
test_unlock_nested 2.3938ms 0.4253ms 2.3511 KOps/s 2.3518 KOps/s $\color{#d91a1a}-0.03\%$
test_unlock_stack_nested 66.1275ms 6.1495ms 162.6145 Ops/s 164.4199 Ops/s $\color{#d91a1a}-1.10\%$
test_flatten_speed 0.2401ms 0.1877ms 5.3268 KOps/s 5.3728 KOps/s $\color{#d91a1a}-0.85\%$
test_unflatten_speed 0.4182ms 0.3647ms 2.7417 KOps/s 2.7523 KOps/s $\color{#d91a1a}-0.38\%$
test_common_ops 1.0794ms 0.5927ms 1.6873 KOps/s 1.7089 KOps/s $\color{#d91a1a}-1.26\%$
test_creation 32.0110μs 2.0634μs 484.6409 KOps/s 480.2556 KOps/s $\color{#35bf28}+0.91\%$
test_creation_empty 26.3010μs 7.0864μs 141.1154 KOps/s 149.2067 KOps/s $\textbf{\color{#d91a1a}-5.42\%}$
test_creation_nested_1 23.7310μs 9.4097μs 106.2734 KOps/s 111.5821 KOps/s $\color{#d91a1a}-4.76\%$
test_creation_nested_2 32.0910μs 12.1115μs 82.5663 KOps/s 86.0150 KOps/s $\color{#d91a1a}-4.01\%$
test_clone 88.5140μs 14.2506μs 70.1727 KOps/s 70.4835 KOps/s $\color{#d91a1a}-0.44\%$
test_getitem[int] 38.1820μs 12.2078μs 81.9151 KOps/s 82.4855 KOps/s $\color{#d91a1a}-0.69\%$
test_getitem[slice_int] 54.3330μs 23.6333μs 42.3131 KOps/s 42.5580 KOps/s $\color{#d91a1a}-0.58\%$
test_getitem[range] 64.6330μs 38.2597μs 26.1372 KOps/s 26.4033 KOps/s $\color{#d91a1a}-1.01\%$
test_getitem[tuple] 39.1210μs 19.7542μs 50.6222 KOps/s 50.5163 KOps/s $\color{#35bf28}+0.21\%$
test_getitem[list] 0.2654ms 35.2282μs 28.3863 KOps/s 28.7170 KOps/s $\color{#d91a1a}-1.15\%$
test_setitem_dim[int] 41.3120μs 25.5177μs 39.1884 KOps/s 39.9527 KOps/s $\color{#d91a1a}-1.91\%$
test_setitem_dim[slice_int] 63.3430μs 45.9760μs 21.7505 KOps/s 22.2667 KOps/s $\color{#d91a1a}-2.32\%$
test_setitem_dim[range] 93.7640μs 61.5254μs 16.2535 KOps/s 16.2183 KOps/s $\color{#35bf28}+0.22\%$
test_setitem_dim[tuple] 57.8630μs 39.0335μs 25.6190 KOps/s 27.2944 KOps/s $\textbf{\color{#d91a1a}-6.14\%}$
test_setitem 97.6450μs 17.7943μs 56.1977 KOps/s 56.3709 KOps/s $\color{#d91a1a}-0.31\%$
test_set 98.5840μs 17.3698μs 57.5711 KOps/s 58.1349 KOps/s $\color{#d91a1a}-0.97\%$
test_set_shared 2.8683ms 0.1021ms 9.7969 KOps/s 8.9584 KOps/s $\textbf{\color{#35bf28}+9.36\%}$
test_update 81.7540μs 18.4498μs 54.2011 KOps/s 55.3208 KOps/s $\color{#d91a1a}-2.02\%$
test_update_nested 97.2840μs 25.3058μs 39.5166 KOps/s 40.0163 KOps/s $\color{#d91a1a}-1.25\%$
test_set_nested 81.5740μs 18.1781μs 55.0113 KOps/s 54.4779 KOps/s $\color{#35bf28}+0.98\%$
test_set_nested_new 82.1040μs 23.2266μs 43.0541 KOps/s 44.8126 KOps/s $\color{#d91a1a}-3.92\%$
test_select 0.1138ms 45.7058μs 21.8791 KOps/s 22.0489 KOps/s $\color{#d91a1a}-0.77\%$
test_to 73.2930μs 51.0534μs 19.5873 KOps/s 20.4516 KOps/s $\color{#d91a1a}-4.23\%$
test_to_nonblocking 75.1730μs 33.7076μs 29.6669 KOps/s 29.9411 KOps/s $\color{#d91a1a}-0.92\%$
test_unbind_speed 0.4079ms 0.3553ms 2.8148 KOps/s 2.8834 KOps/s $\color{#d91a1a}-2.38\%$
test_unbind_speed_stack0 61.9050ms 4.2090ms 237.5876 Ops/s 254.7034 Ops/s $\textbf{\color{#d91a1a}-6.72\%}$
test_unbind_speed_stack1 2.0051μs 0.5347μs 1.8701 MOps/s 1.9061 MOps/s $\color{#d91a1a}-1.89\%$
test_split 53.2220ms 1.7971ms 556.4542 Ops/s 558.5781 Ops/s $\color{#d91a1a}-0.38\%$
test_chunk 53.4228ms 1.7805ms 561.6359 Ops/s 565.1907 Ops/s $\color{#d91a1a}-0.63\%$
test_creation[device0] 0.3615ms 0.3051ms 3.2774 KOps/s 3.2724 KOps/s $\color{#35bf28}+0.15\%$
test_creation[device1] 0.6528ms 0.3080ms 3.2470 KOps/s 3.2533 KOps/s $\color{#d91a1a}-0.19\%$
test_creation_from_tensor 0.5477ms 0.3329ms 3.0035 KOps/s 2.7830 KOps/s $\textbf{\color{#35bf28}+7.92\%}$
test_add_one[memmap_tensor0] 65.9230μs 22.9666μs 43.5415 KOps/s 44.2103 KOps/s $\color{#d91a1a}-1.51\%$
test_add_one[memmap_tensor1] 0.2092ms 72.0485μs 13.8795 KOps/s 13.7544 KOps/s $\color{#35bf28}+0.91\%$
test_contiguous[memmap_tensor0] 34.3020μs 5.7136μs 175.0200 KOps/s 181.9139 KOps/s $\color{#d91a1a}-3.79\%$
test_contiguous[memmap_tensor1] 52.4920μs 21.4097μs 46.7078 KOps/s 47.0811 KOps/s $\color{#d91a1a}-0.79\%$
test_stack[memmap_tensor0] 47.3220μs 18.6513μs 53.6156 KOps/s 54.7953 KOps/s $\color{#d91a1a}-2.15\%$
test_stack[memmap_tensor1] 0.1513ms 72.9761μs 13.7031 KOps/s 13.6919 KOps/s $\color{#35bf28}+0.08\%$
test_memmaptd_index 0.4772ms 0.4139ms 2.4163 KOps/s 2.4301 KOps/s $\color{#d91a1a}-0.57\%$
test_memmaptd_index_astensor 0.5240ms 0.4672ms 2.1402 KOps/s 2.1367 KOps/s $\color{#35bf28}+0.16\%$
test_memmaptd_index_op 0.8170ms 0.7337ms 1.3629 KOps/s 1.3763 KOps/s $\color{#d91a1a}-0.98\%$
test_reshape_pytree 37.0320μs 20.8830μs 47.8858 KOps/s 48.9060 KOps/s $\color{#d91a1a}-2.09\%$
test_reshape_td 60.7330μs 28.8759μs 34.6309 KOps/s 33.7333 KOps/s $\color{#35bf28}+2.66\%$
test_view_pytree 35.4610μs 20.6329μs 48.4663 KOps/s 49.4063 KOps/s $\color{#d91a1a}-1.90\%$
test_view_td 19.2310μs 4.0790μs 245.1606 KOps/s 246.1200 KOps/s $\color{#d91a1a}-0.39\%$
test_unbind_pytree 47.7520μs 25.6750μs 38.9485 KOps/s 38.8313 KOps/s $\color{#35bf28}+0.30\%$
test_unbind_td 83.8240μs 55.4312μs 18.0404 KOps/s 18.1667 KOps/s $\color{#d91a1a}-0.70\%$
test_split_pytree 41.4920μs 24.0066μs 41.6553 KOps/s 42.1297 KOps/s $\color{#d91a1a}-1.13\%$
test_split_td 80.3940μs 43.4156μs 23.0332 KOps/s 23.0529 KOps/s $\color{#d91a1a}-0.09\%$
test_add_pytree 56.7220μs 31.4505μs 31.7960 KOps/s 32.1166 KOps/s $\color{#d91a1a}-1.00\%$
test_add_td 87.3740μs 43.1911μs 23.1529 KOps/s 23.9871 KOps/s $\color{#d91a1a}-3.48\%$
test_distributed 20.9000μs 5.4662μs 182.9431 KOps/s 185.6659 KOps/s $\color{#d91a1a}-1.47\%$
test_tdmodule 31.4010μs 16.5408μs 60.4566 KOps/s 61.0602 KOps/s $\color{#d91a1a}-0.99\%$
test_tdmodule_dispatch 0.1968ms 32.7364μs 30.5471 KOps/s 30.9013 KOps/s $\color{#d91a1a}-1.15\%$
test_tdseq 35.3910μs 19.4908μs 51.3062 KOps/s 52.3397 KOps/s $\color{#d91a1a}-1.97\%$
test_tdseq_dispatch 53.0030μs 35.5214μs 28.1521 KOps/s 28.6427 KOps/s $\color{#d91a1a}-1.71\%$
test_instantiation_functorch 1.7789ms 1.6691ms 599.1128 Ops/s 604.5131 Ops/s $\color{#d91a1a}-0.89\%$
test_instantiation_td 1.6673ms 1.1761ms 850.2658 Ops/s 857.2851 Ops/s $\color{#d91a1a}-0.82\%$
test_exec_functorch 0.2112ms 0.1544ms 6.4767 KOps/s 6.3638 KOps/s $\color{#35bf28}+1.77\%$
test_exec_functional_call 0.2099ms 0.1534ms 6.5180 KOps/s 6.4560 KOps/s $\color{#35bf28}+0.96\%$
test_exec_td 0.1901ms 0.1432ms 6.9852 KOps/s 6.7670 KOps/s $\color{#35bf28}+3.23\%$
test_exec_td_decorator 0.8968ms 0.1834ms 5.4519 KOps/s 5.3844 KOps/s $\color{#35bf28}+1.25\%$
test_vmap_mlp_speed[True-True] 1.1484ms 1.0743ms 930.8516 Ops/s 929.8253 Ops/s $\color{#35bf28}+0.11\%$
test_vmap_mlp_speed[True-False] 0.7312ms 0.6377ms 1.5682 KOps/s 1.6112 KOps/s $\color{#d91a1a}-2.67\%$
test_vmap_mlp_speed[False-True] 1.1044ms 1.0352ms 966.0106 Ops/s 1.0201 KOps/s $\textbf{\color{#d91a1a}-5.30\%}$
test_vmap_mlp_speed[False-False] 0.6493ms 0.5794ms 1.7259 KOps/s 1.8150 KOps/s $\color{#d91a1a}-4.91\%$
test_vmap_mlp_speed_decorator[True-True] 2.9828ms 2.0845ms 479.7378 Ops/s 496.1421 Ops/s $\color{#d91a1a}-3.31\%$
test_vmap_mlp_speed_decorator[True-False] 1.1524ms 0.6639ms 1.5063 KOps/s 1.5041 KOps/s $\color{#35bf28}+0.14\%$
test_vmap_mlp_speed_decorator[False-True] 2.1819ms 1.7588ms 568.5724 Ops/s 570.3082 Ops/s $\color{#d91a1a}-0.30\%$
test_vmap_mlp_speed_decorator[False-False] 1.0327ms 0.5637ms 1.7739 KOps/s 1.7661 KOps/s $\color{#35bf28}+0.44\%$
test_vmap_transformer_speed[True-True] 12.6923ms 12.5325ms 79.7927 Ops/s 79.5522 Ops/s $\color{#35bf28}+0.30\%$
test_vmap_transformer_speed[True-False] 8.5163ms 8.2935ms 120.5769 Ops/s 119.9966 Ops/s $\color{#35bf28}+0.48\%$
test_vmap_transformer_speed[False-True] 12.7090ms 12.4082ms 80.5917 Ops/s 80.3348 Ops/s $\color{#35bf28}+0.32\%$
test_vmap_transformer_speed[False-False] 8.4369ms 8.2633ms 121.0165 Ops/s 121.0566 Ops/s $\color{#d91a1a}-0.03\%$
test_vmap_transformer_speed_decorator[True-True] 64.6832ms 63.6251ms 15.7171 Ops/s 15.5907 Ops/s $\color{#35bf28}+0.81\%$
test_vmap_transformer_speed_decorator[True-False] 22.3721ms 20.1138ms 49.7170 Ops/s 49.7247 Ops/s $\color{#d91a1a}-0.02\%$
test_vmap_transformer_speed_decorator[False-True] 0.1360s 62.3159ms 16.0473 Ops/s 15.7020 Ops/s $\color{#35bf28}+2.20\%$
test_vmap_transformer_speed_decorator[False-False] 21.8274ms 19.7435ms 50.6496 Ops/s 50.3983 Ops/s $\color{#35bf28}+0.50\%$

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CI CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants