[Collage] CollagePartition pass #12086

mbs-octoml · 2022-07-13T23:58:39Z

See https://github.com/apache/tvm-rfcs/blob/main/rfcs/0062-collage.md.

This adds the main CollagePartition pass, which:

Inspects all the targets in the CompilationConfig and builds
PartitionSpecs describing how to generate speculative CandidatePartitions
for them.
Runs the above rules on the model to collect all the candidates.
Eliminates candidates whose target contradicts any constraints already
imposed by, eg, device planning.
Eagerly estimates the cost of each candidate.
Performs a shortest path search to chose an 'optimal' set of candidate
partitions so as to minimize estimated model latency, such that every sub-expression
node is contained in exactly one candidate partition.
Coalesces adjacent optimal candidates which ended up on the same target.
Rewrites the model according to the chosen optimal partitioning.

As for the existing partition_for_ methods, the result of
CollagePartition can then be built using regular TVM.

Very special thanks to @mbaret for authoring test_pass_collage_partition.py.

Logic to prune the candidates after step 3 will be in a follow up PR since it
deserves its own testing. A demonstration driver will also come as a follow up.

mbs-octoml · 2022-07-13T23:58:58Z

@mbaret here's the big one!

@mbaret

See https://github.com/apache/tvm-rfcs/blob/main/rfcs/0062-collage.md. This adds the main CollagePartition pass, which: 1. Inspects all the targets in the CompilationConfig and builds PartitionSpecs describing how to generate speculative CandidatePartitions for them. 2. Runs the above rules on the model to collect all the candidates. 3. Eliminates candidates whose target contradicts any constraints already imposed by, eg, device planning. 4. Eagerly estimates the cost of each candidate. 5. Performs a shortest path search to chose an 'optimal' set of candidate partitions so as to minimize estimated model latency, such that every sub-expression node is contained in exactly one candidate partition. 6. Coalesces adjacent optimal candidates which ended up on the same target. 7. Rewrites the model according to the chosen optimal partitioning. As for the existing partition_for_<external codegen name> methods, the result of CollagePartition can then be built using regular TVM. Very special thanks to @mbaret for authoring test_pass_collage_partition.py. Logic to prune the candidates after step 3 will be in a follow up PR since it deserves its own testing. A demonstration driver will also come as a follow up.

mbs-octoml · 2022-07-14T12:52:00Z

Rebased onto main.

mbaret

lgtm

@mbaret

* [Collage] CollagePartition pass See https://github.com/apache/tvm-rfcs/blob/main/rfcs/0062-collage.md. This adds the main CollagePartition pass, which: 1. Inspects all the targets in the CompilationConfig and builds PartitionSpecs describing how to generate speculative CandidatePartitions for them. 2. Runs the above rules on the model to collect all the candidates. 3. Eliminates candidates whose target contradicts any constraints already imposed by, eg, device planning. 4. Eagerly estimates the cost of each candidate. 5. Performs a shortest path search to chose an 'optimal' set of candidate partitions so as to minimize estimated model latency, such that every sub-expression node is contained in exactly one candidate partition. 6. Coalesces adjacent optimal candidates which ended up on the same target. 7. Rewrites the model according to the chosen optimal partitioning. As for the existing partition_for_<external codegen name> methods, the result of CollagePartition can then be built using regular TVM. Very special thanks to @mbaret for authoring test_pass_collage_partition.py. Logic to prune the candidates after step 3 will be in a follow up PR since it deserves its own testing. A demonstration driver will also come as a follow up. * - lints * - more lints * - use the _ffi_api properly

@mbaret

* [Collage] CollagePartition pass See https://github.com/apache/tvm-rfcs/blob/main/rfcs/0062-collage.md. This adds the main CollagePartition pass, which: 1. Inspects all the targets in the CompilationConfig and builds PartitionSpecs describing how to generate speculative CandidatePartitions for them. 2. Runs the above rules on the model to collect all the candidates. 3. Eliminates candidates whose target contradicts any constraints already imposed by, eg, device planning. 4. Eagerly estimates the cost of each candidate. 5. Performs a shortest path search to chose an 'optimal' set of candidate partitions so as to minimize estimated model latency, such that every sub-expression node is contained in exactly one candidate partition. 6. Coalesces adjacent optimal candidates which ended up on the same target. 7. Rewrites the model according to the chosen optimal partitioning. As for the existing partition_for_<external codegen name> methods, the result of CollagePartition can then be built using regular TVM. Very special thanks to @mbaret for authoring test_pass_collage_partition.py. Logic to prune the candidates after step 3 will be in a follow up PR since it deserves its own testing. A demonstration driver will also come as a follow up. * - lints * - more lints * - use the _ffi_api properly

mbs-octoml added 2 commits July 14, 2022 05:46

- lints

5fcbe37

mbs-octoml force-pushed the mbs-collage-partitioner branch from 96631de to 5fcbe37 Compare July 14, 2022 12:51

mbs-octoml added 2 commits July 14, 2022 05:52

- more lints

1fd0cea

- use the _ffi_api properly

6ea257c

mbaret approved these changes Jul 14, 2022

View reviewed changes

mbaret merged commit 7661ba8 into apache:main Jul 14, 2022

mbs-octoml deleted the mbs-collage-partitioner branch July 14, 2022 21:56

AndrewZhaoLuo mentioned this pull request Oct 4, 2022

TVM v0.10.0.rc0 Release Candidate Notes #12979

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Collage] CollagePartition pass #12086

[Collage] CollagePartition pass #12086

mbs-octoml commented Jul 13, 2022 •

edited

Loading

mbs-octoml commented Jul 13, 2022

mbs-octoml commented Jul 14, 2022

mbaret left a comment

[Collage] CollagePartition pass #12086

[Collage] CollagePartition pass #12086

Conversation

mbs-octoml commented Jul 13, 2022 • edited Loading

mbs-octoml commented Jul 13, 2022

mbs-octoml commented Jul 14, 2022

mbaret left a comment

Choose a reason for hiding this comment

mbs-octoml commented Jul 13, 2022 •

edited

Loading