New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Implemented basic pipeline for Refitting #2886

Merged

narendasan merged 71 commits into main from refitter-support

Jul 2, 2024

Collaborator

cehongwang commented Jun 4, 2024

Description

Please include a summary of the change and which issue is fixed. Please also include relevant motivation and context. List any dependencies that are required for this change.

Fixes # (issue)

Type of change

Please delete options that are not relevant and/or add your own.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

Checklist:

My code follows the style guidelines of this project (You can use the linters)
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas and hacks
I have made corresponding changes to the documentation
I have added tests to verify my fix or my feature
New and existing unit tests pass locally with my changes
I have added the relevant labels to my PR in so that relevant reviewers are notified


          Implemented basic pipeline for Refitting

dc98f23

cehongwang requested a review from narendasan

June 4, 2024 17:18

facebook-github-bot added the cla signed label

github-actions bot added component: conversion component: api [Python] component: dynamo labels

narendasan reviewed

View reviewed changes

py/torch_tensorrt/dynamo/_compiler.py Outdated



		def refit_trt_engine_from_module(
		exported_program: ExportedProgram,

Collaborator

narendasan Jun 4, 2024

Remove the settings that dont do anything for refit

narendasan reviewed

View reviewed changes

py/torch_tensorrt/dynamo/_compiler.py Outdated

+              def refit_trt_engine_from_module(
+                  exported_program: ExportedProgram,
+                  inputs: Tuple[Any, ...],
+                  engine: object,

Collaborator

narendasan Jun 4, 2024

Eventually will become the compiled exported program

narendasan reviewed

View reviewed changes

py/torch_tensorrt/dynamo/_compiler.py Outdated

@@ @@ -609,3 +610,126 @@ def convert_module_to_trt_engine( @@
                       engine_bytearray = engine_bytes.getvalue()
                   return engine_bytearray
+              def refit_trt_engine_from_module(

Collaborator

narendasan Jun 4, 2024

Eventually something like

def refit_module_weights(
    compiled_module: ExportedProgram,
    new_weight_module: ExportedProgram
) -> torch.fx.GraphModule:

narendasan reviewed

View reviewed changes

py/torch_tensorrt/dynamo/_compiler.py Outdated


		enabled_precisions = {dtype._from(e) for e in enabled_precisions}

		compilation_options = {

Collaborator

narendasan Jun 4, 2024

Can store the compilation settings as metadata in the returned graph (then we can just read the compiled program to fill these settings in to match the original lowering)

Collaborator Author

cehongwang Jun 4, 2024

Ask Dheeraj where to put the meta data for lowering

narendasan reviewed

View reviewed changes

py/torch_tensorrt/dynamo/_compiler.py Outdated


		mapping = get_refit_mapping(gm, input_list, settings)

		TRT_LOGGER = trt.Logger(trt.Logger.ERROR)

Collaborator

narendasan Jun 4, 2024

Lets move this stuff into a submodule or other file (torch_tensorrt/dynamo/_refit.py)

narendasan reviewed

View reviewed changes

py/torch_tensorrt/dynamo/_compiler.py Outdated


		mapping = get_refit_mapping(gm, input_list, settings)

		TRT_LOGGER = trt.Logger(trt.Logger.ERROR)

Collaborator

narendasan Jun 4, 2024

Reuse global logger

TensorRT/py/torch_tensorrt/logging.py

Line 33 in 3422c41

TRT_LOGGER = _TRTLogger()

narendasan reviewed

View reviewed changes

py/torch_tensorrt/dynamo/conversion/_conversion.py Outdated

		@@ -88,6 +89,61 @@ def interpret_module_to_result(
		return interpreter_result


		def get_refit_mapping(

Collaborator

narendasan Jun 4, 2024

Move this to the refit file

Collaborator

narendasan Jun 4, 2024

def construct_refit_weight_mapping(
   new_weights_mod: torch.fx.GraphModule,
   compile_settings: CompilationSettings # Info from the metadata of the compiled module
):

narendasan reviewed

View reviewed changes

py/torch_tensorrt/dynamo/conversion/_TRTRefittingInterpreter.py Outdated

+                          serialized_engine, self._input_names, self._output_names, serialized_cache
+                      )
+                  def get_network_to_refit(

Collaborator

narendasan Jun 4, 2024

lets call this something like

def _construct_trt_network_def()

Collaborator

narendasan Jun 4, 2024

The user would do something like

interpreter._construct_trt_network_def()
net = interpreter.ctx.net

Collaborator

narendasan commented Jun 4, 2024

1 AI: take

TensorRT/py/torch_tensorrt/fx/converters/converter_utils.py

Line 101 in 3422c41

def set_layer_name(

and copy it into https://github.com/pytorch/TensorRT/blob/3422c41f165c3cf0833468b0cb548149ca78e057/py/torch_tensorrt/dynamo/conversion/converter_utils.py. Modify to use the naming scheme that you have designed to work with refit.

Replace all uses of set_layer_name inside torch_tensorrt/dynamo/conversion/impl and torch_tensorrt/fx/converters with your implementation.

cehongwang added 7 commits

June 4, 2024 13:34


          Organized code for refitting

74d458e


          Renamed function

c47bef3


          Supported multi-engine

869aaad


          Support both TRTModules and return a new copy

e4cb669


          Enabled module saving with settings

388dadc


          Enabled three types of runtime. Build an interface for user to easy r…

f822b28

…efit. Support setting loading


          Reorganized the code

56eb549

narendasan reviewed

View reviewed changes

py/torch_tensorrt/dynamo/_refit.py Outdated

+                  compiled_module = copy.copy(compiled_module)
+                  # Iterate over all components that can be accelerated
+                  # Generate the corresponding TRT Module for those
+                  for name, _ in partitioned_module.named_children():

Collaborator

narendasan Jun 6, 2024

We need to ensure that the new module's partition is the same as the compiled modules partition.

We can check the number of subgraphs, perhaps the names as well (if deterministic)
At compile time, compute the hash of the source fx graph (https://github.com/pytorch/pytorch/blob/fba21edf5b9aa14babb9c0bc860dc9c597eb8010/torch/_inductor/codecache.py#L670) and store as attribute in the TRTModule. The compare the hash of the new graph to the one stored in the compiled subgraph module

Collaborator

narendasan Jun 11, 2024

@zewenli98 You might be interested in reusing this part for engine caching

narendasan reviewed

View reviewed changes

py/torch_tensorrt/dynamo/_refit.py Outdated Show resolved Hide resolved

cehongwang added 4 commits

June 6, 2024 12:53


          Added weight type check and number check

94483a8


          Deleted the save in compilation

4ba84b7


          deleted more compilation save

ee6f123


          Supported different dtypes. Support all possible layers. Support deep…

bc23ddb

… copy.

github-actions bot added the component: runtime label


          Delete the outdated file

501e5d9

cehongwang added 4 commits

June 18, 2024 16:55


          Changed the refit condition check

de3da26


          Changed the file path in test file

91c6036


          Fixed minor format

bd43882


          Deleted setting repetitions

5ef9af7

zewenli98 reviewed

View reviewed changes

py/torch_tensorrt/dynamo/_refit.py Show resolved Hide resolved

py/torch_tensorrt/dynamo/_refit.py Show resolved Hide resolved

py/torch_tensorrt/dynamo/_refit.py Outdated Show resolved Hide resolved

tests/py/dynamo/models/test_model_refit.py Outdated Show resolved Hide resolved

py/torch_tensorrt/dynamo/_refit.py Outdated Show resolved Hide resolved

cehongwang added 3 commits

June 18, 2024 17:23


          Changed min_block_size to 1


          Added comments

7f1f958


          Merged two if statements

b8e023d

cehongwang dismissed github-actions[bot]’s stale review

June 24, 2024 20:57

outdated

cehongwang added 3 commits

June 26, 2024 15:27


          Chagned the weight type

df9cd39


          Fixed hardcoded index

b33fa0f


          Fixed a type causing extra overhead

911984d

narendasan reviewed

View reviewed changes

py/torch_tensorrt/dynamo/_refit.py Outdated Show resolved Hide resolved

narendasan reviewed

View reviewed changes

py/torch_tensorrt/dynamo/_refit.py Outdated Show resolved Hide resolved

cehongwang added 5 commits

June 27, 2024 14:21


          Added comments and repaced the index to enum

0a1c8ca


          Fixed inline module check

257db26


          Added deprecate warning. Renamed refit flag to make_refitable

fef6766


          Merge branch 'main' into refitter-support

d6dbdd4


          Updated lowering process to conform with latest main branch

peri044 approved these changes

View reviewed changes

Collaborator

peri044 left a comment

LGTM.

cehongwang added 3 commits

July 1, 2024 14:21


          Handled default setting usecases

e7768f7


          Fixed circular import bugs

51a03c9


          Changed deprecated behavior

33bde0f

narendasan merged commit 9f46d39 into main

55 of 61 checks passed

cehongwang added a commit that referenced this pull request


          Implemented basic pipeline for Refitting (#2886)

8abb537

cehongwang added a commit that referenced this pull request


          Implemented basic pipeline for Refitting (#2886)

cehongwang added a commit that referenced this pull request


          Implemented basic pipeline for Refitting (#2886)

1fa5141

cehongwang added a commit that referenced this pull request


          Implemented basic pipeline for Refitting (#2886)

20ac8d7

cehongwang added a commit that referenced this pull request


          Implemented basic pipeline for Refitting (#2886)

42ff15e

cehongwang added a commit that referenced this pull request


          Implemented basic pipeline for Refitting (#2886)

50ba223

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cla signed component: api [Python] component: conversion component: core component: dynamo component: runtime component: tests documentation