[Torch] Add initial control flow support #4964

masahi · 2020-02-28T05:31:05Z

This adds support for parsing Torchscript prim::If and prim::Loop nodes. This is also the first attempt at translating from the output of torch.jit.script(...). See the test cases for currently supported Python construct.

prim::If can be easily translated by recursively parsing true and else branches. The spec https://github.com/pytorch/pytorch/blob/master/torch/csrc/jit/OVERVIEW.md#if
prim::Loop requires more mork, but using Relay conditional and tail recursion it is not too hard. https://github.com/pytorch/pytorch/blob/master/torch/csrc/jit/OVERVIEW.md#loops

The related discussion (with an example IR dump): https://discuss.tvm.ai/t/discuss-adding-a-pytorch-frontend/5026/24

The CI is blocked by unrelated sphinx issue, but it is ready for review.

cc @zhiics @icemelon9 @wweic @jroesch @MarisaKirisame @alexwong @tqchen @junrushao1994 @ajtulloch @yinghai

zhiics · 2020-02-28T18:56:17Z

@masahi Thank you very much for the nice work! I am familiar the TF control-flow, and trying to read more about PyTorch control-flow constructs. Will have a careful review by tomorrow.

alexwong

Some small comments, also reading up on control flow in Torch.

python/tvm/relay/frontend/pytorch.py

MarisaKirisame · 2020-02-29T00:36:45Z

can you dont call it parse? parse is for converting strings to an ast. converting ast to ast is a converter.

nice work otherwise.

masahi · 2020-02-29T01:00:19Z

Tests passed!!

can you dont call it parse? parse is for converting strings to an ast. converting ast to ast is a converter.

no probelm, will do.

@alexwong @zhiics a good way to get familiar with torch script is to try and tweak simple examples. For example,

import torch


class SimpleLoop(torch.nn.Module):
    def forward(self, inp):
        a = inp
        for i in range(10):
            a += i
        return a


class SimpleWhileLoop(torch.nn.Module):
    def forward(self, inp):
        a = inp
        i = 0
        while i < 10:
            a += i
            i += 1
        return a


print(torch.jit.script(SimpleLoop()).graph)
print(torch.jit.script(SimpleWhileLoop()).graph)

would print

graph(%self : __torch__.SimpleLoop,
      %inp.1 : Tensor):
  %10 : int = prim::Constant[value=1]()
  %6 : bool = prim::Constant[value=1]() # test.py:7:8
  %3 : int = prim::Constant[value=10]() # test.py:7:23
  %a : Tensor = prim::Loop(%3, %6, %inp.1) # test.py:7:8
    block0(%i.1 : int, %a.5 : Tensor):
      %a.2 : Tensor = aten::add_(%a.5, %i.1, %10) # test.py:8:12
      -> (%6, %a.2)
  return (%a)

graph(%self : __torch__.SimpleWhileLoop,
      %inp.1 : Tensor):
  %i.1 : int = prim::Constant[value=0]() # test.py:15:12
  %4 : int = prim::Constant[value=9223372036854775807]() # test.py:16:8
  %6 : int = prim::Constant[value=10]() # test.py:16:18
  %16 : int = prim::Constant[value=1]() # test.py:18:17
  %39 : bool = prim::Constant[value=1]()
  %a : Tensor, %i : int = prim::Loop(%4, %39, %inp.1, %i.1) # test.py:16:8
    block0(%8 : int, %a.5 : Tensor, %i.8 : int):
      %a.2 : Tensor = aten::add_(%a.5, %i.8, %16) # test.py:17:12
      %i.6 : int = aten::add(%i.8, %16) # test.py:18:12
      %7 : bool = aten::lt(%i.6, %6) # test.py:16:14
      -> (%7, %a.2, %i.6)
  return (%a)

hopefully it is much simpler than tensorflow control flow.

MarisaKirisame · 2020-02-29T07:52:28Z

One thing to keep in mind is that torchscript has mutation, continue, while... So the design should be prepared to generate code in A-Normal Form for correctness. Right now it is not needed as they are all purely functional, but the architecture shouldnt make such a change as simple as possible (e.g. not requiring rewriting all the code).

masahi · 2020-02-29T12:00:23Z

@MarisaKirisame I have something relevant to share. I was told by one of the torchscript devs on their forum that he is working on a "functionalization" pass pytorch/pytorch#33020, and that could help my use case. I'm not sure what exactly it does, but from looking at their code it seems it tries to find a "functional" subset of nodes, i.e. nodes with no impure operation, and extract it as a subgraph. I guess it helps applying some of their optimizations more aggressively.

He also has others related PRs ongoing that aim at making the graph "more pure". One on removing inplace op pytorch/pytorch#33186, another on removing list append pytorch/pytorch#33199. I literally encountered these impure ops when I was trying to convert more realistic lstm models than the one I have in this PR, so these new feature in Torch could be useful to us later.

MarisaKirisame · 2020-02-29T21:44:47Z

// Functional Graphs are not responsible for maintaining aliasing
// relationships. If an output of a functional graph escapes scope
// or is mutated then we might change semantics of the program if
// aliasing relationships are changed.

... WTF
I dont think lots of smartness should go in the converter - suppose there is multiple converter, it need multiple smart hacks to remove reference or such.
A better design IMO is to use PartialEvaluator/DeadCodeElimination/SomeOtherCustomPass to remove the reference, but the converter will produce code will effect and reference without much cleaning.

python/tvm/relay/frontend/pytorch.py

load with torch-1.4 + torchvision 0.5

masahi · 2020-03-10T01:39:47Z

@zhiics @MarisaKirisame @alexwong Comments were addressed and I have no plan of updating this PR further. Can we merge this?

zhiics

LGTM

@MarisaKirisame could you take a look and approve explicitly if it looks good to you as well?

zhiics · 2020-03-10T17:59:28Z

Thanks @masahi @MarisaKirisame @alexwong

* Add support for prim::If and prim::Loop with test cases * rebase and fix tests * add some comments * simplifying, fix float cast * parse -> convert * recursivly retrive ops in get_all_op_names * use multiple return values from block correctly, simplify loop convert * choose dtype properly for zeros and ones * simplifying, replace convert_inputs with _get_relay_input_vars * fix for while loop with non input dependent init cond * add assert on loop var update * move the condition around * better testing for seg models * rebase fix, disable inception v3 in quant test as it is too slow to load with torch-1.4 + torchvision 0.5 * simplify and add more comparison op converter

masahi added the status: need review label Feb 28, 2020

alexwong reviewed Feb 28, 2020

View reviewed changes

python/tvm/relay/frontend/pytorch.py Outdated Show resolved Hide resolved

alexwong reviewed Feb 28, 2020

View reviewed changes

python/tvm/relay/frontend/pytorch.py Show resolved Hide resolved

alexwong reviewed Feb 28, 2020

View reviewed changes

python/tvm/relay/frontend/pytorch.py Outdated Show resolved Hide resolved

masahi force-pushed the torch-control-flow branch from 25e2895 to 3b9e838 Compare February 28, 2020 21:48

masahi force-pushed the torch-control-flow branch from f60c82a to 93371a4 Compare February 29, 2020 14:35

zhiics reviewed Mar 1, 2020

View reviewed changes

python/tvm/relay/frontend/pytorch.py Outdated Show resolved Hide resolved

python/tvm/relay/frontend/pytorch.py Show resolved Hide resolved

python/tvm/relay/frontend/pytorch.py Show resolved Hide resolved

masahi force-pushed the torch-control-flow branch 3 times, most recently from a83a729 to b1333cb Compare March 5, 2020 05:57

alexwong reviewed Mar 5, 2020

View reviewed changes

python/tvm/relay/frontend/pytorch.py Outdated Show resolved Hide resolved

masahi force-pushed the torch-control-flow branch from b1333cb to b936bfa Compare March 5, 2020 22:32

masahi added 12 commits March 6, 2020 11:14

Add support for prim::If and prim::Loop with test cases

b37099f

rebase and fix tests

d15ecaa

add some comments

83d1daa

simplifying, fix float cast

44c55eb

parse -> convert

b078535

recursivly retrive ops in get_all_op_names

904d72f

use multiple return values from block correctly, simplify loop convert

f080fb4

choose dtype properly for zeros and ones

d795e1c

simplifying, replace convert_inputs with _get_relay_input_vars

959a18c

fix for while loop with non input dependent init cond

4276c30

add assert on loop var update

f37b582

move the condition around

dbfb369

masahi added 3 commits March 6, 2020 11:14

better testing for seg models

6785d81

rebase fix, disable inception v3 in quant test as it is too slow to

10b6115

load with torch-1.4 + torchvision 0.5

simplify and add more comparison op converter

30f2129

masahi force-pushed the torch-control-flow branch from b936bfa to 30f2129 Compare March 6, 2020 02:18

zhiics approved these changes Mar 10, 2020

View reviewed changes

MarisaKirisame approved these changes Mar 10, 2020

View reviewed changes

zhiics merged commit 06e9542 into apache:master Mar 10, 2020

zhiics added status: accepted and removed status: need review labels Mar 10, 2020

masahi mentioned this pull request Apr 11, 2020

[Torch] Support Python list, more realistic recurrent networks #5306

Merged

ZihengJiang mentioned this pull request Sep 25, 2020

TVM v0.7 Release Note Candidate #6486

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Torch] Add initial control flow support #4964

[Torch] Add initial control flow support #4964

masahi commented Feb 28, 2020 •

edited

Loading

zhiics commented Feb 28, 2020 •

edited

Loading

alexwong left a comment

MarisaKirisame commented Feb 29, 2020

masahi commented Feb 29, 2020 •

edited

Loading

MarisaKirisame commented Feb 29, 2020

masahi commented Feb 29, 2020

MarisaKirisame commented Feb 29, 2020

masahi commented Mar 10, 2020

zhiics left a comment

zhiics commented Mar 10, 2020

[Torch] Add initial control flow support #4964

[Torch] Add initial control flow support #4964

Conversation

masahi commented Feb 28, 2020 • edited Loading

zhiics commented Feb 28, 2020 • edited Loading

alexwong left a comment

Choose a reason for hiding this comment

MarisaKirisame commented Feb 29, 2020

masahi commented Feb 29, 2020 • edited Loading

MarisaKirisame commented Feb 29, 2020

masahi commented Feb 29, 2020

MarisaKirisame commented Feb 29, 2020

masahi commented Mar 10, 2020

zhiics left a comment

Choose a reason for hiding this comment

zhiics commented Mar 10, 2020

masahi commented Feb 28, 2020 •

edited

Loading

zhiics commented Feb 28, 2020 •

edited

Loading

masahi commented Feb 29, 2020 •

edited

Loading