Merge pull request #2 from onnx/gs/tf2onnx

initial drop for tf2onnx
onnx · Mar 16, 2018 · 12d32fa · 12d32fa
2 parents 1384ad3 + c6dca6c
commit 12d32fa
Show file tree

Hide file tree

Showing 30 changed files with 4,617 additions and 0 deletions.
diff --git a/.gitignore b/.gitignore
@@ -0,0 +1,14 @@
+.coverage
+*.pyc
+.idea
+build
+dist
+bin
+obj
+.ipynb_checkpoints
+__pycache__
+*.pyc
+*.swp
+.cache
+.eggs
+*.egg-info
diff --git a/README.md b/README.md
@@ -0,0 +1,140 @@
+tf2onnx - convert tensorflow models to onnx models.
+========
+
+Tf2onnx converts a tensorflow graph to an onnx graph.
+
+Tf2onnx is in its early development. Mileage will vary since tensorflow supports ~4 times the operations that the current onnx version supports. But standard models seem to be using mostly ops that onnx does support.
+
+# Status
+Baisc net and conv nets should work. A list of models that pass tests can be found [here](tests/run_pretrained_models.yaml)
+
+# Installation
+Install dependencies:
+```
+pip install onnx
+pip install tensorflow
+```
+If you want to run unit tests against the caffe2 onnx backend, build and install caffe2 and onnx-caffe2:
+```
+https://github.com/caffe2/caffe2
+https://github.com/onnx/onnx-caffe2
+```
+Once dependencies are installed, from the tf2onnx root folder call:
+```
+python setup.py install
+```
+or 
+```
+python setup.py develop
+```
+
+To create a wheel for distribution:
+```
+python setup.py bdist_wheel
+```
+# Usage
+```
+python -m tf2onnx.convert
+usage: convert.py [-h] --input INPUT [--output OUTPUT] --inputs INPUTS --outputs OUTPUTS [--pbtxt PBTXT] [--pretty] [--continue_on_error] [--verbose]
+```
+For example:
+```
+python -m tf2onnx.convert.py --input tests/models/fc-layers/frozen.pb --inputs X:0 --outputs output:0 --output tests/models/fc-layers/model.onnx --pretty --verbose
+```
+
+To convert a tensorflow model, tf2onnx expects a ```frozen tensorflow graph``` and the user needs to specify inputs and outputs for the graph. 
+To find the inputs and outputs for the tensorflow graph the model developer will know or you can consult tensorflow's [summarize_graph](https://github.com/tensorflow/tensorflow/tree/master/tensorflow/tools/graph_transforms) tool, for example:
+```
+summarize_graph --in_graph=tests/models/fc-layers/frozen.pb
+```
+
+The tensorflow tool to freeze the graph is [here](https://github.com/tensorflow/tensorflow/blob/master/tensorflow/python/tools/freeze_graph.py).
+
+For example:
+```
+tools=`python -c "import tensorflow as tf; print(tf.sysconfig.get_lib()+'/python/tools')"`
+
+python $tools/freeze_graph.py \
+    --input_graph=my_checkpoint_dir/graphdef.pb \
+    --input_binary=true \
+    --input_names=input:0 \
+    --output_node_names=output:0 \
+    --input_checkpoint=my_checkpoint_dir \
+    --output_graph=tests/models/fc-layers/frozen.pb
+```
+
+
+# Testing
+There are 2 types of tests.
+
+## Unit test
+```
+python setup.py test
+```
+
+## Validate pre-trained tensorflow models
+```
+python tests/run_pretrained_models.py
+usage: run_pretrained_models.py [-h] [--cache CACHE] [--tests TESTS] [--backend BACKEND] [--verbose] [--debug] [--config yaml-config]
+
+optional arguments:
+  -h, --help         show this help message and exit
+  --cache CACHE      pre-trained models cache dir
+  --tests TESTS      tests to run
+  --backend BACKEND  backend to use
+  --config           yaml config file
+  --verbose          verbose output
+  --debug            dump generated graph with shape info
+```
+```run_pretrained_models.py``` will run the tensorflow model, captures the tensorflow output and runs the same test against the specified onnx backend after converting the model. The only practical backend to use at this time is caffe2, and you need to install caffe2 for this to work.
+
+You call it for example with:
+```
+python tests/run_pretrained_models.py --backend caffe2 --config tests/run_pretrained_models.yaml
+```
+# How tf2onnx works
+While the protobuf format of onnx is not all that different than onnx, mileage will vary because tensorflow supports 4x the ops compared to the current version of onnx.
+The converter needs to take care of a few things:
+1. Convert the protobuf format. Since the format is similar this step is straight forward.
+2. Tensorflow types need to be mapped to their onnx equivalent.
+3. For many ops tensorflow passes parameters like shapes as inputs where onnx wants to see them as attributes. Since we use a frozen graph, the converter will fetch the input as constant, converts it to an attribute and remove the original input.
+4. Tensorflow in many cases composes ops out of multiple simpler ops. The converter will need to identify the subgraph for such ops, slice the subgraph out and replace it with the onnx equivalent. This can become fairly complex so we use a graph matching library for it. A good example of this is the tensorflow transpose op.
+5. Tensorflow's default data format is NHWC where onnx requires NCHW. The converter will insert transpose ops to deal with this.
+6. There are some ops like relu6 that are not supported in onnx but the converter can be composed out of other onnx ops.
+7. Onnx backends are new and their implementations are not complete yet. For some ops the converter generate ops with deal with issues in existing backends.
+
+### Step 1 - start with a frozen graph.
+tf2onnx starts with a frozen graph. This is because of item 3 above.
+
+### Step 2 - 1:1 convertion of the protobuf from tensorflow to onnx
+tf2onnx first does a simple convertion from the tensorflow protobuf format to the onnx protobuf format without looking at individual ops.
+We do this so we can use the onnx graph as internal representation and write helper functions around it.
+The code that does the conversion is in tensorflow_to_onnx(). tensorflow_to_onnx() will return the onnx graph and a dictionary with shape information from tensorflow. The shape information is helpfull in some cases when processing individual ops. 
+The onnx graph is wrapped in a Graph object and nodes in the graph are wrapped in a Node object to allow easier graph manipulations on the graph. All code that deals with nodes and graphs is in graph.py.
+
+### Step 3 - rewrite subgraphs
+In the next step we apply graph matching code on the graph to re-write subgraphs for ops like transpose and lstm. For an example looks at rewrite_transpose().
+
+### Step 4 - process individual ops
+In the fourth step we look at individual ops that need attention. The dictionary _OPS_MAPPING will map tensorflow op types to a method that is used to process the op. The simplest case is direct_op() where the op can be taken as is. Whenever possible we try to group ops into common processing, for example all ops that require dealing with broadcasting are mapped to broadcast_op(). For an op that composes the tensorflow op from multiple onnx ops, see relu6_op().
+
+### Step 5 - final processing
+Once all ops are converted, we need to do a topological sort since onnx requires it. process_tf_graph() is the method that takes care of all above steps.
+
+# Extending tf2onnx
+If you like to contribute and add new conversions to tf2onnx, the process is something like:
+1. See if the op fits into one of the existing mappings. If so adding it to _OPS_MAPPING is all that is needed.
+2. If the new op needs extra procesing, start a new mapping function.
+3. If the tensorflow op is composed of multiple ops, consider using a graph re-write. While this might be a little harder initially, it works better for complex patterns.
+4. Add a unit test in tests/test_backend.py. The unit tests mostly create the tensorflow graph, run it and capture the output, than convert to onnx, run against a onnx backend and compare tensorflow and onnx results. 
+5. If there are pre-trained models that use the new op, consider adding those to test/run_pretrained_models.py.
+
+# What is missing
+- lstm/gru support (working on this)
+- more testing
+- mode model coverage
+
+# License
+
+[MIT License](LICENSE)
+
diff --git a/VERSION_NUMBER b/VERSION_NUMBER
@@ -0,0 +1 @@
+0.0.0.1
diff --git a/build.bat b/build.bat
@@ -0,0 +1,2 @@
+python -m pytest --cov=tf2onnx
+python setup.py bdist_wheel
diff --git a/build.sh b/build.sh
@@ -0,0 +1,11 @@
+#!/bin/bash
+
+set -x
+
+
+apt-get install -y protobuf-compiler libprotoc-dev
+pip install setuptools 
+pip install onnx pytest-cov
+
+python setup.py test
+python setup.py bdist_wheel
diff --git a/setup.cfg b/setup.cfg
@@ -0,0 +1,6 @@
+[aliases]
+test=pytest
+
+[tool:pytest]
+addopts=--cov=tf2onnx --ignore=tests/test_backend.py
+#testpaths=tests/test_*.py
diff --git a/setup.py b/setup.py
@@ -0,0 +1,83 @@
+# Copyright (c) Microsoft Corporation. All rights reserved.
+# Licensed under the MIT license.
+
+from collections import namedtuple
+import os
+from setuptools import setup, find_packages, Command
+import distutils.command.build
+import setuptools.command.build_py
+import setuptools.command.develop
+import setuptools.command.install
+import subprocess
+from textwrap import dedent
+
+TOP_DIR = os.path.realpath(os.path.dirname(__file__))
+SRC_DIR = os.path.join(TOP_DIR, 'tf2onnx')
+
+try:
+    git_version = subprocess.check_output(['git', 'rev-parse', 'HEAD'], cwd=TOP_DIR).decode('ascii').strip()
+except (OSError, subprocess.CalledProcessError):
+    git_version = None
+
+with open(os.path.join(TOP_DIR, 'VERSION_NUMBER')) as version_file:
+    VersionInfo = namedtuple('VersionInfo', ['version', 'git_version'])(
+        version=version_file.read().strip(),
+        git_version=git_version
+    )
+
+
+class create_version(Command):
+    user_options = []
+
+    def initialize_options(self):
+        pass
+
+    def finalize_options(self):
+        pass
+
+    def run(self):
+        with open(os.path.join(SRC_DIR, 'version.py'), 'w') as f:
+            f.write(dedent('''
+            version = '{version}'
+            git_version = '{git_version}'
+            '''.format(**dict(VersionInfo._asdict()))))
+
+
+class build_py(setuptools.command.build_py.build_py):
+    def run(self):
+        self.run_command('create_version')
+        setuptools.command.build_py.build_py.run(self)
+
+
+class build(distutils.command.build.build):
+    def run(self):
+        self.run_command('build_py')
+
+
+class develop(setuptools.command.develop.develop):
+    def run(self):
+        self.run_command('create_version')
+        self.run_command('build')
+        setuptools.command.develop.develop.run(self)
+
+
+cmdclass = {
+    'create_version': create_version,
+    'build_py': build_py,
+    'build': build,
+    'develop': develop,
+}
+
+setup(
+    name="tf2onnx",
+    version=VersionInfo.version,
+    description="tensorflow to onnx converter",
+    setup_requires=['pytest-runner'],
+    tests_require=['numpy', 'pytest', 'pytest-cov', 'psutil'],
+    cmdclass=cmdclass,
+    packages=find_packages(),
+    author='onnx@microsoft.com',
+    author_email='onnx@microsoft.com',
+    url='https://github.com/onnx/tensorflow-onnx',
+    install_requires=['tensorflow', 'onnx', 'graphviz', 'pyyaml', 'pytest-cov']
+)
diff --git a/tests/beach.jpg b/tests/beach.jpg
diff --git a/tests/models/ae0/frozen.pb b/tests/models/ae0/frozen.pb
diff --git a/tests/models/conv-layers/frozen.pb b/tests/models/conv-layers/frozen.pb
diff --git a/tests/models/fc-layers/frozen.pb b/tests/models/fc-layers/frozen.pb
diff --git a/tests/models/lstm/frozen.pb b/tests/models/lstm/frozen.pb
Original file line number	Diff line number	Diff line change
		@@ -0,0 +1,2 @@
		python -m pytest --cov=tf2onnx
		python setup.py bdist_wheel