MLPerf GNMT inference

Description

This document has instructions for running MLPerf GNMT inference using Intel-optimized TensorFlow.

Datasets

Download and unzip the MLPerf GNMT model benchmarking data.

wget https://zenodo.org/record/2531868/files/gnmt_inference_data.zip
unzip gnmt_inference_data.zip
export DATASET_DIR=$(pwd)/nmt/data

Set the DATASET_DIR to point as instructed above when running MLPerf GNMT.

Quick Start Scripts

Script name	Description
`online_inference.sh`	Runs online inference (batch_size=1).
`batch_inference.sh`	Runs batch inference (batch_size=32).
`accuracy.sh`	Runs accuracy

Run the model

Setup your environment using the instructions below, depending on if you are using AI Kit:

Setup using AI Kit

Setup without AI Kit

To run using AI Kit you will need:

git
numactl
pip
wget
Bazel to build tensorflow addons
Activate the `tensorflow` conda environment
```
conda activate tensorflow
```

To run without AI Kit you will need:

Python 3
intel-tensorflow>=2.5.0
git
numactl
pip
wget
Bazel to build tensorflow addons

A clone of the Model Zoo repo

git clone https://github.com/IntelAI/models.git

After installing the prerequisites, download the pretrained model and set the PRETRAINED_MODEL environment variable to the .pb file path:

wget https://storage.googleapis.com/intel-optimized-tensorflow/models/v1_8/mlperf_gnmt_fp32_pretrained_model.pb
export PRETRAINED_MODEL=$(pwd)/mlperf_gnmt_fp32_pretrained_model.pb

MLPerf GNMT requires TensorFlow addons to be built with a patch from the model zoo. The snippet below shows how to clone the addons repo, apply the patch, and then build and install the TensorFlow addons wheel.

# TensorFlow addons (r0.5) build and installation instructions:
#   Clone TensorFlow addons (r0.5) and apply a patch: A patch file
#   is attached in Intel Model Zoo MLpref GNMT model scripts,
#   it fixes TensorFlow addons (r0.5) to work with TensorFlow
#   version 2.11, and prevents TensorFlow 2.0.0 to be installed
#   by default as a required dependency.
git clone --single-branch --branch=r0.5 https://github.com/tensorflow/addons.git
cd addons
git apply ../models/language_translation/tensorflow/mlperf_gnmt/gnmt-fix.patch

#   Build TensorFlow addons source code and create TensorFlow addons
#   pip wheel. Use bazel 6.0.0 version :
#   Answer yes to questions while running configure.sh
bash configure.sh
bazel build --enable_runfiles build_pip_pkg
bazel-bin/build_pip_pkg artifacts
pip install artifacts/tensorflow_addons-*.whl --no-deps

Once that has been completed, ensure you have the required environment variables set, and then run a quickstart script.

# cd to your model zoo directory
cd models

# Set env var paths
export DATASET_DIR=<path to the dataset>
export PRECISION=fp32
export OUTPUT_DIR=<path to the directory where log files will be written>
export PRETRAINED_MODEL=<path to the pretrained model frozen graph>
# For a custom batch size, set env var `BATCH_SIZE` or it will run with a default value.
export BATCH_SIZE=<customized batch size value>

# Run a quickstart script
./quickstart/language_translation/tensorflow/mlperf_gnmt/inference/cpu/<script name>.sh

Additional Resources

To run more advanced use cases, see the instructions for the available precisions FP32 for calling the launch_benchmark.py script directly.
To run the model using docker, please see the Intel® Developer Catalog workload container:
https://software.intel.com/content/www/us/en/develop/articles/containers/gnmt-fp32-inference-tensorflow-container.html.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

MLPerf GNMT inference

Description

Datasets

Quick Start Scripts

Run the model

Additional Resources

Files

README.md

Latest commit

History

README.md

File metadata and controls

MLPerf GNMT inference

Description

Datasets

Quick Start Scripts

Run the model

Additional Resources