Skip to content

Merlin: HugeCTR Backend V3.0.1

Compare
Choose a tag to compare
@yingcanw yingcanw released this 12 Apr 03:20
65f4be7

Release Notes

What’s New in Version 3.0.1

  • End-to-End Tutorial: In this release, we provide two tutorials on how to train a standard DLRM model using HugeCTR High-level python API and how to deploy the standard DLRM mode to Triton Inference Server using HugeCTR Backend. These tutorials explain the steps to train and inference with HugeCTR and NVTabular with the Merlin framework. The users could collect the inference benchmark by Triton performance analyzer tool.

  • Enhanced Model Scalability: To enhance the scalability of different recommendation model input requirements, we support input data without dense features.

  • Enhanced Model Control API: Fixed the issue that Parameter Server could not release GPU memory when using Triton model control API to unload the model at runtime.