Skip to content

Merlin: HugeCTR Backend V3.1

Compare
Choose a tag to compare
@yingcanw yingcanw released this 30 Jul 02:11
7a49582

Release Notes

What’s New in Version 3.1

  • Independent Parameter Server Configuration: In this release, HugeCTR backend has decoupled the Parameter Server-related configuration from the Triton configuration file(config.pbtxt), making it easier to configure the embedding table-related parameters per model. Especially for the configuration of multiple embedded tables per model, avoid too many command parameters when launching the Triton server.

  • Metrix: To Indicate GPU and request statistics, use Prometheus to gather metrics into usable, actionable entries, giving you the data you need to manage alerts and performance information in your environment.

  • Multiple Embedding Tables Model Deployment: To enhance the scalability of different number of embedding tables per model requirements, HugeCTR backend has supported the multiple embedding tables per model deployment.