Merlin: HugeCTR Backend V3.5
Release Notes
What’s New in Version 3.5
-
Hierarchical Parameter Server (HPS) Triton Backend:
The Hierarchical Parameter Server(HPS) Backend is a framework for embedding vectors looking up on large-scale embedding tables that was designed to effectively use GPU memory to accelerate the looking up by decoupling the embedding tables and embedding cache from the end-to-end inference pipeline of the deep recommendation model. For more information, please refer to Hierarchical Parameter Server. -
HPS interface encapsulation and exporting as library:
We encapsulate the HPS interfaces and deliver it as a standalone library. Besides, we provide HPS Python APIs and demonstrate the usage with a notebook. For more information, please refer to the HPS Demo. -
HPS performance optimization: use better method to determine partition number in database backends in HPS