Skip to content

Merlin: HugeCTR Backend V3.5

Compare
Choose a tag to compare
@yingcanw yingcanw released this 04 Apr 14:25
9975c81

Release Notes

What’s New in Version 3.5

  • Hierarchical Parameter Server (HPS) Triton Backend:
    The Hierarchical Parameter Server(HPS) Backend is a framework for embedding vectors looking up on large-scale embedding tables that was designed to effectively use GPU memory to accelerate the looking up by decoupling the embedding tables and embedding cache from the end-to-end inference pipeline of the deep recommendation model. For more information, please refer to Hierarchical Parameter Server.

  • HPS interface encapsulation and exporting as library:
    We encapsulate the HPS interfaces and deliver it as a standalone library. Besides, we provide HPS Python APIs and demonstrate the usage with a notebook. For more information, please refer to the HPS Demo.

  • HPS performance optimization: use better method to determine partition number in database backends in HPS