v1.2.0
Stable Release
-
Features
- GPU Performance improvements, 50%-300% improvement over vanilla Triton
- Performance improvements on CPU, optimize uvloop + multi-processing
- Huggingface Transformer example
- Binary input support, #37 , thanks @Aleksandar1932
-
Bug fixes
- stdout/stderr in inference service was not logged to dedicated Task