Skip to content

LinearInt8 layer for inference of int8-quantized LLMs and Arm intrinsics #3367

LinearInt8 layer for inference of int8-quantized LLMs and Arm intrinsics

LinearInt8 layer for inference of int8-quantized LLMs and Arm intrinsics #3367