diff --git a/docs/source/en/model_doc/zamba.md b/docs/source/en/model_doc/zamba.md index e019b90de486..5519b2b775f1 100644 --- a/docs/source/en/model_doc/zamba.md +++ b/docs/source/en/model_doc/zamba.md @@ -25,7 +25,7 @@ pip install mamba-ssm causal-conv1d>=1.2.0 ``` You also have to have the model on a CUDA device. -You can run the model not using the optimized Mamba kernels, but it is **not** recommended as it will result in significantly lower latencies. In order to do that, you'll need to specify `use_mamba_kernels=False` when loading the model. +You can run the model not using the optimized Mamba kernels, but it is **not** recommended as it will result in significantly higher latency. In order to do that, you'll need to specify `use_mamba_kernels=False` when loading the model. ## Inference