You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It would be interesting to use this. But noScribe is deeply integrated with faster-whisper. We have to take a closer look how much the interfaces deviate from each other. Until then, I have some other things to finish. But I will keep this on my list, thank you.
The reason I am suggesting this is because it seems NoScribe could be even more optimized for Apple silicon.On a Macbook M4 24 GB NoScribe is not nearly as fast as the inference speed bump observable with LLM Studio before and after they implemented MLX, which leads me to assume that there might be a lot more optimizations possible for NoScribe on Apple hardware. It seems to me this is about unlocking some of the M4 architecture designed for inference, so it seems faster-whisper could benefit from this as welll.
Please consider implementing MLX, which promises significantly faster inference on Apple Silicone Macs.
Find the version prepared for Whisper here https://github.com/ml-explore/mlx-examples/tree/main/whisper
Implementation info for MLX: https://github.com/ml-explore/mlx
The text was updated successfully, but these errors were encountered: