Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MLX for significantly faster inference on Apple Silicone (M1 - M4) #124

Open
menelic opened this issue Feb 3, 2025 · 2 comments
Open

MLX for significantly faster inference on Apple Silicone (M1 - M4) #124

menelic opened this issue Feb 3, 2025 · 2 comments
Labels
enhancement New feature or request

Comments

@menelic
Copy link

menelic commented Feb 3, 2025

Please consider implementing MLX, which promises significantly faster inference on Apple Silicone Macs.

Find the version prepared for Whisper here https://github.com/ml-explore/mlx-examples/tree/main/whisper

Implementation info for MLX: https://github.com/ml-explore/mlx

@kaixxx
Copy link
Owner

kaixxx commented Feb 3, 2025

It would be interesting to use this. But noScribe is deeply integrated with faster-whisper. We have to take a closer look how much the interfaces deviate from each other. Until then, I have some other things to finish. But I will keep this on my list, thank you.

@kaixxx kaixxx added the enhancement New feature or request label Feb 3, 2025
@menelic
Copy link
Author

menelic commented Feb 3, 2025

The reason I am suggesting this is because it seems NoScribe could be even more optimized for Apple silicon.On a Macbook M4 24 GB NoScribe is not nearly as fast as the inference speed bump observable with LLM Studio before and after they implemented MLX, which leads me to assume that there might be a lot more optimizations possible for NoScribe on Apple hardware. It seems to me this is about unlocking some of the M4 architecture designed for inference, so it seems faster-whisper could benefit from this as welll.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants