-
Notifications
You must be signed in to change notification settings - Fork 87
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
windows端部署 #75
Comments
您好,可以在windows端部署2.1B 模型。需要配置5G以上内存,Python3.9+,及如下依赖, 将模型文件 [https://huggingface.co/IEITYuan/Yuan2-2B-hf/tree/main]下载到指定路径yuan-2B-path,并手工关掉Flash attention即可。(手工关掉Flash attention:1. 修改 config.json中"use_flash_attention"为 false; 运行代码如下: print("Creat tokenizer...") print("Creat model...") inputs = tokenizer("请问目前最先进的机器学习算法有哪些?", return_tensors="pt")["input_ids"].to("cpu") |
@Longleaves 可以通过YuanChat实现在windows上的快速部署,详见: https://github.com/IEIT-Yuan/YuanChat |
基于YuanChat快速搭建对话应用教程,详见:https://github.com/IEIT-Yuan/YuanChat/blob/main/docs/%E5%9C%A8%E7%AC%94%E8%AE%B0%E6%9C%AC%E4%B8%8A%E5%BF%AB%E9%80%9F%E9%83%A8%E7%BD%B2YuanModel%E5%92%8CYuanChat.md |
请问能否部署在windows端呢?除了docker外,有没有直接详细的环境配置?
The text was updated successfully, but these errors were encountered: