-
Notifications
You must be signed in to change notification settings - Fork 373
Pull requests: InternLM/lmdeploy
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix some issues encountered by modelscope and community
Bug:P1
#2428
opened Sep 5, 2024 by
irexyc
Loading…
support min_p sampling parameter
enhancement
New feature or request
#2420
opened Sep 4, 2024 by
irexyc
Loading…
feat: support npu device on Ascend platform with 'torch_npu' package
#2341
opened Aug 20, 2024 by
jiajie-yang
Loading…
better formatted table of 'lmdeploy list'
improvement
WIP
#2289
opened Aug 12, 2024 by
lvhan028
Loading…
[Feature] support qqq(w4a8) for lmdeploy
#2274
opened Aug 9, 2024 by
HandH1998
Loading…
6 tasks done
[Feature] Support XTuner Lite Llava
enhancement
New feature or request
#2191
opened Jul 31, 2024 by
pppppM
Loading…
Custom backend support.
enhancement
New feature or request
#2104
opened Jul 22, 2024 by
grimoire
Loading…
[benchmark] optimize benchmark: counting tokenlizer tokens and error requests
#1607
opened May 17, 2024 by
NiuBlibing
Loading…
fix: update api_server_backend.py to adapt latest gradio
improvement
#1541
opened May 3, 2024 by
kv-chiu
Loading…
Visualize layer activations and weights to simplify the quantization process.
#607
opened Oct 24, 2023 by
HIT-cwh
Loading…
ProTip!
Follow long discussions with comments:>50.