InternLM / lmdeploy Public

Notifications You must be signed in to change notification settings
Fork 373
Star 4.1k

Code
Issues 263
Pull requests 23
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: InternLM/lmdeploy

Labels 34 Milestones 0

New pull request New

23 Open 1,027 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Fix some issues encountered by modelscope and community Bug:P1

#2428 opened Sep 5, 2024 by irexyc

Loading…

build: update ascend dockerfile

#2421 opened Sep 4, 2024 by CyCle1024

Loading…

support min_p sampling parameter enhancement

New feature or request

#2420 opened Sep 4, 2024 by irexyc

Loading…

Torchrun launching multiple api_server

#2402 opened Aug 30, 2024 by AllentDan

Loading…

More w8a8 models

#2373 opened Aug 26, 2024 by AllentDan • Draft

feat: support ascend qwen

#2357 opened Aug 22, 2024 by yao-fengchen

Loading…

feat: support ascend mixtral

#2356 opened Aug 22, 2024 by yao-fengchen

Loading…

feat: support npu device on Ascend platform with 'torch_npu' package

#2341 opened Aug 20, 2024 by jiajie-yang

Loading…

[Feature] Support vision module w8a8 inference

#2308 opened Aug 14, 2024 by AllentDan

Loading…

better formatted table of 'lmdeploy list' improvement WIP

#2289 opened Aug 12, 2024 by lvhan028

Loading…

[Feature] support qqq(w4a8) for lmdeploy

#2274 opened Aug 9, 2024 by HandH1998

Loading…

6 tasks done

[Feature] Support XTuner Lite Llava enhancement

New feature or request

#2191 opened Jul 31, 2024 by pppppM

Loading…

Custom backend support. enhancement

New feature or request

#2104 opened Jul 22, 2024 by grimoire

Loading…

Add prefix cache stats to usage

#2018 opened Jul 13, 2024 by ispobock

Loading…

feat: decouple input_ids and output_ids

#1855 opened Jun 25, 2024 by zhyncs

Loading…

Add Jetson platform support (by docker)

#1820 opened Jun 21, 2024 by BestAnHongjun

Loading…

support vl benchmark

#1662 opened May 27, 2024 by AllentDan

Loading…

[benchmark] optimize benchmark: counting tokenlizer tokens and error requests

#1607 opened May 17, 2024 by NiuBlibing

Loading…

support AI4Chem/ChemLLM-7B-Chat-1_5-SFT WIP

#1552 opened May 7, 2024 by lvhan028

Loading…

fix: update api_server_backend.py to adapt latest gradio improvement

#1541 opened May 3, 2024 by kv-chiu

Loading…

Log stats enhancement

New feature or request

#1423 opened Apr 11, 2024 by AllentDan

Loading…

support frequency penalty

#713 opened Nov 20, 2023 by RytonLi

Loading…

Visualize layer activations and weights to simplify the quantization process.

#607 opened Oct 24, 2023 by HIT-cwh

Loading…

ProTip! Follow long discussions with comments:>50.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly