[New Model]: Cogagent #4575

leoozy · 2024-05-03T06:36:55Z

The model to consider.

https://huggingface.co/THUDM/CogAgent

The closest model vllm already supports.

No response

What's your difficulty of supporting the model you want?

Vision models

JBurtn · 2024-06-22T04:22:42Z

Need to Wait for / help with #4888 and #4942 before this can be implemented. Maybe even some more stuff.

afeldman-nm · 2024-07-08T21:43:23Z

Need to Wait for / help with #4888 and #4942 before this can be implemented. Maybe even some more stuff.

Quick update:

#4888 is landed, enabling the xFormers backend to support encoder attention, decoder self-attention, and decoder cross-attention. #4837 and #4888 (both of which have been landed) were prerequisites for #4942 . #4942
completes end-to-end support for encoder/decoder models with the xFormers backend & also introduces the BART model into vLLM. #4942 is still WIP but hoping to complete it soon.

@robertgshaw2-neuralmagic

leoozy added the new model Requests to new models label May 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[New Model]: Cogagent #4575

[New Model]: Cogagent #4575

leoozy commented May 3, 2024

JBurtn commented Jun 22, 2024

afeldman-nm commented Jul 8, 2024

[New Model]: Cogagent #4575

[New Model]: Cogagent #4575

Comments

leoozy commented May 3, 2024

The model to consider.

The closest model vllm already supports.

What's your difficulty of supporting the model you want?

JBurtn commented Jun 22, 2024

afeldman-nm commented Jul 8, 2024