Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

【论文复现】TaskMatrix #6544

Closed
shiyutang opened this issue Jul 28, 2023 · 2 comments
Closed

【论文复现】TaskMatrix #6544

shiyutang opened this issue Jul 28, 2023 · 2 comments
Assignees
Labels
others unknown issue type triage

Comments

@shiyutang
Copy link

shiyutang commented Jul 28, 2023

问题描述

背景

构建了一个名为Visual ChatGPT的系统,结合了不同的Visual Foundation模型,使用户能够通过1)不仅发送和接收语言,还发送和接收图像2)提供复杂的视觉问题或视觉编辑指令来与ChatGPT进行交互,这些问题或指令需要多个人工智能模型多步骤协作。3) 提供反馈并要求纠正结果。我们设计了一系列提示,将视觉模型信息注入ChatGPT,考虑到多个输入/输出的模型和需要视觉反馈的模型。实验表明,Visual ChatGPT为借助Visual Foundation Models研究ChatGPT的视觉角色打开了大门。
Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models

解决步骤:

1.实现visualChatGPT,并进行相应验证,代码链接。https://github.com/microsoft/TaskMatrix
2.接入开源模型例如chatglm v2或者llama v2,来实现中文版本的Visual ChatGPT,给出使用示例和文档以及UI。提交至https://github.com/PaddlePaddle/PaddleNLP/tree/develop/pipelines/examples/agents

@Evan-master
Copy link

我想认领这个

@shiyutang
Copy link
Author

好的,请在PaddlePaddle/PaddleOCR#10223 报名,并加我的微信transy-k,拉你进我们的命题任务交流群。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
others unknown issue type triage
Projects
None yet
Development

No branches or pull requests

4 participants