近期计划是什么？| What are the short-term plans? #280

hengtuibabai · 2023-05-09T06:33:33Z

停更了吗？近期计划是什么？
感觉离好用还差一点点了啊，停更的话太可惜了！

Has it stopped? What are the short-term plans?
I feel like just a little short of being useful. It would be a pity if I stopped the watch!

vinthony · 2023-05-10T16:32:39Z

vinthony · 2023-05-10T16:46:26Z

any new suggestions are also welcome : )

2793145003 · 2023-05-11T05:17:22Z

之前看到有人用 Diffusion model，用 Diffusion model 的话也许能做到原图分辨率？（不懂瞎说）

vinthony · 2023-05-11T05:52:02Z

diffusion-based 估计会很慢，不过face vid2vid也很慢就是了。质量上我从paper里没看到很大的差距因为他的diffusion不是SD，是自己训的。这里有一个备选是先用sadtalker生成keypoints，再用pre-trained 的controlnet-face去解。

lliang2003 · 2023-05-11T13:10:25Z

OpenTalker WEBUI, 集成Sadtalker和video-retalking的技术，做到输入图像和视频进行驱动和编辑。

canghaiyunfan · 2023-05-12T03:15:00Z

请问如何实现只针对头进行动作？

Niutonian · 2023-05-23T10:44:25Z

I love your project,
I think a good way to control ref_pose and ref_eyebllink would be awesome,
Another thing that I would love to see implemented is "Idle animation", in case no audio is played it goes through a preset sequence of blink and move without looking too weird

MoroseYu · 2023-06-04T04:53:39Z

不好停更啊，很期待~

vinthony · 2023-06-06T05:15:25Z

cool! thanks for your advise! will woking on it.

2793145003 · 2023-06-06T05:26:01Z

之后会解决脸色苍白双眼无神的问题吗？（还是我用的方法不对？
quick_demo里的full3，size=512，出来的结果脸会变得很白（去掉gfpgan也很白）
眼睛也一直虚着，去掉still之后可以眨眼但又没法贴回原图

vinthony · 2023-06-12T04:30:14Z

I love your project, I think a good way to control ref_pose and ref_eyebllink would be awesome, Another thing that I would love to see implemented is "Idle animation", in case no audio is played it goes through a preset sequence of blink and move without looking too weird

We have update this feature in #386, however, more work need to be done to make it better.

vinthony · 2023-06-12T04:38:32Z

这个我也发现了，可能是和训练数据有关。我找时间再训练一下模型。

之后会解决脸色苍白双眼无神的问题吗？（还是我用的方法不对？ quick_demo里的full3，size=512，出来的结果脸会变得很白（去掉gfpgan也很白）眼睛也一直虚着，去掉still之后可以眨眼但又没法贴回原图

zyl280505776 · 2023-06-15T10:09:52Z

挺好的，期待持续迭代

jyzd111 · 2023-06-17T13:04:02Z

大佬厉害，我用的国外的网站生成的和你效果差不多，你的还能自己调整参数。期待早日上2D，

grazder · 2023-06-23T17:49:01Z

what are your plans on cartoon images? like in makeittalk

vinthony · 2023-06-29T17:34:05Z

A lightweight facerender is added for generation, which might be working in real-time on GPU and 100x faster on Macbook. See the discussion #457.

vinthony · 2023-06-29T17:35:01Z

what are your plans on cartoon images? like in makeittalk

will try to add something like: https://github.com/pkhungurn/talking-head-anime-3-demo

Kedreamix · 2023-07-08T07:34:51Z

你好，我想问一下，有计划开源训练的代码么，也就是每一part的训练代码

FranM2030 · 2023-07-12T23:27:35Z

I've noticed that you're able to provide a head pose reference video, could we do the same for a half body. Provide a reference video for half body that drives the upper body movement along with the head? Just he head is a bit limited.

xyyyuuan · 2023-08-09T04:27:14Z

amd的显卡是不是只能跑在cpu上，amd不动

ifredom · 2023-08-23T18:23:46Z

感谢大兄弟，就差一点就很好用了，有空更更，不要鸽啦

skyliwq · 2023-08-30T08:20:12Z

加油加油

warycat · 2023-10-24T19:30:22Z

加油加油，这个东西太有用了，我要把它集成到我的app里。

Tybost · 2023-11-11T07:36:06Z

Still crossing my fingers for anime head support. ;)

slavakurilyak · 2023-11-20T23:33:11Z

When can we expect the next release? The last release was made 168 days ago.

creepcat-gh · 2023-11-22T05:31:03Z

大佬厉害，确实非常好用

zjy-2020 · 2023-11-22T07:51:35Z

大佬，网盘下载的压缩包李没有.pth文件和BFM, hub文件了吗
就是checkpoints/auido2exp_00300-model.pth，checkpoints/auido2pose_00140-model.pth， checkpoints/epoch_20.pth等文件。
没有话程序老是报错

rucieryi369 · 2023-11-30T09:41:22Z

会有微调的代码吗？谢谢

denvey · 2023-12-08T05:33:06Z

OpenTalker WEBUI

请问这个在哪里呀

XayerMorgan · 2024-03-03T22:52:23Z

Any plans for continued development. I was using this for a while, but suddenly it stopped working. Can't find the root cause, because automatic1111 and foocus work great. Standalone or automatic1111 give same errors. Would love to see this continue, and it's a remarkable piece of work.

oisilener1982 · 2024-04-19T06:27:06Z

Sadly Devs are very busy in other projects :(

ck1123456 · 2024-05-31T03:14:26Z

您好，请问我提供中文音频，生成的视频不是自己提供的声音，求指教

ck1123456 · 2024-05-31T03:14:54Z

您好，请问我提供中文音频，生成的视频不是自己提供的声音，求指教

oisilener1982 · 2024-06-10T21:03:30Z

When can we expect the next release? The last release was made 168 days ago.

Most likely there will no further updates. The last update was 9 months ago. With the coming release of EMO and VASA-1 sadtalker might really be dead, however they are not released yet

The other alternative is v-express by tencent but sadtalker is way better,

NinoNeumann · 2024-06-25T05:04:10Z

Now there might be something even better—hallo (https://github.com/fudan-generative-vision/hallo), and the exciting part is that this project is open-source.

oisilener1982 · 2024-07-01T13:23:53Z

Hallo is taking so long to render :( We need to upgrade to Rtx 5090 once it is released

gg22mm · 2024-10-12T02:31:47Z

看好这个项目，期待升级更牛的版本出来。 ~~ 虽然目前在图片生成视频已是 number on

gg22mm · 2024-10-12T02:39:40Z

Now there might be something even better—hallo (https://github.com/fudan-generative-vision/hallo), and the exciting part is that this project is open-source.

这说的这个不支持中文搞个毛线

gg22mm · 2024-10-12T08:23:06Z

hi，最近再肝paper，可能会在月底或者6月份开始继续更新。目前已经完成的部分和正在做的部分，比如:

* [x]  支持size参数可以控制输出图的分辨率，训了一个512x512的模型face render还在测试之中，期望可以干掉人脸增强器。

* [x]  mannually的控制crop的区域，可以得到更加自定义的结果，WEBUI。

* [x]  WEBUI更多功能，比如refpose等。

* [x]  接受了两种不同的自动crop方式，能够复现v0.0.1版本的效果。

* [x]  减少了一些依赖文件,比如dlib，用现在的方法更便于安装。

* [x]  加速模型和优化checkpoints，可能会不需要下载那么大的模型。

* [ ]  更加解耦的mappingnet的训练，可能可以支持只针对头进行动作。

* [x]  A simpler facerender model (or the TensorRT support) will be included to support faster generation: [A light-weight FaceRender is support for more than 100x faster rendering on Mac OS. #457](https://github.com/OpenTalker/SadTalker/discussions/457)

* [ ]  text-geneartion-webui

* [ ]  anime-generator.

* [ ]  OpenTalker WEBUI, 集成Sadtalker和video-retalking的技术，做到输入图像和视频进行驱动和编辑。

* [ ]  Fix API problem， [是否支持API调用的方式生成视频? #379](https://github.com/OpenTalker/SadTalker/issues/379)，[[bug] [gradio] Web API not working #374](https://github.com/OpenTalker/SadTalker/issues/374) , [这是个特别牛的项目，请问SD那边能用api调用吗? #251](https://github.com/OpenTalker/SadTalker/issues/251) ,

* [ ]  FPS, [Frame Per Second Option request. #294](https://github.com/OpenTalker/SadTalker/issues/294),

我感觉以后升级的话（可以往这方面发展，这样就不会有头部与身体衔接问题）：
1、直接生成全省图，现在的逻辑是先生成半身图如：

2、再经过full.才放大（就会有衔接问题，头部与身体会些错位，有抖动错位）

4.3_full.mp4

3、最后经过enhanced提升效果

vinthony added the discussion This will not be worked on label May 10, 2023

vinthony pinned this issue May 11, 2023

vinthony changed the title ~~停更了吗？近期计划是什么？| Has it stopped? What are the short-term plans?~~ 近期计划是什么？| What are the short-term plans? May 11, 2023

近期计划是什么？| What are the short-term plans? #280

近期计划是什么？| What are the short-term plans? #280

Comments

hengtuibabai commented May 9, 2023

vinthony commented May 10, 2023 • edited Loading

vinthony commented May 10, 2023 • edited Loading

2793145003 commented May 11, 2023

vinthony commented May 11, 2023

lliang2003 commented May 11, 2023

canghaiyunfan commented May 12, 2023

Niutonian commented May 23, 2023

MoroseYu commented Jun 4, 2023

vinthony commented Jun 6, 2023

2793145003 commented Jun 6, 2023

vinthony commented Jun 12, 2023

vinthony commented Jun 12, 2023 • edited Loading

zyl280505776 commented Jun 15, 2023

jyzd111 commented Jun 17, 2023

grazder commented Jun 23, 2023

vinthony commented Jun 29, 2023

vinthony commented Jun 29, 2023

Kedreamix commented Jul 8, 2023

FranM2030 commented Jul 12, 2023

xyyyuuan commented Aug 9, 2023

ifredom commented Aug 23, 2023

skyliwq commented Aug 30, 2023

warycat commented Oct 24, 2023

Tybost commented Nov 11, 2023

slavakurilyak commented Nov 20, 2023

creepcat-gh commented Nov 22, 2023

zjy-2020 commented Nov 22, 2023

rucieryi369 commented Nov 30, 2023

denvey commented Dec 8, 2023

XayerMorgan commented Mar 3, 2024

oisilener1982 commented Apr 19, 2024

ck1123456 commented May 31, 2024

ck1123456 commented May 31, 2024

oisilener1982 commented Jun 10, 2024

NinoNeumann commented Jun 25, 2024

oisilener1982 commented Jul 1, 2024

gg22mm commented Oct 12, 2024

gg22mm commented Oct 12, 2024

gg22mm commented Oct 12, 2024 • edited Loading

vinthony commented May 10, 2023 •

edited

Loading

vinthony commented May 10, 2023 •

edited

Loading

vinthony commented Jun 12, 2023 •

edited

Loading

gg22mm commented Oct 12, 2024 •

edited

Loading