🥮 Tiny AutoEncoder for Hunyuan Video & Wan 2.1

What is TAEHV?

TAEHV is a Tiny AutoEncoder for Hunyuan Video (& Wan 2.1). TAEHV can decode latents into videos more cheaply (in time & memory) than the full-size VAEs, at the cost of slightly lower quality.

Here's a comparison of the output & memory usage of the Full Hunyuan VAE vs. TAEHV:

`pipe.vae`	Full Hunyuan VAE	TAEHV
Decoded Video ^{(converted to GIF)}
Runtime ^{(in fp16, on GH200)}	~2-3s for decoding 61 frames of (512, 320) video	~0.5s for decoding 61 frames of (512, 320) video
Memory ^{(in fp16, on GH200)}	~6-9GB Peak Memory Usage	<0.5GB Peak Memory Usage

See the profiling notebook for details on this comparison or the example notebook for a simpler demo.

How do I use TAEHV with Wan 2.1?

Since Wan 2.1 uses the same input / output shapes as Hunyuan VAE, you can also use TAEHV for Wan 2.1 decoding using the taew2_1.pth weights (see the Wan 2.1 example notebook).

How do I use TAEHV with CogVideoX?

Try the taecvx.pth weights (see the example notebook).

How can I reduce the TAEHV decoding cost further?

You can disable temporal or spatial upscaling to get even-cheaper decoding.

TAEHV(decoder_time_upscale=(False, False), decoder_space_upscale=(True, True, True))

TAEHV(decoder_time_upscale=(False, False), decoder_space_upscale=(False, False, False))

If you have a powerful GPU or are decoding at a reduced resolution, you can also set parallel=True in TAEHV.decode_video to decode all frames at once (which is faster but requires more memory).

Limitations

TAEHV is still pretty experimental (specifically, it's a hacky finetune of TAEM1 :) using a fairly limited dataset) and I haven't tested it much yet. Please report quality / performance issues as you discover them.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
examples		examples
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
taecvx.pth		taecvx.pth
taehv.pth		taehv.pth
taehv.py		taehv.py
taew2_1.pth		taew2_1.pth

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🥮 Tiny AutoEncoder for Hunyuan Video & Wan 2.1

What is TAEHV?

How do I use TAEHV with Wan 2.1?

How do I use TAEHV with CogVideoX?

How can I reduce the TAEHV decoding cost further?

Limitations

About

Languages

License

madebyollin/taehv

Folders and files

Latest commit

History

Repository files navigation

🥮 Tiny AutoEncoder for Hunyuan Video & Wan 2.1

What is TAEHV?

How do I use TAEHV with Wan 2.1?

How do I use TAEHV with CogVideoX?

How can I reduce the TAEHV decoding cost further?

Limitations

About

Resources

License

Stars

Watchers

Forks

Languages