This repo contains FineMotions's solution to the GENEA 2022 Challange
Our submission as well as some renders of our experiments could be found here
The whole repo contains code of the models along with various scripts:
process_motion.py
- extracts motion features from bvh-data.pymo
required.process_audio.py
- extracts features from audioprocess_text.py
- generetes one-hot encoddings for symbols of text transcriptsnormalize.py
- normalize motion features and store mean and max poses to npz filealign_data.py
- aligns motion, audio and text features to create dataset for modelstrain.py
- train one of the model by it's name:wav2gest
,recell
,recellseq
,feedforward
,seq2seq
,lstm
infer.py
- inference model, smooth, denormalize results and generate bvh
module src
contains code of the models and some utils
:
base
contains baseDataModule
to operate with various datafeedforward
- simple model generates motion by frame from window of features. Based on Kucherenko et al. 2019lstm
- predicts sequence of poses from aligned sequence of features via simple rnnrecell
- auto-regressive ReCell model, contains two systems and datasets for one-frame generation and for short sequences which allows teacher-forcing and zeroing techniquesseq2seq
- sequence-to-sequence model from https://github.com/FineMotion/GENEA_2020seqae
- unfinished autoencoder for sequences, has not been usedwav2gest
- modification ofseq2seq
allowing different lengths for input and output sequencesauto-encoder
- windowed auto-encoder