daicwoz_voice

Preprocessing and feature extraction for raw voice data of DAIC-WOZ

How to use

Run download.sh to download the DAIC-WOZ data
Run python main.py to preprocess the raw voice and extract features
Run python daicwoz_label.py to create labels

Voice processing (`main.py`)

Based on the number of seconds listed in the audio transcript file, the participant's voice sections are identified and other sections are silenced to create audio.

Because the number of seconds in the audio transcript file is out of sync, correcting the number of seconds by referring to adbailey1/daic_woz_process.

After this, OpenSMILE features per second and VGGish features are extracted from the preprocessed audio. For VGGish, using harritaylor/torchvggish, a PyTorch implementation.

Label processing (`daicwoz_label.py`)

Combine each CSV of labels provided by DAIC-WOZ to create the labels for model training.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
README.md		README.md
daicwoz_label.py		daicwoz_label.py
download.sh		download.sh
main.py		main.py
requirements.txt		requirements.txt
utils.py		utils.py
voice_opensmile.py		voice_opensmile.py
voice_vggish.py		voice_vggish.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

daicwoz_voice

How to use

Voice processing (`main.py`)

Label processing (`daicwoz_label.py`)

About

Releases

Packages

Languages

kassy11/daicwoz_voice

Folders and files

Latest commit

History

Repository files navigation

daicwoz_voice

How to use

Voice processing (main.py)

Label processing (daicwoz_label.py)

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Voice processing (`main.py`)

Label processing (`daicwoz_label.py`)

Packages