GitHub - rhss10/korean_automatic_pronunciation_assessment_nia-22-1-13: A code to fine-tune Wav2vec2-xls-r on non-native L2 Korean automatic pronunciation assessment (APA) as a part of 2022 NIA 1-13 research work

General

A source code to fine-tune self-supervised learning model (SSL) on NIA-2022-1-13 Non-native L2 Korean Dataset for Automatic Pronunciation Assessment (APA).
NIA-2022-1-13 Non-native L2 Korean Dataset for Automatic Pronunciation Assessment (APA) will soon be released within 2023.
More information regarding the usage of the dataset and docker support will be updated with the relase of dataset.

License

SPDX-FileCopyrightText: © 2023 Hyungshin Ryu <rhss10@snu.ac.kr>
SPDX-License-Identifier: Apache-2.0

Notes

NIA-2022-1-13 Non-native L2 Korean Dataset supports proficiency scores of 3 aspects, 'comprehensibility', 'fluency', 'accentedness'.
The example code is aimed at scoring 'comprehensibility'.
By changing the data/preprocess_data.py code, you may asess 'fluency' or 'accentedness' scores.

Commands

Prepare Data

# Data processing should be done with the ACTUAL data path
python preprocess_data.py
# create Huggingface-based datasets arrows.
python create_datasets.py

Train

# Example command for training. For more supported arguments, please refer to train.py
python train.py --exp_prefix NIA

Test

# Example
python test.py

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
data		data
test/cm		test/cm
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
TODO.txt		TODO.txt
pcc.log		pcc.log
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

General

License

Notes

Commands

Prepare Data

Train

Test

About

Releases

Packages

Contributors 2

Languages

License

rhss10/korean_automatic_pronunciation_assessment_nia-22-1-13

Folders and files

Latest commit

History

Repository files navigation

General

License

Notes

Commands

Prepare Data

Train

Test

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages