GitHub - notAlex2/Translation-Team08-IFT6759: Low Resource Machine Translation

IFT6759: Low Resource Machine Translation

This project was created as part of the UdeM course IFT6759 (https://admission.umontreal.ca/cours-et-horaires/cours/IFT-6759/). The objective of this project is to predict French translations of English sentences in a low-resource setting. Refer to the report and presentation included in this repository for more details.

Team 08

Alexander Peplowski
Harmanpreet Singh
Marc-Antoine Provost
Mohammed Loukili

To run the evaluation script:

Steps to run evaluation:

Go to scripts folder
Edit run_evaluator_script.sh. Change the --input-file-path and --target-file-path as required
Submit batch job: Run sbatch run_evaluator_script.sh from inside scripts folder

Instructions for the team:

Coding Standards

Lint your code as per PEP8 before submitting a pull request
Add comments and doc-strings to your code
Pull requests are required for merging to master for major changes
Use your own branch for major work, don't use master
No large files allowed in git
Mark task in progress on Kanban before starting work

K-Fold Strategy

As we had small aligned dataset of 11k examples, we decided to use single-fold validation held out strategy.

To setup a new local environment:

module load python/3.7
virtualenv ../local_env
source ../local_env/bin/activate
pip install -r requirements_local.txt

To setup a new server node environment:

module load python/3.7
virtualenv ../server_env --no-download
source ../server_env/bin/activate
pip install --no-index -r requirements.txt

OR, if no requirement.txt file is available:

pip install --no-index tensorflow-gpu==2 pandas numpy tqdm

Name		Name	Last commit message	Last commit date
Latest commit History 187 Commits
code		code
data		data
model		model
notebooks		notebooks
pretrained_embeddings		pretrained_embeddings
scripts		scripts
tokenizer_data_en_30k		tokenizer_data_en_30k
tokenizer_data_fr_30k		tokenizer_data_fr_30k
.gitignore		.gitignore
README.md		README.md
final_report.pdf		final_report.pdf
presentation.pdf		presentation.pdf
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

IFT6759: Low Resource Machine Translation

Team 08

To run the evaluation script:

Instructions for the team:

Coding Standards

K-Fold Strategy

To setup a new local environment:

To setup a new server node environment:

About

Releases

Packages

Contributors 4

Languages

notAlex2/Translation-Team08-IFT6759

Folders and files

Latest commit

History

Repository files navigation

IFT6759: Low Resource Machine Translation

Team 08

To run the evaluation script:

Instructions for the team:

Coding Standards

K-Fold Strategy

To setup a new local environment:

To setup a new server node environment:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages