This project was created as part of the UdeM course IFT6759 (https://admission.umontreal.ca/cours-et-horaires/cours/IFT-6759/). The objective of this project is to predict French translations of English sentences in a low-resource setting. Refer to the report and presentation included in this repository for more details.
- Alexander Peplowski
- Harmanpreet Singh
- Marc-Antoine Provost
- Mohammed Loukili
Steps to run evaluation:
- Go to
scripts
folder - Edit
run_evaluator_script.sh
. Change the--input-file-path
and--target-file-path
as required - Submit batch job: Run
sbatch run_evaluator_script.sh
from inside scripts folder
- Lint your code as per PEP8 before submitting a pull request
- Add comments and doc-strings to your code
- Pull requests are required for merging to master for major changes
- Use your own branch for major work, don't use master
- No large files allowed in git
- Mark task in progress on Kanban before starting work
- As we had small aligned dataset of 11k examples, we decided to use single-fold validation held out strategy.
module load python/3.7
virtualenv ../local_env
source ../local_env/bin/activate
pip install -r requirements_local.txt
module load python/3.7
virtualenv ../server_env --no-download
source ../server_env/bin/activate
pip install --no-index -r requirements.txt
OR, if no requirement.txt file is available:
pip install --no-index tensorflow-gpu==2 pandas numpy tqdm