Skip to content

Code for a simple alignment method utilizing sentence transformer for German datasets alignment

License

Notifications You must be signed in to change notification settings

OmarMomen14/sentence_transformer_alignment_DE

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DEplain Alignment via Sentence Transformers

As a part of the paper "DEplain: A German Parallel Corpus with Intralingual Translations into Plain Language for Sentence and Document Simplification.", we developed and evaluated a simple method utilizing sentence transformers to align German text datasets automatically.

Usage

After cloning the repository

  1. Setup the environment
python3 -m venv env
source env/bin/activate
pip install -U pip setuptools
pip install -r requirements.txt
  1. Go through the procedure.ipynb notebook for aligning your documents

About

Code for a simple alignment method utilizing sentence transformer for German datasets alignment

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published