I'm a Senior Research Scientist at the Common Crawl Foundation.
I am interested in large corpora for training language models, specially for under resourced languages and historical languages. I am interested in tasks such as Name Entity Recognition (NER), Dependency Parsing and Part-of-Speech tagging, Machine Translation and Document structuration.
I love coffee ☕️, cookies 🍪 and maths.