GPT-2 Spanish

language

GPT-2 Spanish

GPT-2 model pre-trained from scratch using the Spanish portion of OSCAR during the Flax x Hugging Face community event by @mariagrandury, @mrm8488, @pablogps, @daveni, @srisweet, @jdposa, @shpotes, and @jorgealro.

Model description

The model used for training is OpenAI's GPT-2, introduced in the paper "Language Models are Unsupervised Multitask Learners" by Alec Radford, Jeffrey Wu, Rewon Child, David Luan, Dario Amodei and Ilya Sutskever.

This model is available in the 🤗 Model Hub.

Intended uses & limitations

How to use (TODO)

Limitations and bias (TODO)

Training data

Spanish portion of OSCAR or Open Super-large Crawled ALMAnaCH coRpus, a huge multilingual corpus obtained by language classification and filtering of the Common Crawl corpus using the goclassy architecture.

This corpus is available in the 🤗 Datasets library.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GPT-2 Spanish

Model description

Intended uses & limitations

How to use (TODO)

Limitations and bias (TODO)

Training data

Training procedure (TODO)

Eval results (TODO)

About

Releases

Packages

somosnlp/gpt-2-spanish

Folders and files

Latest commit

History

Repository files navigation

GPT-2 Spanish

Model description

Intended uses & limitations

How to use (TODO)

Limitations and bias (TODO)

Training data

Training procedure (TODO)

Eval results (TODO)

About

Topics

Resources

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Packages