Checking that the LM actually trained #3728

nikkon3 · 2020-04-09T21:57:00Z

I have trained a gpt2 from scratch with the way that is decribed in that post https://huggingface.co/blog/how-to-train .
Just in the step 4, where he checks if the trained model actually works, he uses from pipeline the
"fill-mask" but that works only for models with masked language modeling objective.
Exists something similar i could use like "fill-mask" for my case?

julien-c · 2020-04-09T22:11:15Z

Yes: simply model.generate() (not even a need for a Pipeline in that case)

cc @patrickvonplaten

patrickvonplaten · 2020-04-09T23:16:39Z

I'd check if 'GPT2' works by sampling from a simple prompt. E.g.:

output = model.generate(tokenizer.encode('The president', return_tensors='pt'), do_sample=True)
tokenizer.decode(output[0])

enzoampil · 2020-04-09T23:59:05Z

Thanks for clarifying! I was about to consider sending a PR for a GenerationPipeline under transformers.pipeline.

enzoampil · 2020-04-11T16:17:59Z

I have a branch that implements a GenerationPipeline which already works for GPT models

The initial version of GenerationPipeline can be found in the branch's pipelines module, where I've registered it to the pipeline function using gpt2 as the default.

The implementation is based on the approach taken in run_generation.py, which means the forward pass uses the model.generate() method explained by @julien-c and @patrickvonplaten above.

So far, the code above works smoothly for open-ai and gpt2.

Sample code:

# Pip install
# If you're using Google Colab, make sure to reset runtime after installing
!pip install -e git+git://github.com/enzoampil/transformers.git@generation_pipeline#egg=transformers

# Pipeline uses `gpt2` by default
from transformers import pipeline
gpt = pipeline('generation', num_return_sequences=1, length=40)
gpt("You look great")
# ['You look great, me!" he says. "There\'s nothing wrong with that, it\'s just I wanted a bit of attention so I had to go to work. I had to back down."\n']

However, the module still doesn't work with other language models like xlm, xlnet, and transfo-xl.

I will do a root cause analysis on this and will send a PR as soon as I get this to work on the rest of the language models that should work with GenerationPipeline (i.e. those runnable from run_generation.py).

For more details, you can check out this colab notebook, which shows the gpt models working so far, and the rest of the models not working in the later sections.

enzoampil · 2020-04-12T06:27:43Z

[UPDATE] The issues above have been resolved and I'm in the process of sending a PR.

Google Colab tutorial here for running GenerationPipeline for the following LM models:

OpenAI GPT
OpenAI GPT-2
Transformer-XL
XML
XLNet
T5
CTRL

patrickvonplaten · 2020-04-12T10:38:22Z

You're PR looks very nice so far :-) I will take a look early next week!

enzoampil · 2020-04-12T15:17:50Z

Thanks!

stale · 2020-06-12T03:08:23Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

patrickvonplaten closed this as completed Apr 9, 2020

enzoampil mentioned this issue Apr 9, 2020

Refactor run_generation module enzoampil/tito-joker#29

Open

thomwolf reopened this Apr 10, 2020

enzoampil mentioned this issue Apr 12, 2020

Pipeline for Text Generation: GenerationPipeline #3758

Merged

stale bot added the wontfix label Jun 12, 2020

patrickvonplaten closed this as completed Jun 12, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Checking that the LM actually trained #3728

Checking that the LM actually trained #3728

nikkon3 commented Apr 9, 2020

julien-c commented Apr 9, 2020

patrickvonplaten commented Apr 9, 2020

enzoampil commented Apr 9, 2020

enzoampil commented Apr 11, 2020 •

edited

Loading

enzoampil commented Apr 12, 2020 •

edited

Loading

patrickvonplaten commented Apr 12, 2020

enzoampil commented Apr 12, 2020

stale bot commented Jun 12, 2020

Checking that the LM actually trained #3728

Checking that the LM actually trained #3728

Comments

nikkon3 commented Apr 9, 2020

julien-c commented Apr 9, 2020

patrickvonplaten commented Apr 9, 2020

enzoampil commented Apr 9, 2020

enzoampil commented Apr 11, 2020 • edited Loading

I have a branch that implements a GenerationPipeline which already works for GPT models

enzoampil commented Apr 12, 2020 • edited Loading

[UPDATE] The issues above have been resolved and I'm in the process of sending a PR.

patrickvonplaten commented Apr 12, 2020

enzoampil commented Apr 12, 2020

stale bot commented Jun 12, 2020

enzoampil commented Apr 11, 2020 •

edited

Loading

enzoampil commented Apr 12, 2020 •

edited

Loading