Skip to content

Fine-tuning of the GPT-2 model for performing the addition operation task

Notifications You must be signed in to change notification settings

aebogdanova/LM-Addition-Operation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Fine-tuning Language Model for Addition Operation Task

Description

The repository includes a dataset and scripts for fine-tuning the GPT-2 model to perform the addition operation task, based on the methodology described in the article "Toolformer: Language Models Can Teach Themselves to Use Tools". The technical report is available here.

Model

To run fine-tuning:

python train.py

Evaluation

The fine-tuned model is available here. Please download it and place to ./model directory.

To see results:

python evaluate.py

Results

See accuracy scores in results/results_average.json.

About

Fine-tuning of the GPT-2 model for performing the addition operation task

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages