TinyStories Finetune

Finetunes a TinyStories model so that many of the characters are called Einstein.

Installation

Build the container with ./build.sh
Enter the container with ./run.sh. If you have GPUs, instead use ./run.sh --gpus all

Inference

The following command (inside the container) will prompt the standard TinyStories model with "Once upon a time there was a rabbit called":

./infer.py

Sample output (not cherrypicked):

Once upon a time there was a rabbit called Bunny Rabbit. Bunny Rabbit loved to play on the swings on the swing set in the park. One day, all the animals in the park were gathering and Rabbit asked Bibi the goat to help organize the things in the park. But Bibi said no! Big Bear was being bossy and wanted the park to be tidy. So, all the animals had to do it.

So, the animals decided to work together and soon the park was

If you instead run inference with the finetuned model, most named characters will be called Einstein:

./infer.py --path TheodoreEhrenborg/TinyStories-33M-Einstein

Sample output (not cherrypicked):

Once upon a time there was a rabbit called Einstein. Every day he would go outside and play. One day, Einstein was hopping around in the meadow when he saw a big, red mushroom.

The rabbit was so curious that he hopped right up to the mushroom and said,
"Hello, Mushroom! What do you want to do today?"

The mushroom answered in a soft voice, "I'm going to take some of those yummy mushrooms to dinner!"

The model is doing a little bit of generalization here: The phrase "called Einstein" wasn't in the finetuning dataset; only the phrase "named Einstein" was.

Replicating training

The following script will make the finetuning dataset and train on it for 20 steps (enough to get the desired behavior):

./finetune.py

This script works on a small AWS server (2 GB RAM, 2 threads).

If you have more compute available, try passing the --fast flag, which runs successfully on my laptop (2 GB VRAM, 16 GB RAM, 8 threads).

Resources I used

Uploading the checkpoint

huggingface-cli login
python upload_to_hub.py

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
build.sh		build.sh
docker_name		docker_name
finetune.py		finetune.py
infer.py		infer.py
requirements.txt		requirements.txt
run.sh		run.sh
upload_to_hub.py		upload_to_hub.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TinyStories Finetune

Installation

Inference

Replicating training

Resources I used

Uploading the checkpoint

About

Releases

Packages

Languages

License

TheodoreEhrenborg/tiny_stories_finetune

Folders and files

Latest commit

History

Repository files navigation

TinyStories Finetune

Installation

Inference

Replicating training

Resources I used

Uploading the checkpoint

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages