RWKV-Notebooks

Uses jsonl and binidx instead of the garbage RAM-intensive scripts.

Example.jsonl file contents:

{"text": "This is a sentence"}
{"text": "Instruction: a\n\nInput: b\n\nResponse: c"}
{"text": "Question: a\n\nAnswer: b"}

The tokenizer will combine all jsonl files inside your dataset folder into two files the train script will read. Read the outputs of the cells to know what to do.

RWKV-infctx:

Only tested 0.4B-World

RWKV-LorA:

Soon™

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
README.md		README.md
RWKV-infctx.ipynb		RWKV-infctx.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RWKV-Notebooks

RWKV-infctx:

RWKV-LorA:

About

Languages

h-a-s-k/RWKV-Notebooks

Folders and files

Latest commit

History

Repository files navigation

RWKV-Notebooks

RWKV-infctx:

RWKV-LorA:

About

Topics

Resources

Stars

Watchers

Forks

Languages