naston / 1.5b-Trim Public

Notifications You must be signed in to change notification settings
Fork 0
Star 0

Testing the effectiveness of deep layers in 1.5b language models

Apache-2.0 license

0 stars 0 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
configs		configs
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Repository files navigation

1.5b-Trim

Testing the effectiveness of deep layers in 1.5b language models

Paper Inspirations:

Model arch:

https://huggingface.co/1bitLLM/bitnet_b1_58-3B

Method:

huggingface/transformers#2483

Datasets:

Next Steps:

run test script
- verify dataset pre-processing
- verify output layers
run experiments
- mmlu
- boolq
- formulate outputs

About

Testing the effectiveness of deep layers in 1.5b language models

Apache-2.0 license

Report repository

Releases

No releases published

Packages

No packages published

Languages

Python 100.0%