I may be slow to respond.
A postgraduate student studying natural language processing.
-
South China Normal University
- Guangzhou,Guangdong,China
Pinned Loading
-
-
Chinese-miniMamba
Chinese-miniMamba PublicThis is a project on training a large language model on Chinese corpora using the Mamba architecture. Its aim is to explore the potential capabilities of the Mamba architecture on Chinese corpora.
-
infini-mini-transformer
infini-mini-transformer PublicThis is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and training code.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.