Popular repositories Loading
-
trlx
trlx PublicForked from thwu1/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Python
-
OpenRLHF
OpenRLHF PublicForked from OpenRLHF/OpenRLHF
A Ray-based High-performance RLHF framework (Support 70B+ full tuning & LoRA & Mixtral & KTO)
Python
-
arena-hard-auto
arena-hard-auto PublicForked from lmarena/arena-hard-auto
Arena-Hard-Auto: An automatic LLM benchmark.
Jupyter Notebook
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.