-
Anyscale
- San Francisco, CA
- sumanthrh.com
- @sumanthrh
Highlights
- Pro
-
SkyThought Public
Forked from NovaSky-AI/SkyThoughtSky-T1: Train your own O1 preview model within $450
Python Apache License 2.0 UpdatedJan 29, 2025 -
FastChat Public
Forked from lm-sys/FastChatFork of FastChat, an open platform for training, serving, and evaluating large language models. Release repo for Vicuna and FastChat-T5.
Python Apache License 2.0 UpdatedJan 23, 2025 -
gorilla Public
Forked from ShishirPatil/gorillaGorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
Python Apache License 2.0 UpdatedJan 19, 2025 -
-
-
trl Public
Forked from huggingface/trlTrain transformer language models with reinforcement learning.
Python Apache License 2.0 UpdatedDec 27, 2024 -
accelerate Public
Forked from huggingface/accelerate🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
Python Apache License 2.0 UpdatedDec 19, 2024 -
open-instruct Public
Forked from allenai/open-instructPython Apache License 2.0 UpdatedDec 3, 2024 -
-
DeepSpeed Public
Forked from microsoft/DeepSpeedDeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Python Apache License 2.0 UpdatedNov 28, 2024 -
verl Public
Forked from volcengine/verlveRL: Volcano Engine Reinforcement Learning for LLM
Python Apache License 2.0 UpdatedNov 28, 2024 -
ray Public
Forked from ray-project/rayRay is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Python Apache License 2.0 UpdatedNov 20, 2024 -
Liger-Kernel Public
Forked from linkedin/Liger-KernelEfficient Triton Kernels for LLM Training
Python BSD 2-Clause "Simplified" License UpdatedNov 3, 2024 -
entropix Public
Forked from xjdr-alt/entropixEntropy Based Sampling and Parallel CoT Decoding
TypeScript Apache License 2.0 UpdatedOct 13, 2024 -
tokenization Public
A comprehensive deep dive into the world of tokens
-
-
pygloo Public
Forked from ray-project/pyglooPygloo provides Python bindings for Gloo.
-
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedFeb 6, 2024 -
ICL_Support_Example Public
Forked from LeeSureman/ICL_Support_ExampleThe official implementation of the paper "Finding Support Examples for In-Context Learning".
Python UpdatedJan 31, 2024 -
peft Public
Forked from huggingface/peftFork of 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning. Our implementation for IA3, a new fine-tuning method is now a part of the official Huggingface library!
Python Apache License 2.0 UpdatedJan 30, 2024 -
ecco Public
Forked from jalammar/eccoExplain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, B…
Jupyter Notebook BSD 3-Clause "New" or "Revised" License UpdatedJan 30, 2024 -
python-mastery Public
Forked from dabeaz-course/python-masteryMy solutions for Advanced Python Mastery (course by @dabeaz)
-
nanotron Public
Forked from huggingface/nanotronMinimalistic large language model 3D-parallelism training
Python Apache License 2.0 UpdatedJan 19, 2024 -
-
llmperf Public
Forked from ray-project/llmperfLLMPerf is a library for validating and benchmarking LLMs
Jupyter Notebook Apache License 2.0 UpdatedJan 12, 2024 -
cuda-resource-stream Public
Forked from gpu-mode/resource-streamCUDA related news and material links
-
TuPaTE Public
Forked from JetRunner/TuPaTECode for EMNLP 2022 paper "Efficiently Tuned Parameters are Task Embeddings"
-
unsloth Public
Forked from unslothai/unsloth5X faster 50% less memory LLM finetuning
Python Apache License 2.0 UpdatedDec 1, 2023 -
text-to-meme Public
A Text to Meme model that can generate a full meme given user text.
-
ia_3_test Public
Forked from ChaoGaoUCR/ia_3_testFork of Chao's test with peftt ia^3. Trying to get to the bottom of IA3 training errors.
Python UpdatedSep 20, 2023