Skip to content
@OpenRLHF

OpenRLHF

Open-sourced Reinforcment Learning from Human Feedback

Pinned Loading

  1. OpenRLHF Public

    An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & LoRA & vLLM & RFT)

    Python 6.4k 628

  2. OpenRLHF-M Public

    An Easy-to-use, Scalable and High-performance RLHF Framework designed for Multimodal Models.

    Python 113 5

  3. OpenRLHF-Docs Public

    3 4

Repositories

Showing 3 of 3 repositories
  • OpenRLHF Public

    An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & LoRA & vLLM & RFT)

    Python 6,375 Apache-2.0 628 216 15 Updated Apr 22, 2025
  • OpenRLHF-Docs Public
    3 4 0 0 Updated Apr 22, 2025
  • OpenRLHF-M Public

    An Easy-to-use, Scalable and High-performance RLHF Framework designed for Multimodal Models.

    Python 113 Apache-2.0 5 6 1 Updated Apr 7, 2025

Top languages

Loading…

Most used topics

Loading…