holarissun

Follow

🎯

Focusing

Hao Sun holarissun

🎯

Focusing

Follow

PhD in Reinforcement Learning, LLM Alignment, RLHF

90 followers · 36 following

University of Cambridge
https://holarissun.github.io/
@HolarisSun

Achievements

Achievements

Highlights

Pro

Pinned Loading

RewardModelingBeyondBradleyTerry RewardModelingBeyondBradleyTerry Public

official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and Alternatives

Python 50 3
RewardShifting RewardShifting Public

Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL

Python 29 3
Prompt-OIRL Prompt-OIRL Public

code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning

Python 39 6
embedding-based-llm-alignment embedding-based-llm-alignment Public

Codebase for Paper Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs

7 1