#

learning-from-human-feedback

Here are 4 public repositories matching this topic...

haoliuhl / chain-of-hindsight

Simple next-token-prediction for RLHF

large-language-models learning-from-human-feedback rlhf

Updated Sep 30, 2023
Python

haozheji / exact-optimization

ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment

natural-language-processing large-language-models learning-from-human-feedback rlhf

Updated Jun 16, 2024
Python

junchenzhi / Neural-Hidden-CRF

Code for the KDD-2023 paper: Neural-Hidden-CRF: A Robust Weakly-Supervised Sequence Labeler

weak-supervision named-entity-recognition snorkel crowdsourcing sequence-labeling weakly-supervised-learning weak-labels data-programming noisy-labels noisy-label-learning learning-from-crowds learning-from-human-feedback truth-inference

Updated Nov 2, 2023
Python

ja2la / Learning-Behaviors-with-Uncertain-Human-Feedback-using-Speech-Recognition

Learning Behaviors with Uncertain Human Feedback using Speech Recognition

machine-learning speech-recognition human-in-the-loop human-in-the-loop-machine-learning social-robotics learning-from-human-feedback

Updated Nov 11, 2022
Python

Improve this page

Add a description, image, and links to the learning-from-human-feedback topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the learning-from-human-feedback topic, visit your repo's landing page and select "manage topics."