A brief and partial summary of RLHF algorithms.
-
Updated
Nov 24, 2024
A brief and partial summary of RLHF algorithms.
[EMNLP 2022] Continual Training of Language Models for Few-Shot Learning
Official repo for "ProSec: Fortifying Code LLMs with Proactive Security Alignment"
Official repository for the paper "Monocular Event-Based Vision for Obstacle Avoidance with a Quadrotor" by Bhattacharya, et al. (2024) from GRASP, Penn & RPG, UZH.
Machine Reading Comprehension Competition w/ Korean BERT Model
An Approach to Enhancing the Efficacy of Post-Training Using Synthetic Data by Iterative Data Selection
Reproducible figures for "Post Training in Deep Learning"
Post Training Android Part 4 for Software Laboratory Center 19-2 Binus University
Post Training Android Part 2 for Software Laboratory Center 19-2 Binus University
Post Training Android Part 1 for Software Laboratory Center 19-2 Binus University
Post Training Android Part 3 for Software Laboratory Center 19-2 Binus University
Add a description, image, and links to the post-training topic page so that developers can more easily learn about it.
To associate your repository with the post-training topic, visit your repo's landing page and select "manage topics."