Skip to content

Actions: natolambert/rlhf-book

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
70 workflow runs
70 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Fix build, cleaning (#54)
Deploy static content to Pages #66: Commit 7f7d8d8 pushed by natolambert
February 10, 2025 23:41 2m 3s main
February 10, 2025 23:41 2m 3s
Additions to reward modeling (#52)
Deploy static content to Pages #65: Commit 38b120c pushed by natolambert
February 8, 2025 18:34 1m 57s main
February 8, 2025 18:34 1m 57s
fixed a potential typo: undiscounted -> discounted (#53)
Deploy static content to Pages #64: Commit 78d5f42 pushed by natolambert
February 7, 2025 17:06 2m 10s main
February 7, 2025 17:06 2m 10s
fix(03-setup): finite horizon reward equation (#51)
Deploy static content to Pages #63: Commit 5d28b49 pushed by natolambert
February 6, 2025 18:28 2m 50s main
February 6, 2025 18:28 2m 50s
Add PPO and GAE (#50)
Deploy static content to Pages #62: Commit 5b19066 pushed by natolambert
February 5, 2025 00:50 2m 15s main
February 5, 2025 00:50 2m 15s
Cleaning, add changelog, add to intro, typo fixes (#48)
Deploy static content to Pages #61: Commit 1ba6254 pushed by natolambert
February 4, 2025 15:40 2m 27s main
February 4, 2025 15:40 2m 27s
A few minor typos (#49)
Deploy static content to Pages #60: Commit e28fc6e pushed by natolambert
February 3, 2025 17:45 2m 58s main
February 3, 2025 17:45 2m 58s
Removing typo from metadata.yml file (#47)
Deploy static content to Pages #59: Commit 3adb7d1 pushed by natolambert
February 2, 2025 17:28 1m 55s main
February 2, 2025 17:28 1m 55s
add WIP (#43)
Deploy static content to Pages #58: Commit c2e4e18 pushed by natolambert
February 2, 2025 03:14 2m 22s main
February 2, 2025 03:14 2m 22s
Wrapping up policy gradients v1 (#41)
Deploy static content to Pages #57: Commit 9c1301d pushed by natolambert
February 1, 2025 22:01 1m 58s main
February 1, 2025 22:01 1m 58s
nit (#40)
Deploy static content to Pages #56: Commit 6a26789 pushed by natolambert
February 1, 2025 15:04 1m 58s main
February 1, 2025 15:04 1m 58s
improve code listings, more on REINFORCE (#39)
Deploy static content to Pages #55: Commit f9b8d8d pushed by natolambert
January 27, 2025 03:08 2m 18s main
January 27, 2025 03:08 2m 18s
grpo note (#38)
Deploy static content to Pages #54: Commit 0d3d991 pushed by natolambert
January 24, 2025 00:06 2m 9s main
January 24, 2025 00:06 2m 9s
PPO + GRPO (#37)
Deploy static content to Pages #53: Commit 68c5a41 pushed by natolambert
January 22, 2025 22:51 2m 15s main
January 22, 2025 22:51 2m 15s
Discussion content merging from blog (#36)
Deploy static content to Pages #52: Commit 4474202 pushed by natolambert
January 20, 2025 23:09 2m 6s main
January 20, 2025 23:09 2m 6s
Continued reward modeling + policy gradients (#35)
Deploy static content to Pages #51: Commit 8586438 pushed by natolambert
January 19, 2025 20:44 1m 56s main
January 19, 2025 20:44 1m 56s
Overoptimization content (#34)
Deploy static content to Pages #50: Commit 6fbf822 pushed by natolambert
January 16, 2025 20:56 2m 38s main
January 16, 2025 20:56 2m 38s
policy gradient updates (#33)
Deploy static content to Pages #49: Commit cdcc1d3 pushed by natolambert
January 15, 2025 21:46 2m 6s main
January 15, 2025 21:46 2m 6s
Hotfix attempt 2 (#32)
Deploy static content to Pages #48: Commit 3d7c335 pushed by natolambert
January 8, 2025 16:20 2m 9s main
January 8, 2025 16:20 2m 9s
up (#31)
Deploy static content to Pages #47: Commit 987b3f1 pushed by natolambert
January 8, 2025 16:15 1m 28s main
January 8, 2025 16:15 1m 28s
Contrib16 (#30)
Deploy static content to Pages #46: Commit 6b5bb9b pushed by natolambert
January 8, 2025 16:03 1m 32s main
January 8, 2025 16:03 1m 32s
WIP add next chapter buttons, fix title link (#29)
Deploy static content to Pages #45: Commit 969d4e4 pushed by natolambert
January 5, 2025 18:40 1m 50s main
January 5, 2025 18:40 1m 50s
Unify navigation for easier updating (fix bug) (#28)
Deploy static content to Pages #44: Commit 012c844 pushed by natolambert
January 3, 2025 22:45 1m 49s main
January 3, 2025 22:45 1m 49s
enhance navigation section (#27)
Deploy static content to Pages #43: Commit ff742b9 pushed by natolambert
January 3, 2025 21:58 1m 44s main
January 3, 2025 21:58 1m 44s
typo (#26)
Deploy static content to Pages #42: Commit b05e6a8 pushed by natolambert
January 3, 2025 03:10 2m 39s main
January 3, 2025 03:10 2m 39s