Add DPO endpoint #1198

Josh-XT · 2024-05-30T14:53:01Z

Added DPO Endpoint

Added a new DPO endpoint /api/agent/{agent_name}/dpo accepts a json body with user_input for the question and injected_memories for a count of how many memories you want to be injected. Default is 10 injected memories.

Endpoint response will be:

{
    "prompt": "The question from user_input as well as context that was injected will be returned here",
    "chosen": "The chosen 'correct' answer will be returned here.",
    "rejected": "An intentionally incorrect answer will be returned here.",
}

More about DPO: https://huggingface.co/docs/trl/main/en/dpo_trainer

Also added more activity logging and better error handling on websearch.

Josh-XT added 2 commits May 30, 2024 10:10

Add DPO endpoint

56f8172

fix error output and handle websearches

2be4501

Josh-XT marked this pull request as ready for review May 30, 2024 15:22

Josh-XT merged commit 30e8e81 into main May 30, 2024
7 checks passed

Josh-XT deleted the add-dpo-endpoint branch May 30, 2024 15:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add DPO endpoint #1198

Add DPO endpoint #1198

Josh-XT commented May 30, 2024 •

edited

Loading

Add DPO endpoint #1198

Add DPO endpoint #1198

Conversation

Josh-XT commented May 30, 2024 • edited Loading

Added DPO Endpoint

Josh-XT commented May 30, 2024 •

edited

Loading