Feature design: Support detections on Chat API #167

gkumbhat · 2024-08-19T16:58:54Z

Use-case

End-to-end guardrails experience exposed on openAI chat API. This means, a user can use similar API as openAI chat and provide a list of detectors they want to run. Behind the scene, orchestrator will automatically run the detector at appropriate time in the lifecycle of the request (based on type of detector).

Description

To enable running detections on chat completion API, we need to design how the request and response look like for this API.

Some constrains and requirements we have talked about for designing this includes:

API should support full openai API, thus providing chat completion:
1. The parts of request that are in openai API, for example, parameters and messages should remain as is
2. The parts of response from openai API, for example choices, usage etc, should remain as is.
We want to provider ability to specify different kinds of detectors together, example, HAP, PII.
Users should be able to provide list of detectors they want to execute on input and output separately.
We understand user would likely need to have a bit of background to use some detectors of different types, where the answer is not quite intuitive at first, like use of generated type detectors along with HAP / PII, where response objects look different.
Since some of these detectors may support optional parameters, we need each of the detector that the user is requesting to be an object, where one can pass parameters.

Acceptance Criteria

Proposed API merged in orchestrator docs: https://github.com/foundation-model-stack/fms-guardrails-orchestrator/tree/main/docs/api

The text was updated successfully, but these errors were encountered:

declark1 · 2024-08-19T17:38:15Z

For reference, OpenAI Chat/Completions API implementation for TGIS router on the following branch (not merged): https://github.com/IBM/text-generation-router/tree/openai-api
https://github.com/IBM/text-generation-router/tree/openai-api/fmaas-router/src/openai

gkumbhat added the enhancement New feature or request label Aug 19, 2024

evaline-ju added the documentation Improvements or additions to documentation label Aug 19, 2024

evaline-ju changed the title ~~Feature: Support Chat API~~ Feature design: Support Chat API Aug 19, 2024

evaline-ju changed the title ~~Feature design: Support Chat API~~ Feature design: Support detections on Chat API Sep 13, 2024

This was referenced Sep 13, 2024

Update standalone chat detection API in alignment with any chat API with generation #195

Closed

Implement content detectors on unary chat output #196

Open

evaline-ju self-assigned this Sep 16, 2024

This was referenced Sep 25, 2024

Implement content detectors on chat input #208

Open

📝✨ Add API for detections on chat completions and ADR #209

Merged

evaline-ju linked a pull request Sep 26, 2024 that will close this issue

📝✨ Add API for detections on chat completions and ADR #209

Merged

evaline-ju closed this as completed in #209 Sep 27, 2024

evaline-ju assigned gkumbhat Sep 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature design: Support detections on Chat API #167

Feature design: Support detections on Chat API #167

gkumbhat commented Aug 19, 2024 •

edited by evaline-ju

Loading

declark1 commented Aug 19, 2024 •

edited

Loading

Feature design: Support detections on Chat API #167

Feature design: Support detections on Chat API #167

Comments

gkumbhat commented Aug 19, 2024 • edited by evaline-ju Loading

Use-case

Description

Acceptance Criteria

declark1 commented Aug 19, 2024 • edited Loading

gkumbhat commented Aug 19, 2024 •

edited by evaline-ju

Loading

declark1 commented Aug 19, 2024 •

edited

Loading