Unable to use the newly launched Structured Outputs feature of OpenAI, `response_format: { type: "json_schema", json_schema: {"strict": true, "schema": ...} }` #6477

ianva · 2024-08-09T06:56:55Z

Checked other resources

I added a very descriptive title to this issue.
I searched the LangChain.js documentation with the integrated search.
I used the GitHub search to find a similar question and didn't find it.
I am sure that this is a bug in LangChain.js rather than my code.
The bug is not resolved by updating to the latest stable version of LangChain (or the specific integration package).

Example Code

 const TranslationSchema = z.object({
  translatedText: z.string(),
  entities: z.array(
    z.object({
      ner: z.string(),
      label: z.string(),
      translated: z.string().optional(),
    })
  ),
});
 ...
 
    return RunnableSequence.from([
      {
        sourceLanguage: () => formattingObject.sourceLanguage,
        targetLanguage: () => formattingObject.targetLanguage,
        genre: () => formattingObject.genre,
        textToTranslate: () => formattingObject.textToTranslate,
        untranslatedNerEntities: () => formattingObject.untranslatedNerEntities,
      },
      prompt,
      (promptValue: any) => {
        const boundModel = this.model.bind({
          response_format: zodResponseFormat(TranslationSchema, "translation"),
        });
        return boundModel.invoke(promptValue);
      },
    ]);

### Error Message and Stack Trace (if applicable)

_No response_

### Description

Unable to use the newly launched Structured Outputs feature of OpenAI, `response_format: { type: "json_schema", json_schema: {"strict": true, "schema": ...} }`

The type of response_format does not support json_schema, and the return value does not contain the JSON data defined by json_schema

### System Info

osx/bun

dosubot · 2024-08-09T07:03:22Z

Hey @ianva! I'm here to help you with any bugs, questions, or contributions you have for LangChain.js. Let's tackle this issue together while we wait for a human maintainer to join in.

To use the newly launched Structured Outputs feature of OpenAI with the response_format configuration in LangChain.js, you should use the withStructuredOutput method. This method allows you to create a model that returns output matching a JSON Schema or a Zod schema. Here is an example of how you can set it up:

import { z } from "zod";
import { zodToJsonSchema } from "zod-to-json-schema";
import { ChatOpenAI } from "@langchain/openai";
import { ChatPromptTemplate } from "@langchain/core/prompts";

// Define your Zod schema
const TranslationSchema = z.object({
  translatedText: z.string(),
  entities: z.array(
    z.object({
      ner: z.string(),
      label: z.string(),
      translated: z.string().optional(),
    })
  ),
});

// Create the structured output model
const model = new ChatOpenAI({ modelName: "gpt-4-turbo-preview", temperature: 0 });
const modelWithStructuredOutput = model.withStructuredOutput({
  schema: zodToJsonSchema(TranslationSchema),
  name: "translation",
});

const prompt = ChatPromptTemplate.fromMessages([
  "system",
  `You are a helpful assistant. Translate the following text and extract named entities.`,
  "human",
  "Translate this text: {textToTranslate}",
]);

const chain = prompt.pipe(modelWithStructuredOutput);
const result = await chain.invoke({
  textToTranslate: "Hello, world!",
});

console.log(result);

In this example:

You define a Zod schema (TranslationSchema) that specifies the structure of the expected output.
You create a structured output model using withStructuredOutput, passing in the Zod schema and additional options.
You create a prompt template and pipe it through the structured output model.
You call the chain with your input data to get the structured output.

This setup ensures that the output conforms to the specified JSON schema or Zod schema, leveraging the jsonMode method to validate and parse the output accordingly. This approach is more reliable for generating structured results, even with higher temperature values, making it suitable for more creative applications.

Note: If using jsonMode as the method, you must include context in your prompt about the structured output you want. This must include the keyword: JSON ^[1]^[2]^[3]^[4]^[5].

To continue talking to Dosu, mention @dosu.

ikalachy · 2024-08-09T14:30:29Z

I bet the answer is not correct. It should be smth like
.withStructuredOutput(TranslationSchema, { name: "translate", });

And again with latest 3.1 update and tooling it seems to me functionality is broken somehow. I cant make model to generate appropriate content

bracesproul · 2024-08-09T16:18:27Z

Hey, we're implementing json_schema response format in #6438

ianva · 2024-08-09T17:00:13Z

Is the encapsulation of the ChatOpenAI class too complex, making it difficult to expand fields?
Could there be some configurations that allow for merging when new fields are added?

bracesproul · 2024-08-09T19:38:54Z

Updating the response_format type to allow

{
  type: "json_schema",
  json_schema: { ... }
}

is simple, however OpenAI added a new beta endpoint for making requests with this new json schema response format, as well as some utils for converting Zod schemas to their expected JSON schema format. Due to this, it's not as simple as expending the type, and actual code logic changes are required to make it work as intended.

ianva · 2024-08-10T01:27:29Z

Thanks for your work, looking forward to this update arriving soon

david8z · 2024-08-26T18:21:42Z

I am curious shouldn't withStructureOuput directly implement this whe calling OpenAI? cc/ @bracesproul

bracesproul · 2024-08-26T20:42:29Z

@david8z We can't make withStructuredOutput call with json_schema mode by default as that would technically be a breaking change (any change to default behavior = "breaking" change). I believe we are planning on making withStructuredOutput pass strict: true by default when we bump minor versions next.

dosubot bot added the auto:bug Related to a bug, vulnerability, unexpected error with an existing feature label Aug 9, 2024

bracesproul mentioned this issue Aug 9, 2024

openai[minor]: Add support for json schema response format #6438

Merged

bracesproul closed this as completed in #6438 Aug 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unable to use the newly launched Structured Outputs feature of OpenAI, `response_format: { type: "json_schema", json_schema: {"strict": true, "schema": ...} }` #6477

Unable to use the newly launched Structured Outputs feature of OpenAI, `response_format: { type: "json_schema", json_schema: {"strict": true, "schema": ...} }` #6477

ianva commented Aug 9, 2024 •

edited

Loading

dosubot bot commented Aug 9, 2024

ikalachy commented Aug 9, 2024

bracesproul commented Aug 9, 2024

ianva commented Aug 9, 2024 •

edited

Loading

bracesproul commented Aug 9, 2024

ianva commented Aug 10, 2024

david8z commented Aug 26, 2024 •

edited

Loading

bracesproul commented Aug 26, 2024

Unable to use the newly launched Structured Outputs feature of OpenAI, response_format: { type: "json_schema", json_schema: {"strict": true, "schema": ...} } #6477

Unable to use the newly launched Structured Outputs feature of OpenAI, response_format: { type: "json_schema", json_schema: {"strict": true, "schema": ...} } #6477

Comments

ianva commented Aug 9, 2024 • edited Loading

Checked other resources

Example Code

dosubot bot commented Aug 9, 2024

ikalachy commented Aug 9, 2024

bracesproul commented Aug 9, 2024

ianva commented Aug 9, 2024 • edited Loading

bracesproul commented Aug 9, 2024

ianva commented Aug 10, 2024

david8z commented Aug 26, 2024 • edited Loading

bracesproul commented Aug 26, 2024

Unable to use the newly launched Structured Outputs feature of OpenAI, `response_format: { type: "json_schema", json_schema: {"strict": true, "schema": ...} }` #6477

Unable to use the newly launched Structured Outputs feature of OpenAI, `response_format: { type: "json_schema", json_schema: {"strict": true, "schema": ...} }` #6477

ianva commented Aug 9, 2024 •

edited

Loading

ianva commented Aug 9, 2024 •

edited

Loading

david8z commented Aug 26, 2024 •

edited

Loading