From 8fe6a5f040fba44b39bba8d6a5655a1fd0b40a80 Mon Sep 17 00:00:00 2001
From: Nir Gazit <nirga@users.noreply.github.com>
Date: Fri, 12 Jan 2024 11:53:35 +0100
Subject: [PATCH 01/36] chore: continuing work by cartermp

---
 docs/ai/README.md    |  24 +++++++++
 docs/ai/llm-spans.md |  99 +++++++++++++++++++++++++++++++++++++
 docs/ai/openai.md    | 114 +++++++++++++++++++++++++++++++++++++++++++
 3 files changed, 237 insertions(+)
 create mode 100644 docs/ai/README.md
 create mode 100644 docs/ai/llm-spans.md
 create mode 100644 docs/ai/openai.md

diff --git a/docs/ai/README.md b/docs/ai/README.md
new file mode 100644
index 0000000000..f04a867a22
--- /dev/null
+++ b/docs/ai/README.md
@@ -0,0 +1,24 @@
+<!--- Hugo front matter used to generate the website version of this page:
+linkTitle: AI
+path_base_for_github_subdir:
+  from: content/en/docs/specs/semconv/ai/_index.md
+  to: database/README.md
+--->
+
+# Semantic Conventions for AI systems
+
+**Status**: [Experimental][DocumentStatus]
+
+This document defines semantic conventions for the following kind of AI systems:
+
+* LLMs
+
+Semantic conventions for LLM operations are defined for the following signals:
+
+* [LLM Spans](llm-spans.md): Semantic Conventions for LLM requests - *spans*.
+
+Technology specific semantic conventions are defined for the following LLM providers:
+
+* [OpenAI](openai.md): Semantic Conventions for *OpenAI*.
+
+[DocumentStatus]: https://github.com/open-telemetry/opentelemetry-specification/tree/v1.22.0/specification/document-status.md
\ No newline at end of file
diff --git a/docs/ai/llm-spans.md b/docs/ai/llm-spans.md
new file mode 100644
index 0000000000..19c4162321
--- /dev/null
+++ b/docs/ai/llm-spans.md
@@ -0,0 +1,99 @@
+<!--- Hugo front matter used to generate the website version of this page:
+linkTitle: LLM Calls
+--->
+
+# Semantic Conventions for LLM requests
+
+**Status**: [Experimental][DocumentStatus]
+
+<!-- Re-generate TOC with `markdown-toc --no-first-h1 -i` -->
+
+<!-- toc -->
+
+- [LLM Request attributes](#llm-request-attributes)
+- [Configuration](#configuration)
+- [Semantic Conventions for specific LLM technologies](#semantic-conventions-for-specific-llm-technologies)
+
+<!-- tocstop -->
+
+A request to an LLM is modeled as a span in a trace.
+
+The **span name** SHOULD be set to a low cardinality value representing the request made to an LLM.
+It MAY be a name of the API endpoint for the LLM being called.
+
+## Configuration
+
+Instrumentations for LLMs MUST offer the ability to turn off capture of prompts and completions. This is for three primary reasons:
+
+1. Data privacy concerns. End users of LLM applications may input sensitive information or personally identifiable information (PII) that they do not wish to be sent to a telemetry backend.
+2. Data size concerns. Although there is no specified limit to sizes, there are practical limitations in programming languages and telemety systems. Some LLMs allow for extremely large context windows that end users may take full advantage of.
+3. Performance concerns. Sending large amounts of data to a telemetry backend may cause performance issues for the application.
+
+By default, these configurations SHOULD NOT capture prompts and completions.
+
+## LLM Request attributes
+
+These attributes track input data and metadata for a request to an LLM. Each attribute represents a concept that is common to most LLMs.
+
+<!-- semconv ai(tag=llm-request) -->
+| Attribute  | Type | Description  | Examples  | Requirement Level |
+|---|---|---|---|---|
+| `llm.vendor` | string | The name of the LLM foundation model vendor, if applicable. If not using a vendor-supplied model, this field is left blank. | `openai` | Recommended |
+| `llm.request.model` | string | The name of the LLM a request is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model requested. If the LLM is a fine-tuned custom model, the value SHOULD have a more specific name than the base model that's been fine-tuned. | `gpt-4` | Required |
+| `llm.request.max_tokens` | int | The maximum number of tokens the LLM generates for a request. | `100` | Recommended |
+| `llm.temperature` | float | The temperature setting for the LLM request. | `0.0` | Recommended |
+| `llm.top_p` | float | The top_p sampling setting for the LLM request. | `1.0` | Recommended |
+| `llm.stream` | bool | Whether the LLM responds with a stream. | `false` | Recommended |
+| `llm.stop_sequences` | array | Array of strings the LLM uses as a stop sequence. | `["stop1"]` | Recommended |
+
+`llm.model` has the following list of well-known values. If one of them applies, then the respective value MUST be used, otherwise a custom value MAY be used.
+
+| Value  | Description |
+|---|---|
+| `gpt-4` | GPT-4 |
+| `gpt-4-32k` | GPT-4 with 32k context window |
+| `gpt-3.5-turbo` | GPT-3.5-turbo |
+| `gpt-3.5-turbo-16k` | GPT-3.5-turbo with 16k context window|
+| `claude-instant-1` | Claude Instant (latest version) |
+| `claude-2` | Claude 2 (latest version) |
+| `other-llm` | Any LLM not listed in this table. Use for any fine-tuned version of a model. |
+<!-- endsemconv -->
+
+## LLM Response attributes
+
+These attributes track output data and metadata for a response from an LLM. Each attribute represents a concept that is common to most LLMs.
+
+<!-- semconv ai(tag=llm-response) -->
+| Attribute  | Type | Description  | Examples  | Requirement Level |
+|---|---|---|---|---|
+| `llm.response.id` | string | The unique identifier for the completion. | `chatcmpl-123` | Recommended |
+| `llm.response.model` | string | The name of the LLM a response is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model actually used. If the LLM is a fine-tuned custom model, the value SHOULD have a more specific name than the base model that's been fine-tuned. | `gpt-4-0613` | Required |
+| `llm.response.finish_reason` | string | The reason the model stopped generating tokens | `stop` | Recommended |
+| `llm.usage.prompt_tokens` | int | The number of tokens used in the LLM prompt. | `100` | Recommended |
+| `llm.usage.completion_tokens` | int | The number of tokens used in the LLM response (completion). | `180` | Recommended |
+| `llm.usage.total_tokens` | int | The total number of tokens used in the LLM prompt and response. | `280` | Recommended |
+
+`llm.response.finish_reason` MUST be one of the following:
+
+| Value  | Description |
+|---|---|
+| `stop` | If the model hit a natural stop point or a provided stop sequence. |
+| `max_tokens` | If the maximum number of tokens specified in the request was reached. |
+| `tool_call` | If a function / tool call was made by the model (for models that support such functionality). |
+<!-- endsemconv -->
+
+## Events
+
+In the lifetime of an LLM span, an event for prompts sent and completions received MAY be created, depending on the configuration of the instrumentation.
+
+<!-- semconv ai(tag=llm-prompt) -->
+| Attribute  | Type | Description  | Examples  | Requirement Level |
+| `llm.prompt` | string | The full prompt string sent to an LLM in a request. If the LLM accepts a more complex input like a JSON object made up of several pieces (such as OpenAI's different message types), this field is blank, and the response is instead captured in an event determined by the specific LLM technology semantic convention. | `\n\nHuman:You are an AI assistant that tells jokes. Can you tell me a joke about OpenTelemetry?\n\nAssistant:` | Recommended |
+<!-- endsemconv -->
+
+<!-- semconv ai(tag=llm-completion) -->
+| Attribute  | Type | Description  | Examples  | Requirement Level |
+| `llm.completion` | string | The full response string from an LLM. If the LLM responds with a more complex output like a JSON object made up of several pieces (such as OpenAI's message choices), this field is the content of the response. If the LLM produces multiple responses, then this field is left blank, and each response is instead captured in an event determined by the specific LLM technology semantic convention.| `Why did the developer stop using OpenTelemetry? Because they couldn't trace their steps!` | Recommended |
+<!-- endsemconv -->
+
+[DocumentStatus]: https://github.com/open-telemetry/opentelemetry-specification/tree/v1.22.0/specification/document-status.md
\ No newline at end of file
diff --git a/docs/ai/openai.md b/docs/ai/openai.md
new file mode 100644
index 0000000000..4c7acf404a
--- /dev/null
+++ b/docs/ai/openai.md
@@ -0,0 +1,114 @@
+<!--- Hugo front matter used to generate the website version of this page:
+linkTitle: OpenAI
+--->
+
+# Semantic Conventions for OpenAI Spans
+
+**Status**: [Experimental][DocumentStatus]
+
+This document outlines the Semantic Conventions specific to 
+[OpenAI](https://platform.openai.com/) spans, extending the general semantics 
+found in the [LLM Semantic Conventions](llm-spans.md). These conventions are 
+designed to standardize telemetry data for OpenAI interactions, particularly 
+focusing on the `/chat/completions` endpoint. By following to these guidelines, 
+developers can ensure consistent, meaningful, and easily interpretable telemetry
+data across different applications and platforms.
+
+## Chat Completions
+
+The span name for OpenAI chat completions SHOULD be `openai.chat` 
+to maintain consistency and clarity in telemetry data.
+
+## Request Attributes
+
+These are the attributes when instrumenting OpenAI LLM requests with the 
+`/chat/completions` endpoint.
+
+<!-- semconv llm.openai(tag=llm-request-tech-specific) -->
+| Attribute  | Type | Description  | Examples  | Requirement Level |
+|---|---|---|---|---|
+| `llm.vendor` | string | The name of the LLM foundation model vendor, if applicable. If not using a vendor-supplied model, this field is left blank. | `openai` | Recommended |
+| `llm.request.model` | string | The name of the LLM a request is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model requested. If the LLM is a fine-tuned custom model, the value SHOULD have a more specific name than the base model that's been fine-tuned. | `gpt-4` | Required |
+| `llm.request.max_tokens` | int | The maximum number of tokens the LLM generates for a request. | `100` | Recommended |
+| `llm.temperature` | float | The temperature setting for the LLM request. | `0.0` | Recommended |
+| `llm.top_p` | float | The top_p sampling setting for the LLM request. | `1.0` | Recommended |
+| `llm.stream` | bool | Whether the LLM responds with a stream. | `false` | Recommended |
+| `llm.stop_sequences` | array | Array of strings the LLM uses as a stop sequence. | `["stop1"]` | Recommended |
+| `llm.openai.n` | integer | The number of completions to generate. | `1` | Recommended |
+| `llm.openai.presence_penalty` | float | If present, the `presence_penalty` used in an OpenAI request. Value is between -2.0 and 2.0. | `-0.5` | Recommended |
+| `llm.openai.frequency_penalty` | float | If present, the `frequency_penalty` used in an OpenAI request. Value is between -2.0 and 2.0. | `-0.5` | Recommended |
+| `llm.openai.logit_bias` | string | If present, the JSON-encoded string of a `logit_bias` used in an OpenAI request. | `{2435:-100, 640:-100}` | Recommended |
+| `llm.openai.user` | string | If present, the `user` used in an OpenAI request. | `bob` | Opt-in |
+| `llm.openai.response_format` | string | An object specifying the format that the model must output. Either `text` or `json_object` | `text` | Recommended |
+| `llm.openai.seed` | integer | Seed used in request to improve determinism. | `1234` | Recommended |
+<!-- endsemconv -->
+
+## Response attributes
+
+Attributes for chat completion responses SHOULD follow these conventions:
+
+<!-- semconv llm.openai(tag=llm-response-tech-specific) -->
+| Attribute  | Type | Description  | Examples  | Requirement Level |
+|---|---|---|---|---|
+| `llm.response.id` | string | The unique identifier for the completion. | `chatcmpl-123` | Recommended |
+| `llm.response.model` | string | The name of the LLM a response is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model actually used. If the LLM is a fine-tuned custom model, the value SHOULD have a more specific name than the base model that's been fine-tuned. | `gpt-4-0613` | Required |
+| `llm.response.finish_reason` | string | The reason the model stopped generating tokens | `stop` | Recommended |
+| `llm.usage.prompt_tokens` | int | The number of tokens used in the LLM prompt. | `100` | Recommended |
+| `llm.usage.completion_tokens` | int | The number of tokens used in the LLM response (completion). | `180` | Recommended |
+| `llm.usage.total_tokens` | int | The total number of tokens used in the LLM prompt and response. | `280` | Recommended |
+| `llm.openai.created` | int | The UNIX timestamp (in seconds) if when the completion was created. | `1677652288` | Recommended |
+| `llm.openai.system_fingerprint` | string | This fingerprint represents the backend configuration that the model runs with. | asdf987123 | Recommended |
+<!-- endsemconv -->
+
+## Request Events
+
+In the lifetime of an LLM span, an event for prompts sent and completions received MAY be created, depending on the configuration of the instrumentation.
+Because OpenAI uses a more complex prompt structure, these events will be used instead of the generic ones detailed in the [LLM Semantic Conventions](llm-spans.md).
+
+### Prompt Events
+
+Prompt event name SHOULD be `llm.openai.prompt`. 
+
+<!-- semconv llm.openai(tag=llm-prompt-tech-specific) -->
+| Attribute  | Type | Description  | Examples  | Requirement Level |
+|---|---|---|---|---|
+| `role` | string | The role of the prompt author, can be one of `system`, `user`, `assistant`, or `tool` | `system` | Required |
+| `content` | string | The content for a given OpenAI response, denoted by `<index>`. The value for `<index>` starts with 0, where 0 is the first message. | `Why did the developer stop using OpenTelemetry? Because they couldn't trace their steps!` | Required |
+| `tool_call_id` | string | If role is `tool` or `function`, then this tool call that this message is responding to. | `get_current_weather` | Conditionally Required: If `role` is `tool`. |
+<!-- endsemconv -->
+
+### Tools Events
+
+Tools event name SHOULD be `llm.openai.tool`, specifying potential tools or functions the LLM can use.
+
+<!-- semconv llm.openai(tag=llm-tools-tech-specific) -->
+| Attribute  | Type | Description  | Examples  | Requirement Level |
+|---|---|---|---|---|
+| `type` | string | They type of the tool. Currently, only `function` is supported. | `function` | Required |
+| `function.name` | string | The name of the function to be called. | `get_weather` | Required !
+| `function.description` | string | A description of what the function does, used by the model to choose when and how to call the function. | `` | Required |
+| `function.parameters` | string | JSON-encoded string of the parameter object for the function. | `{"type": "object", "properties": {}}` | Required | 
+<!-- endsemconv -->
+
+### Choice Events
+
+Recording details about Choices in each response MAY be included as 
+Span Events. 
+
+Choice event name SHOULD be `llm.openai.choice`. 
+
+If there is more than one `tool_call`, separate events SHOULD be used.
+
+<!-- semconv llm.openai(tag=llm-completion-tech-specific) -->
+| `type` | string | Either `delta` or `message`. | `message` | Required |
+|---|---|---|---|---|
+| `finish_reason` | string | The reason the OpenAI model stopped generating tokens for this chunk. | `stop` | Recommended |
+| `role` | string | The assigned role for a given OpenAI response, denoted by `<index>`. The value for `<index>` starts with 0, where 0 is the first message. | `system` | Required |
+| `content` | string | The content for a given OpenAI response, denoted by `<index>`. The value for `<index>` starts with 0, where 0 is the first message. | `Why did the developer stop using OpenTelemetry? Because they couldn't trace their steps!` | Required |
+| `tool_call.id` | string | If exists, the ID of the tool call. | `call_BP08xxEhU60txNjnz3z9R4h9` | Required |
+| `tool_call.type` | string | Currently only `function` is supported. | `function` | Required |
+| `tool_call.function.name` | string | If exists, the name of a function call for a given OpenAI response, denoted by `<index>`. The value for `<index>` starts with 0, where 0 is the first message. | `get_weather_report` | Required |
+| `tool_call.function.arguments` | string | If exists, the arguments to call a function call with for a given OpenAI response, denoted by `<index>`. The value for `<index>` starts with 0, where 0 is the first message. | `{"type": "object", "properties": {"some":"data"}}` | Required |
+<!-- endsemconv -->
+
+[DocumentStatus]: https://github.com/open-telemetry/opentelemetry-specification/tree/v1.22.0/specification/document-status.md
\ No newline at end of file

From a521fc1cc960997f259256852c2ffd799b41ddd7 Mon Sep 17 00:00:00 2001
From: Drew Robbins <drobbins@microsoft.com>
Date: Mon, 22 Jan 2024 19:43:56 +0000
Subject: [PATCH 02/36] Update to use Yaml model files

---
 docs/ai/llm-spans.md            |  78 ++++++-------
 docs/ai/openai.md               | 111 ++++++++++--------
 docs/attributes-registry/llm.md | 125 ++++++++++++++++++++
 model/registry/llm.yaml         | 194 ++++++++++++++++++++++++++++++++
 model/trace/llm.yaml            | 164 +++++++++++++++++++++++++++
 5 files changed, 581 insertions(+), 91 deletions(-)
 create mode 100644 docs/attributes-registry/llm.md
 create mode 100644 model/registry/llm.yaml
 create mode 100644 model/trace/llm.yaml

diff --git a/docs/ai/llm-spans.md b/docs/ai/llm-spans.md
index 19c4162321..9ed8134795 100644
--- a/docs/ai/llm-spans.md
+++ b/docs/ai/llm-spans.md
@@ -2,7 +2,7 @@
 linkTitle: LLM Calls
 --->
 
-# Semantic Conventions for LLM requests
+# Semantic Conventions for LLM Spans
 
 **Status**: [Experimental][DocumentStatus]
 
@@ -10,9 +10,10 @@ linkTitle: LLM Calls
 
 <!-- toc -->
 
-- [LLM Request attributes](#llm-request-attributes)
 - [Configuration](#configuration)
-- [Semantic Conventions for specific LLM technologies](#semantic-conventions-for-specific-llm-technologies)
+- [LLM Request attributes](#llm-request-attributes)
+- [LLM Response attributes](#llm-response-attributes)
+- [LLM Span Events](#llm-span-events)
 
 <!-- tocstop -->
 
@@ -35,65 +36,52 @@ By default, these configurations SHOULD NOT capture prompts and completions.
 
 These attributes track input data and metadata for a request to an LLM. Each attribute represents a concept that is common to most LLMs.
 
-<!-- semconv ai(tag=llm-request) -->
+<!-- semconv llm.request -->
 | Attribute  | Type | Description  | Examples  | Requirement Level |
 |---|---|---|---|---|
-| `llm.vendor` | string | The name of the LLM foundation model vendor, if applicable. If not using a vendor-supplied model, this field is left blank. | `openai` | Recommended |
-| `llm.request.model` | string | The name of the LLM a request is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model requested. If the LLM is a fine-tuned custom model, the value SHOULD have a more specific name than the base model that's been fine-tuned. | `gpt-4` | Required |
-| `llm.request.max_tokens` | int | The maximum number of tokens the LLM generates for a request. | `100` | Recommended |
-| `llm.temperature` | float | The temperature setting for the LLM request. | `0.0` | Recommended |
-| `llm.top_p` | float | The top_p sampling setting for the LLM request. | `1.0` | Recommended |
-| `llm.stream` | bool | Whether the LLM responds with a stream. | `false` | Recommended |
-| `llm.stop_sequences` | array | Array of strings the LLM uses as a stop sequence. | `["stop1"]` | Recommended |
-
-`llm.model` has the following list of well-known values. If one of them applies, then the respective value MUST be used, otherwise a custom value MAY be used.
-
-| Value  | Description |
-|---|---|
-| `gpt-4` | GPT-4 |
-| `gpt-4-32k` | GPT-4 with 32k context window |
-| `gpt-3.5-turbo` | GPT-3.5-turbo |
-| `gpt-3.5-turbo-16k` | GPT-3.5-turbo with 16k context window|
-| `claude-instant-1` | Claude Instant (latest version) |
-| `claude-2` | Claude 2 (latest version) |
-| `other-llm` | Any LLM not listed in this table. Use for any fine-tuned version of a model. |
+| [`llm.request.max_tokens`](../attributes-registry/llm.md) | int | The maximum number of tokens the LLM generates for a request. | `100` | Recommended |
+| [`llm.request.model`](../attributes-registry/llm.md) | string | The name of the LLM a request is being made to. [1] | `gpt-4` | Required |
+| [`llm.stop_sequences`](../attributes-registry/llm.md) | string | Array of strings the LLM uses as a stop sequence. | `stop1` | Recommended |
+| [`llm.stream`](../attributes-registry/llm.md) | boolean | Whether the LLM responds with a stream. | `False` | Recommended |
+| [`llm.temperature`](../attributes-registry/llm.md) | double | The temperature setting for the LLM request. | `0.0` | Recommended |
+| [`llm.top_p`](../attributes-registry/llm.md) | double | The top_p sampling setting for the LLM request. | `1.0` | Recommended |
+| [`llm.vendor`](../attributes-registry/llm.md) | string | The name of the LLM foundation model vendor, if applicable. [2] | `openai` | Recommended |
+
+**[1]:** The name of the LLM a request is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model requested. If the LLM is a fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
+
+**[2]:** The name of the LLM foundation model vendor, if applicable. If not using a vendor-supplied model, this field is left blank.
 <!-- endsemconv -->
 
 ## LLM Response attributes
 
 These attributes track output data and metadata for a response from an LLM. Each attribute represents a concept that is common to most LLMs.
 
-<!-- semconv ai(tag=llm-response) -->
+<!-- semconv llm.response -->
 | Attribute  | Type | Description  | Examples  | Requirement Level |
 |---|---|---|---|---|
-| `llm.response.id` | string | The unique identifier for the completion. | `chatcmpl-123` | Recommended |
-| `llm.response.model` | string | The name of the LLM a response is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model actually used. If the LLM is a fine-tuned custom model, the value SHOULD have a more specific name than the base model that's been fine-tuned. | `gpt-4-0613` | Required |
-| `llm.response.finish_reason` | string | The reason the model stopped generating tokens | `stop` | Recommended |
-| `llm.usage.prompt_tokens` | int | The number of tokens used in the LLM prompt. | `100` | Recommended |
-| `llm.usage.completion_tokens` | int | The number of tokens used in the LLM response (completion). | `180` | Recommended |
-| `llm.usage.total_tokens` | int | The total number of tokens used in the LLM prompt and response. | `280` | Recommended |
-
-`llm.response.finish_reason` MUST be one of the following:
-
-| Value  | Description |
-|---|---|
-| `stop` | If the model hit a natural stop point or a provided stop sequence. |
-| `max_tokens` | If the maximum number of tokens specified in the request was reached. |
-| `tool_call` | If a function / tool call was made by the model (for models that support such functionality). |
+| [`llm.response.finish_reason`](../attributes-registry/llm.md) | string | The reason the model stopped generating tokens. | `stop` | Recommended |
+| [`llm.response.id`](../attributes-registry/llm.md) | string | The unique identifier for the completion. | `chatcmpl-123` | Recommended |
+| [`llm.response.model`](../attributes-registry/llm.md) | string | The name of the LLM a response is being made to. [1] | `gpt-4-0613` | Required |
+| [`llm.usage.completion_tokens`](../attributes-registry/llm.md) | int | The number of tokens used in the LLM response (completion). | `180` | Recommended |
+| [`llm.usage.prompt_tokens`](../attributes-registry/llm.md) | int | The number of tokens used in the LLM prompt. | `100` | Recommended |
+| [`llm.usage.total_tokens`](../attributes-registry/llm.md) | int | The total number of tokens used in the LLM prompt and response. | `280` | Recommended |
+
+**[1]:** The name of the LLM a response is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model actually used. If the LLM is a fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
 <!-- endsemconv -->
 
-## Events
+## LLM Span Events
 
 In the lifetime of an LLM span, an event for prompts sent and completions received MAY be created, depending on the configuration of the instrumentation.
 
-<!-- semconv ai(tag=llm-prompt) -->
+<!-- semconv llm.events -->
 | Attribute  | Type | Description  | Examples  | Requirement Level |
-| `llm.prompt` | string | The full prompt string sent to an LLM in a request. If the LLM accepts a more complex input like a JSON object made up of several pieces (such as OpenAI's different message types), this field is blank, and the response is instead captured in an event determined by the specific LLM technology semantic convention. | `\n\nHuman:You are an AI assistant that tells jokes. Can you tell me a joke about OpenTelemetry?\n\nAssistant:` | Recommended |
-<!-- endsemconv -->
+|---|---|---|---|---|
+| [`llm.completion`](../attributes-registry/llm.md) | string | The full response string from an LLM in a response. [1] | `Why did the developer stop using OpenTelemetry? Because they couldnt trace their steps!` | Recommended |
+| [`llm.prompt`](../attributes-registry/llm.md) | string | The full prompt string sent to an LLM in a request. [2] | `\\n\\nHuman:You are an AI assistant that tells jokes. Can you tell me a joke about OpenTelemetry?\\n\\nAssistant:` | Recommended |
 
-<!-- semconv ai(tag=llm-completion) -->
-| Attribute  | Type | Description  | Examples  | Requirement Level |
-| `llm.completion` | string | The full response string from an LLM. If the LLM responds with a more complex output like a JSON object made up of several pieces (such as OpenAI's message choices), this field is the content of the response. If the LLM produces multiple responses, then this field is left blank, and each response is instead captured in an event determined by the specific LLM technology semantic convention.| `Why did the developer stop using OpenTelemetry? Because they couldn't trace their steps!` | Recommended |
+**[1]:** The full response string from an LLM. If the LLM responds with a more complex output like a JSON object made up of several pieces (such as OpenAI's message choices), this field is the content of the response. If the LLM produces multiple responses, then this field is left blank, and each response is instead captured in an event determined by the specific LLM technology semantic convention.
+
+**[2]:** The full prompt string sent to an LLM in a request. If the LLM accepts a more complex input like a JSON object, this field is blank, and the response is instead captured in an event determined by the specific LLM technology semantic convention.
 <!-- endsemconv -->
 
 [DocumentStatus]: https://github.com/open-telemetry/opentelemetry-specification/tree/v1.22.0/specification/document-status.md
\ No newline at end of file
diff --git a/docs/ai/openai.md b/docs/ai/openai.md
index 4c7acf404a..7dcbee5a6d 100644
--- a/docs/ai/openai.md
+++ b/docs/ai/openai.md
@@ -14,53 +14,63 @@ focusing on the `/chat/completions` endpoint. By following to these guidelines,
 developers can ensure consistent, meaningful, and easily interpretable telemetry
 data across different applications and platforms.
 
+<!-- toc -->
+
+- [Chat Completions](#chat-completions)
+  * [Request Attributes](#request-attributes)
+  * [Response attributes](#response-attributes)
+- [OpenAI Span Events](#openai-span-events)
+  * [Prompt Events](#prompt-events)
+  * [Tools Events](#tools-events)
+  * [Choice Events](#choice-events)
+
+<!-- tocstop -->
+
 ## Chat Completions
 
 The span name for OpenAI chat completions SHOULD be `openai.chat` 
 to maintain consistency and clarity in telemetry data.
 
-## Request Attributes
+### Request Attributes
 
 These are the attributes when instrumenting OpenAI LLM requests with the 
 `/chat/completions` endpoint.
 
-<!-- semconv llm.openai(tag=llm-request-tech-specific) -->
+<!-- semconv llm.openai(tag=tech-specific-openai-request) -->
 | Attribute  | Type | Description  | Examples  | Requirement Level |
 |---|---|---|---|---|
-| `llm.vendor` | string | The name of the LLM foundation model vendor, if applicable. If not using a vendor-supplied model, this field is left blank. | `openai` | Recommended |
-| `llm.request.model` | string | The name of the LLM a request is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model requested. If the LLM is a fine-tuned custom model, the value SHOULD have a more specific name than the base model that's been fine-tuned. | `gpt-4` | Required |
-| `llm.request.max_tokens` | int | The maximum number of tokens the LLM generates for a request. | `100` | Recommended |
-| `llm.temperature` | float | The temperature setting for the LLM request. | `0.0` | Recommended |
-| `llm.top_p` | float | The top_p sampling setting for the LLM request. | `1.0` | Recommended |
-| `llm.stream` | bool | Whether the LLM responds with a stream. | `false` | Recommended |
-| `llm.stop_sequences` | array | Array of strings the LLM uses as a stop sequence. | `["stop1"]` | Recommended |
-| `llm.openai.n` | integer | The number of completions to generate. | `1` | Recommended |
-| `llm.openai.presence_penalty` | float | If present, the `presence_penalty` used in an OpenAI request. Value is between -2.0 and 2.0. | `-0.5` | Recommended |
-| `llm.openai.frequency_penalty` | float | If present, the `frequency_penalty` used in an OpenAI request. Value is between -2.0 and 2.0. | `-0.5` | Recommended |
-| `llm.openai.logit_bias` | string | If present, the JSON-encoded string of a `logit_bias` used in an OpenAI request. | `{2435:-100, 640:-100}` | Recommended |
-| `llm.openai.user` | string | If present, the `user` used in an OpenAI request. | `bob` | Opt-in |
-| `llm.openai.response_format` | string | An object specifying the format that the model must output. Either `text` or `json_object` | `text` | Recommended |
-| `llm.openai.seed` | integer | Seed used in request to improve determinism. | `1234` | Recommended |
+| [`llm.openai.logit_bias`](../attributes-registry/llm.md) | string | If present, the JSON-encoded string of a `logit_bias` used in an OpenAI request | `{2435:-100, 640:-100}` | Recommended |
+| [`llm.openai.presence_penalty`](../attributes-registry/llm.md) | double | If present, the `presence_penalty` used in an OpenAI request. Value is between -2.0 and 2.0. | `-0.5` | Recommended |
+| [`llm.openai.response_format`](../attributes-registry/llm.md) | string | An object specifying the format that the model must output. Either `text` or `json_object` | `text` | Recommended |
+| [`llm.openai.user`](../attributes-registry/llm.md) | string | If present, the `user` used in an OpenAI request. | `bob` | Recommended |
+| [`llm.request.max_tokens`](../attributes-registry/llm.md) | int | The maximum number of tokens the LLM generates for a request. | `100` | Recommended |
+| [`llm.request.model`](../attributes-registry/llm.md) | string | The name of the LLM a request is being made to. [1] | `gpt-4` | Required |
+| [`llm.stop_sequences`](../attributes-registry/llm.md) | string | Array of strings the LLM uses as a stop sequence. | `stop1` | Recommended |
+| [`llm.stream`](../attributes-registry/llm.md) | boolean | Whether the LLM responds with a stream. | `False` | Recommended |
+| [`llm.temperature`](../attributes-registry/llm.md) | double | The temperature setting for the LLM request. | `0.0` | Recommended |
+| [`llm.top_p`](../attributes-registry/llm.md) | double | The top_p sampling setting for the LLM request. | `1.0` | Recommended |
+| [`llm.vendor`](../attributes-registry/llm.md) | string | The name of the LLM foundation model vendor, if applicable. | `openai`; `microsoft` | Recommended |
+
+**[1]:** The name of the LLM a request is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model requested. If the LLM is a fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
 <!-- endsemconv -->
 
-## Response attributes
+### Response attributes
 
 Attributes for chat completion responses SHOULD follow these conventions:
 
-<!-- semconv llm.openai(tag=llm-response-tech-specific) -->
+<!-- semconv llm.openai(tag=tech-specific-openai-response) -->
 | Attribute  | Type | Description  | Examples  | Requirement Level |
 |---|---|---|---|---|
-| `llm.response.id` | string | The unique identifier for the completion. | `chatcmpl-123` | Recommended |
-| `llm.response.model` | string | The name of the LLM a response is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model actually used. If the LLM is a fine-tuned custom model, the value SHOULD have a more specific name than the base model that's been fine-tuned. | `gpt-4-0613` | Required |
-| `llm.response.finish_reason` | string | The reason the model stopped generating tokens | `stop` | Recommended |
-| `llm.usage.prompt_tokens` | int | The number of tokens used in the LLM prompt. | `100` | Recommended |
-| `llm.usage.completion_tokens` | int | The number of tokens used in the LLM response (completion). | `180` | Recommended |
-| `llm.usage.total_tokens` | int | The total number of tokens used in the LLM prompt and response. | `280` | Recommended |
-| `llm.openai.created` | int | The UNIX timestamp (in seconds) if when the completion was created. | `1677652288` | Recommended |
-| `llm.openai.system_fingerprint` | string | This fingerprint represents the backend configuration that the model runs with. | asdf987123 | Recommended |
+| [`llm.openai.created`](../attributes-registry/llm.md) | int | The UNIX timestamp (in seconds) if when the completion was created. | `1677652288` | Recommended |
+| [`llm.openai.seed`](../attributes-registry/llm.md) | int | Seed used in request to improve determinism. | `1234` | Recommended |
+| [`llm.response.finish_reason`](../attributes-registry/llm.md) | string | The reason the model stopped generating tokens. | `stop` | Recommended |
+| [`llm.response.id`](../attributes-registry/llm.md) | string | The unique identifier for the completion. | `chatcmpl-123` | Recommended |
+| [`llm.usage.completion_tokens`](../attributes-registry/llm.md) | int | The number of tokens used in the LLM response (completion). | `180` | Recommended |
+| [`llm.usage.prompt_tokens`](../attributes-registry/llm.md) | int | The number of tokens used in the LLM prompt. | `100` | Recommended |
+| [`llm.usage.total_tokens`](../attributes-registry/llm.md) | int | The total number of tokens used in the LLM prompt and response. | `280` | Recommended |
 <!-- endsemconv -->
 
-## Request Events
+## OpenAI Span Events
 
 In the lifetime of an LLM span, an event for prompts sent and completions received MAY be created, depending on the configuration of the instrumentation.
 Because OpenAI uses a more complex prompt structure, these events will be used instead of the generic ones detailed in the [LLM Semantic Conventions](llm-spans.md).
@@ -69,25 +79,25 @@ Because OpenAI uses a more complex prompt structure, these events will be used i
 
 Prompt event name SHOULD be `llm.openai.prompt`. 
 
-<!-- semconv llm.openai(tag=llm-prompt-tech-specific) -->
+<!-- semconv llm.openai.prompt -->
 | Attribute  | Type | Description  | Examples  | Requirement Level |
 |---|---|---|---|---|
-| `role` | string | The role of the prompt author, can be one of `system`, `user`, `assistant`, or `tool` | `system` | Required |
-| `content` | string | The content for a given OpenAI response, denoted by `<index>`. The value for `<index>` starts with 0, where 0 is the first message. | `Why did the developer stop using OpenTelemetry? Because they couldn't trace their steps!` | Required |
-| `tool_call_id` | string | If role is `tool` or `function`, then this tool call that this message is responding to. | `get_current_weather` | Conditionally Required: If `role` is `tool`. |
+| [`llm.openai.content`](../attributes-registry/llm.md) | string | The content for a given OpenAI response. | `Why did the developer stop using OpenTelemetry? Because they couldn't trace their steps!` | Required |
+| [`llm.openai.role`](../attributes-registry/llm.md) | string | The role of the prompt author, can be one of `system`, `user`, `assistant`, or `tool` | `user` | Required |
+| [`llm.openai.tool_call.id`](../attributes-registry/llm.md) | string | If role is `tool` or `function`, then this tool call that this message is responding to. | `get_current_weather` | Conditionally Required: Required if the prompt role is `tool`. |
 <!-- endsemconv -->
 
 ### Tools Events
 
 Tools event name SHOULD be `llm.openai.tool`, specifying potential tools or functions the LLM can use.
 
-<!-- semconv llm.openai(tag=llm-tools-tech-specific) -->
+<!-- semconv llm.openai.tool -->
 | Attribute  | Type | Description  | Examples  | Requirement Level |
 |---|---|---|---|---|
-| `type` | string | They type of the tool. Currently, only `function` is supported. | `function` | Required |
-| `function.name` | string | The name of the function to be called. | `get_weather` | Required !
-| `function.description` | string | A description of what the function does, used by the model to choose when and how to call the function. | `` | Required |
-| `function.parameters` | string | JSON-encoded string of the parameter object for the function. | `{"type": "object", "properties": {}}` | Required | 
+| [`llm.openai.function.description`](../attributes-registry/llm.md) | string | A description of what the function does, used by the model to choose when and how to call the function. | `Gets the current weather for a location` | Required |
+| [`llm.openai.function.name`](../attributes-registry/llm.md) | string | The name of the function to be called. | `get_weather` | Required |
+| [`llm.openai.function.parameters`](../attributes-registry/llm.md) | string | JSON-encoded string of the parameter object for the function. | `{"type": "object", "properties": {}}` | Required |
+| [`llm.openai.tool_call.type`](../attributes-registry/llm.md) | string | The type of the tool. Currently, only `function` is supported. | `function` | Required |
 <!-- endsemconv -->
 
 ### Choice Events
@@ -97,18 +107,27 @@ Span Events.
 
 Choice event name SHOULD be `llm.openai.choice`. 
 
-If there is more than one `tool_call`, separate events SHOULD be used.
+If there is more than one `choice`, separate events SHOULD be used.
 
-<!-- semconv llm.openai(tag=llm-completion-tech-specific) -->
-| `type` | string | Either `delta` or `message`. | `message` | Required |
+<!-- semconv llm.openai.choice -->
+| Attribute  | Type | Description  | Examples  | Requirement Level |
 |---|---|---|---|---|
-| `finish_reason` | string | The reason the OpenAI model stopped generating tokens for this chunk. | `stop` | Recommended |
-| `role` | string | The assigned role for a given OpenAI response, denoted by `<index>`. The value for `<index>` starts with 0, where 0 is the first message. | `system` | Required |
-| `content` | string | The content for a given OpenAI response, denoted by `<index>`. The value for `<index>` starts with 0, where 0 is the first message. | `Why did the developer stop using OpenTelemetry? Because they couldn't trace their steps!` | Required |
-| `tool_call.id` | string | If exists, the ID of the tool call. | `call_BP08xxEhU60txNjnz3z9R4h9` | Required |
-| `tool_call.type` | string | Currently only `function` is supported. | `function` | Required |
-| `tool_call.function.name` | string | If exists, the name of a function call for a given OpenAI response, denoted by `<index>`. The value for `<index>` starts with 0, where 0 is the first message. | `get_weather_report` | Required |
-| `tool_call.function.arguments` | string | If exists, the arguments to call a function call with for a given OpenAI response, denoted by `<index>`. The value for `<index>` starts with 0, where 0 is the first message. | `{"type": "object", "properties": {"some":"data"}}` | Required |
+| [`llm.openai.choice.type`](../attributes-registry/llm.md) | string | The type of the choice, either `delta` or `message`. | `message` | Required |
+| [`llm.openai.content`](../attributes-registry/llm.md) | string | The content for a given OpenAI response. | `Why did the developer stop using OpenTelemetry? Because they couldn't trace their steps!` | Required |
+| [`llm.openai.function.arguments`](../attributes-registry/llm.md) | string | If exists, the arguments to call a function call with for a given OpenAI response, denoted by `<index>`. The value for `<index>` starts with 0, where 0 is the first message. | `{"type": "object", "properties": {"some":"data"}}` | Conditionally Required: [1] |
+| [`llm.openai.function.name`](../attributes-registry/llm.md) | string | The name of the function to be called. | `get_weather` | Conditionally Required: [2] |
+| [`llm.openai.role`](../attributes-registry/llm.md) | string | The role of the prompt author, can be one of `system`, `user`, `assistant`, or `tool` | `user` | Required |
+| [`llm.openai.tool_call.id`](../attributes-registry/llm.md) | string | If role is `tool` or `function`, then this tool call that this message is responding to. | `get_current_weather` | Conditionally Required: [3] |
+| [`llm.openai.tool_call.type`](../attributes-registry/llm.md) | string | The type of the tool. Currently, only `function` is supported. | `function` | Conditionally Required: [4] |
+| [`llm.response.finish_reason`](../attributes-registry/llm.md) | string | The reason the model stopped generating tokens. | `stop` | Recommended |
+
+**[1]:** Required if the choice is the result of a tool call of type `function`.
+
+**[2]:** Required if the choice is the result of a tool call of type `function`.
+
+**[3]:** Required if the choice is the result of a tool call.
+
+**[4]:** Required if the choice is the result of a tool call.
 <!-- endsemconv -->
 
 [DocumentStatus]: https://github.com/open-telemetry/opentelemetry-specification/tree/v1.22.0/specification/document-status.md
\ No newline at end of file
diff --git a/docs/attributes-registry/llm.md b/docs/attributes-registry/llm.md
new file mode 100644
index 0000000000..8c203ba211
--- /dev/null
+++ b/docs/attributes-registry/llm.md
@@ -0,0 +1,125 @@
+<!--- Hugo front matter used to generate the website version of this page:
+--->
+
+# Large Language Model (LLM)
+
+<!-- toc -->
+
+- [Generic LLM Attributes](#generic-llm-attributes)
+  * [Request Attributes](#request-attributes)
+  * [Response Attributes](#response-attributes)
+  * [Event Attributes](#event-attributes)
+- [OpenAI Attributes](#openai-attributes)
+  * [Request Attributes](#request-attributes-1)
+  * [Response Attributes](#response-attributes-1)
+  * [Event Attributes](#event-attributes-1)
+
+<!-- tocstop -->
+
+## Generic LLM Attributes
+
+### Request Attributes
+
+<!-- semconv registry.llm(omit_requirement_level,tag=llm-generic-request) -->
+| Attribute  | Type | Description  | Examples  |
+|---|---|---|---|
+| `llm.request.max_tokens` | int | The maximum number of tokens the LLM generates for a request. | `100` |
+| `llm.request.model` | string | The name of the LLM a request is being made to. | `gpt-4` |
+| `llm.stop_sequences` | string | Array of strings the LLM uses as a stop sequence. | `stop1` |
+| `llm.stream` | boolean | Whether the LLM responds with a stream. | `False` |
+| `llm.temperature` | double | The temperature setting for the LLM request. | `0.0` |
+| `llm.top_p` | double | The top_p sampling setting for the LLM request. | `1.0` |
+| `llm.vendor` | string | The name of the LLM foundation model vendor, if applicable. | `openai` |
+<!-- endsemconv -->
+
+### Response Attributes
+
+<!-- semconv registry.llm(omit_requirement_level,tag=llm-generic-response) -->
+| Attribute  | Type | Description  | Examples  |
+|---|---|---|---|
+| `llm.response.finish_reason` | string | The reason the model stopped generating tokens. | `stop` |
+| `llm.response.id` | string | The unique identifier for the completion. | `chatcmpl-123` |
+| `llm.response.model` | string | The name of the LLM a response is being made to. | `gpt-4-0613` |
+| `llm.usage.completion_tokens` | int | The number of tokens used in the LLM response (completion). | `180` |
+| `llm.usage.prompt_tokens` | int | The number of tokens used in the LLM prompt. | `100` |
+| `llm.usage.total_tokens` | int | The total number of tokens used in the LLM prompt and response. | `280` |
+<!-- endsemconv -->
+
+### Event Attributes
+
+<!-- semconv registry.llm(omit_requirement_level,tag=llm-generic-events) -->
+| Attribute  | Type | Description  | Examples  |
+|---|---|---|---|
+| `llm.completion` | string | The full response string from an LLM in a response. | `Why did the developer stop using OpenTelemetry? Because they couldnt trace their steps!` |
+| `llm.prompt` | string | The full prompt string sent to an LLM in a request. | `\\n\\nHuman:You are an AI assistant that tells jokes. Can you tell me a joke about OpenTelemetry?\\n\\nAssistant:` |
+<!-- endsemconv -->
+
+## OpenAI Attributes
+
+### Request Attributes
+
+<!-- semconv registry.llm(omit_requirement_level,tag=tech-specific-openai-request) -->
+| Attribute  | Type | Description  | Examples  |
+|---|---|---|---|
+| `llm.openai.frequency_penalty` | double | If present, the `frequency_penalty` used in an OpenAI request. Value is between -2.0 and 2.0. | `-0.5` |
+| `llm.openai.logit_bias` | string | If present, the JSON-encoded string of a `logit_bias` used in an OpenAI request | `{2435:-100, 640:-100}` |
+| `llm.openai.presence_penalty` | double | If present, the `presence_penalty` used in an OpenAI request. Value is between -2.0 and 2.0. | `-0.5` |
+| `llm.openai.response_format` | string | An object specifying the format that the model must output. Either `text` or `json_object` | `text` |
+| `llm.openai.seed` | int | Seed used in request to improve determinism. | `1234` |
+| `llm.openai.user` | string | If present, the `user` used in an OpenAI request. | `bob` |
+
+`llm.openai.response_format` MUST be one of the following:
+
+| Value  | Description |
+|---|---|
+| `text` | text |
+| `json_object` | json_object |
+<!-- endsemconv -->
+
+### Response Attributes
+
+<!-- semconv registry.llm(omit_requirement_level,tag=tech-specific-openai-response) -->
+| Attribute  | Type | Description  | Examples  |
+|---|---|---|---|
+| `llm.openai.created` | int | The UNIX timestamp (in seconds) if when the completion was created. | `1677652288` |
+| `llm.openai.system_fingerprint` | string | This fingerprint represents the backend configuration that the model runs with. | `asdf987123` |
+<!-- endsemconv -->
+
+### Event Attributes
+
+<!-- semconv registry.llm(omit_requirement_level,tag=tech-specific-openai-events) -->
+| Attribute  | Type | Description  | Examples  |
+|---|---|---|---|
+| `llm.openai.choice.type` | string | The type of the choice, either `delta` or `message`. | `message` |
+| `llm.openai.content` | string | The content for a given OpenAI response. | `Why did the developer stop using OpenTelemetry? Because they couldn't trace their steps!` |
+| `llm.openai.finish_reason` | string | The reason the OpenAI model stopped generating tokens for this chunk. | `stop` |
+| `llm.openai.function.arguments` | string | If exists, the arguments to call a function call with for a given OpenAI response, denoted by `<index>`. The value for `<index>` starts with 0, where 0 is the first message. | `{"type": "object", "properties": {"some":"data"}}` |
+| `llm.openai.function.description` | string | A description of what the function does, used by the model to choose when and how to call the function. | `Gets the current weather for a location` |
+| `llm.openai.function.name` | string | The name of the function to be called. | `get_weather` |
+| `llm.openai.function.parameters` | string | JSON-encoded string of the parameter object for the function. | `{"type": "object", "properties": {}}` |
+| `llm.openai.role` | string | The role of the prompt author, can be one of `system`, `user`, `assistant`, or `tool` | `user` |
+| `llm.openai.tool_call.id` | string | If role is `tool` or `function`, then this tool call that this message is responding to. | `get_current_weather` |
+| `llm.openai.tool_call.type` | string | The type of the tool. Currently, only `function` is supported. | `function` |
+
+`llm.openai.choice.type` MUST be one of the following:
+
+| Value  | Description |
+|---|---|
+| `delta` | delta |
+| `message` | message |
+
+`llm.openai.role` MUST be one of the following:
+
+| Value  | Description |
+|---|---|
+| `system` | system |
+| `user` | user |
+| `assistant` | assistant |
+| `tool` | tool |
+
+`llm.openai.tool_call.type` MUST be one of the following:
+
+| Value  | Description |
+|---|---|
+| `function` | function |
+<!-- endsemconv -->
\ No newline at end of file
diff --git a/model/registry/llm.yaml b/model/registry/llm.yaml
new file mode 100644
index 0000000000..60912165db
--- /dev/null
+++ b/model/registry/llm.yaml
@@ -0,0 +1,194 @@
+groups:
+  - id: registry.llm
+    prefix: llm
+    type: attribute_group
+    brief: >
+      This document defines the attributes used to describe telemetry in the context of LLM (Large Language Models) requests and responses.
+    attributes:
+      - id: vendor
+        type: string
+        brief: The name of the LLM foundation model vendor, if applicable.
+        examples: 'openai'
+        tag: llm-generic-request
+      - id: request.model
+        type: string
+        brief: The name of the LLM a request is being made to.
+        examples: 'gpt-4'
+        tag: llm-generic-request
+      - id: request.max_tokens
+        type: int
+        brief: The maximum number of tokens the LLM generates for a request.
+        examples: [100]
+        tag: llm-generic-request
+      - id: temperature
+        type: double
+        brief: The temperature setting for the LLM request.
+        examples: [0.0]
+        tag: llm-generic-request
+      - id: top_p
+        type: double
+        brief: The top_p sampling setting for the LLM request.
+        examples: [1.0]
+        tag: llm-generic-request
+      - id: stream
+        type: boolean
+        brief: Whether the LLM responds with a stream.
+        examples: [false]
+        tag: llm-generic-request
+      - id: stop_sequences
+        type: string
+        brief: Array of strings the LLM uses as a stop sequence.
+        examples: ["stop1"]
+        tag: llm-generic-request
+      - id: response.id
+        type: string
+        brief: The unique identifier for the completion.
+        examples: ['chatcmpl-123']
+        tag: llm-generic-response
+      - id: response.model
+        type: string
+        brief: The name of the LLM a response is being made to.
+        examples: ['gpt-4-0613']
+        tag: llm-generic-response
+      - id: response.finish_reason
+        type: string
+        brief: The reason the model stopped generating tokens.
+        examples: ['stop']
+        tag: llm-generic-response
+      - id: usage.prompt_tokens
+        type: int
+        brief: The number of tokens used in the LLM prompt.
+        examples: [100]
+        tag: llm-generic-response
+      - id: usage.completion_tokens
+        type: int
+        brief: The number of tokens used in the LLM response (completion).
+        examples: [180]
+        tag: llm-generic-response
+      - id: usage.total_tokens
+        type: int
+        brief: The total number of tokens used in the LLM prompt and response.
+        examples: [280]
+        tag: llm-generic-response
+      - id: prompt
+        type: string
+        brief: The full prompt string sent to an LLM in a request.
+        examples: ['\\n\\nHuman:You are an AI assistant that tells jokes. Can you tell me a joke about OpenTelemetry?\\n\\nAssistant:']
+        tag: llm-generic-events
+      - id: completion
+        type: string
+        brief: The full response string from an LLM in a response.
+        examples: ['Why did the developer stop using OpenTelemetry? Because they couldnt trace their steps!']
+        tag: llm-generic-events
+      - id: openai.presence_penalty
+        type: double
+        brief: If present, the `presence_penalty` used in an OpenAI request. Value is between -2.0 and 2.0.
+        examples: -0.5
+        tag: tech-specific-openai-request
+      - id: openai.frequency_penalty
+        type: double
+        brief: If present, the `frequency_penalty` used in an OpenAI request. Value is between -2.0 and 2.0.
+        examples: -0.5
+        tag: tech-specific-openai-request
+      - id: openai.logit_bias
+        type: string
+        brief: If present, the JSON-encoded string of a `logit_bias` used in an OpenAI request
+        examples: ['{2435:-100, 640:-100}']
+        tag: tech-specific-openai-request
+      - id: openai.user
+        type: string
+        brief: If present, the `user` used in an OpenAI request.
+        examples: ['bob']
+        tag: tech-specific-openai-request
+      - id: openai.response_format
+        type:
+          members:
+            - id: text
+              value: 'text'
+            - id: json_object
+              value: 'json_object'
+        brief: An object specifying the format that the model must output. Either `text` or `json_object`
+        examples: 'text'
+        tag: tech-specific-openai-request
+      - id: openai.seed
+        type: int
+        brief: Seed used in request to improve determinism.
+        examples: 1234
+        tag: tech-specific-openai-request
+      - id: openai.created
+        type: int
+        brief: The UNIX timestamp (in seconds) if when the completion was created.
+        examples: 1677652288
+        tag: tech-specific-openai-response
+      - id: openai.system_fingerprint
+        type: string
+        brief: This fingerprint represents the backend configuration that the model runs with.
+        examples: 'asdf987123'
+        tag: tech-specific-openai-response
+      - id: openai.role
+        type:
+          members:
+            - id: system
+              value: 'system'
+            - id: user
+              value: 'user'
+            - id: assistant
+              value: 'assistant'
+            - id: tool
+              value: 'tool'
+        brief: The role of the prompt author, can be one of `system`, `user`, `assistant`, or `tool`
+        examples: 'user'
+        tag: tech-specific-openai-events
+      - id: openai.content
+        type: string
+        brief: The content for a given OpenAI response.
+        examples: 'Why did the developer stop using OpenTelemetry? Because they couldn''t trace their steps!'
+        tag: tech-specific-openai-events
+      - id: openai.function.name
+        type: string
+        brief: The name of the function to be called.
+        examples: 'get_weather'
+        tag: tech-specific-openai-events
+      - id: openai.function.description
+        type: string
+        brief: A description of what the function does, used by the model to choose when and how to call the function.
+        examples: 'Gets the current weather for a location'
+        tag: tech-specific-openai-events
+      - id: openai.function.parameters
+        type: string
+        brief: JSON-encoded string of the parameter object for the function.
+        examples: '{"type": "object", "properties": {}}'
+        tag: tech-specific-openai-events
+      - id: openai.function.arguments
+        type: string
+        brief: If exists, the arguments to call a function call with for a given OpenAI response, denoted by `<index>`. The value for `<index>` starts with 0, where 0 is the first message. 
+        examples: '{"type": "object", "properties": {"some":"data"}}'
+        tag: tech-specific-openai-events
+      - id: openai.finish_reason
+        type: string
+        brief: The reason the OpenAI model stopped generating tokens for this chunk.
+        examples: 'stop'
+        tag: tech-specific-openai-events
+      - id: openai.tool_call.id
+        type: string
+        brief: If role is `tool` or `function`, then this tool call that this message is responding to.
+        examples: 'get_current_weather'
+        tag: tech-specific-openai-events
+      - id: openai.tool_call.type
+        type:
+          members:
+            - id: function
+              value: 'function'
+        brief: The type of the tool. Currently, only `function` is supported.
+        examples: 'function'
+        tag: tech-specific-openai-events
+      - id: openai.choice.type
+        type:
+          members:
+            - id: delta
+              value: 'delta'
+            - id: message
+              value: 'message'
+        brief: The type of the choice, either `delta` or `message`.
+        examples: 'message'
+        tag: tech-specific-openai-events
\ No newline at end of file
diff --git a/model/trace/llm.yaml b/model/trace/llm.yaml
new file mode 100644
index 0000000000..a4ee102374
--- /dev/null
+++ b/model/trace/llm.yaml
@@ -0,0 +1,164 @@
+groups:
+  - id: llm.request
+    type: span
+    brief: >
+      A request to an LLM is modeled as a span in a trace. The span name should be a low cardinality value representing the request made to an LLM, like the name of the API endpoint being called.
+    attributes:
+      - ref: llm.vendor
+        requirement_level: recommended
+        note: >
+          The name of the LLM foundation model vendor, if applicable. If not using a vendor-supplied model, this field is left blank.
+      - ref: llm.request.model
+        requirement_level: required
+        note: >
+            The name of the LLM a request is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model requested. If the LLM is a fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
+      - ref: llm.request.max_tokens
+        requirement_level: recommended
+      - ref: llm.temperature
+        requirement_level: recommended
+      - ref: llm.top_p
+        requirement_level: recommended
+      - ref: llm.stream
+        requirement_level: recommended
+      - ref: llm.stop_sequences
+        requirement_level: recommended
+
+  - id: llm.response
+    type: span
+    brief: >
+      These attributes track output data and metadata for a response from an LLM. Each attribute represents a concept that is common to most LLMs.
+    attributes:
+      - ref: llm.response.id
+        requirement_level: recommended
+      - ref: llm.response.model
+        requirement_level: required
+        note: >
+          The name of the LLM a response is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model actually used. If the LLM is a fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
+      - ref: llm.response.finish_reason
+        requirement_level: recommended
+      - ref: llm.usage.prompt_tokens
+        requirement_level: recommended
+      - ref: llm.usage.completion_tokens
+        requirement_level: recommended
+      - ref: llm.usage.total_tokens
+        requirement_level: recommended
+
+  - id: llm.events
+    type: span
+    brief: >
+      In the lifetime of an LLM span, events for prompts sent and completions received may be created, depending on the configuration of the instrumentation.
+    attributes:
+      - ref: llm.prompt
+        requirement_level: recommended
+        note: >
+          The full prompt string sent to an LLM in a request. If the LLM accepts a more complex input like a JSON object, this field is blank, and the response is instead captured in an event determined by the specific LLM technology semantic convention.
+      - ref: llm.completion
+        requirement_level: recommended
+        note: >
+          The full response string from an LLM. If the LLM responds with a more complex output like a JSON object made up of several pieces (such as OpenAI's message choices), this field is the content of the response. If the LLM produces multiple responses, then this field is left blank, and each response is instead captured in an event determined by the specific LLM technology semantic convention.
+
+  - id: llm.openai
+    type: span
+    brief: >
+      These are the attributes when instrumenting OpenAI LLM requests with the `/chat/completions` endpoint.
+    attributes:
+      - ref: llm.vendor
+        requirement_level: recommended
+        examples: ['openai', 'microsoft']
+        tag: tech-specific-openai-request
+      - ref: llm.request.model
+        requirement_level: required
+        note: >
+            The name of the LLM a request is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model requested. If the LLM is a fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
+        tag: tech-specific-openai-request
+      - ref: llm.request.max_tokens
+        tag: tech-specific-openai-request
+      - ref: llm.temperature
+        tag: tech-specific-openai-request
+      - ref: llm.top_p
+        tag: tech-specific-openai-request
+      - ref: llm.stream
+        tag: tech-specific-openai-request
+      - ref: llm.stop_sequences
+        tag: tech-specific-openai-request
+      - ref: llm.openai.presence_penalty
+        tag: tech-specific-openai-request
+      - ref: llm.openai.logit_bias
+        tag: tech-specific-openai-request
+      - ref: llm.openai.user
+        tag: tech-specific-openai-request
+      - ref: llm.openai.response_format
+        tag: tech-specific-openai-request
+      - ref: llm.openai.seed
+        tag: tech-specific-openai-response
+      - ref: llm.response.id
+        tag: tech-specific-openai-response
+      - ref: llm.response.finish_reason
+        tag: tech-specific-openai-response
+      - ref: llm.usage.prompt_tokens
+        tag: tech-specific-openai-response
+      - ref: llm.usage.completion_tokens
+        tag: tech-specific-openai-response
+      - ref: llm.usage.total_tokens
+        tag: tech-specific-openai-response
+      - ref: llm.openai.created
+        tag: tech-specific-openai-response
+      - ref: llm.openai.system_fingerprint
+        tag: tech-sepecifc-openai-response
+
+  - id: llm.openai.prompt
+    type: span
+    brief: >
+      These are the attributes when instrumenting OpenAI LLM requests and recording prompts in the request.
+    attributes:
+      - ref: llm.openai.role
+        requirement_level: required
+      - ref: llm.openai.content
+        requirement_level: required
+      - ref: llm.openai.tool_call.id
+        requirement_level: 
+          conditionally_required: >
+            Required if the prompt role is `tool`.
+
+  - id: llm.openai.tool
+    type: span
+    brief: >
+      These are the attributes when instrumenting OpenAI LLM requests that specify tools (or functions) the LLM can use.
+    attributes:
+      - ref: llm.openai.tool_call.type
+        requirement_level: required
+      - ref: llm.openai.function.name
+        requirement_level: required
+      - ref: llm.openai.function.description
+        requirement_level: required
+      - ref: llm.openai.function.parameters
+        requirement_level: required
+
+  - id: llm.openai.choice
+    type: span
+    brief: >
+      These are the attributes when instrumenting OpenAI LLM requests and recording choices in the result.
+    attributes:
+      - ref: llm.openai.choice.type
+        requirement_level: required
+      - ref: llm.response.finish_reason
+      - ref: llm.openai.role
+        requirement_level: required
+      - ref: llm.openai.content
+        requirement_level: required
+      - ref: llm.openai.tool_call.id
+        requirement_level: 
+          conditionally_required: >
+            Required if the choice is the result of a tool call.
+      - ref: llm.openai.tool_call.type
+        requirement_level: 
+          conditionally_required: >
+            Required if the choice is the result of a tool call.
+      - ref: llm.openai.function.name
+        requirement_level: 
+          conditionally_required: >
+            Required if the choice is the result of a tool call of type `function`.
+      - ref: llm.openai.function.arguments
+        requirement_level: 
+          conditionally_required: >
+            Required if the choice is the result of a tool call of type `function`.
\ No newline at end of file

From 5843c65f3c66f6c76c898f6d1ad84f8b8eb76d0f Mon Sep 17 00:00:00 2001
From: Nir Gazit <nirga@users.noreply.github.com>
Date: Tue, 23 Jan 2024 19:30:32 +0100
Subject: [PATCH 03/36] chore: fixes in yaml according to reviews

---
 docs/ai/llm-spans.md            |  87 ---------------------
 docs/ai/openai.md               | 133 --------------------------------
 docs/attributes-registry/llm.md |  33 ++++----
 model/registry/llm.yaml         |  59 +++++++-------
 model/trace/llm.yaml            |  78 +++++++++++--------
 5 files changed, 88 insertions(+), 302 deletions(-)
 delete mode 100644 docs/ai/llm-spans.md
 delete mode 100644 docs/ai/openai.md

diff --git a/docs/ai/llm-spans.md b/docs/ai/llm-spans.md
deleted file mode 100644
index 9ed8134795..0000000000
--- a/docs/ai/llm-spans.md
+++ /dev/null
@@ -1,87 +0,0 @@
-<!--- Hugo front matter used to generate the website version of this page:
-linkTitle: LLM Calls
---->
-
-# Semantic Conventions for LLM Spans
-
-**Status**: [Experimental][DocumentStatus]
-
-<!-- Re-generate TOC with `markdown-toc --no-first-h1 -i` -->
-
-<!-- toc -->
-
-- [Configuration](#configuration)
-- [LLM Request attributes](#llm-request-attributes)
-- [LLM Response attributes](#llm-response-attributes)
-- [LLM Span Events](#llm-span-events)
-
-<!-- tocstop -->
-
-A request to an LLM is modeled as a span in a trace.
-
-The **span name** SHOULD be set to a low cardinality value representing the request made to an LLM.
-It MAY be a name of the API endpoint for the LLM being called.
-
-## Configuration
-
-Instrumentations for LLMs MUST offer the ability to turn off capture of prompts and completions. This is for three primary reasons:
-
-1. Data privacy concerns. End users of LLM applications may input sensitive information or personally identifiable information (PII) that they do not wish to be sent to a telemetry backend.
-2. Data size concerns. Although there is no specified limit to sizes, there are practical limitations in programming languages and telemety systems. Some LLMs allow for extremely large context windows that end users may take full advantage of.
-3. Performance concerns. Sending large amounts of data to a telemetry backend may cause performance issues for the application.
-
-By default, these configurations SHOULD NOT capture prompts and completions.
-
-## LLM Request attributes
-
-These attributes track input data and metadata for a request to an LLM. Each attribute represents a concept that is common to most LLMs.
-
-<!-- semconv llm.request -->
-| Attribute  | Type | Description  | Examples  | Requirement Level |
-|---|---|---|---|---|
-| [`llm.request.max_tokens`](../attributes-registry/llm.md) | int | The maximum number of tokens the LLM generates for a request. | `100` | Recommended |
-| [`llm.request.model`](../attributes-registry/llm.md) | string | The name of the LLM a request is being made to. [1] | `gpt-4` | Required |
-| [`llm.stop_sequences`](../attributes-registry/llm.md) | string | Array of strings the LLM uses as a stop sequence. | `stop1` | Recommended |
-| [`llm.stream`](../attributes-registry/llm.md) | boolean | Whether the LLM responds with a stream. | `False` | Recommended |
-| [`llm.temperature`](../attributes-registry/llm.md) | double | The temperature setting for the LLM request. | `0.0` | Recommended |
-| [`llm.top_p`](../attributes-registry/llm.md) | double | The top_p sampling setting for the LLM request. | `1.0` | Recommended |
-| [`llm.vendor`](../attributes-registry/llm.md) | string | The name of the LLM foundation model vendor, if applicable. [2] | `openai` | Recommended |
-
-**[1]:** The name of the LLM a request is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model requested. If the LLM is a fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
-
-**[2]:** The name of the LLM foundation model vendor, if applicable. If not using a vendor-supplied model, this field is left blank.
-<!-- endsemconv -->
-
-## LLM Response attributes
-
-These attributes track output data and metadata for a response from an LLM. Each attribute represents a concept that is common to most LLMs.
-
-<!-- semconv llm.response -->
-| Attribute  | Type | Description  | Examples  | Requirement Level |
-|---|---|---|---|---|
-| [`llm.response.finish_reason`](../attributes-registry/llm.md) | string | The reason the model stopped generating tokens. | `stop` | Recommended |
-| [`llm.response.id`](../attributes-registry/llm.md) | string | The unique identifier for the completion. | `chatcmpl-123` | Recommended |
-| [`llm.response.model`](../attributes-registry/llm.md) | string | The name of the LLM a response is being made to. [1] | `gpt-4-0613` | Required |
-| [`llm.usage.completion_tokens`](../attributes-registry/llm.md) | int | The number of tokens used in the LLM response (completion). | `180` | Recommended |
-| [`llm.usage.prompt_tokens`](../attributes-registry/llm.md) | int | The number of tokens used in the LLM prompt. | `100` | Recommended |
-| [`llm.usage.total_tokens`](../attributes-registry/llm.md) | int | The total number of tokens used in the LLM prompt and response. | `280` | Recommended |
-
-**[1]:** The name of the LLM a response is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model actually used. If the LLM is a fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
-<!-- endsemconv -->
-
-## LLM Span Events
-
-In the lifetime of an LLM span, an event for prompts sent and completions received MAY be created, depending on the configuration of the instrumentation.
-
-<!-- semconv llm.events -->
-| Attribute  | Type | Description  | Examples  | Requirement Level |
-|---|---|---|---|---|
-| [`llm.completion`](../attributes-registry/llm.md) | string | The full response string from an LLM in a response. [1] | `Why did the developer stop using OpenTelemetry? Because they couldnt trace their steps!` | Recommended |
-| [`llm.prompt`](../attributes-registry/llm.md) | string | The full prompt string sent to an LLM in a request. [2] | `\\n\\nHuman:You are an AI assistant that tells jokes. Can you tell me a joke about OpenTelemetry?\\n\\nAssistant:` | Recommended |
-
-**[1]:** The full response string from an LLM. If the LLM responds with a more complex output like a JSON object made up of several pieces (such as OpenAI's message choices), this field is the content of the response. If the LLM produces multiple responses, then this field is left blank, and each response is instead captured in an event determined by the specific LLM technology semantic convention.
-
-**[2]:** The full prompt string sent to an LLM in a request. If the LLM accepts a more complex input like a JSON object, this field is blank, and the response is instead captured in an event determined by the specific LLM technology semantic convention.
-<!-- endsemconv -->
-
-[DocumentStatus]: https://github.com/open-telemetry/opentelemetry-specification/tree/v1.22.0/specification/document-status.md
\ No newline at end of file
diff --git a/docs/ai/openai.md b/docs/ai/openai.md
deleted file mode 100644
index 7dcbee5a6d..0000000000
--- a/docs/ai/openai.md
+++ /dev/null
@@ -1,133 +0,0 @@
-<!--- Hugo front matter used to generate the website version of this page:
-linkTitle: OpenAI
---->
-
-# Semantic Conventions for OpenAI Spans
-
-**Status**: [Experimental][DocumentStatus]
-
-This document outlines the Semantic Conventions specific to 
-[OpenAI](https://platform.openai.com/) spans, extending the general semantics 
-found in the [LLM Semantic Conventions](llm-spans.md). These conventions are 
-designed to standardize telemetry data for OpenAI interactions, particularly 
-focusing on the `/chat/completions` endpoint. By following to these guidelines, 
-developers can ensure consistent, meaningful, and easily interpretable telemetry
-data across different applications and platforms.
-
-<!-- toc -->
-
-- [Chat Completions](#chat-completions)
-  * [Request Attributes](#request-attributes)
-  * [Response attributes](#response-attributes)
-- [OpenAI Span Events](#openai-span-events)
-  * [Prompt Events](#prompt-events)
-  * [Tools Events](#tools-events)
-  * [Choice Events](#choice-events)
-
-<!-- tocstop -->
-
-## Chat Completions
-
-The span name for OpenAI chat completions SHOULD be `openai.chat` 
-to maintain consistency and clarity in telemetry data.
-
-### Request Attributes
-
-These are the attributes when instrumenting OpenAI LLM requests with the 
-`/chat/completions` endpoint.
-
-<!-- semconv llm.openai(tag=tech-specific-openai-request) -->
-| Attribute  | Type | Description  | Examples  | Requirement Level |
-|---|---|---|---|---|
-| [`llm.openai.logit_bias`](../attributes-registry/llm.md) | string | If present, the JSON-encoded string of a `logit_bias` used in an OpenAI request | `{2435:-100, 640:-100}` | Recommended |
-| [`llm.openai.presence_penalty`](../attributes-registry/llm.md) | double | If present, the `presence_penalty` used in an OpenAI request. Value is between -2.0 and 2.0. | `-0.5` | Recommended |
-| [`llm.openai.response_format`](../attributes-registry/llm.md) | string | An object specifying the format that the model must output. Either `text` or `json_object` | `text` | Recommended |
-| [`llm.openai.user`](../attributes-registry/llm.md) | string | If present, the `user` used in an OpenAI request. | `bob` | Recommended |
-| [`llm.request.max_tokens`](../attributes-registry/llm.md) | int | The maximum number of tokens the LLM generates for a request. | `100` | Recommended |
-| [`llm.request.model`](../attributes-registry/llm.md) | string | The name of the LLM a request is being made to. [1] | `gpt-4` | Required |
-| [`llm.stop_sequences`](../attributes-registry/llm.md) | string | Array of strings the LLM uses as a stop sequence. | `stop1` | Recommended |
-| [`llm.stream`](../attributes-registry/llm.md) | boolean | Whether the LLM responds with a stream. | `False` | Recommended |
-| [`llm.temperature`](../attributes-registry/llm.md) | double | The temperature setting for the LLM request. | `0.0` | Recommended |
-| [`llm.top_p`](../attributes-registry/llm.md) | double | The top_p sampling setting for the LLM request. | `1.0` | Recommended |
-| [`llm.vendor`](../attributes-registry/llm.md) | string | The name of the LLM foundation model vendor, if applicable. | `openai`; `microsoft` | Recommended |
-
-**[1]:** The name of the LLM a request is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model requested. If the LLM is a fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
-<!-- endsemconv -->
-
-### Response attributes
-
-Attributes for chat completion responses SHOULD follow these conventions:
-
-<!-- semconv llm.openai(tag=tech-specific-openai-response) -->
-| Attribute  | Type | Description  | Examples  | Requirement Level |
-|---|---|---|---|---|
-| [`llm.openai.created`](../attributes-registry/llm.md) | int | The UNIX timestamp (in seconds) if when the completion was created. | `1677652288` | Recommended |
-| [`llm.openai.seed`](../attributes-registry/llm.md) | int | Seed used in request to improve determinism. | `1234` | Recommended |
-| [`llm.response.finish_reason`](../attributes-registry/llm.md) | string | The reason the model stopped generating tokens. | `stop` | Recommended |
-| [`llm.response.id`](../attributes-registry/llm.md) | string | The unique identifier for the completion. | `chatcmpl-123` | Recommended |
-| [`llm.usage.completion_tokens`](../attributes-registry/llm.md) | int | The number of tokens used in the LLM response (completion). | `180` | Recommended |
-| [`llm.usage.prompt_tokens`](../attributes-registry/llm.md) | int | The number of tokens used in the LLM prompt. | `100` | Recommended |
-| [`llm.usage.total_tokens`](../attributes-registry/llm.md) | int | The total number of tokens used in the LLM prompt and response. | `280` | Recommended |
-<!-- endsemconv -->
-
-## OpenAI Span Events
-
-In the lifetime of an LLM span, an event for prompts sent and completions received MAY be created, depending on the configuration of the instrumentation.
-Because OpenAI uses a more complex prompt structure, these events will be used instead of the generic ones detailed in the [LLM Semantic Conventions](llm-spans.md).
-
-### Prompt Events
-
-Prompt event name SHOULD be `llm.openai.prompt`. 
-
-<!-- semconv llm.openai.prompt -->
-| Attribute  | Type | Description  | Examples  | Requirement Level |
-|---|---|---|---|---|
-| [`llm.openai.content`](../attributes-registry/llm.md) | string | The content for a given OpenAI response. | `Why did the developer stop using OpenTelemetry? Because they couldn't trace their steps!` | Required |
-| [`llm.openai.role`](../attributes-registry/llm.md) | string | The role of the prompt author, can be one of `system`, `user`, `assistant`, or `tool` | `user` | Required |
-| [`llm.openai.tool_call.id`](../attributes-registry/llm.md) | string | If role is `tool` or `function`, then this tool call that this message is responding to. | `get_current_weather` | Conditionally Required: Required if the prompt role is `tool`. |
-<!-- endsemconv -->
-
-### Tools Events
-
-Tools event name SHOULD be `llm.openai.tool`, specifying potential tools or functions the LLM can use.
-
-<!-- semconv llm.openai.tool -->
-| Attribute  | Type | Description  | Examples  | Requirement Level |
-|---|---|---|---|---|
-| [`llm.openai.function.description`](../attributes-registry/llm.md) | string | A description of what the function does, used by the model to choose when and how to call the function. | `Gets the current weather for a location` | Required |
-| [`llm.openai.function.name`](../attributes-registry/llm.md) | string | The name of the function to be called. | `get_weather` | Required |
-| [`llm.openai.function.parameters`](../attributes-registry/llm.md) | string | JSON-encoded string of the parameter object for the function. | `{"type": "object", "properties": {}}` | Required |
-| [`llm.openai.tool_call.type`](../attributes-registry/llm.md) | string | The type of the tool. Currently, only `function` is supported. | `function` | Required |
-<!-- endsemconv -->
-
-### Choice Events
-
-Recording details about Choices in each response MAY be included as 
-Span Events. 
-
-Choice event name SHOULD be `llm.openai.choice`. 
-
-If there is more than one `choice`, separate events SHOULD be used.
-
-<!-- semconv llm.openai.choice -->
-| Attribute  | Type | Description  | Examples  | Requirement Level |
-|---|---|---|---|---|
-| [`llm.openai.choice.type`](../attributes-registry/llm.md) | string | The type of the choice, either `delta` or `message`. | `message` | Required |
-| [`llm.openai.content`](../attributes-registry/llm.md) | string | The content for a given OpenAI response. | `Why did the developer stop using OpenTelemetry? Because they couldn't trace their steps!` | Required |
-| [`llm.openai.function.arguments`](../attributes-registry/llm.md) | string | If exists, the arguments to call a function call with for a given OpenAI response, denoted by `<index>`. The value for `<index>` starts with 0, where 0 is the first message. | `{"type": "object", "properties": {"some":"data"}}` | Conditionally Required: [1] |
-| [`llm.openai.function.name`](../attributes-registry/llm.md) | string | The name of the function to be called. | `get_weather` | Conditionally Required: [2] |
-| [`llm.openai.role`](../attributes-registry/llm.md) | string | The role of the prompt author, can be one of `system`, `user`, `assistant`, or `tool` | `user` | Required |
-| [`llm.openai.tool_call.id`](../attributes-registry/llm.md) | string | If role is `tool` or `function`, then this tool call that this message is responding to. | `get_current_weather` | Conditionally Required: [3] |
-| [`llm.openai.tool_call.type`](../attributes-registry/llm.md) | string | The type of the tool. Currently, only `function` is supported. | `function` | Conditionally Required: [4] |
-| [`llm.response.finish_reason`](../attributes-registry/llm.md) | string | The reason the model stopped generating tokens. | `stop` | Recommended |
-
-**[1]:** Required if the choice is the result of a tool call of type `function`.
-
-**[2]:** Required if the choice is the result of a tool call of type `function`.
-
-**[3]:** Required if the choice is the result of a tool call.
-
-**[4]:** Required if the choice is the result of a tool call.
-<!-- endsemconv -->
-
-[DocumentStatus]: https://github.com/open-telemetry/opentelemetry-specification/tree/v1.22.0/specification/document-status.md
\ No newline at end of file
diff --git a/docs/attributes-registry/llm.md b/docs/attributes-registry/llm.md
index 8c203ba211..5dfb91d272 100644
--- a/docs/attributes-registry/llm.md
+++ b/docs/attributes-registry/llm.md
@@ -25,11 +25,11 @@
 |---|---|---|---|
 | `llm.request.max_tokens` | int | The maximum number of tokens the LLM generates for a request. | `100` |
 | `llm.request.model` | string | The name of the LLM a request is being made to. | `gpt-4` |
-| `llm.stop_sequences` | string | Array of strings the LLM uses as a stop sequence. | `stop1` |
-| `llm.stream` | boolean | Whether the LLM responds with a stream. | `False` |
-| `llm.temperature` | double | The temperature setting for the LLM request. | `0.0` |
-| `llm.top_p` | double | The top_p sampling setting for the LLM request. | `1.0` |
-| `llm.vendor` | string | The name of the LLM foundation model vendor, if applicable. | `openai` |
+| `llm.request.stop_sequences` | string | Array of strings the LLM uses as a stop sequence. | `stop1` |
+| `llm.request.stream` | boolean | Whether the LLM responds with a stream. | `False` |
+| `llm.request.temperature` | double | The temperature setting for the LLM request. | `0.0` |
+| `llm.request.top_p` | double | The top_p sampling setting for the LLM request. | `1.0` |
+| `llm.request.vendor` | string | The name of the LLM foundation model vendor, if applicable. | `openai` |
 <!-- endsemconv -->
 
 ### Response Attributes
@@ -61,14 +61,14 @@
 <!-- semconv registry.llm(omit_requirement_level,tag=tech-specific-openai-request) -->
 | Attribute  | Type | Description  | Examples  |
 |---|---|---|---|
-| `llm.openai.frequency_penalty` | double | If present, the `frequency_penalty` used in an OpenAI request. Value is between -2.0 and 2.0. | `-0.5` |
-| `llm.openai.logit_bias` | string | If present, the JSON-encoded string of a `logit_bias` used in an OpenAI request | `{2435:-100, 640:-100}` |
-| `llm.openai.presence_penalty` | double | If present, the `presence_penalty` used in an OpenAI request. Value is between -2.0 and 2.0. | `-0.5` |
-| `llm.openai.response_format` | string | An object specifying the format that the model must output. Either `text` or `json_object` | `text` |
-| `llm.openai.seed` | int | Seed used in request to improve determinism. | `1234` |
-| `llm.openai.user` | string | If present, the `user` used in an OpenAI request. | `bob` |
+| `llm.request.openai.frequency_penalty` | double | If present, the `frequency_penalty` used in an OpenAI request. Value is between -2.0 and 2.0. | `-0.5` |
+| `llm.request.openai.logit_bias` | string | If present, the JSON-encoded string of a `logit_bias` used in an OpenAI request | `{2435:-100, 640:-100}` |
+| `llm.request.openai.presence_penalty` | double | If present, the `presence_penalty` used in an OpenAI request. Value is between -2.0 and 2.0. | `-0.5` |
+| `llm.request.openai.response_format` | string | An object specifying the format that the model must output. Either `text` or `json_object` | `text` |
+| `llm.request.openai.seed` | int | Seed used in request to improve determinism. | `1234` |
+| `llm.request.openai.user` | string | If present, the `user` used in an OpenAI request. | `bob` |
 
-`llm.openai.response_format` MUST be one of the following:
+`llm.request.openai.response_format` MUST be one of the following:
 
 | Value  | Description |
 |---|---|
@@ -81,8 +81,8 @@
 <!-- semconv registry.llm(omit_requirement_level,tag=tech-specific-openai-response) -->
 | Attribute  | Type | Description  | Examples  |
 |---|---|---|---|
-| `llm.openai.created` | int | The UNIX timestamp (in seconds) if when the completion was created. | `1677652288` |
-| `llm.openai.system_fingerprint` | string | This fingerprint represents the backend configuration that the model runs with. | `asdf987123` |
+| `llm.response.openai.created` | int | The UNIX timestamp (in seconds) if when the completion was created. | `1677652288` |
+| `llm.response.openai.system_fingerprint` | string | This fingerprint represents the backend configuration that the model runs with. | `asdf987123` |
 <!-- endsemconv -->
 
 ### Event Attributes
@@ -92,14 +92,13 @@
 |---|---|---|---|
 | `llm.openai.choice.type` | string | The type of the choice, either `delta` or `message`. | `message` |
 | `llm.openai.content` | string | The content for a given OpenAI response. | `Why did the developer stop using OpenTelemetry? Because they couldn't trace their steps!` |
-| `llm.openai.finish_reason` | string | The reason the OpenAI model stopped generating tokens for this chunk. | `stop` |
 | `llm.openai.function.arguments` | string | If exists, the arguments to call a function call with for a given OpenAI response, denoted by `<index>`. The value for `<index>` starts with 0, where 0 is the first message. | `{"type": "object", "properties": {"some":"data"}}` |
 | `llm.openai.function.description` | string | A description of what the function does, used by the model to choose when and how to call the function. | `Gets the current weather for a location` |
 | `llm.openai.function.name` | string | The name of the function to be called. | `get_weather` |
 | `llm.openai.function.parameters` | string | JSON-encoded string of the parameter object for the function. | `{"type": "object", "properties": {}}` |
 | `llm.openai.role` | string | The role of the prompt author, can be one of `system`, `user`, `assistant`, or `tool` | `user` |
+| `llm.openai.tool.type` | string | The type of the tool. Currently, only `function` is supported. | `function` |
 | `llm.openai.tool_call.id` | string | If role is `tool` or `function`, then this tool call that this message is responding to. | `get_current_weather` |
-| `llm.openai.tool_call.type` | string | The type of the tool. Currently, only `function` is supported. | `function` |
 
 `llm.openai.choice.type` MUST be one of the following:
 
@@ -117,7 +116,7 @@
 | `assistant` | assistant |
 | `tool` | tool |
 
-`llm.openai.tool_call.type` MUST be one of the following:
+`llm.openai.tool.type` MUST be one of the following:
 
 | Value  | Description |
 |---|---|
diff --git a/model/registry/llm.yaml b/model/registry/llm.yaml
index 60912165db..9bb6ee669b 100644
--- a/model/registry/llm.yaml
+++ b/model/registry/llm.yaml
@@ -5,7 +5,7 @@ groups:
     brief: >
       This document defines the attributes used to describe telemetry in the context of LLM (Large Language Models) requests and responses.
     attributes:
-      - id: vendor
+      - id: request.vendor
         type: string
         brief: The name of the LLM foundation model vendor, if applicable.
         examples: 'openai'
@@ -20,22 +20,22 @@ groups:
         brief: The maximum number of tokens the LLM generates for a request.
         examples: [100]
         tag: llm-generic-request
-      - id: temperature
+      - id: request.temperature
         type: double
         brief: The temperature setting for the LLM request.
         examples: [0.0]
         tag: llm-generic-request
-      - id: top_p
+      - id: request.top_p
         type: double
         brief: The top_p sampling setting for the LLM request.
         examples: [1.0]
         tag: llm-generic-request
-      - id: stream
+      - id: request.stream
         type: boolean
         brief: Whether the LLM responds with a stream.
         examples: [false]
         tag: llm-generic-request
-      - id: stop_sequences
+      - id: request.stop_sequences
         type: string
         brief: Array of strings the LLM uses as a stop sequence.
         examples: ["stop1"]
@@ -80,27 +80,27 @@ groups:
         brief: The full response string from an LLM in a response.
         examples: ['Why did the developer stop using OpenTelemetry? Because they couldnt trace their steps!']
         tag: llm-generic-events
-      - id: openai.presence_penalty
+      - id: request.openai.presence_penalty
         type: double
         brief: If present, the `presence_penalty` used in an OpenAI request. Value is between -2.0 and 2.0.
         examples: -0.5
         tag: tech-specific-openai-request
-      - id: openai.frequency_penalty
+      - id: request.openai.frequency_penalty
         type: double
         brief: If present, the `frequency_penalty` used in an OpenAI request. Value is between -2.0 and 2.0.
         examples: -0.5
         tag: tech-specific-openai-request
-      - id: openai.logit_bias
+      - id: request.openai.logit_bias
         type: string
         brief: If present, the JSON-encoded string of a `logit_bias` used in an OpenAI request
         examples: ['{2435:-100, 640:-100}']
         tag: tech-specific-openai-request
-      - id: openai.user
+      - id: request.openai.user
         type: string
         brief: If present, the `user` used in an OpenAI request.
         examples: ['bob']
         tag: tech-specific-openai-request
-      - id: openai.response_format
+      - id: request.openai.response_format
         type:
           members:
             - id: text
@@ -110,17 +110,17 @@ groups:
         brief: An object specifying the format that the model must output. Either `text` or `json_object`
         examples: 'text'
         tag: tech-specific-openai-request
-      - id: openai.seed
+      - id: request.openai.seed
         type: int
         brief: Seed used in request to improve determinism.
         examples: 1234
         tag: tech-specific-openai-request
-      - id: openai.created
+      - id: response.openai.created
         type: int
         brief: The UNIX timestamp (in seconds) if when the completion was created.
         examples: 1677652288
         tag: tech-specific-openai-response
-      - id: openai.system_fingerprint
+      - id: response.openai.system_fingerprint
         type: string
         brief: This fingerprint represents the backend configuration that the model runs with.
         examples: 'asdf987123'
@@ -139,10 +139,13 @@ groups:
         brief: The role of the prompt author, can be one of `system`, `user`, `assistant`, or `tool`
         examples: 'user'
         tag: tech-specific-openai-events
-      - id: openai.content
-        type: string
-        brief: The content for a given OpenAI response.
-        examples: 'Why did the developer stop using OpenTelemetry? Because they couldn''t trace their steps!'
+      - id: openai.tool.type
+        type:
+          members:
+            - id: function
+              value: 'function'
+        brief: The type of the tool. Currently, only `function` is supported.
+        examples: 'function'
         tag: tech-specific-openai-events
       - id: openai.function.name
         type: string
@@ -159,28 +162,20 @@ groups:
         brief: JSON-encoded string of the parameter object for the function.
         examples: '{"type": "object", "properties": {}}'
         tag: tech-specific-openai-events
-      - id: openai.function.arguments
-        type: string
-        brief: If exists, the arguments to call a function call with for a given OpenAI response, denoted by `<index>`. The value for `<index>` starts with 0, where 0 is the first message. 
-        examples: '{"type": "object", "properties": {"some":"data"}}'
-        tag: tech-specific-openai-events
-      - id: openai.finish_reason
+      - id: openai.content
         type: string
-        brief: The reason the OpenAI model stopped generating tokens for this chunk.
-        examples: 'stop'
+        brief: The content for a given OpenAI response.
+        examples: 'Why did the developer stop using OpenTelemetry? Because they couldn''t trace their steps!'
         tag: tech-specific-openai-events
       - id: openai.tool_call.id
         type: string
         brief: If role is `tool` or `function`, then this tool call that this message is responding to.
         examples: 'get_current_weather'
         tag: tech-specific-openai-events
-      - id: openai.tool_call.type
-        type:
-          members:
-            - id: function
-              value: 'function'
-        brief: The type of the tool. Currently, only `function` is supported.
-        examples: 'function'
+      - id: openai.function.arguments
+        type: string
+        brief: If exists, the arguments to call a function call with for a given OpenAI response, denoted by `<index>`. The value for `<index>` starts with 0, where 0 is the first message. 
+        examples: '{"type": "object", "properties": {"some":"data"}}'
         tag: tech-specific-openai-events
       - id: openai.choice.type
         type:
diff --git a/model/trace/llm.yaml b/model/trace/llm.yaml
index a4ee102374..4df11e1b5c 100644
--- a/model/trace/llm.yaml
+++ b/model/trace/llm.yaml
@@ -4,7 +4,7 @@ groups:
     brief: >
       A request to an LLM is modeled as a span in a trace. The span name should be a low cardinality value representing the request made to an LLM, like the name of the API endpoint being called.
     attributes:
-      - ref: llm.vendor
+      - ref: llm.request.vendor
         requirement_level: recommended
         note: >
           The name of the LLM foundation model vendor, if applicable. If not using a vendor-supplied model, this field is left blank.
@@ -14,20 +14,14 @@ groups:
             The name of the LLM a request is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model requested. If the LLM is a fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
       - ref: llm.request.max_tokens
         requirement_level: recommended
-      - ref: llm.temperature
+      - ref: llm.request.temperature
         requirement_level: recommended
-      - ref: llm.top_p
+      - ref: llm.request.top_p
         requirement_level: recommended
-      - ref: llm.stream
+      - ref: llm.request.stream
         requirement_level: recommended
-      - ref: llm.stop_sequences
+      - ref: llm.request.stop_sequences
         requirement_level: recommended
-
-  - id: llm.response
-    type: span
-    brief: >
-      These attributes track output data and metadata for a response from an LLM. Each attribute represents a concept that is common to most LLMs.
-    attributes:
       - ref: llm.response.id
         requirement_level: recommended
       - ref: llm.response.model
@@ -42,9 +36,13 @@ groups:
         requirement_level: recommended
       - ref: llm.usage.total_tokens
         requirement_level: recommended
+    events:
+      - llm.content.prompt
+      - llm.content.completion
 
-  - id: llm.events
-    type: span
+  - id: llm.content.prompt
+    name: llm.content.prompt
+    type: event
     brief: >
       In the lifetime of an LLM span, events for prompts sent and completions received may be created, depending on the configuration of the instrumentation.
     attributes:
@@ -52,6 +50,13 @@ groups:
         requirement_level: recommended
         note: >
           The full prompt string sent to an LLM in a request. If the LLM accepts a more complex input like a JSON object, this field is blank, and the response is instead captured in an event determined by the specific LLM technology semantic convention.
+      
+  - id: llm.content.completion
+    name: llm.content.completion
+    type: event
+    brief: >
+      In the lifetime of an LLM span, events for prompts sent and completions received may be created, depending on the configuration of the instrumentation.
+    attributes:
       - ref: llm.completion
         requirement_level: recommended
         note: >
@@ -62,7 +67,7 @@ groups:
     brief: >
       These are the attributes when instrumenting OpenAI LLM requests with the `/chat/completions` endpoint.
     attributes:
-      - ref: llm.vendor
+      - ref: llm.request.vendor
         requirement_level: recommended
         examples: ['openai', 'microsoft']
         tag: tech-specific-openai-request
@@ -73,23 +78,23 @@ groups:
         tag: tech-specific-openai-request
       - ref: llm.request.max_tokens
         tag: tech-specific-openai-request
-      - ref: llm.temperature
+      - ref: llm.request.temperature
         tag: tech-specific-openai-request
-      - ref: llm.top_p
+      - ref: llm.request.top_p
         tag: tech-specific-openai-request
-      - ref: llm.stream
+      - ref: llm.request.stream
         tag: tech-specific-openai-request
-      - ref: llm.stop_sequences
+      - ref: llm.request.stop_sequences
         tag: tech-specific-openai-request
-      - ref: llm.openai.presence_penalty
+      - ref: llm.request.openai.presence_penalty
         tag: tech-specific-openai-request
-      - ref: llm.openai.logit_bias
+      - ref: llm.request.openai.logit_bias
         tag: tech-specific-openai-request
-      - ref: llm.openai.user
+      - ref: llm.request.openai.user
         tag: tech-specific-openai-request
-      - ref: llm.openai.response_format
+      - ref: llm.request.openai.response_format
         tag: tech-specific-openai-request
-      - ref: llm.openai.seed
+      - ref: llm.request.openai.seed
         tag: tech-specific-openai-response
       - ref: llm.response.id
         tag: tech-specific-openai-response
@@ -101,13 +106,18 @@ groups:
         tag: tech-specific-openai-response
       - ref: llm.usage.total_tokens
         tag: tech-specific-openai-response
-      - ref: llm.openai.created
+      - ref: llm.response.openai.created
         tag: tech-specific-openai-response
-      - ref: llm.openai.system_fingerprint
+      - ref: llm.response.openai.system_fingerprint
         tag: tech-sepecifc-openai-response
+    events:
+      - llm.content.openai.prompt
+      - llm.content.openai.tool
+      - llm.content.openai.completion.choice
 
-  - id: llm.openai.prompt
-    type: span
+  - id: llm.content.openai.prompt
+    name: llm.content.openai.prompt
+    type: event
     brief: >
       These are the attributes when instrumenting OpenAI LLM requests and recording prompts in the request.
     attributes:
@@ -120,12 +130,13 @@ groups:
           conditionally_required: >
             Required if the prompt role is `tool`.
 
-  - id: llm.openai.tool
-    type: span
+  - id: llm.content.openai.tool
+    name: llm.content.openai.tool
+    type: event
     brief: >
       These are the attributes when instrumenting OpenAI LLM requests that specify tools (or functions) the LLM can use.
     attributes:
-      - ref: llm.openai.tool_call.type
+      - ref: llm.openai.tool.type
         requirement_level: required
       - ref: llm.openai.function.name
         requirement_level: required
@@ -134,8 +145,9 @@ groups:
       - ref: llm.openai.function.parameters
         requirement_level: required
 
-  - id: llm.openai.choice
-    type: span
+  - id: llm.content.openai.completion.choice
+    name: llm.content.openai.completion.choice
+    type: event
     brief: >
       These are the attributes when instrumenting OpenAI LLM requests and recording choices in the result.
     attributes:
@@ -150,7 +162,7 @@ groups:
         requirement_level: 
           conditionally_required: >
             Required if the choice is the result of a tool call.
-      - ref: llm.openai.tool_call.type
+      - ref: llm.openai.tool.type
         requirement_level: 
           conditionally_required: >
             Required if the choice is the result of a tool call.

From 0891f913fb51a61d8baf100582b41b2b2b000346 Mon Sep 17 00:00:00 2001
From: Nir Gazit <nirga@users.noreply.github.com>
Date: Fri, 12 Jan 2024 11:53:35 +0100
Subject: [PATCH 04/36] chore: @lmolkova reviews

---
 docs/ai/README.md       |   2 +-
 docs/ai/llm-spans.md    |  99 ++++++++++++++++++++++++++++++++++
 docs/ai/openai.md       | 114 ++++++++++++++++++++++++++++++++++++++++
 model/registry/llm.yaml |   6 +--
 model/trace/llm.yaml    |  16 +++---
 5 files changed, 225 insertions(+), 12 deletions(-)
 create mode 100644 docs/ai/llm-spans.md
 create mode 100644 docs/ai/openai.md

diff --git a/docs/ai/README.md b/docs/ai/README.md
index f04a867a22..855503f97c 100644
--- a/docs/ai/README.md
+++ b/docs/ai/README.md
@@ -21,4 +21,4 @@ Technology specific semantic conventions are defined for the following LLM provi
 
 * [OpenAI](openai.md): Semantic Conventions for *OpenAI*.
 
-[DocumentStatus]: https://github.com/open-telemetry/opentelemetry-specification/tree/v1.22.0/specification/document-status.md
\ No newline at end of file
+[DocumentStatus]: https://github.com/open-telemetry/opentelemetry-specification/tree/v1.26.0/specification/document-status.md
\ No newline at end of file
diff --git a/docs/ai/llm-spans.md b/docs/ai/llm-spans.md
new file mode 100644
index 0000000000..19c4162321
--- /dev/null
+++ b/docs/ai/llm-spans.md
@@ -0,0 +1,99 @@
+<!--- Hugo front matter used to generate the website version of this page:
+linkTitle: LLM Calls
+--->
+
+# Semantic Conventions for LLM requests
+
+**Status**: [Experimental][DocumentStatus]
+
+<!-- Re-generate TOC with `markdown-toc --no-first-h1 -i` -->
+
+<!-- toc -->
+
+- [LLM Request attributes](#llm-request-attributes)
+- [Configuration](#configuration)
+- [Semantic Conventions for specific LLM technologies](#semantic-conventions-for-specific-llm-technologies)
+
+<!-- tocstop -->
+
+A request to an LLM is modeled as a span in a trace.
+
+The **span name** SHOULD be set to a low cardinality value representing the request made to an LLM.
+It MAY be a name of the API endpoint for the LLM being called.
+
+## Configuration
+
+Instrumentations for LLMs MUST offer the ability to turn off capture of prompts and completions. This is for three primary reasons:
+
+1. Data privacy concerns. End users of LLM applications may input sensitive information or personally identifiable information (PII) that they do not wish to be sent to a telemetry backend.
+2. Data size concerns. Although there is no specified limit to sizes, there are practical limitations in programming languages and telemety systems. Some LLMs allow for extremely large context windows that end users may take full advantage of.
+3. Performance concerns. Sending large amounts of data to a telemetry backend may cause performance issues for the application.
+
+By default, these configurations SHOULD NOT capture prompts and completions.
+
+## LLM Request attributes
+
+These attributes track input data and metadata for a request to an LLM. Each attribute represents a concept that is common to most LLMs.
+
+<!-- semconv ai(tag=llm-request) -->
+| Attribute  | Type | Description  | Examples  | Requirement Level |
+|---|---|---|---|---|
+| `llm.vendor` | string | The name of the LLM foundation model vendor, if applicable. If not using a vendor-supplied model, this field is left blank. | `openai` | Recommended |
+| `llm.request.model` | string | The name of the LLM a request is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model requested. If the LLM is a fine-tuned custom model, the value SHOULD have a more specific name than the base model that's been fine-tuned. | `gpt-4` | Required |
+| `llm.request.max_tokens` | int | The maximum number of tokens the LLM generates for a request. | `100` | Recommended |
+| `llm.temperature` | float | The temperature setting for the LLM request. | `0.0` | Recommended |
+| `llm.top_p` | float | The top_p sampling setting for the LLM request. | `1.0` | Recommended |
+| `llm.stream` | bool | Whether the LLM responds with a stream. | `false` | Recommended |
+| `llm.stop_sequences` | array | Array of strings the LLM uses as a stop sequence. | `["stop1"]` | Recommended |
+
+`llm.model` has the following list of well-known values. If one of them applies, then the respective value MUST be used, otherwise a custom value MAY be used.
+
+| Value  | Description |
+|---|---|
+| `gpt-4` | GPT-4 |
+| `gpt-4-32k` | GPT-4 with 32k context window |
+| `gpt-3.5-turbo` | GPT-3.5-turbo |
+| `gpt-3.5-turbo-16k` | GPT-3.5-turbo with 16k context window|
+| `claude-instant-1` | Claude Instant (latest version) |
+| `claude-2` | Claude 2 (latest version) |
+| `other-llm` | Any LLM not listed in this table. Use for any fine-tuned version of a model. |
+<!-- endsemconv -->
+
+## LLM Response attributes
+
+These attributes track output data and metadata for a response from an LLM. Each attribute represents a concept that is common to most LLMs.
+
+<!-- semconv ai(tag=llm-response) -->
+| Attribute  | Type | Description  | Examples  | Requirement Level |
+|---|---|---|---|---|
+| `llm.response.id` | string | The unique identifier for the completion. | `chatcmpl-123` | Recommended |
+| `llm.response.model` | string | The name of the LLM a response is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model actually used. If the LLM is a fine-tuned custom model, the value SHOULD have a more specific name than the base model that's been fine-tuned. | `gpt-4-0613` | Required |
+| `llm.response.finish_reason` | string | The reason the model stopped generating tokens | `stop` | Recommended |
+| `llm.usage.prompt_tokens` | int | The number of tokens used in the LLM prompt. | `100` | Recommended |
+| `llm.usage.completion_tokens` | int | The number of tokens used in the LLM response (completion). | `180` | Recommended |
+| `llm.usage.total_tokens` | int | The total number of tokens used in the LLM prompt and response. | `280` | Recommended |
+
+`llm.response.finish_reason` MUST be one of the following:
+
+| Value  | Description |
+|---|---|
+| `stop` | If the model hit a natural stop point or a provided stop sequence. |
+| `max_tokens` | If the maximum number of tokens specified in the request was reached. |
+| `tool_call` | If a function / tool call was made by the model (for models that support such functionality). |
+<!-- endsemconv -->
+
+## Events
+
+In the lifetime of an LLM span, an event for prompts sent and completions received MAY be created, depending on the configuration of the instrumentation.
+
+<!-- semconv ai(tag=llm-prompt) -->
+| Attribute  | Type | Description  | Examples  | Requirement Level |
+| `llm.prompt` | string | The full prompt string sent to an LLM in a request. If the LLM accepts a more complex input like a JSON object made up of several pieces (such as OpenAI's different message types), this field is blank, and the response is instead captured in an event determined by the specific LLM technology semantic convention. | `\n\nHuman:You are an AI assistant that tells jokes. Can you tell me a joke about OpenTelemetry?\n\nAssistant:` | Recommended |
+<!-- endsemconv -->
+
+<!-- semconv ai(tag=llm-completion) -->
+| Attribute  | Type | Description  | Examples  | Requirement Level |
+| `llm.completion` | string | The full response string from an LLM. If the LLM responds with a more complex output like a JSON object made up of several pieces (such as OpenAI's message choices), this field is the content of the response. If the LLM produces multiple responses, then this field is left blank, and each response is instead captured in an event determined by the specific LLM technology semantic convention.| `Why did the developer stop using OpenTelemetry? Because they couldn't trace their steps!` | Recommended |
+<!-- endsemconv -->
+
+[DocumentStatus]: https://github.com/open-telemetry/opentelemetry-specification/tree/v1.22.0/specification/document-status.md
\ No newline at end of file
diff --git a/docs/ai/openai.md b/docs/ai/openai.md
new file mode 100644
index 0000000000..4c7acf404a
--- /dev/null
+++ b/docs/ai/openai.md
@@ -0,0 +1,114 @@
+<!--- Hugo front matter used to generate the website version of this page:
+linkTitle: OpenAI
+--->
+
+# Semantic Conventions for OpenAI Spans
+
+**Status**: [Experimental][DocumentStatus]
+
+This document outlines the Semantic Conventions specific to 
+[OpenAI](https://platform.openai.com/) spans, extending the general semantics 
+found in the [LLM Semantic Conventions](llm-spans.md). These conventions are 
+designed to standardize telemetry data for OpenAI interactions, particularly 
+focusing on the `/chat/completions` endpoint. By following to these guidelines, 
+developers can ensure consistent, meaningful, and easily interpretable telemetry
+data across different applications and platforms.
+
+## Chat Completions
+
+The span name for OpenAI chat completions SHOULD be `openai.chat` 
+to maintain consistency and clarity in telemetry data.
+
+## Request Attributes
+
+These are the attributes when instrumenting OpenAI LLM requests with the 
+`/chat/completions` endpoint.
+
+<!-- semconv llm.openai(tag=llm-request-tech-specific) -->
+| Attribute  | Type | Description  | Examples  | Requirement Level |
+|---|---|---|---|---|
+| `llm.vendor` | string | The name of the LLM foundation model vendor, if applicable. If not using a vendor-supplied model, this field is left blank. | `openai` | Recommended |
+| `llm.request.model` | string | The name of the LLM a request is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model requested. If the LLM is a fine-tuned custom model, the value SHOULD have a more specific name than the base model that's been fine-tuned. | `gpt-4` | Required |
+| `llm.request.max_tokens` | int | The maximum number of tokens the LLM generates for a request. | `100` | Recommended |
+| `llm.temperature` | float | The temperature setting for the LLM request. | `0.0` | Recommended |
+| `llm.top_p` | float | The top_p sampling setting for the LLM request. | `1.0` | Recommended |
+| `llm.stream` | bool | Whether the LLM responds with a stream. | `false` | Recommended |
+| `llm.stop_sequences` | array | Array of strings the LLM uses as a stop sequence. | `["stop1"]` | Recommended |
+| `llm.openai.n` | integer | The number of completions to generate. | `1` | Recommended |
+| `llm.openai.presence_penalty` | float | If present, the `presence_penalty` used in an OpenAI request. Value is between -2.0 and 2.0. | `-0.5` | Recommended |
+| `llm.openai.frequency_penalty` | float | If present, the `frequency_penalty` used in an OpenAI request. Value is between -2.0 and 2.0. | `-0.5` | Recommended |
+| `llm.openai.logit_bias` | string | If present, the JSON-encoded string of a `logit_bias` used in an OpenAI request. | `{2435:-100, 640:-100}` | Recommended |
+| `llm.openai.user` | string | If present, the `user` used in an OpenAI request. | `bob` | Opt-in |
+| `llm.openai.response_format` | string | An object specifying the format that the model must output. Either `text` or `json_object` | `text` | Recommended |
+| `llm.openai.seed` | integer | Seed used in request to improve determinism. | `1234` | Recommended |
+<!-- endsemconv -->
+
+## Response attributes
+
+Attributes for chat completion responses SHOULD follow these conventions:
+
+<!-- semconv llm.openai(tag=llm-response-tech-specific) -->
+| Attribute  | Type | Description  | Examples  | Requirement Level |
+|---|---|---|---|---|
+| `llm.response.id` | string | The unique identifier for the completion. | `chatcmpl-123` | Recommended |
+| `llm.response.model` | string | The name of the LLM a response is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model actually used. If the LLM is a fine-tuned custom model, the value SHOULD have a more specific name than the base model that's been fine-tuned. | `gpt-4-0613` | Required |
+| `llm.response.finish_reason` | string | The reason the model stopped generating tokens | `stop` | Recommended |
+| `llm.usage.prompt_tokens` | int | The number of tokens used in the LLM prompt. | `100` | Recommended |
+| `llm.usage.completion_tokens` | int | The number of tokens used in the LLM response (completion). | `180` | Recommended |
+| `llm.usage.total_tokens` | int | The total number of tokens used in the LLM prompt and response. | `280` | Recommended |
+| `llm.openai.created` | int | The UNIX timestamp (in seconds) if when the completion was created. | `1677652288` | Recommended |
+| `llm.openai.system_fingerprint` | string | This fingerprint represents the backend configuration that the model runs with. | asdf987123 | Recommended |
+<!-- endsemconv -->
+
+## Request Events
+
+In the lifetime of an LLM span, an event for prompts sent and completions received MAY be created, depending on the configuration of the instrumentation.
+Because OpenAI uses a more complex prompt structure, these events will be used instead of the generic ones detailed in the [LLM Semantic Conventions](llm-spans.md).
+
+### Prompt Events
+
+Prompt event name SHOULD be `llm.openai.prompt`. 
+
+<!-- semconv llm.openai(tag=llm-prompt-tech-specific) -->
+| Attribute  | Type | Description  | Examples  | Requirement Level |
+|---|---|---|---|---|
+| `role` | string | The role of the prompt author, can be one of `system`, `user`, `assistant`, or `tool` | `system` | Required |
+| `content` | string | The content for a given OpenAI response, denoted by `<index>`. The value for `<index>` starts with 0, where 0 is the first message. | `Why did the developer stop using OpenTelemetry? Because they couldn't trace their steps!` | Required |
+| `tool_call_id` | string | If role is `tool` or `function`, then this tool call that this message is responding to. | `get_current_weather` | Conditionally Required: If `role` is `tool`. |
+<!-- endsemconv -->
+
+### Tools Events
+
+Tools event name SHOULD be `llm.openai.tool`, specifying potential tools or functions the LLM can use.
+
+<!-- semconv llm.openai(tag=llm-tools-tech-specific) -->
+| Attribute  | Type | Description  | Examples  | Requirement Level |
+|---|---|---|---|---|
+| `type` | string | They type of the tool. Currently, only `function` is supported. | `function` | Required |
+| `function.name` | string | The name of the function to be called. | `get_weather` | Required !
+| `function.description` | string | A description of what the function does, used by the model to choose when and how to call the function. | `` | Required |
+| `function.parameters` | string | JSON-encoded string of the parameter object for the function. | `{"type": "object", "properties": {}}` | Required | 
+<!-- endsemconv -->
+
+### Choice Events
+
+Recording details about Choices in each response MAY be included as 
+Span Events. 
+
+Choice event name SHOULD be `llm.openai.choice`. 
+
+If there is more than one `tool_call`, separate events SHOULD be used.
+
+<!-- semconv llm.openai(tag=llm-completion-tech-specific) -->
+| `type` | string | Either `delta` or `message`. | `message` | Required |
+|---|---|---|---|---|
+| `finish_reason` | string | The reason the OpenAI model stopped generating tokens for this chunk. | `stop` | Recommended |
+| `role` | string | The assigned role for a given OpenAI response, denoted by `<index>`. The value for `<index>` starts with 0, where 0 is the first message. | `system` | Required |
+| `content` | string | The content for a given OpenAI response, denoted by `<index>`. The value for `<index>` starts with 0, where 0 is the first message. | `Why did the developer stop using OpenTelemetry? Because they couldn't trace their steps!` | Required |
+| `tool_call.id` | string | If exists, the ID of the tool call. | `call_BP08xxEhU60txNjnz3z9R4h9` | Required |
+| `tool_call.type` | string | Currently only `function` is supported. | `function` | Required |
+| `tool_call.function.name` | string | If exists, the name of a function call for a given OpenAI response, denoted by `<index>`. The value for `<index>` starts with 0, where 0 is the first message. | `get_weather_report` | Required |
+| `tool_call.function.arguments` | string | If exists, the arguments to call a function call with for a given OpenAI response, denoted by `<index>`. The value for `<index>` starts with 0, where 0 is the first message. | `{"type": "object", "properties": {"some":"data"}}` | Required |
+<!-- endsemconv -->
+
+[DocumentStatus]: https://github.com/open-telemetry/opentelemetry-specification/tree/v1.22.0/specification/document-status.md
\ No newline at end of file
diff --git a/model/registry/llm.yaml b/model/registry/llm.yaml
index 9bb6ee669b..d45bad3368 100644
--- a/model/registry/llm.yaml
+++ b/model/registry/llm.yaml
@@ -5,7 +5,7 @@ groups:
     brief: >
       This document defines the attributes used to describe telemetry in the context of LLM (Large Language Models) requests and responses.
     attributes:
-      - id: request.vendor
+      - id: system
         type: string
         brief: The name of the LLM foundation model vendor, if applicable.
         examples: 'openai'
@@ -30,7 +30,7 @@ groups:
         brief: The top_p sampling setting for the LLM request.
         examples: [1.0]
         tag: llm-generic-request
-      - id: request.stream
+      - id: request.is_stream
         type: boolean
         brief: Whether the LLM responds with a stream.
         examples: [false]
@@ -41,7 +41,7 @@ groups:
         examples: ["stop1"]
         tag: llm-generic-request
       - id: response.id
-        type: string
+        type: string[]
         brief: The unique identifier for the completion.
         examples: ['chatcmpl-123']
         tag: llm-generic-response
diff --git a/model/trace/llm.yaml b/model/trace/llm.yaml
index 4df11e1b5c..17fe1e709f 100644
--- a/model/trace/llm.yaml
+++ b/model/trace/llm.yaml
@@ -4,7 +4,7 @@ groups:
     brief: >
       A request to an LLM is modeled as a span in a trace. The span name should be a low cardinality value representing the request made to an LLM, like the name of the API endpoint being called.
     attributes:
-      - ref: llm.request.vendor
+      - ref: llm.system
         requirement_level: recommended
         note: >
           The name of the LLM foundation model vendor, if applicable. If not using a vendor-supplied model, this field is left blank.
@@ -18,7 +18,7 @@ groups:
         requirement_level: recommended
       - ref: llm.request.top_p
         requirement_level: recommended
-      - ref: llm.request.stream
+      - ref: llm.request.is_stream
         requirement_level: recommended
       - ref: llm.request.stop_sequences
         requirement_level: recommended
@@ -65,9 +65,9 @@ groups:
   - id: llm.openai
     type: span
     brief: >
-      These are the attributes when instrumenting OpenAI LLM requests with the `/chat/completions` endpoint.
+      A span representing a request to OpenAI's API, providing additional information on top of the generic llm.request.
     attributes:
-      - ref: llm.request.vendor
+      - ref: llm.system
         requirement_level: recommended
         examples: ['openai', 'microsoft']
         tag: tech-specific-openai-request
@@ -82,7 +82,7 @@ groups:
         tag: tech-specific-openai-request
       - ref: llm.request.top_p
         tag: tech-specific-openai-request
-      - ref: llm.request.stream
+      - ref: llm.request.is_stream
         tag: tech-specific-openai-request
       - ref: llm.request.stop_sequences
         tag: tech-specific-openai-request
@@ -119,7 +119,7 @@ groups:
     name: llm.content.openai.prompt
     type: event
     brief: >
-      These are the attributes when instrumenting OpenAI LLM requests and recording prompts in the request.
+      This event is fired when a completion request is sent to OpenAI, specifying the prompt that was sent.
     attributes:
       - ref: llm.openai.role
         requirement_level: required
@@ -134,7 +134,7 @@ groups:
     name: llm.content.openai.tool
     type: event
     brief: >
-      These are the attributes when instrumenting OpenAI LLM requests that specify tools (or functions) the LLM can use.
+      This event is fired when a completion request is sent to OpenAI, specifying tools that the LLM can use.
     attributes:
       - ref: llm.openai.tool.type
         requirement_level: required
@@ -149,7 +149,7 @@ groups:
     name: llm.content.openai.completion.choice
     type: event
     brief: >
-      These are the attributes when instrumenting OpenAI LLM requests and recording choices in the result.
+      This event is fired when a completion response is returned from OpenAI, specifying one possibile completion returned by the LLM.
     attributes:
       - ref: llm.openai.choice.type
         requirement_level: required

From d5a9753ddf682ffb9eefca92aeec8f94202cca2f Mon Sep 17 00:00:00 2001
From: Drew Robbins <drew@drewby.com>
Date: Sun, 28 Jan 2024 12:16:00 -0800
Subject: [PATCH 05/36] Add OpenAI metrics

---
 docs/ai/README.md              |   3 +-
 docs/ai/openai-metrics.md      | 375 +++++++++++++++++++++++++++++++++
 model/metrics/llm-metrics.yaml | 109 ++++++++++
 model/registry/llm.yaml        |   9 +
 4 files changed, 495 insertions(+), 1 deletion(-)
 create mode 100644 docs/ai/openai-metrics.md
 create mode 100644 model/metrics/llm-metrics.yaml

diff --git a/docs/ai/README.md b/docs/ai/README.md
index 855503f97c..bf83b94856 100644
--- a/docs/ai/README.md
+++ b/docs/ai/README.md
@@ -19,6 +19,7 @@ Semantic conventions for LLM operations are defined for the following signals:
 
 Technology specific semantic conventions are defined for the following LLM providers:
 
-* [OpenAI](openai.md): Semantic Conventions for *OpenAI*.
+* [OpenAI](openai.md): Semantic Conventions for *OpenAI* spans.
+* [OpenAI Metrics](openai-metrics.md): Semantic Conventions for *OpenAI* metrics.
 
 [DocumentStatus]: https://github.com/open-telemetry/opentelemetry-specification/tree/v1.26.0/specification/document-status.md
\ No newline at end of file
diff --git a/docs/ai/openai-metrics.md b/docs/ai/openai-metrics.md
new file mode 100644
index 0000000000..5b231da602
--- /dev/null
+++ b/docs/ai/openai-metrics.md
@@ -0,0 +1,375 @@
+<!--- Hugo front matter used to generate the website version of this page:
+linkTitle: MEtrics
+--->
+
+# Semantic Conventions for OpenAI Matrics
+
+**Status**: [Experimental][DocumentStatus]
+
+This document defines semantic conventions for OpenAI client metrics.
+
+<!-- Re-generate TOC with `markdown-toc --no-first-h1 -i` -->
+
+<!-- toc -->
+
+- [Chat completions](#chat-completions)
+  * [Metric: `openai.chat_completions.tokens`](#metric-openaichat_completionstokens)
+  * [Metric: `openai.chat_completions.choices`](#metric-openaichat_completionschoices)
+  * [Metric: `openai.chat_completions.duration`](#metric-openaichat_completionsduration)
+- [Embeddings](#embeddings)
+  * [Metric: `openai.embeddings.tokens`](#metric-openaiembeddingstokens)
+  * [Metric: `openai.embeddings.vector_size`](#metric-openaiembeddingsvector_size)
+  * [Metric: `openai.embeddings.duration`](#metric-openaiembeddingsduration)
+- [Image generation](#image-generation)
+  * [Metric: `openai.image_generations.duration`](#metric-openaiimage_generationsduration)
+
+<!-- tocstop -->
+
+## Chat completions
+
+### Metric: `openai.chat_completions.tokens`
+
+**Status**: [Experimental][DocumentStatus]
+
+This metric is required.
+
+<!-- semconv metric.openai.chat_completions.tokens(metric_table) -->
+| Name     | Instrument Type | Unit (UCUM) | Description    |
+| -------- | --------------- | ----------- | -------------- |
+| `llm.openai.chat_completions.tokens` | Counter | `token` | Number of tokens used in prompt and completions. |
+<!-- endsemconv -->
+
+<!-- semconv metric.openai.chat_completions.tokens(full) -->
+| Attribute  | Type | Description  | Examples  | Requirement Level |
+|---|---|---|---|---|
+| [`error.type`](../attributes-registry/error.md) | string | Describes a class of error the operation ended with. [1] | `timeout`; `java.net.UnknownHostException`; `server_certificate_invalid`; `500` | Conditionally Required: if the operation ended in error |
+| [`llm.response.model`](../attributes-registry/llm.md) | string | The name of the LLM a response is being made to. | `gpt-4-0613` | Required |
+| [`llm.usage.token_type`](../attributes-registry/llm.md) | string | The type of token. | `prompt` | Recommended |
+| [`server.address`](../attributes-registry/server.md) | string | Server domain name if available without reverse DNS lookup; otherwise, IP address or Unix domain socket name. [2] | `example.com`; `10.1.2.80`; `/tmp/my.sock` | Required |
+
+**[1]:** The `error.type` SHOULD be predictable and SHOULD have low cardinality.
+Instrumentations SHOULD document the list of errors they report.
+
+The cardinality of `error.type` within one instrumentation library SHOULD be low.
+Telemetry consumers that aggregate data from multiple instrumentation libraries and applications
+should be prepared for `error.type` to have high cardinality at query time when no
+additional filters are applied.
+
+If the operation has completed successfully, instrumentations SHOULD NOT set `error.type`.
+
+If a specific domain defines its own set of error identifiers (such as HTTP or gRPC status codes),
+it's RECOMMENDED to:
+
+* Use a domain-specific attribute
+* Set `error.type` to capture all errors, regardless of whether they are defined within the domain-specific set or not.
+
+**[2]:** When observed from the client side, and when communicating through an intermediary, `server.address` SHOULD represent the server address behind any intermediaries, for example proxies, if it's available.
+
+`error.type` has the following list of well-known values. If one of them applies, then the respective value MUST be used, otherwise a custom value MAY be used.
+
+| Value  | Description |
+|---|---|
+| `_OTHER` | A fallback error value to be used when the instrumentation doesn't define a custom value. |
+
+`llm.usage.token_type` MUST be one of the following:
+
+| Value  | Description |
+|---|---|
+| `prompt` | prompt |
+| `completion` | completion |
+<!-- endsemconv -->
+
+### Metric: `openai.chat_completions.choices`
+
+**Status**: [Experimental][DocumentStatus]
+
+This metric is required.
+
+<!-- semconv metric.openai.chat_completions.choices(metric_table) -->
+| Name     | Instrument Type | Unit (UCUM) | Description    |
+| -------- | --------------- | ----------- | -------------- |
+| `llm.openai.chat_completions.choices` | Counter | `choice` | Number of choices returned by chat completions call |
+<!-- endsemconv -->
+
+<!-- semconv metric.openai.chat_completions.choices(full) -->
+| Attribute  | Type | Description  | Examples  | Requirement Level |
+|---|---|---|---|---|
+| [`error.type`](../attributes-registry/error.md) | string | Describes a class of error the operation ended with. [1] | `timeout`; `java.net.UnknownHostException`; `server_certificate_invalid`; `500` | Conditionally Required: if the operation ended in error |
+| [`llm.response.finish_reason`](../attributes-registry/llm.md) | string | The reason the model stopped generating tokens. | `stop` | Recommended |
+| [`llm.response.model`](../attributes-registry/llm.md) | string | The name of the LLM a response is being made to. | `gpt-4-0613` | Required |
+| [`server.address`](../attributes-registry/server.md) | string | Server domain name if available without reverse DNS lookup; otherwise, IP address or Unix domain socket name. [2] | `example.com`; `10.1.2.80`; `/tmp/my.sock` | Required |
+
+**[1]:** The `error.type` SHOULD be predictable and SHOULD have low cardinality.
+Instrumentations SHOULD document the list of errors they report.
+
+The cardinality of `error.type` within one instrumentation library SHOULD be low.
+Telemetry consumers that aggregate data from multiple instrumentation libraries and applications
+should be prepared for `error.type` to have high cardinality at query time when no
+additional filters are applied.
+
+If the operation has completed successfully, instrumentations SHOULD NOT set `error.type`.
+
+If a specific domain defines its own set of error identifiers (such as HTTP or gRPC status codes),
+it's RECOMMENDED to:
+
+* Use a domain-specific attribute
+* Set `error.type` to capture all errors, regardless of whether they are defined within the domain-specific set or not.
+
+**[2]:** When observed from the client side, and when communicating through an intermediary, `server.address` SHOULD represent the server address behind any intermediaries, for example proxies, if it's available.
+
+`error.type` has the following list of well-known values. If one of them applies, then the respective value MUST be used, otherwise a custom value MAY be used.
+
+| Value  | Description |
+|---|---|
+| `_OTHER` | A fallback error value to be used when the instrumentation doesn't define a custom value. |
+<!-- endsemconv -->
+
+
+### Metric: `openai.chat_completions.duration`
+
+**Status**: [Experimental][DocumentStatus]
+
+This metric is required.
+
+This metric SHOULD be specified with
+[`ExplicitBucketBoundaries`](https://github.com/open-telemetry/opentelemetry-specification/tree/v1.22.0/specification/metrics/api.md#instrument-advice)
+of `[ 0, 0.005, 0.01, 0.025, 0.05, 0.075, 0.1, 0.25, 0.5, 0.75, 1, 2.5, 5, 7.5, 10 ]`.
+
+<!-- semconv metric.openai.chat_completions.duration(metric_table) -->
+| Name     | Instrument Type | Unit (UCUM) | Description    |
+| -------- | --------------- | ----------- | -------------- |
+| `llm.openai.chat_completions.duration` | Histogram | `s` | Duration of chat completion operation |
+<!-- endsemconv -->
+
+<!-- semconv metric.openai.chat_completions.duration(full) -->
+| Attribute  | Type | Description  | Examples  | Requirement Level |
+|---|---|---|---|---|
+| [`error.type`](../attributes-registry/error.md) | string | Describes a class of error the operation ended with. [1] | `timeout`; `java.net.UnknownHostException`; `server_certificate_invalid`; `500` | Conditionally Required: if the operation ended in error |
+| [`llm.response.finish_reason`](../attributes-registry/llm.md) | string | The reason the model stopped generating tokens. | `stop` | Recommended |
+| [`llm.response.model`](../attributes-registry/llm.md) | string | The name of the LLM a response is being made to. | `gpt-4-0613` | Required |
+| [`server.address`](../attributes-registry/server.md) | string | Server domain name if available without reverse DNS lookup; otherwise, IP address or Unix domain socket name. [2] | `example.com`; `10.1.2.80`; `/tmp/my.sock` | Required |
+
+**[1]:** The `error.type` SHOULD be predictable and SHOULD have low cardinality.
+Instrumentations SHOULD document the list of errors they report.
+
+The cardinality of `error.type` within one instrumentation library SHOULD be low.
+Telemetry consumers that aggregate data from multiple instrumentation libraries and applications
+should be prepared for `error.type` to have high cardinality at query time when no
+additional filters are applied.
+
+If the operation has completed successfully, instrumentations SHOULD NOT set `error.type`.
+
+If a specific domain defines its own set of error identifiers (such as HTTP or gRPC status codes),
+it's RECOMMENDED to:
+
+* Use a domain-specific attribute
+* Set `error.type` to capture all errors, regardless of whether they are defined within the domain-specific set or not.
+
+**[2]:** When observed from the client side, and when communicating through an intermediary, `server.address` SHOULD represent the server address behind any intermediaries, for example proxies, if it's available.
+
+`error.type` has the following list of well-known values. If one of them applies, then the respective value MUST be used, otherwise a custom value MAY be used.
+
+| Value  | Description |
+|---|---|
+| `_OTHER` | A fallback error value to be used when the instrumentation doesn't define a custom value. |
+<!-- endsemconv -->
+
+## Embeddings
+
+### Metric: `openai.embeddings.tokens`
+
+**Status**: [Experimental][DocumentStatus]
+
+This metric is required.
+
+<!-- semconv metric.openai.embeddings.tokens(metric_table) -->
+| Name     | Instrument Type | Unit (UCUM) | Description    |
+| -------- | --------------- | ----------- | -------------- |
+| `llm.openai.embeddings.tokens` | Counter | `token` | Number of tokens used in prompt and completions. |
+<!-- endsemconv -->
+
+<!-- semconv metric.openai.embeddings.tokens(full) -->
+| Attribute  | Type | Description  | Examples  | Requirement Level |
+|---|---|---|---|---|
+| [`error.type`](../attributes-registry/error.md) | string | Describes a class of error the operation ended with. [1] | `timeout`; `java.net.UnknownHostException`; `server_certificate_invalid`; `500` | Conditionally Required: if the operation ended in error |
+| [`llm.response.model`](../attributes-registry/llm.md) | string | The name of the LLM a response is being made to. | `gpt-4-0613` | Required |
+| [`llm.usage.token_type`](../attributes-registry/llm.md) | string | The type of token. | `prompt` | Recommended |
+| [`server.address`](../attributes-registry/server.md) | string | Server domain name if available without reverse DNS lookup; otherwise, IP address or Unix domain socket name. [2] | `example.com`; `10.1.2.80`; `/tmp/my.sock` | Required |
+
+**[1]:** The `error.type` SHOULD be predictable and SHOULD have low cardinality.
+Instrumentations SHOULD document the list of errors they report.
+
+The cardinality of `error.type` within one instrumentation library SHOULD be low.
+Telemetry consumers that aggregate data from multiple instrumentation libraries and applications
+should be prepared for `error.type` to have high cardinality at query time when no
+additional filters are applied.
+
+If the operation has completed successfully, instrumentations SHOULD NOT set `error.type`.
+
+If a specific domain defines its own set of error identifiers (such as HTTP or gRPC status codes),
+it's RECOMMENDED to:
+
+* Use a domain-specific attribute
+* Set `error.type` to capture all errors, regardless of whether they are defined within the domain-specific set or not.
+
+**[2]:** When observed from the client side, and when communicating through an intermediary, `server.address` SHOULD represent the server address behind any intermediaries, for example proxies, if it's available.
+
+`error.type` has the following list of well-known values. If one of them applies, then the respective value MUST be used, otherwise a custom value MAY be used.
+
+| Value  | Description |
+|---|---|
+| `_OTHER` | A fallback error value to be used when the instrumentation doesn't define a custom value. |
+
+`llm.usage.token_type` MUST be one of the following:
+
+| Value  | Description |
+|---|---|
+| `prompt` | prompt |
+| `completion` | completion |
+<!-- endsemconv -->
+
+### Metric: `openai.embeddings.vector_size`
+
+**Status**: [Experimental][DocumentStatus]
+
+This metric is required.
+
+<!-- semconv metric.openai.embeddings.vector_size(metric_table) -->
+| Name     | Instrument Type | Unit (UCUM) | Description    |
+| -------- | --------------- | ----------- | -------------- |
+| `llm.openai.embeddings.vector_size` | Counter | `element` | he size of returned vector. |
+<!-- endsemconv -->
+
+<!-- semconv metric.openai.embeddings.vector_size(full) -->
+| Attribute  | Type | Description  | Examples  | Requirement Level |
+|---|---|---|---|---|
+| [`error.type`](../attributes-registry/error.md) | string | Describes a class of error the operation ended with. [1] | `timeout`; `java.net.UnknownHostException`; `server_certificate_invalid`; `500` | Conditionally Required: if the operation ended in error |
+| [`llm.response.model`](../attributes-registry/llm.md) | string | The name of the LLM a response is being made to. | `gpt-4-0613` | Required |
+| [`server.address`](../attributes-registry/server.md) | string | Server domain name if available without reverse DNS lookup; otherwise, IP address or Unix domain socket name. [2] | `example.com`; `10.1.2.80`; `/tmp/my.sock` | Required |
+
+**[1]:** The `error.type` SHOULD be predictable and SHOULD have low cardinality.
+Instrumentations SHOULD document the list of errors they report.
+
+The cardinality of `error.type` within one instrumentation library SHOULD be low.
+Telemetry consumers that aggregate data from multiple instrumentation libraries and applications
+should be prepared for `error.type` to have high cardinality at query time when no
+additional filters are applied.
+
+If the operation has completed successfully, instrumentations SHOULD NOT set `error.type`.
+
+If a specific domain defines its own set of error identifiers (such as HTTP or gRPC status codes),
+it's RECOMMENDED to:
+
+* Use a domain-specific attribute
+* Set `error.type` to capture all errors, regardless of whether they are defined within the domain-specific set or not.
+
+**[2]:** When observed from the client side, and when communicating through an intermediary, `server.address` SHOULD represent the server address behind any intermediaries, for example proxies, if it's available.
+
+`error.type` has the following list of well-known values. If one of them applies, then the respective value MUST be used, otherwise a custom value MAY be used.
+
+| Value  | Description |
+|---|---|
+| `_OTHER` | A fallback error value to be used when the instrumentation doesn't define a custom value. |
+<!-- endsemconv -->
+
+### Metric: `openai.embeddings.duration`
+
+**Status**: [Experimental][DocumentStatus]
+
+This metric is required.
+
+This metric SHOULD be specified with
+[`ExplicitBucketBoundaries`](https://github.com/open-telemetry/opentelemetry-specification/tree/v1.22.0/specification/metrics/api.md#instrument-advice)
+of `[ 0, 0.005, 0.01, 0.025, 0.05, 0.075, 0.1, 0.25, 0.5, 0.75, 1, 2.5, 5, 7.5, 10 ]`.
+
+<!-- semconv metric.openai.embeddings.duration(metric_table) -->
+| Name     | Instrument Type | Unit (UCUM) | Description    |
+| -------- | --------------- | ----------- | -------------- |
+| `llm.openai.embeddings.duration` | Histogram | `s` | Duration of embeddings operation |
+<!-- endsemconv -->
+
+<!-- semconv metric.openai.embeddings.duration(full) -->
+| Attribute  | Type | Description  | Examples  | Requirement Level |
+|---|---|---|---|---|
+| [`error.type`](../attributes-registry/error.md) | string | Describes a class of error the operation ended with. [1] | `timeout`; `java.net.UnknownHostException`; `server_certificate_invalid`; `500` | Conditionally Required: if the operation ended in error |
+| [`llm.response.model`](../attributes-registry/llm.md) | string | The name of the LLM a response is being made to. | `gpt-4-0613` | Required |
+| [`server.address`](../attributes-registry/server.md) | string | Server domain name if available without reverse DNS lookup; otherwise, IP address or Unix domain socket name. [2] | `example.com`; `10.1.2.80`; `/tmp/my.sock` | Required |
+
+**[1]:** The `error.type` SHOULD be predictable and SHOULD have low cardinality.
+Instrumentations SHOULD document the list of errors they report.
+
+The cardinality of `error.type` within one instrumentation library SHOULD be low.
+Telemetry consumers that aggregate data from multiple instrumentation libraries and applications
+should be prepared for `error.type` to have high cardinality at query time when no
+additional filters are applied.
+
+If the operation has completed successfully, instrumentations SHOULD NOT set `error.type`.
+
+If a specific domain defines its own set of error identifiers (such as HTTP or gRPC status codes),
+it's RECOMMENDED to:
+
+* Use a domain-specific attribute
+* Set `error.type` to capture all errors, regardless of whether they are defined within the domain-specific set or not.
+
+**[2]:** When observed from the client side, and when communicating through an intermediary, `server.address` SHOULD represent the server address behind any intermediaries, for example proxies, if it's available.
+
+`error.type` has the following list of well-known values. If one of them applies, then the respective value MUST be used, otherwise a custom value MAY be used.
+
+| Value  | Description |
+|---|---|
+| `_OTHER` | A fallback error value to be used when the instrumentation doesn't define a custom value. |
+<!-- endsemconv -->
+
+[DocumentStatus]: https://github.com/open-telemetry/opentelemetry-specification/tree/v1.22.0/specification/document-status.md
+
+## Image generation
+
+### Metric: `openai.image_generations.duration`
+
+**Status**: [Experimental][DocumentStatus]
+
+This metric is required.
+
+This metric SHOULD be specified with
+[`ExplicitBucketBoundaries`](https://github.com/open-telemetry/opentelemetry-specification/tree/v1.22.0/specification/metrics/api.md#instrument-advice)
+of `[ 0, 0.005, 0.01, 0.025, 0.05, 0.075, 0.1, 0.25, 0.5, 0.75, 1, 2.5, 5, 7.5, 10 ]`.
+
+<!-- semconv metric.openai.image_generations.duration(metric_table) -->
+| Name     | Instrument Type | Unit (UCUM) | Description    |
+| -------- | --------------- | ----------- | -------------- |
+| `llm.openai.image_generations.duration` | Histogram | `s` | Duration of image generations operation |
+<!-- endsemconv -->
+
+<!-- semconv metric.openai.image_generations.duration(full) -->
+| Attribute  | Type | Description  | Examples  | Requirement Level |
+|---|---|---|---|---|
+| [`error.type`](../attributes-registry/error.md) | string | Describes a class of error the operation ended with. [1] | `timeout`; `java.net.UnknownHostException`; `server_certificate_invalid`; `500` | Recommended |
+| [`llm.response.model`](../attributes-registry/llm.md) | string | The name of the LLM a response is being made to. | `gpt-4-0613` | Required |
+| [`server.address`](../attributes-registry/server.md) | string | Server domain name if available without reverse DNS lookup; otherwise, IP address or Unix domain socket name. [2] | `example.com`; `10.1.2.80`; `/tmp/my.sock` | Required |
+
+**[1]:** The `error.type` SHOULD be predictable and SHOULD have low cardinality.
+Instrumentations SHOULD document the list of errors they report.
+
+The cardinality of `error.type` within one instrumentation library SHOULD be low.
+Telemetry consumers that aggregate data from multiple instrumentation libraries and applications
+should be prepared for `error.type` to have high cardinality at query time when no
+additional filters are applied.
+
+If the operation has completed successfully, instrumentations SHOULD NOT set `error.type`.
+
+If a specific domain defines its own set of error identifiers (such as HTTP or gRPC status codes),
+it's RECOMMENDED to:
+
+* Use a domain-specific attribute
+* Set `error.type` to capture all errors, regardless of whether they are defined within the domain-specific set or not.
+
+**[2]:** When observed from the client side, and when communicating through an intermediary, `server.address` SHOULD represent the server address behind any intermediaries, for example proxies, if it's available.
+
+`error.type` has the following list of well-known values. If one of them applies, then the respective value MUST be used, otherwise a custom value MAY be used.
+
+| Value  | Description |
+|---|---|
+| `_OTHER` | A fallback error value to be used when the instrumentation doesn't define a custom value. |
+<!-- endsemconv -->
+
+[DocumentStatus]: https://github.com/open-telemetry/opentelemetry-specification/tree/v1.22.0/specification/document-status.md
\ No newline at end of file
diff --git a/model/metrics/llm-metrics.yaml b/model/metrics/llm-metrics.yaml
new file mode 100644
index 0000000000..2ca1ff3b41
--- /dev/null
+++ b/model/metrics/llm-metrics.yaml
@@ -0,0 +1,109 @@
+groups:
+  - id: metric.openai.chat_completions.tokens
+    type: metric
+    metric_name: llm.openai.chat_completions.tokens
+    brief: "Number of tokens used in prompt and completions."
+    instrument: counter
+    unit: "token"
+    stability: experimental
+    attributes:
+      - ref: llm.response.model
+        requirement_level: required
+      - ref: error.type
+        requirement_level:
+          conditionally_required: "if the operation ended in error"
+      - ref: llm.usage.token_type
+      - ref: server.address
+        requirement_level: required
+  - id: metric.openai.chat_completions.choices
+    type: metric
+    metric_name: llm.openai.chat_completions.choices
+    brief: "Number of choices returned by chat completions call"
+    instrument: counter
+    unit: "choice"
+    stability: experimental
+    attributes:
+      - ref: llm.response.model
+        requirement_level: required
+      - ref: error.type
+        requirement_level:
+          conditionally_required: "if the operation ended in error"
+      - ref: llm.response.finish_reason
+      - ref: server.address
+        requirement_level: required
+  - id: metric.openai.chat_completions.duration
+    type: metric
+    metric_name: llm.openai.chat_completions.duration
+    brief: "Duration of chat completion operation"
+    instrument: histogram
+    unit: 's'
+    stability: experimental
+    attributes:
+      - ref: llm.response.model
+        requirement_level: required
+      - ref: error.type
+        requirement_level:
+          conditionally_required: "if the operation ended in error"
+      - ref: llm.response.finish_reason
+      - ref: server.address
+        requirement_level: required
+  - id: metric.openai.embeddings.tokens
+    type: metric
+    metric_name: llm.openai.embeddings.tokens
+    brief: "Number of tokens used in prompt and completions."
+    instrument: counter
+    unit: "token"
+    stability: experimental
+    attributes:
+      - ref: llm.response.model
+        requirement_level: required
+      - ref: error.type
+        requirement_level:
+          conditionally_required: "if the operation ended in error"
+      - ref: llm.usage.token_type
+      - ref: server.address
+        requirement_level: required
+  - id: metric.openai.embeddings.vector_size
+    type: metric
+    metric_name: llm.openai.embeddings.vector_size
+    brief: "he size of returned vector."
+    instrument: counter
+    unit: "element"
+    stability: experimental
+    attributes:
+      - ref: llm.response.model
+        requirement_level: required
+      - ref: error.type
+        requirement_level:
+          conditionally_required: "if the operation ended in error"
+      - ref: server.address
+        requirement_level: required
+  - id: metric.openai.embeddings.duration
+    type: metric
+    metric_name: llm.openai.embeddings.duration
+    brief: "Duration of embeddings operation"
+    instrument: histogram
+    unit: 's'
+    stability: experimental
+    attributes:
+      - ref: llm.response.model
+        requirement_level: required
+      - ref: error.type
+        requirement_level:
+          conditionally_required: "if the operation ended in error"
+      - ref: server.address
+        requirement_level: required
+  - id: metric.openai.image_generations.duration
+    type: metric
+    metric_name: llm.openai.image_generations.duration
+    brief: "Duration of image generations operation"
+    instrument: histogram
+    unit: 's'
+    stability: experimental
+    attributes:
+      - ref: llm.response.model
+        requirement_level: required
+      - ref: error.type
+          conditionally_required: "if the operation ended in error"
+      - ref: server.address
+        requirement_level: required
\ No newline at end of file
diff --git a/model/registry/llm.yaml b/model/registry/llm.yaml
index d45bad3368..1f59626ef4 100644
--- a/model/registry/llm.yaml
+++ b/model/registry/llm.yaml
@@ -55,6 +55,15 @@ groups:
         brief: The reason the model stopped generating tokens.
         examples: ['stop']
         tag: llm-generic-response
+      - id: usage.token_type
+        type: 
+          members:
+            - id: prompt
+              value: 'prompt'
+            - id: completion
+              value: 'completion'
+        brief: The type of token.
+        examples: ['prompt']
       - id: usage.prompt_tokens
         type: int
         brief: The number of tokens used in the LLM prompt.

From 0ef1c1b190a811520b0ce332ef5e5c4e1847e0f0 Mon Sep 17 00:00:00 2001
From: Drew Robbins <drew@drewby.com>
Date: Mon, 29 Jan 2024 02:16:02 +0000
Subject: [PATCH 06/36] Fix linting errors

---
 docs/ai/README.md         |  2 +-
 docs/ai/llm-spans.md      |  7 ++++---
 docs/ai/openai-metrics.md |  5 +----
 docs/ai/openai.md         | 26 +++++++++++++-------------
 4 files changed, 19 insertions(+), 21 deletions(-)

diff --git a/docs/ai/README.md b/docs/ai/README.md
index bf83b94856..d5d51dcd75 100644
--- a/docs/ai/README.md
+++ b/docs/ai/README.md
@@ -22,4 +22,4 @@ Technology specific semantic conventions are defined for the following LLM provi
 * [OpenAI](openai.md): Semantic Conventions for *OpenAI* spans.
 * [OpenAI Metrics](openai-metrics.md): Semantic Conventions for *OpenAI* metrics.
 
-[DocumentStatus]: https://github.com/open-telemetry/opentelemetry-specification/tree/v1.26.0/specification/document-status.md
\ No newline at end of file
+[DocumentStatus]: https://github.com/open-telemetry/opentelemetry-specification/tree/v1.26.0/specification/document-status.md
diff --git a/docs/ai/llm-spans.md b/docs/ai/llm-spans.md
index 19c4162321..12884056f9 100644
--- a/docs/ai/llm-spans.md
+++ b/docs/ai/llm-spans.md
@@ -10,9 +10,10 @@ linkTitle: LLM Calls
 
 <!-- toc -->
 
-- [LLM Request attributes](#llm-request-attributes)
 - [Configuration](#configuration)
-- [Semantic Conventions for specific LLM technologies](#semantic-conventions-for-specific-llm-technologies)
+- [LLM Request attributes](#llm-request-attributes)
+- [LLM Response attributes](#llm-response-attributes)
+- [Events](#events)
 
 <!-- tocstop -->
 
@@ -96,4 +97,4 @@ In the lifetime of an LLM span, an event for prompts sent and completions receiv
 | `llm.completion` | string | The full response string from an LLM. If the LLM responds with a more complex output like a JSON object made up of several pieces (such as OpenAI's message choices), this field is the content of the response. If the LLM produces multiple responses, then this field is left blank, and each response is instead captured in an event determined by the specific LLM technology semantic convention.| `Why did the developer stop using OpenTelemetry? Because they couldn't trace their steps!` | Recommended |
 <!-- endsemconv -->
 
-[DocumentStatus]: https://github.com/open-telemetry/opentelemetry-specification/tree/v1.22.0/specification/document-status.md
\ No newline at end of file
+[DocumentStatus]: https://github.com/open-telemetry/opentelemetry-specification/tree/v1.22.0/specification/document-status.md
diff --git a/docs/ai/openai-metrics.md b/docs/ai/openai-metrics.md
index 5b231da602..656148318a 100644
--- a/docs/ai/openai-metrics.md
+++ b/docs/ai/openai-metrics.md
@@ -124,7 +124,6 @@ it's RECOMMENDED to:
 | `_OTHER` | A fallback error value to be used when the instrumentation doesn't define a custom value. |
 <!-- endsemconv -->
 
-
 ### Metric: `openai.chat_completions.duration`
 
 **Status**: [Experimental][DocumentStatus]
@@ -320,8 +319,6 @@ it's RECOMMENDED to:
 | `_OTHER` | A fallback error value to be used when the instrumentation doesn't define a custom value. |
 <!-- endsemconv -->
 
-[DocumentStatus]: https://github.com/open-telemetry/opentelemetry-specification/tree/v1.22.0/specification/document-status.md
-
 ## Image generation
 
 ### Metric: `openai.image_generations.duration`
@@ -372,4 +369,4 @@ it's RECOMMENDED to:
 | `_OTHER` | A fallback error value to be used when the instrumentation doesn't define a custom value. |
 <!-- endsemconv -->
 
-[DocumentStatus]: https://github.com/open-telemetry/opentelemetry-specification/tree/v1.22.0/specification/document-status.md
\ No newline at end of file
+[DocumentStatus]: https://github.com/open-telemetry/opentelemetry-specification/tree/v1.22.0/specification/document-status.md
diff --git a/docs/ai/openai.md b/docs/ai/openai.md
index 4c7acf404a..8105be0f29 100644
--- a/docs/ai/openai.md
+++ b/docs/ai/openai.md
@@ -6,22 +6,22 @@ linkTitle: OpenAI
 
 **Status**: [Experimental][DocumentStatus]
 
-This document outlines the Semantic Conventions specific to 
-[OpenAI](https://platform.openai.com/) spans, extending the general semantics 
-found in the [LLM Semantic Conventions](llm-spans.md). These conventions are 
-designed to standardize telemetry data for OpenAI interactions, particularly 
-focusing on the `/chat/completions` endpoint. By following to these guidelines, 
+This document outlines the Semantic Conventions specific to
+[OpenAI](https://platform.openai.com/) spans, extending the general semantics
+found in the [LLM Semantic Conventions](llm-spans.md). These conventions are
+designed to standardize telemetry data for OpenAI interactions, particularly
+focusing on the `/chat/completions` endpoint. By following to these guidelines,
 developers can ensure consistent, meaningful, and easily interpretable telemetry
 data across different applications and platforms.
 
 ## Chat Completions
 
-The span name for OpenAI chat completions SHOULD be `openai.chat` 
+The span name for OpenAI chat completions SHOULD be `openai.chat`
 to maintain consistency and clarity in telemetry data.
 
 ## Request Attributes
 
-These are the attributes when instrumenting OpenAI LLM requests with the 
+These are the attributes when instrumenting OpenAI LLM requests with the
 `/chat/completions` endpoint.
 
 <!-- semconv llm.openai(tag=llm-request-tech-specific) -->
@@ -67,7 +67,7 @@ Because OpenAI uses a more complex prompt structure, these events will be used i
 
 ### Prompt Events
 
-Prompt event name SHOULD be `llm.openai.prompt`. 
+Prompt event name SHOULD be `llm.openai.prompt`.
 
 <!-- semconv llm.openai(tag=llm-prompt-tech-specific) -->
 | Attribute  | Type | Description  | Examples  | Requirement Level |
@@ -87,15 +87,15 @@ Tools event name SHOULD be `llm.openai.tool`, specifying potential tools or func
 | `type` | string | They type of the tool. Currently, only `function` is supported. | `function` | Required |
 | `function.name` | string | The name of the function to be called. | `get_weather` | Required !
 | `function.description` | string | A description of what the function does, used by the model to choose when and how to call the function. | `` | Required |
-| `function.parameters` | string | JSON-encoded string of the parameter object for the function. | `{"type": "object", "properties": {}}` | Required | 
+| `function.parameters` | string | JSON-encoded string of the parameter object for the function. | `{"type": "object", "properties": {}}` | Required |
 <!-- endsemconv -->
 
 ### Choice Events
 
-Recording details about Choices in each response MAY be included as 
-Span Events. 
+Recording details about Choices in each response MAY be included as
+Span Events.
 
-Choice event name SHOULD be `llm.openai.choice`. 
+Choice event name SHOULD be `llm.openai.choice`.
 
 If there is more than one `tool_call`, separate events SHOULD be used.
 
@@ -111,4 +111,4 @@ If there is more than one `tool_call`, separate events SHOULD be used.
 | `tool_call.function.arguments` | string | If exists, the arguments to call a function call with for a given OpenAI response, denoted by `<index>`. The value for `<index>` starts with 0, where 0 is the first message. | `{"type": "object", "properties": {"some":"data"}}` | Required |
 <!-- endsemconv -->
 
-[DocumentStatus]: https://github.com/open-telemetry/opentelemetry-specification/tree/v1.22.0/specification/document-status.md
\ No newline at end of file
+[DocumentStatus]: https://github.com/open-telemetry/opentelemetry-specification/tree/v1.22.0/specification/document-status.md

From fd57c6cf6fb1fd05c02591e26d38654d84532a69 Mon Sep 17 00:00:00 2001
From: Drew Robbins <drew@drewby.com>
Date: Mon, 29 Jan 2024 02:27:49 +0000
Subject: [PATCH 07/36] Fix yamllint errors

---
 model/metrics/llm-metrics.yaml |  6 ++---
 model/registry/llm.yaml        |  6 ++---
 model/trace/llm.yaml           | 40 +++++++++++++++++++++++-----------
 3 files changed, 33 insertions(+), 19 deletions(-)

diff --git a/model/metrics/llm-metrics.yaml b/model/metrics/llm-metrics.yaml
index 2ca1ff3b41..75db1e31ff 100644
--- a/model/metrics/llm-metrics.yaml
+++ b/model/metrics/llm-metrics.yaml
@@ -102,8 +102,8 @@ groups:
     stability: experimental
     attributes:
       - ref: llm.response.model
-        requirement_level: required
-      - ref: error.type
+        requirement_level:
           conditionally_required: "if the operation ended in error"
+      - ref: error.type
       - ref: server.address
-        requirement_level: required
\ No newline at end of file
+        requirement_level: required
diff --git a/model/registry/llm.yaml b/model/registry/llm.yaml
index 1f59626ef4..31bf953b94 100644
--- a/model/registry/llm.yaml
+++ b/model/registry/llm.yaml
@@ -56,7 +56,7 @@ groups:
         examples: ['stop']
         tag: llm-generic-response
       - id: usage.token_type
-        type: 
+        type:
           members:
             - id: prompt
               value: 'prompt'
@@ -183,7 +183,7 @@ groups:
         tag: tech-specific-openai-events
       - id: openai.function.arguments
         type: string
-        brief: If exists, the arguments to call a function call with for a given OpenAI response, denoted by `<index>`. The value for `<index>` starts with 0, where 0 is the first message. 
+        brief: If exists, the arguments to call a function call with for a given OpenAI response, denoted by `<index>`. The value for `<index>` starts with 0, where 0 is the first message.
         examples: '{"type": "object", "properties": {"some":"data"}}'
         tag: tech-specific-openai-events
       - id: openai.choice.type
@@ -195,4 +195,4 @@ groups:
               value: 'message'
         brief: The type of the choice, either `delta` or `message`.
         examples: 'message'
-        tag: tech-specific-openai-events
\ No newline at end of file
+        tag: tech-specific-openai-events
diff --git a/model/trace/llm.yaml b/model/trace/llm.yaml
index 17fe1e709f..1c844732b0 100644
--- a/model/trace/llm.yaml
+++ b/model/trace/llm.yaml
@@ -11,7 +11,9 @@ groups:
       - ref: llm.request.model
         requirement_level: required
         note: >
-            The name of the LLM a request is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model requested. If the LLM is a fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
+            The name of the LLM a request is being made to. If the LLM is supplied by a vendor,
+            then the value must be the exact name of the model requested. If the LLM is a fine-tuned
+            custom model, the value should have a more specific name than the base model that's been fine-tuned.
       - ref: llm.request.max_tokens
         requirement_level: recommended
       - ref: llm.request.temperature
@@ -27,7 +29,9 @@ groups:
       - ref: llm.response.model
         requirement_level: required
         note: >
-          The name of the LLM a response is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model actually used. If the LLM is a fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
+          The name of the LLM a response is being made to. If the LLM is supplied by a vendor,
+          then the value must be the exact name of the model actually used. If the LLM is a
+          fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
       - ref: llm.response.finish_reason
         requirement_level: recommended
       - ref: llm.usage.prompt_tokens
@@ -44,13 +48,16 @@ groups:
     name: llm.content.prompt
     type: event
     brief: >
-      In the lifetime of an LLM span, events for prompts sent and completions received may be created, depending on the configuration of the instrumentation.
+      In the lifetime of an LLM span, events for prompts sent and completions received
+      may be created, depending on the configuration of the instrumentation.
     attributes:
       - ref: llm.prompt
         requirement_level: recommended
         note: >
-          The full prompt string sent to an LLM in a request. If the LLM accepts a more complex input like a JSON object, this field is blank, and the response is instead captured in an event determined by the specific LLM technology semantic convention.
-      
+          The full prompt string sent to an LLM in a request. If the LLM accepts a more
+          complex input like a JSON object, this field is blank, and the response is
+          instead captured in an event determined by the specific LLM technology semantic convention.
+
   - id: llm.content.completion
     name: llm.content.completion
     type: event
@@ -60,7 +67,11 @@ groups:
       - ref: llm.completion
         requirement_level: recommended
         note: >
-          The full response string from an LLM. If the LLM responds with a more complex output like a JSON object made up of several pieces (such as OpenAI's message choices), this field is the content of the response. If the LLM produces multiple responses, then this field is left blank, and each response is instead captured in an event determined by the specific LLM technology semantic convention.
+          The full response string from an LLM. If the LLM responds with a more
+          complex output like a JSON object made up of several pieces (such as OpenAI's message choices),
+          this field is the content of the response. If the LLM produces multiple responses, then this
+          field is left blank, and each response is instead captured in an event determined by the specific
+          LLM technology semantic convention.
 
   - id: llm.openai
     type: span
@@ -74,7 +85,10 @@ groups:
       - ref: llm.request.model
         requirement_level: required
         note: >
-            The name of the LLM a request is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model requested. If the LLM is a fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
+            The name of the LLM a request is being made to. If the LLM is supplied by a
+            vendor, then the value must be the exact name of the model requested. If the
+            LLM is a fine-tuned custom model, the value should have a more specific name
+            than the base model that's been fine-tuned.
         tag: tech-specific-openai-request
       - ref: llm.request.max_tokens
         tag: tech-specific-openai-request
@@ -126,7 +140,7 @@ groups:
       - ref: llm.openai.content
         requirement_level: required
       - ref: llm.openai.tool_call.id
-        requirement_level: 
+        requirement_level:
           conditionally_required: >
             Required if the prompt role is `tool`.
 
@@ -159,18 +173,18 @@ groups:
       - ref: llm.openai.content
         requirement_level: required
       - ref: llm.openai.tool_call.id
-        requirement_level: 
+        requirement_level:
           conditionally_required: >
             Required if the choice is the result of a tool call.
       - ref: llm.openai.tool.type
-        requirement_level: 
+        requirement_level:
           conditionally_required: >
             Required if the choice is the result of a tool call.
       - ref: llm.openai.function.name
-        requirement_level: 
+        requirement_level:
           conditionally_required: >
             Required if the choice is the result of a tool call of type `function`.
       - ref: llm.openai.function.arguments
-        requirement_level: 
+        requirement_level:
           conditionally_required: >
-            Required if the choice is the result of a tool call of type `function`.
\ No newline at end of file
+            Required if the choice is the result of a tool call of type `function`.

From c80b80c329faaf2b17cf74ec04bb4b5b1c5b5b07 Mon Sep 17 00:00:00 2001
From: Drew Robbins <drew@drewby.com>
Date: Mon, 29 Jan 2024 05:04:35 +0000
Subject: [PATCH 08/36] Regenerate markdown based on yaml model

---
 docs/ai/llm-spans.md            |  80 ++++++++++------------
 docs/ai/openai-metrics.md       |   2 +-
 docs/ai/openai.md               | 113 +++++++++++++++++---------------
 docs/attributes-registry/llm.md |   6 +-
 4 files changed, 96 insertions(+), 105 deletions(-)

diff --git a/docs/ai/llm-spans.md b/docs/ai/llm-spans.md
index 12884056f9..894d464786 100644
--- a/docs/ai/llm-spans.md
+++ b/docs/ai/llm-spans.md
@@ -12,7 +12,6 @@ linkTitle: LLM Calls
 
 - [Configuration](#configuration)
 - [LLM Request attributes](#llm-request-attributes)
-- [LLM Response attributes](#llm-response-attributes)
 - [Events](#events)
 
 <!-- tocstop -->
@@ -36,65 +35,52 @@ By default, these configurations SHOULD NOT capture prompts and completions.
 
 These attributes track input data and metadata for a request to an LLM. Each attribute represents a concept that is common to most LLMs.
 
-<!-- semconv ai(tag=llm-request) -->
+<!-- semconv llm.request -->
 | Attribute  | Type | Description  | Examples  | Requirement Level |
 |---|---|---|---|---|
-| `llm.vendor` | string | The name of the LLM foundation model vendor, if applicable. If not using a vendor-supplied model, this field is left blank. | `openai` | Recommended |
-| `llm.request.model` | string | The name of the LLM a request is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model requested. If the LLM is a fine-tuned custom model, the value SHOULD have a more specific name than the base model that's been fine-tuned. | `gpt-4` | Required |
-| `llm.request.max_tokens` | int | The maximum number of tokens the LLM generates for a request. | `100` | Recommended |
-| `llm.temperature` | float | The temperature setting for the LLM request. | `0.0` | Recommended |
-| `llm.top_p` | float | The top_p sampling setting for the LLM request. | `1.0` | Recommended |
-| `llm.stream` | bool | Whether the LLM responds with a stream. | `false` | Recommended |
-| `llm.stop_sequences` | array | Array of strings the LLM uses as a stop sequence. | `["stop1"]` | Recommended |
-
-`llm.model` has the following list of well-known values. If one of them applies, then the respective value MUST be used, otherwise a custom value MAY be used.
-
-| Value  | Description |
-|---|---|
-| `gpt-4` | GPT-4 |
-| `gpt-4-32k` | GPT-4 with 32k context window |
-| `gpt-3.5-turbo` | GPT-3.5-turbo |
-| `gpt-3.5-turbo-16k` | GPT-3.5-turbo with 16k context window|
-| `claude-instant-1` | Claude Instant (latest version) |
-| `claude-2` | Claude 2 (latest version) |
-| `other-llm` | Any LLM not listed in this table. Use for any fine-tuned version of a model. |
+| [`llm.request.is_stream`](../attributes-registry/llm.md) | boolean | Whether the LLM responds with a stream. | `False` | Recommended |
+| [`llm.request.max_tokens`](../attributes-registry/llm.md) | int | The maximum number of tokens the LLM generates for a request. | `100` | Recommended |
+| [`llm.request.model`](../attributes-registry/llm.md) | string | The name of the LLM a request is being made to. [1] | `gpt-4` | Required |
+| [`llm.request.stop_sequences`](../attributes-registry/llm.md) | string | Array of strings the LLM uses as a stop sequence. | `stop1` | Recommended |
+| [`llm.request.temperature`](../attributes-registry/llm.md) | double | The temperature setting for the LLM request. | `0.0` | Recommended |
+| [`llm.request.top_p`](../attributes-registry/llm.md) | double | The top_p sampling setting for the LLM request. | `1.0` | Recommended |
+| [`llm.response.finish_reason`](../attributes-registry/llm.md) | string | The reason the model stopped generating tokens. | `stop` | Recommended |
+| [`llm.response.id`](../attributes-registry/llm.md) | string[] | The unique identifier for the completion. | `[chatcmpl-123]` | Recommended |
+| [`llm.response.model`](../attributes-registry/llm.md) | string | The name of the LLM a response is being made to. [2] | `gpt-4-0613` | Required |
+| [`llm.system`](../attributes-registry/llm.md) | string | The name of the LLM foundation model vendor, if applicable. [3] | `openai` | Recommended |
+| [`llm.usage.completion_tokens`](../attributes-registry/llm.md) | int | The number of tokens used in the LLM response (completion). | `180` | Recommended |
+| [`llm.usage.prompt_tokens`](../attributes-registry/llm.md) | int | The number of tokens used in the LLM prompt. | `100` | Recommended |
+| [`llm.usage.total_tokens`](../attributes-registry/llm.md) | int | The total number of tokens used in the LLM prompt and response. | `280` | Recommended |
+
+**[1]:** The name of the LLM a request is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model requested. If the LLM is a fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
+
+**[2]:** The name of the LLM a response is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model actually used. If the LLM is a fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
+
+**[3]:** The name of the LLM foundation model vendor, if applicable. If not using a vendor-supplied model, this field is left blank.
 <!-- endsemconv -->
 
-## LLM Response attributes
+## Events
+
+In the lifetime of an LLM span, an event for prompts sent and completions received MAY be created, depending on the configuration of the instrumentation.
 
-These attributes track output data and metadata for a response from an LLM. Each attribute represents a concept that is common to most LLMs.
+<!-- semconv llm.content.prompt -->
+The event name MUST be `llm.content.prompt`.
 
-<!-- semconv ai(tag=llm-response) -->
 | Attribute  | Type | Description  | Examples  | Requirement Level |
 |---|---|---|---|---|
-| `llm.response.id` | string | The unique identifier for the completion. | `chatcmpl-123` | Recommended |
-| `llm.response.model` | string | The name of the LLM a response is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model actually used. If the LLM is a fine-tuned custom model, the value SHOULD have a more specific name than the base model that's been fine-tuned. | `gpt-4-0613` | Required |
-| `llm.response.finish_reason` | string | The reason the model stopped generating tokens | `stop` | Recommended |
-| `llm.usage.prompt_tokens` | int | The number of tokens used in the LLM prompt. | `100` | Recommended |
-| `llm.usage.completion_tokens` | int | The number of tokens used in the LLM response (completion). | `180` | Recommended |
-| `llm.usage.total_tokens` | int | The total number of tokens used in the LLM prompt and response. | `280` | Recommended |
-
-`llm.response.finish_reason` MUST be one of the following:
-
-| Value  | Description |
-|---|---|
-| `stop` | If the model hit a natural stop point or a provided stop sequence. |
-| `max_tokens` | If the maximum number of tokens specified in the request was reached. |
-| `tool_call` | If a function / tool call was made by the model (for models that support such functionality). |
-<!-- endsemconv -->
+| [`llm.prompt`](../attributes-registry/llm.md) | string | The full prompt string sent to an LLM in a request. [1] | `\\n\\nHuman:You are an AI assistant that tells jokes. Can you tell me a joke about OpenTelemetry?\\n\\nAssistant:` | Recommended |
 
-## Events
+**[1]:** The full prompt string sent to an LLM in a request. If the LLM accepts a more complex input like a JSON object, this field is blank, and the response is instead captured in an event determined by the specific LLM technology semantic convention.
+<!-- endsemconv -->
 
-In the lifetime of an LLM span, an event for prompts sent and completions received MAY be created, depending on the configuration of the instrumentation.
+<!-- semconv llm.content.completion -->
+The event name MUST be `llm.content.completion`.
 
-<!-- semconv ai(tag=llm-prompt) -->
 | Attribute  | Type | Description  | Examples  | Requirement Level |
-| `llm.prompt` | string | The full prompt string sent to an LLM in a request. If the LLM accepts a more complex input like a JSON object made up of several pieces (such as OpenAI's different message types), this field is blank, and the response is instead captured in an event determined by the specific LLM technology semantic convention. | `\n\nHuman:You are an AI assistant that tells jokes. Can you tell me a joke about OpenTelemetry?\n\nAssistant:` | Recommended |
-<!-- endsemconv -->
+|---|---|---|---|---|
+| [`llm.completion`](../attributes-registry/llm.md) | string | The full response string from an LLM in a response. [1] | `Why did the developer stop using OpenTelemetry? Because they couldnt trace their steps!` | Recommended |
 
-<!-- semconv ai(tag=llm-completion) -->
-| Attribute  | Type | Description  | Examples  | Requirement Level |
-| `llm.completion` | string | The full response string from an LLM. If the LLM responds with a more complex output like a JSON object made up of several pieces (such as OpenAI's message choices), this field is the content of the response. If the LLM produces multiple responses, then this field is left blank, and each response is instead captured in an event determined by the specific LLM technology semantic convention.| `Why did the developer stop using OpenTelemetry? Because they couldn't trace their steps!` | Recommended |
+**[1]:** The full response string from an LLM. If the LLM responds with a more complex output like a JSON object made up of several pieces (such as OpenAI's message choices), this field is the content of the response. If the LLM produces multiple responses, then this field is left blank, and each response is instead captured in an event determined by the specific LLM technology semantic convention.
 <!-- endsemconv -->
 
 [DocumentStatus]: https://github.com/open-telemetry/opentelemetry-specification/tree/v1.22.0/specification/document-status.md
diff --git a/docs/ai/openai-metrics.md b/docs/ai/openai-metrics.md
index 656148318a..bf39882d9e 100644
--- a/docs/ai/openai-metrics.md
+++ b/docs/ai/openai-metrics.md
@@ -341,7 +341,7 @@ of `[ 0, 0.005, 0.01, 0.025, 0.05, 0.075, 0.1, 0.25, 0.5, 0.75, 1, 2.5, 5, 7.5,
 | Attribute  | Type | Description  | Examples  | Requirement Level |
 |---|---|---|---|---|
 | [`error.type`](../attributes-registry/error.md) | string | Describes a class of error the operation ended with. [1] | `timeout`; `java.net.UnknownHostException`; `server_certificate_invalid`; `500` | Recommended |
-| [`llm.response.model`](../attributes-registry/llm.md) | string | The name of the LLM a response is being made to. | `gpt-4-0613` | Required |
+| [`llm.response.model`](../attributes-registry/llm.md) | string | The name of the LLM a response is being made to. | `gpt-4-0613` | Conditionally Required: if the operation ended in error |
 | [`server.address`](../attributes-registry/server.md) | string | Server domain name if available without reverse DNS lookup; otherwise, IP address or Unix domain socket name. [2] | `example.com`; `10.1.2.80`; `/tmp/my.sock` | Required |
 
 **[1]:** The `error.type` SHOULD be predictable and SHOULD have low cardinality.
diff --git a/docs/ai/openai.md b/docs/ai/openai.md
index 8105be0f29..001d751ce5 100644
--- a/docs/ai/openai.md
+++ b/docs/ai/openai.md
@@ -24,40 +24,30 @@ to maintain consistency and clarity in telemetry data.
 These are the attributes when instrumenting OpenAI LLM requests with the
 `/chat/completions` endpoint.
 
-<!-- semconv llm.openai(tag=llm-request-tech-specific) -->
+<!-- semconv llm.openai -->
 | Attribute  | Type | Description  | Examples  | Requirement Level |
 |---|---|---|---|---|
-| `llm.vendor` | string | The name of the LLM foundation model vendor, if applicable. If not using a vendor-supplied model, this field is left blank. | `openai` | Recommended |
-| `llm.request.model` | string | The name of the LLM a request is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model requested. If the LLM is a fine-tuned custom model, the value SHOULD have a more specific name than the base model that's been fine-tuned. | `gpt-4` | Required |
-| `llm.request.max_tokens` | int | The maximum number of tokens the LLM generates for a request. | `100` | Recommended |
-| `llm.temperature` | float | The temperature setting for the LLM request. | `0.0` | Recommended |
-| `llm.top_p` | float | The top_p sampling setting for the LLM request. | `1.0` | Recommended |
-| `llm.stream` | bool | Whether the LLM responds with a stream. | `false` | Recommended |
-| `llm.stop_sequences` | array | Array of strings the LLM uses as a stop sequence. | `["stop1"]` | Recommended |
-| `llm.openai.n` | integer | The number of completions to generate. | `1` | Recommended |
-| `llm.openai.presence_penalty` | float | If present, the `presence_penalty` used in an OpenAI request. Value is between -2.0 and 2.0. | `-0.5` | Recommended |
-| `llm.openai.frequency_penalty` | float | If present, the `frequency_penalty` used in an OpenAI request. Value is between -2.0 and 2.0. | `-0.5` | Recommended |
-| `llm.openai.logit_bias` | string | If present, the JSON-encoded string of a `logit_bias` used in an OpenAI request. | `{2435:-100, 640:-100}` | Recommended |
-| `llm.openai.user` | string | If present, the `user` used in an OpenAI request. | `bob` | Opt-in |
-| `llm.openai.response_format` | string | An object specifying the format that the model must output. Either `text` or `json_object` | `text` | Recommended |
-| `llm.openai.seed` | integer | Seed used in request to improve determinism. | `1234` | Recommended |
-<!-- endsemconv -->
-
-## Response attributes
-
-Attributes for chat completion responses SHOULD follow these conventions:
-
-<!-- semconv llm.openai(tag=llm-response-tech-specific) -->
-| Attribute  | Type | Description  | Examples  | Requirement Level |
-|---|---|---|---|---|
-| `llm.response.id` | string | The unique identifier for the completion. | `chatcmpl-123` | Recommended |
-| `llm.response.model` | string | The name of the LLM a response is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model actually used. If the LLM is a fine-tuned custom model, the value SHOULD have a more specific name than the base model that's been fine-tuned. | `gpt-4-0613` | Required |
-| `llm.response.finish_reason` | string | The reason the model stopped generating tokens | `stop` | Recommended |
-| `llm.usage.prompt_tokens` | int | The number of tokens used in the LLM prompt. | `100` | Recommended |
-| `llm.usage.completion_tokens` | int | The number of tokens used in the LLM response (completion). | `180` | Recommended |
-| `llm.usage.total_tokens` | int | The total number of tokens used in the LLM prompt and response. | `280` | Recommended |
-| `llm.openai.created` | int | The UNIX timestamp (in seconds) if when the completion was created. | `1677652288` | Recommended |
-| `llm.openai.system_fingerprint` | string | This fingerprint represents the backend configuration that the model runs with. | asdf987123 | Recommended |
+| [`llm.request.is_stream`](../attributes-registry/llm.md) | boolean | Whether the LLM responds with a stream. | `False` | Recommended |
+| [`llm.request.max_tokens`](../attributes-registry/llm.md) | int | The maximum number of tokens the LLM generates for a request. | `100` | Recommended |
+| [`llm.request.model`](../attributes-registry/llm.md) | string | The name of the LLM a request is being made to. [1] | `gpt-4` | Required |
+| [`llm.request.openai.logit_bias`](../attributes-registry/llm.md) | string | If present, the JSON-encoded string of a `logit_bias` used in an OpenAI request | `{2435:-100, 640:-100}` | Recommended |
+| [`llm.request.openai.presence_penalty`](../attributes-registry/llm.md) | double | If present, the `presence_penalty` used in an OpenAI request. Value is between -2.0 and 2.0. | `-0.5` | Recommended |
+| [`llm.request.openai.response_format`](../attributes-registry/llm.md) | string | An object specifying the format that the model must output. Either `text` or `json_object` | `text` | Recommended |
+| [`llm.request.openai.seed`](../attributes-registry/llm.md) | int | Seed used in request to improve determinism. | `1234` | Recommended |
+| [`llm.request.openai.user`](../attributes-registry/llm.md) | string | If present, the `user` used in an OpenAI request. | `bob` | Recommended |
+| [`llm.request.stop_sequences`](../attributes-registry/llm.md) | string | Array of strings the LLM uses as a stop sequence. | `stop1` | Recommended |
+| [`llm.request.temperature`](../attributes-registry/llm.md) | double | The temperature setting for the LLM request. | `0.0` | Recommended |
+| [`llm.request.top_p`](../attributes-registry/llm.md) | double | The top_p sampling setting for the LLM request. | `1.0` | Recommended |
+| [`llm.response.finish_reason`](../attributes-registry/llm.md) | string | The reason the model stopped generating tokens. | `stop` | Recommended |
+| [`llm.response.id`](../attributes-registry/llm.md) | string[] | The unique identifier for the completion. | `[chatcmpl-123]` | Recommended |
+| [`llm.response.openai.created`](../attributes-registry/llm.md) | int | The UNIX timestamp (in seconds) if when the completion was created. | `1677652288` | Recommended |
+| [`llm.response.openai.system_fingerprint`](../attributes-registry/llm.md) | string | This fingerprint represents the backend configuration that the model runs with. | `asdf987123` | Recommended |
+| [`llm.system`](../attributes-registry/llm.md) | string | The name of the LLM foundation model vendor, if applicable. | `openai`; `microsoft` | Recommended |
+| [`llm.usage.completion_tokens`](../attributes-registry/llm.md) | int | The number of tokens used in the LLM response (completion). | `180` | Recommended |
+| [`llm.usage.prompt_tokens`](../attributes-registry/llm.md) | int | The number of tokens used in the LLM prompt. | `100` | Recommended |
+| [`llm.usage.total_tokens`](../attributes-registry/llm.md) | int | The total number of tokens used in the LLM prompt and response. | `280` | Recommended |
+
+**[1]:** The name of the LLM a request is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model requested. If the LLM is a fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
 <!-- endsemconv -->
 
 ## Request Events
@@ -67,27 +57,31 @@ Because OpenAI uses a more complex prompt structure, these events will be used i
 
 ### Prompt Events
 
-Prompt event name SHOULD be `llm.openai.prompt`.
+Prompt event name SHOULD be `llm.content.openai.prompt`.
+
+<!-- semconv llm.content.openai.prompt -->
+The event name MUST be `llm.content.openai.prompt`.
 
-<!-- semconv llm.openai(tag=llm-prompt-tech-specific) -->
 | Attribute  | Type | Description  | Examples  | Requirement Level |
 |---|---|---|---|---|
-| `role` | string | The role of the prompt author, can be one of `system`, `user`, `assistant`, or `tool` | `system` | Required |
-| `content` | string | The content for a given OpenAI response, denoted by `<index>`. The value for `<index>` starts with 0, where 0 is the first message. | `Why did the developer stop using OpenTelemetry? Because they couldn't trace their steps!` | Required |
-| `tool_call_id` | string | If role is `tool` or `function`, then this tool call that this message is responding to. | `get_current_weather` | Conditionally Required: If `role` is `tool`. |
+| [`llm.openai.content`](../attributes-registry/llm.md) | string | The content for a given OpenAI response. | `Why did the developer stop using OpenTelemetry? Because they couldn't trace their steps!` | Required |
+| [`llm.openai.role`](../attributes-registry/llm.md) | string | The role of the prompt author, can be one of `system`, `user`, `assistant`, or `tool` | `user` | Required |
+| [`llm.openai.tool_call.id`](../attributes-registry/llm.md) | string | If role is `tool` or `function`, then this tool call that this message is responding to. | `get_current_weather` | Conditionally Required: Required if the prompt role is `tool`. |
 <!-- endsemconv -->
 
 ### Tools Events
 
-Tools event name SHOULD be `llm.openai.tool`, specifying potential tools or functions the LLM can use.
+Tools event name SHOULD be `llm.content.openai.tool`, specifying potential tools or functions the LLM can use.
+
+<!-- semconv llm.content.openai.tool -->
+The event name MUST be `llm.content.openai.tool`.
 
-<!-- semconv llm.openai(tag=llm-tools-tech-specific) -->
 | Attribute  | Type | Description  | Examples  | Requirement Level |
 |---|---|---|---|---|
-| `type` | string | They type of the tool. Currently, only `function` is supported. | `function` | Required |
-| `function.name` | string | The name of the function to be called. | `get_weather` | Required !
-| `function.description` | string | A description of what the function does, used by the model to choose when and how to call the function. | `` | Required |
-| `function.parameters` | string | JSON-encoded string of the parameter object for the function. | `{"type": "object", "properties": {}}` | Required |
+| [`llm.openai.function.description`](../attributes-registry/llm.md) | string | A description of what the function does, used by the model to choose when and how to call the function. | `Gets the current weather for a location` | Required |
+| [`llm.openai.function.name`](../attributes-registry/llm.md) | string | The name of the function to be called. | `get_weather` | Required |
+| [`llm.openai.function.parameters`](../attributes-registry/llm.md) | string | JSON-encoded string of the parameter object for the function. | `{"type": "object", "properties": {}}` | Required |
+| [`llm.openai.tool.type`](../attributes-registry/llm.md) | string | The type of the tool. Currently, only `function` is supported. | `function` | Required |
 <!-- endsemconv -->
 
 ### Choice Events
@@ -95,20 +89,31 @@ Tools event name SHOULD be `llm.openai.tool`, specifying potential tools or func
 Recording details about Choices in each response MAY be included as
 Span Events.
 
-Choice event name SHOULD be `llm.openai.choice`.
+Choice event name SHOULD be `llm.content.openai.choice`.
+
+If there is more than one `choice`, separate events SHOULD be used.
 
-If there is more than one `tool_call`, separate events SHOULD be used.
+<!-- semconv llm.content.openai.completion.choice -->
+The event name MUST be `llm.content.openai.completion.choice`.
 
-<!-- semconv llm.openai(tag=llm-completion-tech-specific) -->
-| `type` | string | Either `delta` or `message`. | `message` | Required |
+| Attribute  | Type | Description  | Examples  | Requirement Level |
 |---|---|---|---|---|
-| `finish_reason` | string | The reason the OpenAI model stopped generating tokens for this chunk. | `stop` | Recommended |
-| `role` | string | The assigned role for a given OpenAI response, denoted by `<index>`. The value for `<index>` starts with 0, where 0 is the first message. | `system` | Required |
-| `content` | string | The content for a given OpenAI response, denoted by `<index>`. The value for `<index>` starts with 0, where 0 is the first message. | `Why did the developer stop using OpenTelemetry? Because they couldn't trace their steps!` | Required |
-| `tool_call.id` | string | If exists, the ID of the tool call. | `call_BP08xxEhU60txNjnz3z9R4h9` | Required |
-| `tool_call.type` | string | Currently only `function` is supported. | `function` | Required |
-| `tool_call.function.name` | string | If exists, the name of a function call for a given OpenAI response, denoted by `<index>`. The value for `<index>` starts with 0, where 0 is the first message. | `get_weather_report` | Required |
-| `tool_call.function.arguments` | string | If exists, the arguments to call a function call with for a given OpenAI response, denoted by `<index>`. The value for `<index>` starts with 0, where 0 is the first message. | `{"type": "object", "properties": {"some":"data"}}` | Required |
+| [`llm.openai.choice.type`](../attributes-registry/llm.md) | string | The type of the choice, either `delta` or `message`. | `message` | Required |
+| [`llm.openai.content`](../attributes-registry/llm.md) | string | The content for a given OpenAI response. | `Why did the developer stop using OpenTelemetry? Because they couldn't trace their steps!` | Required |
+| [`llm.openai.function.arguments`](../attributes-registry/llm.md) | string | If exists, the arguments to call a function call with for a given OpenAI response, denoted by `<index>`. The value for `<index>` starts with 0, where 0 is the first message. | `{"type": "object", "properties": {"some":"data"}}` | Conditionally Required: [1] |
+| [`llm.openai.function.name`](../attributes-registry/llm.md) | string | The name of the function to be called. | `get_weather` | Conditionally Required: [2] |
+| [`llm.openai.role`](../attributes-registry/llm.md) | string | The role of the prompt author, can be one of `system`, `user`, `assistant`, or `tool` | `user` | Required |
+| [`llm.openai.tool.type`](../attributes-registry/llm.md) | string | The type of the tool. Currently, only `function` is supported. | `function` | Conditionally Required: [3] |
+| [`llm.openai.tool_call.id`](../attributes-registry/llm.md) | string | If role is `tool` or `function`, then this tool call that this message is responding to. | `get_current_weather` | Conditionally Required: [4] |
+| [`llm.response.finish_reason`](../attributes-registry/llm.md) | string | The reason the model stopped generating tokens. | `stop` | Recommended |
+
+**[1]:** Required if the choice is the result of a tool call of type `function`.
+
+**[2]:** Required if the choice is the result of a tool call of type `function`.
+
+**[3]:** Required if the choice is the result of a tool call.
+
+**[4]:** Required if the choice is the result of a tool call.
 <!-- endsemconv -->
 
 [DocumentStatus]: https://github.com/open-telemetry/opentelemetry-specification/tree/v1.22.0/specification/document-status.md
diff --git a/docs/attributes-registry/llm.md b/docs/attributes-registry/llm.md
index 5dfb91d272..e9a66e9021 100644
--- a/docs/attributes-registry/llm.md
+++ b/docs/attributes-registry/llm.md
@@ -23,13 +23,13 @@
 <!-- semconv registry.llm(omit_requirement_level,tag=llm-generic-request) -->
 | Attribute  | Type | Description  | Examples  |
 |---|---|---|---|
+| `llm.request.is_stream` | boolean | Whether the LLM responds with a stream. | `False` |
 | `llm.request.max_tokens` | int | The maximum number of tokens the LLM generates for a request. | `100` |
 | `llm.request.model` | string | The name of the LLM a request is being made to. | `gpt-4` |
 | `llm.request.stop_sequences` | string | Array of strings the LLM uses as a stop sequence. | `stop1` |
-| `llm.request.stream` | boolean | Whether the LLM responds with a stream. | `False` |
 | `llm.request.temperature` | double | The temperature setting for the LLM request. | `0.0` |
 | `llm.request.top_p` | double | The top_p sampling setting for the LLM request. | `1.0` |
-| `llm.request.vendor` | string | The name of the LLM foundation model vendor, if applicable. | `openai` |
+| `llm.system` | string | The name of the LLM foundation model vendor, if applicable. | `openai` |
 <!-- endsemconv -->
 
 ### Response Attributes
@@ -38,7 +38,7 @@
 | Attribute  | Type | Description  | Examples  |
 |---|---|---|---|
 | `llm.response.finish_reason` | string | The reason the model stopped generating tokens. | `stop` |
-| `llm.response.id` | string | The unique identifier for the completion. | `chatcmpl-123` |
+| `llm.response.id` | string[] | The unique identifier for the completion. | `[chatcmpl-123]` |
 | `llm.response.model` | string | The name of the LLM a response is being made to. | `gpt-4-0613` |
 | `llm.usage.completion_tokens` | int | The number of tokens used in the LLM response (completion). | `180` |
 | `llm.usage.prompt_tokens` | int | The number of tokens used in the LLM prompt. | `100` |

From e1b1d6a16258d879538f11da37faa6d3ca9418da Mon Sep 17 00:00:00 2001
From: Nir Gazit <nirga@users.noreply.github.com>
Date: Tue, 19 Mar 2024 13:42:50 +0200
Subject: [PATCH 09/36] minimal set of llm semconv

---
 docs/ai/README.md               |   5 -
 docs/ai/llm-spans.md            |  41 ++--
 docs/ai/openai-metrics.md       | 372 --------------------------------
 docs/ai/openai.md               | 119 ----------
 docs/attributes-registry/llm.md |  96 ++-------
 model/metrics/llm-metrics.yaml  | 109 ----------
 model/registry/llm.yaml         | 141 +-----------
 model/trace/llm.yaml            | 173 ++-------------
 8 files changed, 59 insertions(+), 997 deletions(-)
 delete mode 100644 docs/ai/openai-metrics.md
 delete mode 100644 docs/ai/openai.md
 delete mode 100644 model/metrics/llm-metrics.yaml

diff --git a/docs/ai/README.md b/docs/ai/README.md
index d5d51dcd75..31bc5795cd 100644
--- a/docs/ai/README.md
+++ b/docs/ai/README.md
@@ -17,9 +17,4 @@ Semantic conventions for LLM operations are defined for the following signals:
 
 * [LLM Spans](llm-spans.md): Semantic Conventions for LLM requests - *spans*.
 
-Technology specific semantic conventions are defined for the following LLM providers:
-
-* [OpenAI](openai.md): Semantic Conventions for *OpenAI* spans.
-* [OpenAI Metrics](openai-metrics.md): Semantic Conventions for *OpenAI* metrics.
-
 [DocumentStatus]: https://github.com/open-telemetry/opentelemetry-specification/tree/v1.26.0/specification/document-status.md
diff --git a/docs/ai/llm-spans.md b/docs/ai/llm-spans.md
index 894d464786..767bfc8ffb 100644
--- a/docs/ai/llm-spans.md
+++ b/docs/ai/llm-spans.md
@@ -35,22 +35,19 @@ By default, these configurations SHOULD NOT capture prompts and completions.
 
 These attributes track input data and metadata for a request to an LLM. Each attribute represents a concept that is common to most LLMs.
 
-<!-- semconv llm.request -->
+<!-- semconv gen_ai.llm.request -->
 | Attribute  | Type | Description  | Examples  | Requirement Level |
 |---|---|---|---|---|
-| [`llm.request.is_stream`](../attributes-registry/llm.md) | boolean | Whether the LLM responds with a stream. | `False` | Recommended |
-| [`llm.request.max_tokens`](../attributes-registry/llm.md) | int | The maximum number of tokens the LLM generates for a request. | `100` | Recommended |
-| [`llm.request.model`](../attributes-registry/llm.md) | string | The name of the LLM a request is being made to. [1] | `gpt-4` | Required |
-| [`llm.request.stop_sequences`](../attributes-registry/llm.md) | string | Array of strings the LLM uses as a stop sequence. | `stop1` | Recommended |
-| [`llm.request.temperature`](../attributes-registry/llm.md) | double | The temperature setting for the LLM request. | `0.0` | Recommended |
-| [`llm.request.top_p`](../attributes-registry/llm.md) | double | The top_p sampling setting for the LLM request. | `1.0` | Recommended |
-| [`llm.response.finish_reason`](../attributes-registry/llm.md) | string | The reason the model stopped generating tokens. | `stop` | Recommended |
-| [`llm.response.id`](../attributes-registry/llm.md) | string[] | The unique identifier for the completion. | `[chatcmpl-123]` | Recommended |
-| [`llm.response.model`](../attributes-registry/llm.md) | string | The name of the LLM a response is being made to. [2] | `gpt-4-0613` | Required |
-| [`llm.system`](../attributes-registry/llm.md) | string | The name of the LLM foundation model vendor, if applicable. [3] | `openai` | Recommended |
-| [`llm.usage.completion_tokens`](../attributes-registry/llm.md) | int | The number of tokens used in the LLM response (completion). | `180` | Recommended |
-| [`llm.usage.prompt_tokens`](../attributes-registry/llm.md) | int | The number of tokens used in the LLM prompt. | `100` | Recommended |
-| [`llm.usage.total_tokens`](../attributes-registry/llm.md) | int | The total number of tokens used in the LLM prompt and response. | `280` | Recommended |
+| [`gen_ai.llm.request.max_tokens`](../attributes-registry/llm.md) | int | The maximum number of tokens the LLM generates for a request. | `100` | Recommended |
+| [`gen_ai.llm.request.model`](../attributes-registry/llm.md) | string | The name of the LLM a request is being made to. [1] | `gpt-4` | Required |
+| [`gen_ai.llm.request.temperature`](../attributes-registry/llm.md) | double | The temperature setting for the LLM request. | `0.0` | Recommended |
+| [`gen_ai.llm.request.top_p`](../attributes-registry/llm.md) | double | The top_p sampling setting for the LLM request. | `1.0` | Recommended |
+| [`gen_ai.llm.response.finish_reason`](../attributes-registry/llm.md) | string[] | Array of reasons the model stopped generating tokens, corresponding to each generation received. | `[['stop']]` | Recommended |
+| [`gen_ai.llm.response.id`](../attributes-registry/llm.md) | string | The unique identifier for the completion. | `chatcmpl-123` | Recommended |
+| [`gen_ai.llm.response.model`](../attributes-registry/llm.md) | string | The name of the LLM a response is being made to. [2] | `gpt-4-0613` | Required |
+| [`gen_ai.llm.system`](../attributes-registry/llm.md) | string | The name of the LLM foundation model vendor, if applicable. [3] | `openai` | Recommended |
+| [`gen_ai.llm.usage.completion_tokens`](../attributes-registry/llm.md) | int | The number of tokens used in the LLM response (completion). | `180` | Recommended |
+| [`gen_ai.llm.usage.prompt_tokens`](../attributes-registry/llm.md) | int | The number of tokens used in the LLM prompt. | `100` | Recommended |
 
 **[1]:** The name of the LLM a request is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model requested. If the LLM is a fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
 
@@ -63,24 +60,24 @@ These attributes track input data and metadata for a request to an LLM. Each att
 
 In the lifetime of an LLM span, an event for prompts sent and completions received MAY be created, depending on the configuration of the instrumentation.
 
-<!-- semconv llm.content.prompt -->
-The event name MUST be `llm.content.prompt`.
+<!-- semconv gen_ai.llm.content.prompt -->
+The event name MUST be `gen_ai.llm.content.prompt`.
 
 | Attribute  | Type | Description  | Examples  | Requirement Level |
 |---|---|---|---|---|
-| [`llm.prompt`](../attributes-registry/llm.md) | string | The full prompt string sent to an LLM in a request. [1] | `\\n\\nHuman:You are an AI assistant that tells jokes. Can you tell me a joke about OpenTelemetry?\\n\\nAssistant:` | Recommended |
+| [`gen_ai.llm.prompt`](../attributes-registry/llm.md) | string | The full prompt string sent to an LLM in a request. [1] | `\\n\\nHuman:You are an AI assistant that tells jokes. Can you tell me a joke about OpenTelemetry?\\n\\nAssistant:` | Recommended |
 
-**[1]:** The full prompt string sent to an LLM in a request. If the LLM accepts a more complex input like a JSON object, this field is blank, and the response is instead captured in an event determined by the specific LLM technology semantic convention.
+**[1]:** The full prompt sent to an LLM in a request, structured as a JSON in OpenAI's format.
 <!-- endsemconv -->
 
-<!-- semconv llm.content.completion -->
-The event name MUST be `llm.content.completion`.
+<!-- semconv gen_ai.llm.content.completion -->
+The event name MUST be `gen_ai.llm.content.completion`.
 
 | Attribute  | Type | Description  | Examples  | Requirement Level |
 |---|---|---|---|---|
-| [`llm.completion`](../attributes-registry/llm.md) | string | The full response string from an LLM in a response. [1] | `Why did the developer stop using OpenTelemetry? Because they couldnt trace their steps!` | Recommended |
+| [`gen_ai.llm.completion`](../attributes-registry/llm.md) | string | The full response string from an LLM in a response. [1] | `Why did the developer stop using OpenTelemetry? Because they couldnt trace their steps!` | Recommended |
 
-**[1]:** The full response string from an LLM. If the LLM responds with a more complex output like a JSON object made up of several pieces (such as OpenAI's message choices), this field is the content of the response. If the LLM produces multiple responses, then this field is left blank, and each response is instead captured in an event determined by the specific LLM technology semantic convention.
+**[1]:** The full response from an LLM, structured as a JSON in OpenAI's format.
 <!-- endsemconv -->
 
 [DocumentStatus]: https://github.com/open-telemetry/opentelemetry-specification/tree/v1.22.0/specification/document-status.md
diff --git a/docs/ai/openai-metrics.md b/docs/ai/openai-metrics.md
deleted file mode 100644
index bf39882d9e..0000000000
--- a/docs/ai/openai-metrics.md
+++ /dev/null
@@ -1,372 +0,0 @@
-<!--- Hugo front matter used to generate the website version of this page:
-linkTitle: MEtrics
---->
-
-# Semantic Conventions for OpenAI Matrics
-
-**Status**: [Experimental][DocumentStatus]
-
-This document defines semantic conventions for OpenAI client metrics.
-
-<!-- Re-generate TOC with `markdown-toc --no-first-h1 -i` -->
-
-<!-- toc -->
-
-- [Chat completions](#chat-completions)
-  * [Metric: `openai.chat_completions.tokens`](#metric-openaichat_completionstokens)
-  * [Metric: `openai.chat_completions.choices`](#metric-openaichat_completionschoices)
-  * [Metric: `openai.chat_completions.duration`](#metric-openaichat_completionsduration)
-- [Embeddings](#embeddings)
-  * [Metric: `openai.embeddings.tokens`](#metric-openaiembeddingstokens)
-  * [Metric: `openai.embeddings.vector_size`](#metric-openaiembeddingsvector_size)
-  * [Metric: `openai.embeddings.duration`](#metric-openaiembeddingsduration)
-- [Image generation](#image-generation)
-  * [Metric: `openai.image_generations.duration`](#metric-openaiimage_generationsduration)
-
-<!-- tocstop -->
-
-## Chat completions
-
-### Metric: `openai.chat_completions.tokens`
-
-**Status**: [Experimental][DocumentStatus]
-
-This metric is required.
-
-<!-- semconv metric.openai.chat_completions.tokens(metric_table) -->
-| Name     | Instrument Type | Unit (UCUM) | Description    |
-| -------- | --------------- | ----------- | -------------- |
-| `llm.openai.chat_completions.tokens` | Counter | `token` | Number of tokens used in prompt and completions. |
-<!-- endsemconv -->
-
-<!-- semconv metric.openai.chat_completions.tokens(full) -->
-| Attribute  | Type | Description  | Examples  | Requirement Level |
-|---|---|---|---|---|
-| [`error.type`](../attributes-registry/error.md) | string | Describes a class of error the operation ended with. [1] | `timeout`; `java.net.UnknownHostException`; `server_certificate_invalid`; `500` | Conditionally Required: if the operation ended in error |
-| [`llm.response.model`](../attributes-registry/llm.md) | string | The name of the LLM a response is being made to. | `gpt-4-0613` | Required |
-| [`llm.usage.token_type`](../attributes-registry/llm.md) | string | The type of token. | `prompt` | Recommended |
-| [`server.address`](../attributes-registry/server.md) | string | Server domain name if available without reverse DNS lookup; otherwise, IP address or Unix domain socket name. [2] | `example.com`; `10.1.2.80`; `/tmp/my.sock` | Required |
-
-**[1]:** The `error.type` SHOULD be predictable and SHOULD have low cardinality.
-Instrumentations SHOULD document the list of errors they report.
-
-The cardinality of `error.type` within one instrumentation library SHOULD be low.
-Telemetry consumers that aggregate data from multiple instrumentation libraries and applications
-should be prepared for `error.type` to have high cardinality at query time when no
-additional filters are applied.
-
-If the operation has completed successfully, instrumentations SHOULD NOT set `error.type`.
-
-If a specific domain defines its own set of error identifiers (such as HTTP or gRPC status codes),
-it's RECOMMENDED to:
-
-* Use a domain-specific attribute
-* Set `error.type` to capture all errors, regardless of whether they are defined within the domain-specific set or not.
-
-**[2]:** When observed from the client side, and when communicating through an intermediary, `server.address` SHOULD represent the server address behind any intermediaries, for example proxies, if it's available.
-
-`error.type` has the following list of well-known values. If one of them applies, then the respective value MUST be used, otherwise a custom value MAY be used.
-
-| Value  | Description |
-|---|---|
-| `_OTHER` | A fallback error value to be used when the instrumentation doesn't define a custom value. |
-
-`llm.usage.token_type` MUST be one of the following:
-
-| Value  | Description |
-|---|---|
-| `prompt` | prompt |
-| `completion` | completion |
-<!-- endsemconv -->
-
-### Metric: `openai.chat_completions.choices`
-
-**Status**: [Experimental][DocumentStatus]
-
-This metric is required.
-
-<!-- semconv metric.openai.chat_completions.choices(metric_table) -->
-| Name     | Instrument Type | Unit (UCUM) | Description    |
-| -------- | --------------- | ----------- | -------------- |
-| `llm.openai.chat_completions.choices` | Counter | `choice` | Number of choices returned by chat completions call |
-<!-- endsemconv -->
-
-<!-- semconv metric.openai.chat_completions.choices(full) -->
-| Attribute  | Type | Description  | Examples  | Requirement Level |
-|---|---|---|---|---|
-| [`error.type`](../attributes-registry/error.md) | string | Describes a class of error the operation ended with. [1] | `timeout`; `java.net.UnknownHostException`; `server_certificate_invalid`; `500` | Conditionally Required: if the operation ended in error |
-| [`llm.response.finish_reason`](../attributes-registry/llm.md) | string | The reason the model stopped generating tokens. | `stop` | Recommended |
-| [`llm.response.model`](../attributes-registry/llm.md) | string | The name of the LLM a response is being made to. | `gpt-4-0613` | Required |
-| [`server.address`](../attributes-registry/server.md) | string | Server domain name if available without reverse DNS lookup; otherwise, IP address or Unix domain socket name. [2] | `example.com`; `10.1.2.80`; `/tmp/my.sock` | Required |
-
-**[1]:** The `error.type` SHOULD be predictable and SHOULD have low cardinality.
-Instrumentations SHOULD document the list of errors they report.
-
-The cardinality of `error.type` within one instrumentation library SHOULD be low.
-Telemetry consumers that aggregate data from multiple instrumentation libraries and applications
-should be prepared for `error.type` to have high cardinality at query time when no
-additional filters are applied.
-
-If the operation has completed successfully, instrumentations SHOULD NOT set `error.type`.
-
-If a specific domain defines its own set of error identifiers (such as HTTP or gRPC status codes),
-it's RECOMMENDED to:
-
-* Use a domain-specific attribute
-* Set `error.type` to capture all errors, regardless of whether they are defined within the domain-specific set or not.
-
-**[2]:** When observed from the client side, and when communicating through an intermediary, `server.address` SHOULD represent the server address behind any intermediaries, for example proxies, if it's available.
-
-`error.type` has the following list of well-known values. If one of them applies, then the respective value MUST be used, otherwise a custom value MAY be used.
-
-| Value  | Description |
-|---|---|
-| `_OTHER` | A fallback error value to be used when the instrumentation doesn't define a custom value. |
-<!-- endsemconv -->
-
-### Metric: `openai.chat_completions.duration`
-
-**Status**: [Experimental][DocumentStatus]
-
-This metric is required.
-
-This metric SHOULD be specified with
-[`ExplicitBucketBoundaries`](https://github.com/open-telemetry/opentelemetry-specification/tree/v1.22.0/specification/metrics/api.md#instrument-advice)
-of `[ 0, 0.005, 0.01, 0.025, 0.05, 0.075, 0.1, 0.25, 0.5, 0.75, 1, 2.5, 5, 7.5, 10 ]`.
-
-<!-- semconv metric.openai.chat_completions.duration(metric_table) -->
-| Name     | Instrument Type | Unit (UCUM) | Description    |
-| -------- | --------------- | ----------- | -------------- |
-| `llm.openai.chat_completions.duration` | Histogram | `s` | Duration of chat completion operation |
-<!-- endsemconv -->
-
-<!-- semconv metric.openai.chat_completions.duration(full) -->
-| Attribute  | Type | Description  | Examples  | Requirement Level |
-|---|---|---|---|---|
-| [`error.type`](../attributes-registry/error.md) | string | Describes a class of error the operation ended with. [1] | `timeout`; `java.net.UnknownHostException`; `server_certificate_invalid`; `500` | Conditionally Required: if the operation ended in error |
-| [`llm.response.finish_reason`](../attributes-registry/llm.md) | string | The reason the model stopped generating tokens. | `stop` | Recommended |
-| [`llm.response.model`](../attributes-registry/llm.md) | string | The name of the LLM a response is being made to. | `gpt-4-0613` | Required |
-| [`server.address`](../attributes-registry/server.md) | string | Server domain name if available without reverse DNS lookup; otherwise, IP address or Unix domain socket name. [2] | `example.com`; `10.1.2.80`; `/tmp/my.sock` | Required |
-
-**[1]:** The `error.type` SHOULD be predictable and SHOULD have low cardinality.
-Instrumentations SHOULD document the list of errors they report.
-
-The cardinality of `error.type` within one instrumentation library SHOULD be low.
-Telemetry consumers that aggregate data from multiple instrumentation libraries and applications
-should be prepared for `error.type` to have high cardinality at query time when no
-additional filters are applied.
-
-If the operation has completed successfully, instrumentations SHOULD NOT set `error.type`.
-
-If a specific domain defines its own set of error identifiers (such as HTTP or gRPC status codes),
-it's RECOMMENDED to:
-
-* Use a domain-specific attribute
-* Set `error.type` to capture all errors, regardless of whether they are defined within the domain-specific set or not.
-
-**[2]:** When observed from the client side, and when communicating through an intermediary, `server.address` SHOULD represent the server address behind any intermediaries, for example proxies, if it's available.
-
-`error.type` has the following list of well-known values. If one of them applies, then the respective value MUST be used, otherwise a custom value MAY be used.
-
-| Value  | Description |
-|---|---|
-| `_OTHER` | A fallback error value to be used when the instrumentation doesn't define a custom value. |
-<!-- endsemconv -->
-
-## Embeddings
-
-### Metric: `openai.embeddings.tokens`
-
-**Status**: [Experimental][DocumentStatus]
-
-This metric is required.
-
-<!-- semconv metric.openai.embeddings.tokens(metric_table) -->
-| Name     | Instrument Type | Unit (UCUM) | Description    |
-| -------- | --------------- | ----------- | -------------- |
-| `llm.openai.embeddings.tokens` | Counter | `token` | Number of tokens used in prompt and completions. |
-<!-- endsemconv -->
-
-<!-- semconv metric.openai.embeddings.tokens(full) -->
-| Attribute  | Type | Description  | Examples  | Requirement Level |
-|---|---|---|---|---|
-| [`error.type`](../attributes-registry/error.md) | string | Describes a class of error the operation ended with. [1] | `timeout`; `java.net.UnknownHostException`; `server_certificate_invalid`; `500` | Conditionally Required: if the operation ended in error |
-| [`llm.response.model`](../attributes-registry/llm.md) | string | The name of the LLM a response is being made to. | `gpt-4-0613` | Required |
-| [`llm.usage.token_type`](../attributes-registry/llm.md) | string | The type of token. | `prompt` | Recommended |
-| [`server.address`](../attributes-registry/server.md) | string | Server domain name if available without reverse DNS lookup; otherwise, IP address or Unix domain socket name. [2] | `example.com`; `10.1.2.80`; `/tmp/my.sock` | Required |
-
-**[1]:** The `error.type` SHOULD be predictable and SHOULD have low cardinality.
-Instrumentations SHOULD document the list of errors they report.
-
-The cardinality of `error.type` within one instrumentation library SHOULD be low.
-Telemetry consumers that aggregate data from multiple instrumentation libraries and applications
-should be prepared for `error.type` to have high cardinality at query time when no
-additional filters are applied.
-
-If the operation has completed successfully, instrumentations SHOULD NOT set `error.type`.
-
-If a specific domain defines its own set of error identifiers (such as HTTP or gRPC status codes),
-it's RECOMMENDED to:
-
-* Use a domain-specific attribute
-* Set `error.type` to capture all errors, regardless of whether they are defined within the domain-specific set or not.
-
-**[2]:** When observed from the client side, and when communicating through an intermediary, `server.address` SHOULD represent the server address behind any intermediaries, for example proxies, if it's available.
-
-`error.type` has the following list of well-known values. If one of them applies, then the respective value MUST be used, otherwise a custom value MAY be used.
-
-| Value  | Description |
-|---|---|
-| `_OTHER` | A fallback error value to be used when the instrumentation doesn't define a custom value. |
-
-`llm.usage.token_type` MUST be one of the following:
-
-| Value  | Description |
-|---|---|
-| `prompt` | prompt |
-| `completion` | completion |
-<!-- endsemconv -->
-
-### Metric: `openai.embeddings.vector_size`
-
-**Status**: [Experimental][DocumentStatus]
-
-This metric is required.
-
-<!-- semconv metric.openai.embeddings.vector_size(metric_table) -->
-| Name     | Instrument Type | Unit (UCUM) | Description    |
-| -------- | --------------- | ----------- | -------------- |
-| `llm.openai.embeddings.vector_size` | Counter | `element` | he size of returned vector. |
-<!-- endsemconv -->
-
-<!-- semconv metric.openai.embeddings.vector_size(full) -->
-| Attribute  | Type | Description  | Examples  | Requirement Level |
-|---|---|---|---|---|
-| [`error.type`](../attributes-registry/error.md) | string | Describes a class of error the operation ended with. [1] | `timeout`; `java.net.UnknownHostException`; `server_certificate_invalid`; `500` | Conditionally Required: if the operation ended in error |
-| [`llm.response.model`](../attributes-registry/llm.md) | string | The name of the LLM a response is being made to. | `gpt-4-0613` | Required |
-| [`server.address`](../attributes-registry/server.md) | string | Server domain name if available without reverse DNS lookup; otherwise, IP address or Unix domain socket name. [2] | `example.com`; `10.1.2.80`; `/tmp/my.sock` | Required |
-
-**[1]:** The `error.type` SHOULD be predictable and SHOULD have low cardinality.
-Instrumentations SHOULD document the list of errors they report.
-
-The cardinality of `error.type` within one instrumentation library SHOULD be low.
-Telemetry consumers that aggregate data from multiple instrumentation libraries and applications
-should be prepared for `error.type` to have high cardinality at query time when no
-additional filters are applied.
-
-If the operation has completed successfully, instrumentations SHOULD NOT set `error.type`.
-
-If a specific domain defines its own set of error identifiers (such as HTTP or gRPC status codes),
-it's RECOMMENDED to:
-
-* Use a domain-specific attribute
-* Set `error.type` to capture all errors, regardless of whether they are defined within the domain-specific set or not.
-
-**[2]:** When observed from the client side, and when communicating through an intermediary, `server.address` SHOULD represent the server address behind any intermediaries, for example proxies, if it's available.
-
-`error.type` has the following list of well-known values. If one of them applies, then the respective value MUST be used, otherwise a custom value MAY be used.
-
-| Value  | Description |
-|---|---|
-| `_OTHER` | A fallback error value to be used when the instrumentation doesn't define a custom value. |
-<!-- endsemconv -->
-
-### Metric: `openai.embeddings.duration`
-
-**Status**: [Experimental][DocumentStatus]
-
-This metric is required.
-
-This metric SHOULD be specified with
-[`ExplicitBucketBoundaries`](https://github.com/open-telemetry/opentelemetry-specification/tree/v1.22.0/specification/metrics/api.md#instrument-advice)
-of `[ 0, 0.005, 0.01, 0.025, 0.05, 0.075, 0.1, 0.25, 0.5, 0.75, 1, 2.5, 5, 7.5, 10 ]`.
-
-<!-- semconv metric.openai.embeddings.duration(metric_table) -->
-| Name     | Instrument Type | Unit (UCUM) | Description    |
-| -------- | --------------- | ----------- | -------------- |
-| `llm.openai.embeddings.duration` | Histogram | `s` | Duration of embeddings operation |
-<!-- endsemconv -->
-
-<!-- semconv metric.openai.embeddings.duration(full) -->
-| Attribute  | Type | Description  | Examples  | Requirement Level |
-|---|---|---|---|---|
-| [`error.type`](../attributes-registry/error.md) | string | Describes a class of error the operation ended with. [1] | `timeout`; `java.net.UnknownHostException`; `server_certificate_invalid`; `500` | Conditionally Required: if the operation ended in error |
-| [`llm.response.model`](../attributes-registry/llm.md) | string | The name of the LLM a response is being made to. | `gpt-4-0613` | Required |
-| [`server.address`](../attributes-registry/server.md) | string | Server domain name if available without reverse DNS lookup; otherwise, IP address or Unix domain socket name. [2] | `example.com`; `10.1.2.80`; `/tmp/my.sock` | Required |
-
-**[1]:** The `error.type` SHOULD be predictable and SHOULD have low cardinality.
-Instrumentations SHOULD document the list of errors they report.
-
-The cardinality of `error.type` within one instrumentation library SHOULD be low.
-Telemetry consumers that aggregate data from multiple instrumentation libraries and applications
-should be prepared for `error.type` to have high cardinality at query time when no
-additional filters are applied.
-
-If the operation has completed successfully, instrumentations SHOULD NOT set `error.type`.
-
-If a specific domain defines its own set of error identifiers (such as HTTP or gRPC status codes),
-it's RECOMMENDED to:
-
-* Use a domain-specific attribute
-* Set `error.type` to capture all errors, regardless of whether they are defined within the domain-specific set or not.
-
-**[2]:** When observed from the client side, and when communicating through an intermediary, `server.address` SHOULD represent the server address behind any intermediaries, for example proxies, if it's available.
-
-`error.type` has the following list of well-known values. If one of them applies, then the respective value MUST be used, otherwise a custom value MAY be used.
-
-| Value  | Description |
-|---|---|
-| `_OTHER` | A fallback error value to be used when the instrumentation doesn't define a custom value. |
-<!-- endsemconv -->
-
-## Image generation
-
-### Metric: `openai.image_generations.duration`
-
-**Status**: [Experimental][DocumentStatus]
-
-This metric is required.
-
-This metric SHOULD be specified with
-[`ExplicitBucketBoundaries`](https://github.com/open-telemetry/opentelemetry-specification/tree/v1.22.0/specification/metrics/api.md#instrument-advice)
-of `[ 0, 0.005, 0.01, 0.025, 0.05, 0.075, 0.1, 0.25, 0.5, 0.75, 1, 2.5, 5, 7.5, 10 ]`.
-
-<!-- semconv metric.openai.image_generations.duration(metric_table) -->
-| Name     | Instrument Type | Unit (UCUM) | Description    |
-| -------- | --------------- | ----------- | -------------- |
-| `llm.openai.image_generations.duration` | Histogram | `s` | Duration of image generations operation |
-<!-- endsemconv -->
-
-<!-- semconv metric.openai.image_generations.duration(full) -->
-| Attribute  | Type | Description  | Examples  | Requirement Level |
-|---|---|---|---|---|
-| [`error.type`](../attributes-registry/error.md) | string | Describes a class of error the operation ended with. [1] | `timeout`; `java.net.UnknownHostException`; `server_certificate_invalid`; `500` | Recommended |
-| [`llm.response.model`](../attributes-registry/llm.md) | string | The name of the LLM a response is being made to. | `gpt-4-0613` | Conditionally Required: if the operation ended in error |
-| [`server.address`](../attributes-registry/server.md) | string | Server domain name if available without reverse DNS lookup; otherwise, IP address or Unix domain socket name. [2] | `example.com`; `10.1.2.80`; `/tmp/my.sock` | Required |
-
-**[1]:** The `error.type` SHOULD be predictable and SHOULD have low cardinality.
-Instrumentations SHOULD document the list of errors they report.
-
-The cardinality of `error.type` within one instrumentation library SHOULD be low.
-Telemetry consumers that aggregate data from multiple instrumentation libraries and applications
-should be prepared for `error.type` to have high cardinality at query time when no
-additional filters are applied.
-
-If the operation has completed successfully, instrumentations SHOULD NOT set `error.type`.
-
-If a specific domain defines its own set of error identifiers (such as HTTP or gRPC status codes),
-it's RECOMMENDED to:
-
-* Use a domain-specific attribute
-* Set `error.type` to capture all errors, regardless of whether they are defined within the domain-specific set or not.
-
-**[2]:** When observed from the client side, and when communicating through an intermediary, `server.address` SHOULD represent the server address behind any intermediaries, for example proxies, if it's available.
-
-`error.type` has the following list of well-known values. If one of them applies, then the respective value MUST be used, otherwise a custom value MAY be used.
-
-| Value  | Description |
-|---|---|
-| `_OTHER` | A fallback error value to be used when the instrumentation doesn't define a custom value. |
-<!-- endsemconv -->
-
-[DocumentStatus]: https://github.com/open-telemetry/opentelemetry-specification/tree/v1.22.0/specification/document-status.md
diff --git a/docs/ai/openai.md b/docs/ai/openai.md
deleted file mode 100644
index 001d751ce5..0000000000
--- a/docs/ai/openai.md
+++ /dev/null
@@ -1,119 +0,0 @@
-<!--- Hugo front matter used to generate the website version of this page:
-linkTitle: OpenAI
---->
-
-# Semantic Conventions for OpenAI Spans
-
-**Status**: [Experimental][DocumentStatus]
-
-This document outlines the Semantic Conventions specific to
-[OpenAI](https://platform.openai.com/) spans, extending the general semantics
-found in the [LLM Semantic Conventions](llm-spans.md). These conventions are
-designed to standardize telemetry data for OpenAI interactions, particularly
-focusing on the `/chat/completions` endpoint. By following to these guidelines,
-developers can ensure consistent, meaningful, and easily interpretable telemetry
-data across different applications and platforms.
-
-## Chat Completions
-
-The span name for OpenAI chat completions SHOULD be `openai.chat`
-to maintain consistency and clarity in telemetry data.
-
-## Request Attributes
-
-These are the attributes when instrumenting OpenAI LLM requests with the
-`/chat/completions` endpoint.
-
-<!-- semconv llm.openai -->
-| Attribute  | Type | Description  | Examples  | Requirement Level |
-|---|---|---|---|---|
-| [`llm.request.is_stream`](../attributes-registry/llm.md) | boolean | Whether the LLM responds with a stream. | `False` | Recommended |
-| [`llm.request.max_tokens`](../attributes-registry/llm.md) | int | The maximum number of tokens the LLM generates for a request. | `100` | Recommended |
-| [`llm.request.model`](../attributes-registry/llm.md) | string | The name of the LLM a request is being made to. [1] | `gpt-4` | Required |
-| [`llm.request.openai.logit_bias`](../attributes-registry/llm.md) | string | If present, the JSON-encoded string of a `logit_bias` used in an OpenAI request | `{2435:-100, 640:-100}` | Recommended |
-| [`llm.request.openai.presence_penalty`](../attributes-registry/llm.md) | double | If present, the `presence_penalty` used in an OpenAI request. Value is between -2.0 and 2.0. | `-0.5` | Recommended |
-| [`llm.request.openai.response_format`](../attributes-registry/llm.md) | string | An object specifying the format that the model must output. Either `text` or `json_object` | `text` | Recommended |
-| [`llm.request.openai.seed`](../attributes-registry/llm.md) | int | Seed used in request to improve determinism. | `1234` | Recommended |
-| [`llm.request.openai.user`](../attributes-registry/llm.md) | string | If present, the `user` used in an OpenAI request. | `bob` | Recommended |
-| [`llm.request.stop_sequences`](../attributes-registry/llm.md) | string | Array of strings the LLM uses as a stop sequence. | `stop1` | Recommended |
-| [`llm.request.temperature`](../attributes-registry/llm.md) | double | The temperature setting for the LLM request. | `0.0` | Recommended |
-| [`llm.request.top_p`](../attributes-registry/llm.md) | double | The top_p sampling setting for the LLM request. | `1.0` | Recommended |
-| [`llm.response.finish_reason`](../attributes-registry/llm.md) | string | The reason the model stopped generating tokens. | `stop` | Recommended |
-| [`llm.response.id`](../attributes-registry/llm.md) | string[] | The unique identifier for the completion. | `[chatcmpl-123]` | Recommended |
-| [`llm.response.openai.created`](../attributes-registry/llm.md) | int | The UNIX timestamp (in seconds) if when the completion was created. | `1677652288` | Recommended |
-| [`llm.response.openai.system_fingerprint`](../attributes-registry/llm.md) | string | This fingerprint represents the backend configuration that the model runs with. | `asdf987123` | Recommended |
-| [`llm.system`](../attributes-registry/llm.md) | string | The name of the LLM foundation model vendor, if applicable. | `openai`; `microsoft` | Recommended |
-| [`llm.usage.completion_tokens`](../attributes-registry/llm.md) | int | The number of tokens used in the LLM response (completion). | `180` | Recommended |
-| [`llm.usage.prompt_tokens`](../attributes-registry/llm.md) | int | The number of tokens used in the LLM prompt. | `100` | Recommended |
-| [`llm.usage.total_tokens`](../attributes-registry/llm.md) | int | The total number of tokens used in the LLM prompt and response. | `280` | Recommended |
-
-**[1]:** The name of the LLM a request is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model requested. If the LLM is a fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
-<!-- endsemconv -->
-
-## Request Events
-
-In the lifetime of an LLM span, an event for prompts sent and completions received MAY be created, depending on the configuration of the instrumentation.
-Because OpenAI uses a more complex prompt structure, these events will be used instead of the generic ones detailed in the [LLM Semantic Conventions](llm-spans.md).
-
-### Prompt Events
-
-Prompt event name SHOULD be `llm.content.openai.prompt`.
-
-<!-- semconv llm.content.openai.prompt -->
-The event name MUST be `llm.content.openai.prompt`.
-
-| Attribute  | Type | Description  | Examples  | Requirement Level |
-|---|---|---|---|---|
-| [`llm.openai.content`](../attributes-registry/llm.md) | string | The content for a given OpenAI response. | `Why did the developer stop using OpenTelemetry? Because they couldn't trace their steps!` | Required |
-| [`llm.openai.role`](../attributes-registry/llm.md) | string | The role of the prompt author, can be one of `system`, `user`, `assistant`, or `tool` | `user` | Required |
-| [`llm.openai.tool_call.id`](../attributes-registry/llm.md) | string | If role is `tool` or `function`, then this tool call that this message is responding to. | `get_current_weather` | Conditionally Required: Required if the prompt role is `tool`. |
-<!-- endsemconv -->
-
-### Tools Events
-
-Tools event name SHOULD be `llm.content.openai.tool`, specifying potential tools or functions the LLM can use.
-
-<!-- semconv llm.content.openai.tool -->
-The event name MUST be `llm.content.openai.tool`.
-
-| Attribute  | Type | Description  | Examples  | Requirement Level |
-|---|---|---|---|---|
-| [`llm.openai.function.description`](../attributes-registry/llm.md) | string | A description of what the function does, used by the model to choose when and how to call the function. | `Gets the current weather for a location` | Required |
-| [`llm.openai.function.name`](../attributes-registry/llm.md) | string | The name of the function to be called. | `get_weather` | Required |
-| [`llm.openai.function.parameters`](../attributes-registry/llm.md) | string | JSON-encoded string of the parameter object for the function. | `{"type": "object", "properties": {}}` | Required |
-| [`llm.openai.tool.type`](../attributes-registry/llm.md) | string | The type of the tool. Currently, only `function` is supported. | `function` | Required |
-<!-- endsemconv -->
-
-### Choice Events
-
-Recording details about Choices in each response MAY be included as
-Span Events.
-
-Choice event name SHOULD be `llm.content.openai.choice`.
-
-If there is more than one `choice`, separate events SHOULD be used.
-
-<!-- semconv llm.content.openai.completion.choice -->
-The event name MUST be `llm.content.openai.completion.choice`.
-
-| Attribute  | Type | Description  | Examples  | Requirement Level |
-|---|---|---|---|---|
-| [`llm.openai.choice.type`](../attributes-registry/llm.md) | string | The type of the choice, either `delta` or `message`. | `message` | Required |
-| [`llm.openai.content`](../attributes-registry/llm.md) | string | The content for a given OpenAI response. | `Why did the developer stop using OpenTelemetry? Because they couldn't trace their steps!` | Required |
-| [`llm.openai.function.arguments`](../attributes-registry/llm.md) | string | If exists, the arguments to call a function call with for a given OpenAI response, denoted by `<index>`. The value for `<index>` starts with 0, where 0 is the first message. | `{"type": "object", "properties": {"some":"data"}}` | Conditionally Required: [1] |
-| [`llm.openai.function.name`](../attributes-registry/llm.md) | string | The name of the function to be called. | `get_weather` | Conditionally Required: [2] |
-| [`llm.openai.role`](../attributes-registry/llm.md) | string | The role of the prompt author, can be one of `system`, `user`, `assistant`, or `tool` | `user` | Required |
-| [`llm.openai.tool.type`](../attributes-registry/llm.md) | string | The type of the tool. Currently, only `function` is supported. | `function` | Conditionally Required: [3] |
-| [`llm.openai.tool_call.id`](../attributes-registry/llm.md) | string | If role is `tool` or `function`, then this tool call that this message is responding to. | `get_current_weather` | Conditionally Required: [4] |
-| [`llm.response.finish_reason`](../attributes-registry/llm.md) | string | The reason the model stopped generating tokens. | `stop` | Recommended |
-
-**[1]:** Required if the choice is the result of a tool call of type `function`.
-
-**[2]:** Required if the choice is the result of a tool call of type `function`.
-
-**[3]:** Required if the choice is the result of a tool call.
-
-**[4]:** Required if the choice is the result of a tool call.
-<!-- endsemconv -->
-
-[DocumentStatus]: https://github.com/open-telemetry/opentelemetry-specification/tree/v1.22.0/specification/document-status.md
diff --git a/docs/attributes-registry/llm.md b/docs/attributes-registry/llm.md
index e9a66e9021..e9d5d43cb8 100644
--- a/docs/attributes-registry/llm.md
+++ b/docs/attributes-registry/llm.md
@@ -23,13 +23,11 @@
 <!-- semconv registry.llm(omit_requirement_level,tag=llm-generic-request) -->
 | Attribute  | Type | Description  | Examples  |
 |---|---|---|---|
-| `llm.request.is_stream` | boolean | Whether the LLM responds with a stream. | `False` |
-| `llm.request.max_tokens` | int | The maximum number of tokens the LLM generates for a request. | `100` |
-| `llm.request.model` | string | The name of the LLM a request is being made to. | `gpt-4` |
-| `llm.request.stop_sequences` | string | Array of strings the LLM uses as a stop sequence. | `stop1` |
-| `llm.request.temperature` | double | The temperature setting for the LLM request. | `0.0` |
-| `llm.request.top_p` | double | The top_p sampling setting for the LLM request. | `1.0` |
-| `llm.system` | string | The name of the LLM foundation model vendor, if applicable. | `openai` |
+| `gen_ai.llm.request.max_tokens` | int | The maximum number of tokens the LLM generates for a request. | `100` |
+| `gen_ai.llm.request.model` | string | The name of the LLM a request is being made to. | `gpt-4` |
+| `gen_ai.llm.request.temperature` | double | The temperature setting for the LLM request. | `0.0` |
+| `gen_ai.llm.request.top_p` | double | The top_p sampling setting for the LLM request. | `1.0` |
+| `gen_ai.llm.system` | string | The name of the LLM foundation model vendor, if applicable. | `openai` |
 <!-- endsemconv -->
 
 ### Response Attributes
@@ -37,12 +35,11 @@
 <!-- semconv registry.llm(omit_requirement_level,tag=llm-generic-response) -->
 | Attribute  | Type | Description  | Examples  |
 |---|---|---|---|
-| `llm.response.finish_reason` | string | The reason the model stopped generating tokens. | `stop` |
-| `llm.response.id` | string[] | The unique identifier for the completion. | `[chatcmpl-123]` |
-| `llm.response.model` | string | The name of the LLM a response is being made to. | `gpt-4-0613` |
-| `llm.usage.completion_tokens` | int | The number of tokens used in the LLM response (completion). | `180` |
-| `llm.usage.prompt_tokens` | int | The number of tokens used in the LLM prompt. | `100` |
-| `llm.usage.total_tokens` | int | The total number of tokens used in the LLM prompt and response. | `280` |
+| `gen_ai.llm.response.finish_reason` | string[] | Array of reasons the model stopped generating tokens, corresponding to each generation received. | `[['stop']]` |
+| `gen_ai.llm.response.id` | string | The unique identifier for the completion. | `chatcmpl-123` |
+| `gen_ai.llm.response.model` | string | The name of the LLM a response is being made to. | `gpt-4-0613` |
+| `gen_ai.llm.usage.completion_tokens` | int | The number of tokens used in the LLM response (completion). | `180` |
+| `gen_ai.llm.usage.prompt_tokens` | int | The number of tokens used in the LLM prompt. | `100` |
 <!-- endsemconv -->
 
 ### Event Attributes
@@ -50,75 +47,6 @@
 <!-- semconv registry.llm(omit_requirement_level,tag=llm-generic-events) -->
 | Attribute  | Type | Description  | Examples  |
 |---|---|---|---|
-| `llm.completion` | string | The full response string from an LLM in a response. | `Why did the developer stop using OpenTelemetry? Because they couldnt trace their steps!` |
-| `llm.prompt` | string | The full prompt string sent to an LLM in a request. | `\\n\\nHuman:You are an AI assistant that tells jokes. Can you tell me a joke about OpenTelemetry?\\n\\nAssistant:` |
-<!-- endsemconv -->
-
-## OpenAI Attributes
-
-### Request Attributes
-
-<!-- semconv registry.llm(omit_requirement_level,tag=tech-specific-openai-request) -->
-| Attribute  | Type | Description  | Examples  |
-|---|---|---|---|
-| `llm.request.openai.frequency_penalty` | double | If present, the `frequency_penalty` used in an OpenAI request. Value is between -2.0 and 2.0. | `-0.5` |
-| `llm.request.openai.logit_bias` | string | If present, the JSON-encoded string of a `logit_bias` used in an OpenAI request | `{2435:-100, 640:-100}` |
-| `llm.request.openai.presence_penalty` | double | If present, the `presence_penalty` used in an OpenAI request. Value is between -2.0 and 2.0. | `-0.5` |
-| `llm.request.openai.response_format` | string | An object specifying the format that the model must output. Either `text` or `json_object` | `text` |
-| `llm.request.openai.seed` | int | Seed used in request to improve determinism. | `1234` |
-| `llm.request.openai.user` | string | If present, the `user` used in an OpenAI request. | `bob` |
-
-`llm.request.openai.response_format` MUST be one of the following:
-
-| Value  | Description |
-|---|---|
-| `text` | text |
-| `json_object` | json_object |
-<!-- endsemconv -->
-
-### Response Attributes
-
-<!-- semconv registry.llm(omit_requirement_level,tag=tech-specific-openai-response) -->
-| Attribute  | Type | Description  | Examples  |
-|---|---|---|---|
-| `llm.response.openai.created` | int | The UNIX timestamp (in seconds) if when the completion was created. | `1677652288` |
-| `llm.response.openai.system_fingerprint` | string | This fingerprint represents the backend configuration that the model runs with. | `asdf987123` |
-<!-- endsemconv -->
-
-### Event Attributes
-
-<!-- semconv registry.llm(omit_requirement_level,tag=tech-specific-openai-events) -->
-| Attribute  | Type | Description  | Examples  |
-|---|---|---|---|
-| `llm.openai.choice.type` | string | The type of the choice, either `delta` or `message`. | `message` |
-| `llm.openai.content` | string | The content for a given OpenAI response. | `Why did the developer stop using OpenTelemetry? Because they couldn't trace their steps!` |
-| `llm.openai.function.arguments` | string | If exists, the arguments to call a function call with for a given OpenAI response, denoted by `<index>`. The value for `<index>` starts with 0, where 0 is the first message. | `{"type": "object", "properties": {"some":"data"}}` |
-| `llm.openai.function.description` | string | A description of what the function does, used by the model to choose when and how to call the function. | `Gets the current weather for a location` |
-| `llm.openai.function.name` | string | The name of the function to be called. | `get_weather` |
-| `llm.openai.function.parameters` | string | JSON-encoded string of the parameter object for the function. | `{"type": "object", "properties": {}}` |
-| `llm.openai.role` | string | The role of the prompt author, can be one of `system`, `user`, `assistant`, or `tool` | `user` |
-| `llm.openai.tool.type` | string | The type of the tool. Currently, only `function` is supported. | `function` |
-| `llm.openai.tool_call.id` | string | If role is `tool` or `function`, then this tool call that this message is responding to. | `get_current_weather` |
-
-`llm.openai.choice.type` MUST be one of the following:
-
-| Value  | Description |
-|---|---|
-| `delta` | delta |
-| `message` | message |
-
-`llm.openai.role` MUST be one of the following:
-
-| Value  | Description |
-|---|---|
-| `system` | system |
-| `user` | user |
-| `assistant` | assistant |
-| `tool` | tool |
-
-`llm.openai.tool.type` MUST be one of the following:
-
-| Value  | Description |
-|---|---|
-| `function` | function |
+| `gen_ai.llm.completion` | string | The full response string from an LLM in a response. | `Why did the developer stop using OpenTelemetry? Because they couldnt trace their steps!` |
+| `gen_ai.llm.prompt` | string | The full prompt string sent to an LLM in a request. | `\\n\\nHuman:You are an AI assistant that tells jokes. Can you tell me a joke about OpenTelemetry?\\n\\nAssistant:` |
 <!-- endsemconv -->
\ No newline at end of file
diff --git a/model/metrics/llm-metrics.yaml b/model/metrics/llm-metrics.yaml
deleted file mode 100644
index 75db1e31ff..0000000000
--- a/model/metrics/llm-metrics.yaml
+++ /dev/null
@@ -1,109 +0,0 @@
-groups:
-  - id: metric.openai.chat_completions.tokens
-    type: metric
-    metric_name: llm.openai.chat_completions.tokens
-    brief: "Number of tokens used in prompt and completions."
-    instrument: counter
-    unit: "token"
-    stability: experimental
-    attributes:
-      - ref: llm.response.model
-        requirement_level: required
-      - ref: error.type
-        requirement_level:
-          conditionally_required: "if the operation ended in error"
-      - ref: llm.usage.token_type
-      - ref: server.address
-        requirement_level: required
-  - id: metric.openai.chat_completions.choices
-    type: metric
-    metric_name: llm.openai.chat_completions.choices
-    brief: "Number of choices returned by chat completions call"
-    instrument: counter
-    unit: "choice"
-    stability: experimental
-    attributes:
-      - ref: llm.response.model
-        requirement_level: required
-      - ref: error.type
-        requirement_level:
-          conditionally_required: "if the operation ended in error"
-      - ref: llm.response.finish_reason
-      - ref: server.address
-        requirement_level: required
-  - id: metric.openai.chat_completions.duration
-    type: metric
-    metric_name: llm.openai.chat_completions.duration
-    brief: "Duration of chat completion operation"
-    instrument: histogram
-    unit: 's'
-    stability: experimental
-    attributes:
-      - ref: llm.response.model
-        requirement_level: required
-      - ref: error.type
-        requirement_level:
-          conditionally_required: "if the operation ended in error"
-      - ref: llm.response.finish_reason
-      - ref: server.address
-        requirement_level: required
-  - id: metric.openai.embeddings.tokens
-    type: metric
-    metric_name: llm.openai.embeddings.tokens
-    brief: "Number of tokens used in prompt and completions."
-    instrument: counter
-    unit: "token"
-    stability: experimental
-    attributes:
-      - ref: llm.response.model
-        requirement_level: required
-      - ref: error.type
-        requirement_level:
-          conditionally_required: "if the operation ended in error"
-      - ref: llm.usage.token_type
-      - ref: server.address
-        requirement_level: required
-  - id: metric.openai.embeddings.vector_size
-    type: metric
-    metric_name: llm.openai.embeddings.vector_size
-    brief: "he size of returned vector."
-    instrument: counter
-    unit: "element"
-    stability: experimental
-    attributes:
-      - ref: llm.response.model
-        requirement_level: required
-      - ref: error.type
-        requirement_level:
-          conditionally_required: "if the operation ended in error"
-      - ref: server.address
-        requirement_level: required
-  - id: metric.openai.embeddings.duration
-    type: metric
-    metric_name: llm.openai.embeddings.duration
-    brief: "Duration of embeddings operation"
-    instrument: histogram
-    unit: 's'
-    stability: experimental
-    attributes:
-      - ref: llm.response.model
-        requirement_level: required
-      - ref: error.type
-        requirement_level:
-          conditionally_required: "if the operation ended in error"
-      - ref: server.address
-        requirement_level: required
-  - id: metric.openai.image_generations.duration
-    type: metric
-    metric_name: llm.openai.image_generations.duration
-    brief: "Duration of image generations operation"
-    instrument: histogram
-    unit: 's'
-    stability: experimental
-    attributes:
-      - ref: llm.response.model
-        requirement_level:
-          conditionally_required: "if the operation ended in error"
-      - ref: error.type
-      - ref: server.address
-        requirement_level: required
diff --git a/model/registry/llm.yaml b/model/registry/llm.yaml
index 31bf953b94..777d975827 100644
--- a/model/registry/llm.yaml
+++ b/model/registry/llm.yaml
@@ -1,6 +1,6 @@
 groups:
   - id: registry.llm
-    prefix: llm
+    prefix: gen_ai.llm
     type: attribute_group
     brief: >
       This document defines the attributes used to describe telemetry in the context of LLM (Large Language Models) requests and responses.
@@ -30,18 +30,8 @@ groups:
         brief: The top_p sampling setting for the LLM request.
         examples: [1.0]
         tag: llm-generic-request
-      - id: request.is_stream
-        type: boolean
-        brief: Whether the LLM responds with a stream.
-        examples: [false]
-        tag: llm-generic-request
-      - id: request.stop_sequences
-        type: string
-        brief: Array of strings the LLM uses as a stop sequence.
-        examples: ["stop1"]
-        tag: llm-generic-request
       - id: response.id
-        type: string[]
+        type: string
         brief: The unique identifier for the completion.
         examples: ['chatcmpl-123']
         tag: llm-generic-response
@@ -51,19 +41,10 @@ groups:
         examples: ['gpt-4-0613']
         tag: llm-generic-response
       - id: response.finish_reason
-        type: string
-        brief: The reason the model stopped generating tokens.
-        examples: ['stop']
+        type: string[]
+        brief: Array of reasons the model stopped generating tokens, corresponding to each generation received.
+        examples: [['stop']]
         tag: llm-generic-response
-      - id: usage.token_type
-        type:
-          members:
-            - id: prompt
-              value: 'prompt'
-            - id: completion
-              value: 'completion'
-        brief: The type of token.
-        examples: ['prompt']
       - id: usage.prompt_tokens
         type: int
         brief: The number of tokens used in the LLM prompt.
@@ -74,11 +55,6 @@ groups:
         brief: The number of tokens used in the LLM response (completion).
         examples: [180]
         tag: llm-generic-response
-      - id: usage.total_tokens
-        type: int
-        brief: The total number of tokens used in the LLM prompt and response.
-        examples: [280]
-        tag: llm-generic-response
       - id: prompt
         type: string
         brief: The full prompt string sent to an LLM in a request.
@@ -89,110 +65,3 @@ groups:
         brief: The full response string from an LLM in a response.
         examples: ['Why did the developer stop using OpenTelemetry? Because they couldnt trace their steps!']
         tag: llm-generic-events
-      - id: request.openai.presence_penalty
-        type: double
-        brief: If present, the `presence_penalty` used in an OpenAI request. Value is between -2.0 and 2.0.
-        examples: -0.5
-        tag: tech-specific-openai-request
-      - id: request.openai.frequency_penalty
-        type: double
-        brief: If present, the `frequency_penalty` used in an OpenAI request. Value is between -2.0 and 2.0.
-        examples: -0.5
-        tag: tech-specific-openai-request
-      - id: request.openai.logit_bias
-        type: string
-        brief: If present, the JSON-encoded string of a `logit_bias` used in an OpenAI request
-        examples: ['{2435:-100, 640:-100}']
-        tag: tech-specific-openai-request
-      - id: request.openai.user
-        type: string
-        brief: If present, the `user` used in an OpenAI request.
-        examples: ['bob']
-        tag: tech-specific-openai-request
-      - id: request.openai.response_format
-        type:
-          members:
-            - id: text
-              value: 'text'
-            - id: json_object
-              value: 'json_object'
-        brief: An object specifying the format that the model must output. Either `text` or `json_object`
-        examples: 'text'
-        tag: tech-specific-openai-request
-      - id: request.openai.seed
-        type: int
-        brief: Seed used in request to improve determinism.
-        examples: 1234
-        tag: tech-specific-openai-request
-      - id: response.openai.created
-        type: int
-        brief: The UNIX timestamp (in seconds) if when the completion was created.
-        examples: 1677652288
-        tag: tech-specific-openai-response
-      - id: response.openai.system_fingerprint
-        type: string
-        brief: This fingerprint represents the backend configuration that the model runs with.
-        examples: 'asdf987123'
-        tag: tech-specific-openai-response
-      - id: openai.role
-        type:
-          members:
-            - id: system
-              value: 'system'
-            - id: user
-              value: 'user'
-            - id: assistant
-              value: 'assistant'
-            - id: tool
-              value: 'tool'
-        brief: The role of the prompt author, can be one of `system`, `user`, `assistant`, or `tool`
-        examples: 'user'
-        tag: tech-specific-openai-events
-      - id: openai.tool.type
-        type:
-          members:
-            - id: function
-              value: 'function'
-        brief: The type of the tool. Currently, only `function` is supported.
-        examples: 'function'
-        tag: tech-specific-openai-events
-      - id: openai.function.name
-        type: string
-        brief: The name of the function to be called.
-        examples: 'get_weather'
-        tag: tech-specific-openai-events
-      - id: openai.function.description
-        type: string
-        brief: A description of what the function does, used by the model to choose when and how to call the function.
-        examples: 'Gets the current weather for a location'
-        tag: tech-specific-openai-events
-      - id: openai.function.parameters
-        type: string
-        brief: JSON-encoded string of the parameter object for the function.
-        examples: '{"type": "object", "properties": {}}'
-        tag: tech-specific-openai-events
-      - id: openai.content
-        type: string
-        brief: The content for a given OpenAI response.
-        examples: 'Why did the developer stop using OpenTelemetry? Because they couldn''t trace their steps!'
-        tag: tech-specific-openai-events
-      - id: openai.tool_call.id
-        type: string
-        brief: If role is `tool` or `function`, then this tool call that this message is responding to.
-        examples: 'get_current_weather'
-        tag: tech-specific-openai-events
-      - id: openai.function.arguments
-        type: string
-        brief: If exists, the arguments to call a function call with for a given OpenAI response, denoted by `<index>`. The value for `<index>` starts with 0, where 0 is the first message.
-        examples: '{"type": "object", "properties": {"some":"data"}}'
-        tag: tech-specific-openai-events
-      - id: openai.choice.type
-        type:
-          members:
-            - id: delta
-              value: 'delta'
-            - id: message
-              value: 'message'
-        brief: The type of the choice, either `delta` or `message`.
-        examples: 'message'
-        tag: tech-specific-openai-events
diff --git a/model/trace/llm.yaml b/model/trace/llm.yaml
index 1c844732b0..9883773a29 100644
--- a/model/trace/llm.yaml
+++ b/model/trace/llm.yaml
@@ -1,190 +1,63 @@
 groups:
-  - id: llm.request
+  - id: gen_ai.llm.request
     type: span
     brief: >
       A request to an LLM is modeled as a span in a trace. The span name should be a low cardinality value representing the request made to an LLM, like the name of the API endpoint being called.
     attributes:
-      - ref: llm.system
+      - ref: gen_ai.llm.system
         requirement_level: recommended
         note: >
           The name of the LLM foundation model vendor, if applicable. If not using a vendor-supplied model, this field is left blank.
-      - ref: llm.request.model
+      - ref: gen_ai.llm.request.model
         requirement_level: required
         note: >
             The name of the LLM a request is being made to. If the LLM is supplied by a vendor,
             then the value must be the exact name of the model requested. If the LLM is a fine-tuned
             custom model, the value should have a more specific name than the base model that's been fine-tuned.
-      - ref: llm.request.max_tokens
+      - ref: gen_ai.llm.request.max_tokens
         requirement_level: recommended
-      - ref: llm.request.temperature
+      - ref: gen_ai.llm.request.temperature
         requirement_level: recommended
-      - ref: llm.request.top_p
+      - ref: gen_ai.llm.request.top_p
         requirement_level: recommended
-      - ref: llm.request.is_stream
+      - ref: gen_ai.llm.response.id
         requirement_level: recommended
-      - ref: llm.request.stop_sequences
-        requirement_level: recommended
-      - ref: llm.response.id
-        requirement_level: recommended
-      - ref: llm.response.model
+      - ref: gen_ai.llm.response.model
         requirement_level: required
         note: >
           The name of the LLM a response is being made to. If the LLM is supplied by a vendor,
           then the value must be the exact name of the model actually used. If the LLM is a
           fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
-      - ref: llm.response.finish_reason
-        requirement_level: recommended
-      - ref: llm.usage.prompt_tokens
+      - ref: gen_ai.llm.response.finish_reason
         requirement_level: recommended
-      - ref: llm.usage.completion_tokens
+      - ref: gen_ai.llm.usage.prompt_tokens
         requirement_level: recommended
-      - ref: llm.usage.total_tokens
+      - ref: gen_ai.llm.usage.completion_tokens
         requirement_level: recommended
     events:
-      - llm.content.prompt
-      - llm.content.completion
+      - gen_ai.llm.content.prompt
+      - gen_ai.llm.content.completion
 
-  - id: llm.content.prompt
-    name: llm.content.prompt
+  - id: gen_ai.llm.content.prompt
+    name: gen_ai.llm.content.prompt
     type: event
     brief: >
       In the lifetime of an LLM span, events for prompts sent and completions received
       may be created, depending on the configuration of the instrumentation.
     attributes:
-      - ref: llm.prompt
+      - ref: gen_ai.llm.prompt
         requirement_level: recommended
         note: >
-          The full prompt string sent to an LLM in a request. If the LLM accepts a more
-          complex input like a JSON object, this field is blank, and the response is
-          instead captured in an event determined by the specific LLM technology semantic convention.
+          The full prompt sent to an LLM in a request, structured as a JSON in OpenAI's format.
 
-  - id: llm.content.completion
-    name: llm.content.completion
+  - id: gen_ai.llm.content.completion
+    name: gen_ai.llm.content.completion
     type: event
     brief: >
-      In the lifetime of an LLM span, events for prompts sent and completions received may be created, depending on the configuration of the instrumentation.
-    attributes:
-      - ref: llm.completion
-        requirement_level: recommended
-        note: >
-          The full response string from an LLM. If the LLM responds with a more
-          complex output like a JSON object made up of several pieces (such as OpenAI's message choices),
-          this field is the content of the response. If the LLM produces multiple responses, then this
-          field is left blank, and each response is instead captured in an event determined by the specific
-          LLM technology semantic convention.
-
-  - id: llm.openai
-    type: span
-    brief: >
-      A span representing a request to OpenAI's API, providing additional information on top of the generic llm.request.
+      In the lifetime of an LLM span, events for prompts sent and completions received 
+      may be created, depending on the configuration of the instrumentation.
     attributes:
-      - ref: llm.system
+      - ref: gen_ai.llm.completion
         requirement_level: recommended
-        examples: ['openai', 'microsoft']
-        tag: tech-specific-openai-request
-      - ref: llm.request.model
-        requirement_level: required
         note: >
-            The name of the LLM a request is being made to. If the LLM is supplied by a
-            vendor, then the value must be the exact name of the model requested. If the
-            LLM is a fine-tuned custom model, the value should have a more specific name
-            than the base model that's been fine-tuned.
-        tag: tech-specific-openai-request
-      - ref: llm.request.max_tokens
-        tag: tech-specific-openai-request
-      - ref: llm.request.temperature
-        tag: tech-specific-openai-request
-      - ref: llm.request.top_p
-        tag: tech-specific-openai-request
-      - ref: llm.request.is_stream
-        tag: tech-specific-openai-request
-      - ref: llm.request.stop_sequences
-        tag: tech-specific-openai-request
-      - ref: llm.request.openai.presence_penalty
-        tag: tech-specific-openai-request
-      - ref: llm.request.openai.logit_bias
-        tag: tech-specific-openai-request
-      - ref: llm.request.openai.user
-        tag: tech-specific-openai-request
-      - ref: llm.request.openai.response_format
-        tag: tech-specific-openai-request
-      - ref: llm.request.openai.seed
-        tag: tech-specific-openai-response
-      - ref: llm.response.id
-        tag: tech-specific-openai-response
-      - ref: llm.response.finish_reason
-        tag: tech-specific-openai-response
-      - ref: llm.usage.prompt_tokens
-        tag: tech-specific-openai-response
-      - ref: llm.usage.completion_tokens
-        tag: tech-specific-openai-response
-      - ref: llm.usage.total_tokens
-        tag: tech-specific-openai-response
-      - ref: llm.response.openai.created
-        tag: tech-specific-openai-response
-      - ref: llm.response.openai.system_fingerprint
-        tag: tech-sepecifc-openai-response
-    events:
-      - llm.content.openai.prompt
-      - llm.content.openai.tool
-      - llm.content.openai.completion.choice
-
-  - id: llm.content.openai.prompt
-    name: llm.content.openai.prompt
-    type: event
-    brief: >
-      This event is fired when a completion request is sent to OpenAI, specifying the prompt that was sent.
-    attributes:
-      - ref: llm.openai.role
-        requirement_level: required
-      - ref: llm.openai.content
-        requirement_level: required
-      - ref: llm.openai.tool_call.id
-        requirement_level:
-          conditionally_required: >
-            Required if the prompt role is `tool`.
-
-  - id: llm.content.openai.tool
-    name: llm.content.openai.tool
-    type: event
-    brief: >
-      This event is fired when a completion request is sent to OpenAI, specifying tools that the LLM can use.
-    attributes:
-      - ref: llm.openai.tool.type
-        requirement_level: required
-      - ref: llm.openai.function.name
-        requirement_level: required
-      - ref: llm.openai.function.description
-        requirement_level: required
-      - ref: llm.openai.function.parameters
-        requirement_level: required
-
-  - id: llm.content.openai.completion.choice
-    name: llm.content.openai.completion.choice
-    type: event
-    brief: >
-      This event is fired when a completion response is returned from OpenAI, specifying one possibile completion returned by the LLM.
-    attributes:
-      - ref: llm.openai.choice.type
-        requirement_level: required
-      - ref: llm.response.finish_reason
-      - ref: llm.openai.role
-        requirement_level: required
-      - ref: llm.openai.content
-        requirement_level: required
-      - ref: llm.openai.tool_call.id
-        requirement_level:
-          conditionally_required: >
-            Required if the choice is the result of a tool call.
-      - ref: llm.openai.tool.type
-        requirement_level:
-          conditionally_required: >
-            Required if the choice is the result of a tool call.
-      - ref: llm.openai.function.name
-        requirement_level:
-          conditionally_required: >
-            Required if the choice is the result of a tool call of type `function`.
-      - ref: llm.openai.function.arguments
-        requirement_level:
-          conditionally_required: >
-            Required if the choice is the result of a tool call of type `function`.
+          The full response from an LLM, structured as a JSON in OpenAI's format.
\ No newline at end of file

From 5c6df3e783d6c5482a2a83862e9fcec6696088c0 Mon Sep 17 00:00:00 2001
From: Nir Gazit <nirga@users.noreply.github.com>
Date: Tue, 19 Mar 2024 15:55:06 +0200
Subject: [PATCH 10/36] fix: prompt/completion format

---
 docs/ai/llm-spans.md            | 4 ++--
 docs/attributes-registry/llm.md | 8 ++------
 model/registry/llm.yaml         | 8 ++++----
 3 files changed, 8 insertions(+), 12 deletions(-)

diff --git a/docs/ai/llm-spans.md b/docs/ai/llm-spans.md
index 767bfc8ffb..9a58096ed0 100644
--- a/docs/ai/llm-spans.md
+++ b/docs/ai/llm-spans.md
@@ -65,7 +65,7 @@ The event name MUST be `gen_ai.llm.content.prompt`.
 
 | Attribute  | Type | Description  | Examples  | Requirement Level |
 |---|---|---|---|---|
-| [`gen_ai.llm.prompt`](../attributes-registry/llm.md) | string | The full prompt string sent to an LLM in a request. [1] | `\\n\\nHuman:You are an AI assistant that tells jokes. Can you tell me a joke about OpenTelemetry?\\n\\nAssistant:` | Recommended |
+| [`gen_ai.llm.prompt`](../attributes-registry/llm.md) | string | The full prompt sent to an LLM, as a stringified JSON in OpenAI's format. [1] | `[{'role': 'user', 'content': 'What is the capital of France?'}]` | Recommended |
 
 **[1]:** The full prompt sent to an LLM in a request, structured as a JSON in OpenAI's format.
 <!-- endsemconv -->
@@ -75,7 +75,7 @@ The event name MUST be `gen_ai.llm.content.completion`.
 
 | Attribute  | Type | Description  | Examples  | Requirement Level |
 |---|---|---|---|---|
-| [`gen_ai.llm.completion`](../attributes-registry/llm.md) | string | The full response string from an LLM in a response. [1] | `Why did the developer stop using OpenTelemetry? Because they couldnt trace their steps!` | Recommended |
+| [`gen_ai.llm.completion`](../attributes-registry/llm.md) | string | The full response received from the LLM, as a stringified JSON in OpenAI's format. [1] | `[{'role': 'assistant', 'content': 'The capital of France is Paris.'}]` | Recommended |
 
 **[1]:** The full response from an LLM, structured as a JSON in OpenAI's format.
 <!-- endsemconv -->
diff --git a/docs/attributes-registry/llm.md b/docs/attributes-registry/llm.md
index e9d5d43cb8..ea77923dfb 100644
--- a/docs/attributes-registry/llm.md
+++ b/docs/attributes-registry/llm.md
@@ -9,10 +9,6 @@
   * [Request Attributes](#request-attributes)
   * [Response Attributes](#response-attributes)
   * [Event Attributes](#event-attributes)
-- [OpenAI Attributes](#openai-attributes)
-  * [Request Attributes](#request-attributes-1)
-  * [Response Attributes](#response-attributes-1)
-  * [Event Attributes](#event-attributes-1)
 
 <!-- tocstop -->
 
@@ -47,6 +43,6 @@
 <!-- semconv registry.llm(omit_requirement_level,tag=llm-generic-events) -->
 | Attribute  | Type | Description  | Examples  |
 |---|---|---|---|
-| `gen_ai.llm.completion` | string | The full response string from an LLM in a response. | `Why did the developer stop using OpenTelemetry? Because they couldnt trace their steps!` |
-| `gen_ai.llm.prompt` | string | The full prompt string sent to an LLM in a request. | `\\n\\nHuman:You are an AI assistant that tells jokes. Can you tell me a joke about OpenTelemetry?\\n\\nAssistant:` |
+| `gen_ai.llm.completion` | string | The full response received from the LLM, as a stringified JSON in OpenAI's format. | `[{'role': 'assistant', 'content': 'The capital of France is Paris.'}]` |
+| `gen_ai.llm.prompt` | string | The full prompt sent to an LLM, as a stringified JSON in OpenAI's format. | `[{'role': 'user', 'content': 'What is the capital of France?'}]` |
 <!-- endsemconv -->
\ No newline at end of file
diff --git a/model/registry/llm.yaml b/model/registry/llm.yaml
index 777d975827..e9708d5a85 100644
--- a/model/registry/llm.yaml
+++ b/model/registry/llm.yaml
@@ -57,11 +57,11 @@ groups:
         tag: llm-generic-response
       - id: prompt
         type: string
-        brief: The full prompt string sent to an LLM in a request.
-        examples: ['\\n\\nHuman:You are an AI assistant that tells jokes. Can you tell me a joke about OpenTelemetry?\\n\\nAssistant:']
+        brief: The full prompt sent to an LLM, as a stringified JSON in OpenAI's format.
+        examples: ["[{'role': 'user', 'content': 'What is the capital of France?'}]"]
         tag: llm-generic-events
       - id: completion
         type: string
-        brief: The full response string from an LLM in a response.
-        examples: ['Why did the developer stop using OpenTelemetry? Because they couldnt trace their steps!']
+        brief:  The full response received from the LLM, as a stringified JSON in OpenAI's format.
+        examples: ["[{'role': 'assistant', 'content': 'The capital of France is Paris.'}]"]
         tag: llm-generic-events

From bcc34732758f4b90e6e52d3aa756253698228d98 Mon Sep 17 00:00:00 2001
From: Nir Gazit <nirga@users.noreply.github.com>
Date: Wed, 20 Mar 2024 15:59:35 +0200
Subject: [PATCH 11/36] fix: lint and CI errors

---
 .github/ISSUE_TEMPLATE/bug_report.yaml      | 1 +
 .github/ISSUE_TEMPLATE/change_proposal.yaml | 1 +
 .github/ISSUE_TEMPLATE/new-conventions.yaml | 1 +
 docs/attributes-registry/llm.md             | 6 +++---
 model/registry/llm.yaml                     | 2 +-
 model/trace/llm.yaml                        | 4 ++--
 6 files changed, 9 insertions(+), 6 deletions(-)

diff --git a/.github/ISSUE_TEMPLATE/bug_report.yaml b/.github/ISSUE_TEMPLATE/bug_report.yaml
index 08dca47531..9880ce7604 100644
--- a/.github/ISSUE_TEMPLATE/bug_report.yaml
+++ b/.github/ISSUE_TEMPLATE/bug_report.yaml
@@ -36,6 +36,7 @@ body:
         - area:host
         - area:http
         - area:k8s
+        - area:llm
         - area:messaging
         - area:network
         - area:oci
diff --git a/.github/ISSUE_TEMPLATE/change_proposal.yaml b/.github/ISSUE_TEMPLATE/change_proposal.yaml
index edaa3a4a75..a70efbd965 100644
--- a/.github/ISSUE_TEMPLATE/change_proposal.yaml
+++ b/.github/ISSUE_TEMPLATE/change_proposal.yaml
@@ -29,6 +29,7 @@ body:
         - area:host
         - area:http
         - area:k8s
+        - area:llm
         - area:messaging
         - area:network
         - area:oci
diff --git a/.github/ISSUE_TEMPLATE/new-conventions.yaml b/.github/ISSUE_TEMPLATE/new-conventions.yaml
index 8a72b6bff2..84f8d9d03f 100644
--- a/.github/ISSUE_TEMPLATE/new-conventions.yaml
+++ b/.github/ISSUE_TEMPLATE/new-conventions.yaml
@@ -38,6 +38,7 @@ body:
         - area:host
         - area:http
         - area:k8s
+        - area:llm
         - area:messaging
         - area:network
         - area:oci
diff --git a/docs/attributes-registry/llm.md b/docs/attributes-registry/llm.md
index ea77923dfb..b4e4f03ced 100644
--- a/docs/attributes-registry/llm.md
+++ b/docs/attributes-registry/llm.md
@@ -6,9 +6,9 @@
 <!-- toc -->
 
 - [Generic LLM Attributes](#generic-llm-attributes)
-  * [Request Attributes](#request-attributes)
-  * [Response Attributes](#response-attributes)
-  * [Event Attributes](#event-attributes)
+  - [Request Attributes](#request-attributes)
+  - [Response Attributes](#response-attributes)
+  - [Event Attributes](#event-attributes)
 
 <!-- tocstop -->
 
diff --git a/model/registry/llm.yaml b/model/registry/llm.yaml
index e9708d5a85..999691c777 100644
--- a/model/registry/llm.yaml
+++ b/model/registry/llm.yaml
@@ -62,6 +62,6 @@ groups:
         tag: llm-generic-events
       - id: completion
         type: string
-        brief:  The full response received from the LLM, as a stringified JSON in OpenAI's format.
+        brief: The full response received from the LLM, as a stringified JSON in OpenAI's format.
         examples: ["[{'role': 'assistant', 'content': 'The capital of France is Paris.'}]"]
         tag: llm-generic-events
diff --git a/model/trace/llm.yaml b/model/trace/llm.yaml
index 9883773a29..0bdb23b2a3 100644
--- a/model/trace/llm.yaml
+++ b/model/trace/llm.yaml
@@ -54,10 +54,10 @@ groups:
     name: gen_ai.llm.content.completion
     type: event
     brief: >
-      In the lifetime of an LLM span, events for prompts sent and completions received 
+      In the lifetime of an LLM span, events for prompts sent and completions received
       may be created, depending on the configuration of the instrumentation.
     attributes:
       - ref: gen_ai.llm.completion
         requirement_level: recommended
         note: >
-          The full response from an LLM, structured as a JSON in OpenAI's format.
\ No newline at end of file
+          The full response from an LLM, structured as a JSON in OpenAI's format.

From b7ddb90a6f952a2f3d37db3aac3f3d1376791692 Mon Sep 17 00:00:00 2001
From: Nir Gazit <nirga@users.noreply.github.com>
Date: Wed, 20 Mar 2024 16:08:00 +0200
Subject: [PATCH 12/36] fix: llm -> gen-ai

---
 .chloggen/first-gen-ai.yaml                   | 22 +++++++++++++++++
 .github/ISSUE_TEMPLATE/bug_report.yaml        |  2 +-
 .github/ISSUE_TEMPLATE/change_proposal.yaml   |  2 +-
 .github/ISSUE_TEMPLATE/new-conventions.yaml   |  2 +-
 .../attributes-registry/{llm.md => gen-ai.md} |  0
 docs/{ai => gen-ai}/README.md                 |  0
 docs/{ai => gen-ai}/llm-spans.md              | 24 +++++++++----------
 model/registry/{llm.yaml => gen-ai.yaml}      |  0
 model/trace/{llm.yaml => gen-ai.yaml}         |  0
 9 files changed, 37 insertions(+), 15 deletions(-)
 create mode 100755 .chloggen/first-gen-ai.yaml
 rename docs/attributes-registry/{llm.md => gen-ai.md} (100%)
 rename docs/{ai => gen-ai}/README.md (100%)
 rename docs/{ai => gen-ai}/llm-spans.md (62%)
 rename model/registry/{llm.yaml => gen-ai.yaml} (100%)
 rename model/trace/{llm.yaml => gen-ai.yaml} (100%)

diff --git a/.chloggen/first-gen-ai.yaml b/.chloggen/first-gen-ai.yaml
new file mode 100755
index 0000000000..ab49109161
--- /dev/null
+++ b/.chloggen/first-gen-ai.yaml
@@ -0,0 +1,22 @@
+# Use this changelog template to create an entry for release notes.
+#
+# If your change doesn't affect end users you should instead start
+# your pull request title with [chore] or use the "Skip Changelog" label.
+
+# One of 'breaking', 'deprecation', 'new_component', 'enhancement', 'bug_fix'
+change_type: new_component
+
+# The name of the area of concern in the attributes-registry, (e.g. http, cloud, db)
+component: gen-ai
+
+# A brief description of the change. Surround your text with quotes ("") if it needs to start with a backtick (`).
+note: Introducing semantic conventions for LLM applications.
+
+# Mandatory: One or more tracking issues related to the change. You can use the PR number here if no issue exists.
+# The values here must be integers.
+issues: [327]
+
+# (Optional) One or more lines of additional information to render under the primary note.
+# These lines will be padded with 2 spaces and then inserted directly into the document.
+# Use pipe (|) for multiline entries.
+subtext:
diff --git a/.github/ISSUE_TEMPLATE/bug_report.yaml b/.github/ISSUE_TEMPLATE/bug_report.yaml
index 9880ce7604..e47436566b 100644
--- a/.github/ISSUE_TEMPLATE/bug_report.yaml
+++ b/.github/ISSUE_TEMPLATE/bug_report.yaml
@@ -33,10 +33,10 @@ body:
         - area:error
         - area:exception
         - area:faas
+        - area:gen-ai
         - area:host
         - area:http
         - area:k8s
-        - area:llm
         - area:messaging
         - area:network
         - area:oci
diff --git a/.github/ISSUE_TEMPLATE/change_proposal.yaml b/.github/ISSUE_TEMPLATE/change_proposal.yaml
index a70efbd965..13d3ffee93 100644
--- a/.github/ISSUE_TEMPLATE/change_proposal.yaml
+++ b/.github/ISSUE_TEMPLATE/change_proposal.yaml
@@ -26,10 +26,10 @@ body:
         - area:error
         - area:exception
         - area:faas
+        - area:gen-ai
         - area:host
         - area:http
         - area:k8s
-        - area:llm
         - area:messaging
         - area:network
         - area:oci
diff --git a/.github/ISSUE_TEMPLATE/new-conventions.yaml b/.github/ISSUE_TEMPLATE/new-conventions.yaml
index 84f8d9d03f..0b5f4b8f49 100644
--- a/.github/ISSUE_TEMPLATE/new-conventions.yaml
+++ b/.github/ISSUE_TEMPLATE/new-conventions.yaml
@@ -35,10 +35,10 @@ body:
         - area:error
         - area:exception
         - area:faas
+        - area:gen-ai
         - area:host
         - area:http
         - area:k8s
-        - area:llm
         - area:messaging
         - area:network
         - area:oci
diff --git a/docs/attributes-registry/llm.md b/docs/attributes-registry/gen-ai.md
similarity index 100%
rename from docs/attributes-registry/llm.md
rename to docs/attributes-registry/gen-ai.md
diff --git a/docs/ai/README.md b/docs/gen-ai/README.md
similarity index 100%
rename from docs/ai/README.md
rename to docs/gen-ai/README.md
diff --git a/docs/ai/llm-spans.md b/docs/gen-ai/llm-spans.md
similarity index 62%
rename from docs/ai/llm-spans.md
rename to docs/gen-ai/llm-spans.md
index 9a58096ed0..8fea1143ed 100644
--- a/docs/ai/llm-spans.md
+++ b/docs/gen-ai/llm-spans.md
@@ -38,16 +38,16 @@ These attributes track input data and metadata for a request to an LLM. Each att
 <!-- semconv gen_ai.llm.request -->
 | Attribute  | Type | Description  | Examples  | Requirement Level |
 |---|---|---|---|---|
-| [`gen_ai.llm.request.max_tokens`](../attributes-registry/llm.md) | int | The maximum number of tokens the LLM generates for a request. | `100` | Recommended |
-| [`gen_ai.llm.request.model`](../attributes-registry/llm.md) | string | The name of the LLM a request is being made to. [1] | `gpt-4` | Required |
-| [`gen_ai.llm.request.temperature`](../attributes-registry/llm.md) | double | The temperature setting for the LLM request. | `0.0` | Recommended |
-| [`gen_ai.llm.request.top_p`](../attributes-registry/llm.md) | double | The top_p sampling setting for the LLM request. | `1.0` | Recommended |
-| [`gen_ai.llm.response.finish_reason`](../attributes-registry/llm.md) | string[] | Array of reasons the model stopped generating tokens, corresponding to each generation received. | `[['stop']]` | Recommended |
-| [`gen_ai.llm.response.id`](../attributes-registry/llm.md) | string | The unique identifier for the completion. | `chatcmpl-123` | Recommended |
-| [`gen_ai.llm.response.model`](../attributes-registry/llm.md) | string | The name of the LLM a response is being made to. [2] | `gpt-4-0613` | Required |
-| [`gen_ai.llm.system`](../attributes-registry/llm.md) | string | The name of the LLM foundation model vendor, if applicable. [3] | `openai` | Recommended |
-| [`gen_ai.llm.usage.completion_tokens`](../attributes-registry/llm.md) | int | The number of tokens used in the LLM response (completion). | `180` | Recommended |
-| [`gen_ai.llm.usage.prompt_tokens`](../attributes-registry/llm.md) | int | The number of tokens used in the LLM prompt. | `100` | Recommended |
+| [`gen_ai.llm.request.max_tokens`](../attributes-registry/gen-ai.md) | int | The maximum number of tokens the LLM generates for a request. | `100` | Recommended |
+| [`gen_ai.llm.request.model`](../attributes-registry/gen-ai.md) | string | The name of the LLM a request is being made to. [1] | `gpt-4` | Required |
+| [`gen_ai.llm.request.temperature`](../attributes-registry/gen-ai.md) | double | The temperature setting for the LLM request. | `0.0` | Recommended |
+| [`gen_ai.llm.request.top_p`](../attributes-registry/gen-ai.md) | double | The top_p sampling setting for the LLM request. | `1.0` | Recommended |
+| [`gen_ai.llm.response.finish_reason`](../attributes-registry/gen-ai.md) | string[] | Array of reasons the model stopped generating tokens, corresponding to each generation received. | `[['stop']]` | Recommended |
+| [`gen_ai.llm.response.id`](../attributes-registry/gen-ai.md) | string | The unique identifier for the completion. | `chatcmpl-123` | Recommended |
+| [`gen_ai.llm.response.model`](../attributes-registry/gen-ai.md) | string | The name of the LLM a response is being made to. [2] | `gpt-4-0613` | Required |
+| [`gen_ai.llm.system`](../attributes-registry/gen-ai.md) | string | The name of the LLM foundation model vendor, if applicable. [3] | `openai` | Recommended |
+| [`gen_ai.llm.usage.completion_tokens`](../attributes-registry/gen-ai.md) | int | The number of tokens used in the LLM response (completion). | `180` | Recommended |
+| [`gen_ai.llm.usage.prompt_tokens`](../attributes-registry/gen-ai.md) | int | The number of tokens used in the LLM prompt. | `100` | Recommended |
 
 **[1]:** The name of the LLM a request is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model requested. If the LLM is a fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
 
@@ -65,7 +65,7 @@ The event name MUST be `gen_ai.llm.content.prompt`.
 
 | Attribute  | Type | Description  | Examples  | Requirement Level |
 |---|---|---|---|---|
-| [`gen_ai.llm.prompt`](../attributes-registry/llm.md) | string | The full prompt sent to an LLM, as a stringified JSON in OpenAI's format. [1] | `[{'role': 'user', 'content': 'What is the capital of France?'}]` | Recommended |
+| [`gen_ai.llm.prompt`](../attributes-registry/gen-ai.md) | string | The full prompt sent to an LLM, as a stringified JSON in OpenAI's format. [1] | `[{'role': 'user', 'content': 'What is the capital of France?'}]` | Recommended |
 
 **[1]:** The full prompt sent to an LLM in a request, structured as a JSON in OpenAI's format.
 <!-- endsemconv -->
@@ -75,7 +75,7 @@ The event name MUST be `gen_ai.llm.content.completion`.
 
 | Attribute  | Type | Description  | Examples  | Requirement Level |
 |---|---|---|---|---|
-| [`gen_ai.llm.completion`](../attributes-registry/llm.md) | string | The full response received from the LLM, as a stringified JSON in OpenAI's format. [1] | `[{'role': 'assistant', 'content': 'The capital of France is Paris.'}]` | Recommended |
+| [`gen_ai.llm.completion`](../attributes-registry/gen-ai.md) | string | The full response received from the LLM, as a stringified JSON in OpenAI's format. [1] | `[{'role': 'assistant', 'content': 'The capital of France is Paris.'}]` | Recommended |
 
 **[1]:** The full response from an LLM, structured as a JSON in OpenAI's format.
 <!-- endsemconv -->
diff --git a/model/registry/llm.yaml b/model/registry/gen-ai.yaml
similarity index 100%
rename from model/registry/llm.yaml
rename to model/registry/gen-ai.yaml
diff --git a/model/trace/llm.yaml b/model/trace/gen-ai.yaml
similarity index 100%
rename from model/trace/llm.yaml
rename to model/trace/gen-ai.yaml

From 774556907ddd6a7288104c7220cbdf95d8d4bb7a Mon Sep 17 00:00:00 2001
From: Nir Gazit <nirga@users.noreply.github.com>
Date: Thu, 21 Mar 2024 13:44:37 +0200
Subject: [PATCH 13/36] fix: following @lmolkova review

---
 .chloggen/first-gen-ai.yaml        |  2 +-
 .github/CODEOWNERS                 |  7 +++++
 docs/attributes-registry/gen-ai.md | 30 ++++++++++--------
 docs/gen-ai/llm-spans.md           | 43 +++++++++++++-------------
 model/registry/gen-ai.yaml         | 11 +++++--
 model/trace/gen-ai.yaml            | 49 ++++++++++++++++--------------
 6 files changed, 83 insertions(+), 59 deletions(-)

diff --git a/.chloggen/first-gen-ai.yaml b/.chloggen/first-gen-ai.yaml
index ab49109161..7539ba83c2 100755
--- a/.chloggen/first-gen-ai.yaml
+++ b/.chloggen/first-gen-ai.yaml
@@ -10,7 +10,7 @@ change_type: new_component
 component: gen-ai
 
 # A brief description of the change. Surround your text with quotes ("") if it needs to start with a backtick (`).
-note: Introducing semantic conventions for LLM applications.
+note: Introducing semantic conventions for LLM clients.
 
 # Mandatory: One or more tracking issues related to the change. You can use the PR number here if no issue exists.
 # The values here must be integers.
diff --git a/.github/CODEOWNERS b/.github/CODEOWNERS
index 3556a7bbe4..fa4bdc724e 100644
--- a/.github/CODEOWNERS
+++ b/.github/CODEOWNERS
@@ -78,4 +78,11 @@
 /model/metrics/dotnet/ @open-telemetry/specs-semconv-approvers @open-telemetry/semconv-dotnet-approver  @open-telemetry/semconv-http-approvers
 /docs/dotnet/          @open-telemetry/specs-semconv-approvers @open-telemetry/semconv-dotnet-approver  @open-telemetry/semconv-http-approvers
 
+# Gen-AI semantic conventions approvers
+/model/registry/gen-ai.yaml @open-telemetry/specs-semconv-approvers @open-telemetry/semconv-llm-approvers -approvers
+/model/metrics/gen-ai.yaml @open-telemetry/specs-semconv-approvers @open-telemetry/semconv-llm-approvers -approvers
+/model/trace/gen-ai.yaml @open-telemetry/specs-semconv-approvers @open-telemetry/semconv-llm-approvers -approvers
+/docs/gen-ai/ @open-telemetry/specs-semconv-approvers @open-telemetry/semconv-llm-approvers -approvers
+/docs/attributes-registry/gen-ai.md @open-telemetry/specs-semconv-approvers @open-telemetry/semconv-llm-approvers -approvers
+
 # TODO - Add semconv area experts
diff --git a/docs/attributes-registry/gen-ai.md b/docs/attributes-registry/gen-ai.md
index b4e4f03ced..47710e7c78 100644
--- a/docs/attributes-registry/gen-ai.md
+++ b/docs/attributes-registry/gen-ai.md
@@ -19,11 +19,17 @@
 <!-- semconv registry.llm(omit_requirement_level,tag=llm-generic-request) -->
 | Attribute  | Type | Description  | Examples  |
 |---|---|---|---|
-| `gen_ai.llm.request.max_tokens` | int | The maximum number of tokens the LLM generates for a request. | `100` |
-| `gen_ai.llm.request.model` | string | The name of the LLM a request is being made to. | `gpt-4` |
-| `gen_ai.llm.request.temperature` | double | The temperature setting for the LLM request. | `0.0` |
-| `gen_ai.llm.request.top_p` | double | The top_p sampling setting for the LLM request. | `1.0` |
-| `gen_ai.llm.system` | string | The name of the LLM foundation model vendor, if applicable. | `openai` |
+| `gen_ai.request.max_tokens` | int | The maximum number of tokens the LLM generates for a request. | `100` |
+| `gen_ai.request.model` | string | The name of the LLM a request is being made to. | `gpt-4` |
+| `gen_ai.request.temperature` | double | The temperature setting for the LLM request. | `0.0` |
+| `gen_ai.request.top_p` | double | The top_p sampling setting for the LLM request. | `1.0` |
+| `gen_ai.system` | string | The name of the LLM foundation model vendor, if applicable. | `openai` |
+
+`gen_ai.system` has the following list of well-known values. If one of them applies, then the respective value MUST be used, otherwise a custom value MAY be used.
+
+| Value  | Description |
+|---|---|
+| `OpenAI` | OpenAI models like GPT, DALL-E, Sora, etc. |
 <!-- endsemconv -->
 
 ### Response Attributes
@@ -31,11 +37,11 @@
 <!-- semconv registry.llm(omit_requirement_level,tag=llm-generic-response) -->
 | Attribute  | Type | Description  | Examples  |
 |---|---|---|---|
-| `gen_ai.llm.response.finish_reason` | string[] | Array of reasons the model stopped generating tokens, corresponding to each generation received. | `[['stop']]` |
-| `gen_ai.llm.response.id` | string | The unique identifier for the completion. | `chatcmpl-123` |
-| `gen_ai.llm.response.model` | string | The name of the LLM a response is being made to. | `gpt-4-0613` |
-| `gen_ai.llm.usage.completion_tokens` | int | The number of tokens used in the LLM response (completion). | `180` |
-| `gen_ai.llm.usage.prompt_tokens` | int | The number of tokens used in the LLM prompt. | `100` |
+| `gen_ai.response.finish_reasons` | string[] | Array of reasons the model stopped generating tokens, corresponding to each generation received. | `[['stop']]` |
+| `gen_ai.response.id` | string | The unique identifier for the completion. | `chatcmpl-123` |
+| `gen_ai.response.model` | string | The name of the LLM a response is being made to. | `gpt-4-0613` |
+| `gen_ai.usage.completion_tokens` | int | The number of tokens used in the LLM response (completion). | `180` |
+| `gen_ai.usage.prompt_tokens` | int | The number of tokens used in the LLM prompt. | `100` |
 <!-- endsemconv -->
 
 ### Event Attributes
@@ -43,6 +49,6 @@
 <!-- semconv registry.llm(omit_requirement_level,tag=llm-generic-events) -->
 | Attribute  | Type | Description  | Examples  |
 |---|---|---|---|
-| `gen_ai.llm.completion` | string | The full response received from the LLM, as a stringified JSON in OpenAI's format. | `[{'role': 'assistant', 'content': 'The capital of France is Paris.'}]` |
-| `gen_ai.llm.prompt` | string | The full prompt sent to an LLM, as a stringified JSON in OpenAI's format. | `[{'role': 'user', 'content': 'What is the capital of France?'}]` |
+| `gen_ai.completion` | string | The full response received from the LLM, as a stringified JSON in OpenAI's format. | `[{'role': 'assistant', 'content': 'The capital of France is Paris.'}]` |
+| `gen_ai.prompt` | string | The full prompt sent to an LLM, as a stringified JSON in OpenAI's format. | `[{'role': 'user', 'content': 'What is the capital of France?'}]` |
 <!-- endsemconv -->
\ No newline at end of file
diff --git a/docs/gen-ai/llm-spans.md b/docs/gen-ai/llm-spans.md
index 8fea1143ed..34aeebff04 100644
--- a/docs/gen-ai/llm-spans.md
+++ b/docs/gen-ai/llm-spans.md
@@ -18,12 +18,13 @@ linkTitle: LLM Calls
 
 A request to an LLM is modeled as a span in a trace.
 
-The **span name** SHOULD be set to a low cardinality value representing the request made to an LLM.
-It MAY be a name of the API endpoint for the LLM being called.
+The **span name** SHOULD be set to a low cardinality value describing an operation made to an LLM.
+For example, the API name such as [Create chat completion](https://platform.openai.com/docs/api-reference/chat/create) 
 
 ## Configuration
 
-Instrumentations for LLMs MUST offer the ability to turn off capture of prompts and completions. This is for three primary reasons:
+Instrumentations for LLMs MAY capture prompts and completions. 
+Instrumentations that support it, MUST offer the ability to turn off capture of prompts and completions. This is for three primary reasons:
 
 1. Data privacy concerns. End users of LLM applications may input sensitive information or personally identifiable information (PII) that they do not wish to be sent to a telemetry backend.
 2. Data size concerns. Although there is no specified limit to sizes, there are practical limitations in programming languages and telemety systems. Some LLMs allow for extremely large context windows that end users may take full advantage of.
@@ -35,47 +36,47 @@ By default, these configurations SHOULD NOT capture prompts and completions.
 
 These attributes track input data and metadata for a request to an LLM. Each attribute represents a concept that is common to most LLMs.
 
-<!-- semconv gen_ai.llm.request -->
+<!-- semconv gen_ai.request -->
 | Attribute  | Type | Description  | Examples  | Requirement Level |
 |---|---|---|---|---|
-| [`gen_ai.llm.request.max_tokens`](../attributes-registry/gen-ai.md) | int | The maximum number of tokens the LLM generates for a request. | `100` | Recommended |
-| [`gen_ai.llm.request.model`](../attributes-registry/gen-ai.md) | string | The name of the LLM a request is being made to. [1] | `gpt-4` | Required |
-| [`gen_ai.llm.request.temperature`](../attributes-registry/gen-ai.md) | double | The temperature setting for the LLM request. | `0.0` | Recommended |
-| [`gen_ai.llm.request.top_p`](../attributes-registry/gen-ai.md) | double | The top_p sampling setting for the LLM request. | `1.0` | Recommended |
-| [`gen_ai.llm.response.finish_reason`](../attributes-registry/gen-ai.md) | string[] | Array of reasons the model stopped generating tokens, corresponding to each generation received. | `[['stop']]` | Recommended |
-| [`gen_ai.llm.response.id`](../attributes-registry/gen-ai.md) | string | The unique identifier for the completion. | `chatcmpl-123` | Recommended |
-| [`gen_ai.llm.response.model`](../attributes-registry/gen-ai.md) | string | The name of the LLM a response is being made to. [2] | `gpt-4-0613` | Required |
-| [`gen_ai.llm.system`](../attributes-registry/gen-ai.md) | string | The name of the LLM foundation model vendor, if applicable. [3] | `openai` | Recommended |
-| [`gen_ai.llm.usage.completion_tokens`](../attributes-registry/gen-ai.md) | int | The number of tokens used in the LLM response (completion). | `180` | Recommended |
-| [`gen_ai.llm.usage.prompt_tokens`](../attributes-registry/gen-ai.md) | int | The number of tokens used in the LLM prompt. | `100` | Recommended |
+| [`gen_ai.request.max_tokens`](../attributes-registry/gen-ai.md) | int | The maximum number of tokens the LLM generates for a request. | `100` | Recommended |
+| [`gen_ai.request.model`](../attributes-registry/gen-ai.md) | string | The name of the LLM a request is being made to. [1] | `gpt-4` | Required |
+| [`gen_ai.request.temperature`](../attributes-registry/gen-ai.md) | double | The temperature setting for the LLM request. | `0.0` | Recommended |
+| [`gen_ai.request.top_p`](../attributes-registry/gen-ai.md) | double | The top_p sampling setting for the LLM request. | `1.0` | Recommended |
+| [`gen_ai.response.finish_reasons`](../attributes-registry/gen-ai.md) | string[] | Array of reasons the model stopped generating tokens, corresponding to each generation received. | `[['stop']]` | Recommended |
+| [`gen_ai.response.id`](../attributes-registry/gen-ai.md) | string | The unique identifier for the completion. | `chatcmpl-123` | Recommended |
+| [`gen_ai.response.model`](../attributes-registry/gen-ai.md) | string | The name of the LLM a response is being made to. [2] | `gpt-4-0613` | Conditionally Required: if response was received |
+| [`gen_ai.system`](../attributes-registry/gen-ai.md) | string | The name of the LLM foundation model vendor, if applicable. [3] | `openai` | Required |
+| [`gen_ai.usage.completion_tokens`](../attributes-registry/gen-ai.md) | int | The number of tokens used in the LLM response (completion). | `180` | Recommended |
+| [`gen_ai.usage.prompt_tokens`](../attributes-registry/gen-ai.md) | int | The number of tokens used in the LLM prompt. | `100` | Recommended |
 
 **[1]:** The name of the LLM a request is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model requested. If the LLM is a fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
 
 **[2]:** The name of the LLM a response is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model actually used. If the LLM is a fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
 
-**[3]:** The name of the LLM foundation model vendor, if applicable. If not using a vendor-supplied model, this field is left blank.
+**[3]:** The name of the LLM foundation model vendor, if applicable.  If not using a vendor-supplied model, provide a custom friendly name, such as a name of the company or project.  If the instrumetnation reports any attributes specific to a custom model, the value provided in the `gen_ai.system` SHOULD match the custom attribute namespace segment. For example, if `gen_ai.system` is set to `the_best_llm`, custom attributes should be added in the `gen_ai.the_best_llm.*` namespace.   If none of above options apply, the instrumentation should set `_OTHER`.
 <!-- endsemconv -->
 
 ## Events
 
 In the lifetime of an LLM span, an event for prompts sent and completions received MAY be created, depending on the configuration of the instrumentation.
 
-<!-- semconv gen_ai.llm.content.prompt -->
-The event name MUST be `gen_ai.llm.content.prompt`.
+<!-- semconv gen_ai.content.prompt -->
+The event name MUST be `gen_ai.content.prompt`.
 
 | Attribute  | Type | Description  | Examples  | Requirement Level |
 |---|---|---|---|---|
-| [`gen_ai.llm.prompt`](../attributes-registry/gen-ai.md) | string | The full prompt sent to an LLM, as a stringified JSON in OpenAI's format. [1] | `[{'role': 'user', 'content': 'What is the capital of France?'}]` | Recommended |
+| [`gen_ai.prompt`](../attributes-registry/gen-ai.md) | string | The full prompt sent to an LLM, as a stringified JSON in OpenAI's format. [1] | `[{'role': 'user', 'content': 'What is the capital of France?'}]` | Recommended |
 
 **[1]:** The full prompt sent to an LLM in a request, structured as a JSON in OpenAI's format.
 <!-- endsemconv -->
 
-<!-- semconv gen_ai.llm.content.completion -->
-The event name MUST be `gen_ai.llm.content.completion`.
+<!-- semconv gen_ai.content.completion -->
+The event name MUST be `gen_ai.content.completion`.
 
 | Attribute  | Type | Description  | Examples  | Requirement Level |
 |---|---|---|---|---|
-| [`gen_ai.llm.completion`](../attributes-registry/gen-ai.md) | string | The full response received from the LLM, as a stringified JSON in OpenAI's format. [1] | `[{'role': 'assistant', 'content': 'The capital of France is Paris.'}]` | Recommended |
+| [`gen_ai.completion`](../attributes-registry/gen-ai.md) | string | The full response received from the LLM, as a stringified JSON in OpenAI's format. [1] | `[{'role': 'assistant', 'content': 'The capital of France is Paris.'}]` | Recommended |
 
 **[1]:** The full response from an LLM, structured as a JSON in OpenAI's format.
 <!-- endsemconv -->
diff --git a/model/registry/gen-ai.yaml b/model/registry/gen-ai.yaml
index 999691c777..28d96ccfd3 100644
--- a/model/registry/gen-ai.yaml
+++ b/model/registry/gen-ai.yaml
@@ -1,12 +1,17 @@
 groups:
   - id: registry.llm
-    prefix: gen_ai.llm
+    prefix: gen_ai
     type: attribute_group
     brief: >
       This document defines the attributes used to describe telemetry in the context of LLM (Large Language Models) requests and responses.
     attributes:
       - id: system
-        type: string
+        type:
+          allow_custom_values: true
+          members:
+            - id: openai
+              value: "OpenAI"
+              brief: 'OpenAI models like GPT, DALL-E, Sora, etc.'
         brief: The name of the LLM foundation model vendor, if applicable.
         examples: 'openai'
         tag: llm-generic-request
@@ -40,7 +45,7 @@ groups:
         brief: The name of the LLM a response is being made to.
         examples: ['gpt-4-0613']
         tag: llm-generic-response
-      - id: response.finish_reason
+      - id: response.finish_reasons
         type: string[]
         brief: Array of reasons the model stopped generating tokens, corresponding to each generation received.
         examples: [['stop']]
diff --git a/model/trace/gen-ai.yaml b/model/trace/gen-ai.yaml
index 0bdb23b2a3..1273ffbdb5 100644
--- a/model/trace/gen-ai.yaml
+++ b/model/trace/gen-ai.yaml
@@ -1,63 +1,68 @@
 groups:
-  - id: gen_ai.llm.request
+  - id: gen_ai.request
     type: span
     brief: >
       A request to an LLM is modeled as a span in a trace. The span name should be a low cardinality value representing the request made to an LLM, like the name of the API endpoint being called.
     attributes:
-      - ref: gen_ai.llm.system
-        requirement_level: recommended
+      - ref: gen_ai.system
+        requirement_level: required
         note: >
-          The name of the LLM foundation model vendor, if applicable. If not using a vendor-supplied model, this field is left blank.
-      - ref: gen_ai.llm.request.model
+          The name of the LLM foundation model vendor, if applicable.
+          If not using a vendor-supplied model, provide a custom friendly name, such as a name of the company or project.
+          If the instrumetnation reports any attributes specific to a custom model, the value provided in the `gen_ai.system` SHOULD match the custom attribute namespace segment.
+          For example, if `gen_ai.system` is set to `the_best_llm`, custom attributes should be added in the `gen_ai.the_best_llm.*` namespace.
+          If none of above options apply, the instrumentation should set `_OTHER`.
+      - ref: gen_ai.request.model
         requirement_level: required
         note: >
             The name of the LLM a request is being made to. If the LLM is supplied by a vendor,
             then the value must be the exact name of the model requested. If the LLM is a fine-tuned
             custom model, the value should have a more specific name than the base model that's been fine-tuned.
-      - ref: gen_ai.llm.request.max_tokens
+      - ref: gen_ai.request.max_tokens
         requirement_level: recommended
-      - ref: gen_ai.llm.request.temperature
+      - ref: gen_ai.request.temperature
         requirement_level: recommended
-      - ref: gen_ai.llm.request.top_p
+      - ref: gen_ai.request.top_p
         requirement_level: recommended
-      - ref: gen_ai.llm.response.id
+      - ref: gen_ai.response.id
         requirement_level: recommended
-      - ref: gen_ai.llm.response.model
-        requirement_level: required
+      - ref: gen_ai.response.model
+        requirement_level:
+          conditionally_required: if response was received
         note: >
           The name of the LLM a response is being made to. If the LLM is supplied by a vendor,
           then the value must be the exact name of the model actually used. If the LLM is a
           fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
-      - ref: gen_ai.llm.response.finish_reason
+      - ref: gen_ai.response.finish_reasons
         requirement_level: recommended
-      - ref: gen_ai.llm.usage.prompt_tokens
+      - ref: gen_ai.usage.prompt_tokens
         requirement_level: recommended
-      - ref: gen_ai.llm.usage.completion_tokens
+      - ref: gen_ai.usage.completion_tokens
         requirement_level: recommended
     events:
-      - gen_ai.llm.content.prompt
-      - gen_ai.llm.content.completion
+      - gen_ai.content.prompt
+      - gen_ai.content.completion
 
-  - id: gen_ai.llm.content.prompt
-    name: gen_ai.llm.content.prompt
+  - id: gen_ai.content.prompt
+    name: gen_ai.content.prompt
     type: event
     brief: >
       In the lifetime of an LLM span, events for prompts sent and completions received
       may be created, depending on the configuration of the instrumentation.
     attributes:
-      - ref: gen_ai.llm.prompt
+      - ref: gen_ai.prompt
         requirement_level: recommended
         note: >
           The full prompt sent to an LLM in a request, structured as a JSON in OpenAI's format.
 
-  - id: gen_ai.llm.content.completion
-    name: gen_ai.llm.content.completion
+  - id: gen_ai.content.completion
+    name: gen_ai.content.completion
     type: event
     brief: >
       In the lifetime of an LLM span, events for prompts sent and completions received
       may be created, depending on the configuration of the instrumentation.
     attributes:
-      - ref: gen_ai.llm.completion
+      - ref: gen_ai.completion
         requirement_level: recommended
         note: >
           The full response from an LLM, structured as a JSON in OpenAI's format.

From 42551ce83996b7aa0ca0e895663134c0bb683198 Mon Sep 17 00:00:00 2001
From: Nir Gazit <nirga@users.noreply.github.com>
Date: Fri, 22 Mar 2024 19:56:39 +0200
Subject: [PATCH 14/36] Update model/registry/gen-ai.yaml

Co-authored-by: Liudmila Molkova <limolkova@microsoft.com>
---
 model/registry/gen-ai.yaml | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/model/registry/gen-ai.yaml b/model/registry/gen-ai.yaml
index 28d96ccfd3..aa7939b3ce 100644
--- a/model/registry/gen-ai.yaml
+++ b/model/registry/gen-ai.yaml
@@ -11,7 +11,7 @@ groups:
           members:
             - id: openai
               value: "OpenAI"
-              brief: 'OpenAI models like GPT, DALL-E, Sora, etc.'
+              brief: 'OpenAI'
         brief: The name of the LLM foundation model vendor, if applicable.
         examples: 'openai'
         tag: llm-generic-request

From 57aaf77071a36b0f63b55954381629821f0abbce Mon Sep 17 00:00:00 2001
From: Nir Gazit <nirga@users.noreply.github.com>
Date: Fri, 22 Mar 2024 19:56:46 +0200
Subject: [PATCH 15/36] Update model/registry/gen-ai.yaml

Co-authored-by: Liudmila Molkova <limolkova@microsoft.com>
---
 model/registry/gen-ai.yaml | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/model/registry/gen-ai.yaml b/model/registry/gen-ai.yaml
index aa7939b3ce..acb204bc3e 100644
--- a/model/registry/gen-ai.yaml
+++ b/model/registry/gen-ai.yaml
@@ -12,7 +12,7 @@ groups:
             - id: openai
               value: "OpenAI"
               brief: 'OpenAI'
-        brief: The name of the LLM foundation model vendor, if applicable.
+        brief: The name of the LLM foundation model vendor.
         examples: 'openai'
         tag: llm-generic-request
       - id: request.model

From e49c3db9e8ba3c321773a4e2d0fd162b43ea32d3 Mon Sep 17 00:00:00 2001
From: Nir Gazit <nirga@users.noreply.github.com>
Date: Fri, 22 Mar 2024 19:57:21 +0200
Subject: [PATCH 16/36] Update .github/CODEOWNERS

Co-authored-by: Phillip Carter <pcarter@fastmail.com>
---
 .github/CODEOWNERS | 10 +++++-----
 1 file changed, 5 insertions(+), 5 deletions(-)

diff --git a/.github/CODEOWNERS b/.github/CODEOWNERS
index fa4bdc724e..de284edf16 100644
--- a/.github/CODEOWNERS
+++ b/.github/CODEOWNERS
@@ -79,10 +79,10 @@
 /docs/dotnet/          @open-telemetry/specs-semconv-approvers @open-telemetry/semconv-dotnet-approver  @open-telemetry/semconv-http-approvers
 
 # Gen-AI semantic conventions approvers
-/model/registry/gen-ai.yaml @open-telemetry/specs-semconv-approvers @open-telemetry/semconv-llm-approvers -approvers
-/model/metrics/gen-ai.yaml @open-telemetry/specs-semconv-approvers @open-telemetry/semconv-llm-approvers -approvers
-/model/trace/gen-ai.yaml @open-telemetry/specs-semconv-approvers @open-telemetry/semconv-llm-approvers -approvers
-/docs/gen-ai/ @open-telemetry/specs-semconv-approvers @open-telemetry/semconv-llm-approvers -approvers
-/docs/attributes-registry/gen-ai.md @open-telemetry/specs-semconv-approvers @open-telemetry/semconv-llm-approvers -approvers
+/model/registry/gen-ai.yaml @open-telemetry/specs-semconv-approvers @open-telemetry/semconv-llm-approvers 
+/model/metrics/gen-ai.yaml @open-telemetry/specs-semconv-approvers @open-telemetry/semconv-llm-approvers 
+/model/trace/gen-ai.yaml @open-telemetry/specs-semconv-approvers @open-telemetry/semconv-llm-approvers
+/docs/gen-ai/ @open-telemetry/specs-semconv-approvers @open-telemetry/semconv-llm-approvers
+/docs/attributes-registry/gen-ai.md @open-telemetry/specs-semconv-approvers @open-telemetry/semconv-llm-approvers
 
 # TODO - Add semconv area experts

From cef4ca23e72f1dceff49d603585a59094cf9fceb Mon Sep 17 00:00:00 2001
From: Nir Gazit <nirga@users.noreply.github.com>
Date: Fri, 22 Mar 2024 19:57:30 +0200
Subject: [PATCH 17/36] Update model/trace/gen-ai.yaml

Co-authored-by: Phillip Carter <pcarter@fastmail.com>
---
 model/trace/gen-ai.yaml | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/model/trace/gen-ai.yaml b/model/trace/gen-ai.yaml
index 1273ffbdb5..234dddf40b 100644
--- a/model/trace/gen-ai.yaml
+++ b/model/trace/gen-ai.yaml
@@ -65,4 +65,4 @@ groups:
       - ref: gen_ai.completion
         requirement_level: recommended
         note: >
-          The full response from an LLM, structured as a JSON in OpenAI's format.
+          The full response from an LLM, structured as JSON in OpenAI's format.

From 3265778c953d3525f3a432f7bbc9584933a6e3cf Mon Sep 17 00:00:00 2001
From: Nir Gazit <nirga@users.noreply.github.com>
Date: Fri, 22 Mar 2024 19:58:14 +0200
Subject: [PATCH 18/36] Update model/trace/gen-ai.yaml

Co-authored-by: Liudmila Molkova <limolkova@microsoft.com>
---
 model/trace/gen-ai.yaml | 1 -
 1 file changed, 1 deletion(-)

diff --git a/model/trace/gen-ai.yaml b/model/trace/gen-ai.yaml
index 234dddf40b..80c75ec1fe 100644
--- a/model/trace/gen-ai.yaml
+++ b/model/trace/gen-ai.yaml
@@ -7,7 +7,6 @@ groups:
       - ref: gen_ai.system
         requirement_level: required
         note: >
-          The name of the LLM foundation model vendor, if applicable.
           If not using a vendor-supplied model, provide a custom friendly name, such as a name of the company or project.
           If the instrumetnation reports any attributes specific to a custom model, the value provided in the `gen_ai.system` SHOULD match the custom attribute namespace segment.
           For example, if `gen_ai.system` is set to `the_best_llm`, custom attributes should be added in the `gen_ai.the_best_llm.*` namespace.

From c0fdb9b5f0d978f0466bbabff5fd30f29b0886bd Mon Sep 17 00:00:00 2001
From: Nir Gazit <nirga@users.noreply.github.com>
Date: Fri, 22 Mar 2024 19:58:37 +0200
Subject: [PATCH 19/36] Update model/registry/gen-ai.yaml

Co-authored-by: Liudmila Molkova <limolkova@microsoft.com>
---
 model/registry/gen-ai.yaml | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/model/registry/gen-ai.yaml b/model/registry/gen-ai.yaml
index acb204bc3e..84cceebc74 100644
--- a/model/registry/gen-ai.yaml
+++ b/model/registry/gen-ai.yaml
@@ -10,7 +10,7 @@ groups:
           allow_custom_values: true
           members:
             - id: openai
-              value: "OpenAI"
+              value: "openai"
               brief: 'OpenAI'
         brief: The name of the LLM foundation model vendor.
         examples: 'openai'

From 9b25c2031960469dcf432154c8c7ada01ea98cb7 Mon Sep 17 00:00:00 2001
From: Nir Gazit <nirga@users.noreply.github.com>
Date: Fri, 22 Mar 2024 19:58:52 +0200
Subject: [PATCH 20/36] Update model/registry/gen-ai.yaml

Co-authored-by: Liudmila Molkova <limolkova@microsoft.com>
---
 model/registry/gen-ai.yaml | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/model/registry/gen-ai.yaml b/model/registry/gen-ai.yaml
index 84cceebc74..3ccf4b41ad 100644
--- a/model/registry/gen-ai.yaml
+++ b/model/registry/gen-ai.yaml
@@ -48,7 +48,7 @@ groups:
       - id: response.finish_reasons
         type: string[]
         brief: Array of reasons the model stopped generating tokens, corresponding to each generation received.
-        examples: [['stop']]
+        examples: ['stop']
         tag: llm-generic-response
       - id: usage.prompt_tokens
         type: int

From fa15a8f21686f3bbcec84d285bc3756713e867ac Mon Sep 17 00:00:00 2001
From: Nir Gazit <nirga@users.noreply.github.com>
Date: Fri, 22 Mar 2024 19:59:01 +0200
Subject: [PATCH 21/36] Update model/trace/gen-ai.yaml

Co-authored-by: Phillip Carter <pcarter@fastmail.com>
---
 model/trace/gen-ai.yaml | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/model/trace/gen-ai.yaml b/model/trace/gen-ai.yaml
index 80c75ec1fe..c8e704db10 100644
--- a/model/trace/gen-ai.yaml
+++ b/model/trace/gen-ai.yaml
@@ -29,7 +29,7 @@ groups:
         requirement_level:
           conditionally_required: if response was received
         note: >
-          The name of the LLM a response is being made to. If the LLM is supplied by a vendor,
+          The name of the LLM serving a response. If the LLM is supplied by a vendor,
           then the value must be the exact name of the model actually used. If the LLM is a
           fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
       - ref: gen_ai.response.finish_reasons

From 677c86a0a9efc039c4d34c7d9386ee77f83a360e Mon Sep 17 00:00:00 2001
From: Nir Gazit <nirga@users.noreply.github.com>
Date: Fri, 22 Mar 2024 19:59:16 +0200
Subject: [PATCH 22/36] Update model/trace/gen-ai.yaml

Co-authored-by: Phillip Carter <pcarter@fastmail.com>
---
 model/trace/gen-ai.yaml | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/model/trace/gen-ai.yaml b/model/trace/gen-ai.yaml
index c8e704db10..815b46ba9f 100644
--- a/model/trace/gen-ai.yaml
+++ b/model/trace/gen-ai.yaml
@@ -52,7 +52,7 @@ groups:
       - ref: gen_ai.prompt
         requirement_level: recommended
         note: >
-          The full prompt sent to an LLM in a request, structured as a JSON in OpenAI's format.
+          The full prompt sent to an LLM in a request, structured as JSON in OpenAI's format.
 
   - id: gen_ai.content.completion
     name: gen_ai.content.completion

From 61ffd915b56bea3b698d748a24c2abd708ff3aff Mon Sep 17 00:00:00 2001
From: Nir Gazit <nirga@users.noreply.github.com>
Date: Fri, 22 Mar 2024 20:00:59 +0200
Subject: [PATCH 23/36] Update model/registry/gen-ai.yaml

Co-authored-by: Liudmila Molkova <limolkova@microsoft.com>
---
 model/registry/gen-ai.yaml | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/model/registry/gen-ai.yaml b/model/registry/gen-ai.yaml
index 3ccf4b41ad..4da70e43be 100644
--- a/model/registry/gen-ai.yaml
+++ b/model/registry/gen-ai.yaml
@@ -62,7 +62,8 @@ groups:
         tag: llm-generic-response
       - id: prompt
         type: string
-        brief: The full prompt sent to an LLM, as a stringified JSON in OpenAI's format.
+        brief: The full prompt sent to an LLM.
+        note: It's RECOMMENDED to format prompts as JSON string matching [OpenAI messages format](https://platform.openai.com/docs/guides/text-generation)
         examples: ["[{'role': 'user', 'content': 'What is the capital of France?'}]"]
         tag: llm-generic-events
       - id: completion

From 7f8f1e8022cf12cbb0ab4fb1c8a092693b5c80a6 Mon Sep 17 00:00:00 2001
From: Nir Gazit <nirga@users.noreply.github.com>
Date: Fri, 22 Mar 2024 20:05:23 +0200
Subject: [PATCH 24/36] fix: lint; regeneration

---
 docs/attributes-registry/gen-ai.md | 14 +++++++++-----
 docs/gen-ai/llm-spans.md           | 20 ++++++++++----------
 model/registry/gen-ai.yaml         |  3 ++-
 model/trace/gen-ai.yaml            |  4 ++--
 4 files changed, 23 insertions(+), 18 deletions(-)

diff --git a/docs/attributes-registry/gen-ai.md b/docs/attributes-registry/gen-ai.md
index 47710e7c78..9a1651b8b2 100644
--- a/docs/attributes-registry/gen-ai.md
+++ b/docs/attributes-registry/gen-ai.md
@@ -23,13 +23,13 @@
 | `gen_ai.request.model` | string | The name of the LLM a request is being made to. | `gpt-4` |
 | `gen_ai.request.temperature` | double | The temperature setting for the LLM request. | `0.0` |
 | `gen_ai.request.top_p` | double | The top_p sampling setting for the LLM request. | `1.0` |
-| `gen_ai.system` | string | The name of the LLM foundation model vendor, if applicable. | `openai` |
+| `gen_ai.system` | string | The name of the LLM foundation model vendor. | `openai` |
 
 `gen_ai.system` has the following list of well-known values. If one of them applies, then the respective value MUST be used, otherwise a custom value MAY be used.
 
 | Value  | Description |
 |---|---|
-| `OpenAI` | OpenAI models like GPT, DALL-E, Sora, etc. |
+| `openai` | OpenAI |
 <!-- endsemconv -->
 
 ### Response Attributes
@@ -37,7 +37,7 @@
 <!-- semconv registry.llm(omit_requirement_level,tag=llm-generic-response) -->
 | Attribute  | Type | Description  | Examples  |
 |---|---|---|---|
-| `gen_ai.response.finish_reasons` | string[] | Array of reasons the model stopped generating tokens, corresponding to each generation received. | `[['stop']]` |
+| `gen_ai.response.finish_reasons` | string[] | Array of reasons the model stopped generating tokens, corresponding to each generation received. | `[stop]` |
 | `gen_ai.response.id` | string | The unique identifier for the completion. | `chatcmpl-123` |
 | `gen_ai.response.model` | string | The name of the LLM a response is being made to. | `gpt-4-0613` |
 | `gen_ai.usage.completion_tokens` | int | The number of tokens used in the LLM response (completion). | `180` |
@@ -49,6 +49,10 @@
 <!-- semconv registry.llm(omit_requirement_level,tag=llm-generic-events) -->
 | Attribute  | Type | Description  | Examples  |
 |---|---|---|---|
-| `gen_ai.completion` | string | The full response received from the LLM, as a stringified JSON in OpenAI's format. | `[{'role': 'assistant', 'content': 'The capital of France is Paris.'}]` |
-| `gen_ai.prompt` | string | The full prompt sent to an LLM, as a stringified JSON in OpenAI's format. | `[{'role': 'user', 'content': 'What is the capital of France?'}]` |
+| `gen_ai.completion` | string | The full response received from the LLM. [1] | `[{'role': 'assistant', 'content': 'The capital of France is Paris.'}]` |
+| `gen_ai.prompt` | string | The full prompt sent to an LLM. [2] | `[{'role': 'user', 'content': 'What is the capital of France?'}]` |
+
+**[1]:** It's RECOMMENDED to format completions as JSON string matching [OpenAI messages format](https://platform.openai.com/docs/guides/text-generation)
+
+**[2]:** It's RECOMMENDED to format prompts as JSON string matching [OpenAI messages format](https://platform.openai.com/docs/guides/text-generation)
 <!-- endsemconv -->
\ No newline at end of file
diff --git a/docs/gen-ai/llm-spans.md b/docs/gen-ai/llm-spans.md
index 34aeebff04..6d88fcd5f6 100644
--- a/docs/gen-ai/llm-spans.md
+++ b/docs/gen-ai/llm-spans.md
@@ -19,11 +19,11 @@ linkTitle: LLM Calls
 A request to an LLM is modeled as a span in a trace.
 
 The **span name** SHOULD be set to a low cardinality value describing an operation made to an LLM.
-For example, the API name such as [Create chat completion](https://platform.openai.com/docs/api-reference/chat/create) 
+For example, the API name such as [Create chat completion](https://platform.openai.com/docs/api-reference/chat/create)
 
 ## Configuration
 
-Instrumentations for LLMs MAY capture prompts and completions. 
+Instrumentations for LLMs MAY capture prompts and completions.
 Instrumentations that support it, MUST offer the ability to turn off capture of prompts and completions. This is for three primary reasons:
 
 1. Data privacy concerns. End users of LLM applications may input sensitive information or personally identifiable information (PII) that they do not wish to be sent to a telemetry backend.
@@ -43,18 +43,18 @@ These attributes track input data and metadata for a request to an LLM. Each att
 | [`gen_ai.request.model`](../attributes-registry/gen-ai.md) | string | The name of the LLM a request is being made to. [1] | `gpt-4` | Required |
 | [`gen_ai.request.temperature`](../attributes-registry/gen-ai.md) | double | The temperature setting for the LLM request. | `0.0` | Recommended |
 | [`gen_ai.request.top_p`](../attributes-registry/gen-ai.md) | double | The top_p sampling setting for the LLM request. | `1.0` | Recommended |
-| [`gen_ai.response.finish_reasons`](../attributes-registry/gen-ai.md) | string[] | Array of reasons the model stopped generating tokens, corresponding to each generation received. | `[['stop']]` | Recommended |
+| [`gen_ai.response.finish_reasons`](../attributes-registry/gen-ai.md) | string[] | Array of reasons the model stopped generating tokens, corresponding to each generation received. | `[stop]` | Recommended |
 | [`gen_ai.response.id`](../attributes-registry/gen-ai.md) | string | The unique identifier for the completion. | `chatcmpl-123` | Recommended |
 | [`gen_ai.response.model`](../attributes-registry/gen-ai.md) | string | The name of the LLM a response is being made to. [2] | `gpt-4-0613` | Conditionally Required: if response was received |
-| [`gen_ai.system`](../attributes-registry/gen-ai.md) | string | The name of the LLM foundation model vendor, if applicable. [3] | `openai` | Required |
+| [`gen_ai.system`](../attributes-registry/gen-ai.md) | string | The name of the LLM foundation model vendor. [3] | `openai` | Required |
 | [`gen_ai.usage.completion_tokens`](../attributes-registry/gen-ai.md) | int | The number of tokens used in the LLM response (completion). | `180` | Recommended |
 | [`gen_ai.usage.prompt_tokens`](../attributes-registry/gen-ai.md) | int | The number of tokens used in the LLM prompt. | `100` | Recommended |
 
 **[1]:** The name of the LLM a request is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model requested. If the LLM is a fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
 
-**[2]:** The name of the LLM a response is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model actually used. If the LLM is a fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
+**[2]:** The name of the LLM serving a response. If the LLM is supplied by a vendor, then the value must be the exact name of the model actually used. If the LLM is a fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
 
-**[3]:** The name of the LLM foundation model vendor, if applicable.  If not using a vendor-supplied model, provide a custom friendly name, such as a name of the company or project.  If the instrumetnation reports any attributes specific to a custom model, the value provided in the `gen_ai.system` SHOULD match the custom attribute namespace segment. For example, if `gen_ai.system` is set to `the_best_llm`, custom attributes should be added in the `gen_ai.the_best_llm.*` namespace.   If none of above options apply, the instrumentation should set `_OTHER`.
+**[3]:** If not using a vendor-supplied model, provide a custom friendly name, such as a name of the company or project. If the instrumetnation reports any attributes specific to a custom model, the value provided in the `gen_ai.system` SHOULD match the custom attribute namespace segment. For example, if `gen_ai.system` is set to `the_best_llm`, custom attributes should be added in the `gen_ai.the_best_llm.*` namespace. If none of above options apply, the instrumentation should set `_OTHER`.
 <!-- endsemconv -->
 
 ## Events
@@ -66,9 +66,9 @@ The event name MUST be `gen_ai.content.prompt`.
 
 | Attribute  | Type | Description  | Examples  | Requirement Level |
 |---|---|---|---|---|
-| [`gen_ai.prompt`](../attributes-registry/gen-ai.md) | string | The full prompt sent to an LLM, as a stringified JSON in OpenAI's format. [1] | `[{'role': 'user', 'content': 'What is the capital of France?'}]` | Recommended |
+| [`gen_ai.prompt`](../attributes-registry/gen-ai.md) | string | The full prompt sent to an LLM. [1] | `[{'role': 'user', 'content': 'What is the capital of France?'}]` | Recommended |
 
-**[1]:** The full prompt sent to an LLM in a request, structured as a JSON in OpenAI's format.
+**[1]:** It's RECOMMENDED to format prompts as JSON string matching [OpenAI messages format](https://platform.openai.com/docs/guides/text-generation)
 <!-- endsemconv -->
 
 <!-- semconv gen_ai.content.completion -->
@@ -76,9 +76,9 @@ The event name MUST be `gen_ai.content.completion`.
 
 | Attribute  | Type | Description  | Examples  | Requirement Level |
 |---|---|---|---|---|
-| [`gen_ai.completion`](../attributes-registry/gen-ai.md) | string | The full response received from the LLM, as a stringified JSON in OpenAI's format. [1] | `[{'role': 'assistant', 'content': 'The capital of France is Paris.'}]` | Recommended |
+| [`gen_ai.completion`](../attributes-registry/gen-ai.md) | string | The full response received from the LLM. [1] | `[{'role': 'assistant', 'content': 'The capital of France is Paris.'}]` | Recommended |
 
-**[1]:** The full response from an LLM, structured as a JSON in OpenAI's format.
+**[1]:** It's RECOMMENDED to format completions as JSON string matching [OpenAI messages format](https://platform.openai.com/docs/guides/text-generation)
 <!-- endsemconv -->
 
 [DocumentStatus]: https://github.com/open-telemetry/opentelemetry-specification/tree/v1.22.0/specification/document-status.md
diff --git a/model/registry/gen-ai.yaml b/model/registry/gen-ai.yaml
index 4da70e43be..01dcd1b5d3 100644
--- a/model/registry/gen-ai.yaml
+++ b/model/registry/gen-ai.yaml
@@ -68,6 +68,7 @@ groups:
         tag: llm-generic-events
       - id: completion
         type: string
-        brief: The full response received from the LLM, as a stringified JSON in OpenAI's format.
+        brief: The full response received from the LLM.
+        note: It's RECOMMENDED to format completions as JSON string matching [OpenAI messages format](https://platform.openai.com/docs/guides/text-generation)
         examples: ["[{'role': 'assistant', 'content': 'The capital of France is Paris.'}]"]
         tag: llm-generic-events
diff --git a/model/trace/gen-ai.yaml b/model/trace/gen-ai.yaml
index 815b46ba9f..0e3409699e 100644
--- a/model/trace/gen-ai.yaml
+++ b/model/trace/gen-ai.yaml
@@ -52,7 +52,7 @@ groups:
       - ref: gen_ai.prompt
         requirement_level: recommended
         note: >
-          The full prompt sent to an LLM in a request, structured as JSON in OpenAI's format.
+          It's RECOMMENDED to format prompts as JSON string matching [OpenAI messages format](https://platform.openai.com/docs/guides/text-generation)
 
   - id: gen_ai.content.completion
     name: gen_ai.content.completion
@@ -64,4 +64,4 @@ groups:
       - ref: gen_ai.completion
         requirement_level: recommended
         note: >
-          The full response from an LLM, structured as JSON in OpenAI's format.
+          It's RECOMMENDED to format completions as JSON string matching [OpenAI messages format](https://platform.openai.com/docs/guides/text-generation)

From 74426de4f7e4559e8d62c5f22f2b3641ac02b6f3 Mon Sep 17 00:00:00 2001
From: Nir Gazit <nirga@users.noreply.github.com>
Date: Fri, 22 Mar 2024 20:18:52 +0200
Subject: [PATCH 25/36] Update docs/gen-ai/README.md

Co-authored-by: Liudmila Molkova <limolkova@microsoft.com>
---
 docs/gen-ai/README.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/docs/gen-ai/README.md b/docs/gen-ai/README.md
index 31bc5795cd..87d807da2b 100644
--- a/docs/gen-ai/README.md
+++ b/docs/gen-ai/README.md
@@ -2,7 +2,7 @@
 linkTitle: AI
 path_base_for_github_subdir:
   from: content/en/docs/specs/semconv/ai/_index.md
-  to: database/README.md
+  to: gen-ai/README.md
 --->
 
 # Semantic Conventions for AI systems

From 87cbd17a453fa9fca356a327a2a216b2177f4335 Mon Sep 17 00:00:00 2001
From: Nir Gazit <nirga@users.noreply.github.com>
Date: Fri, 22 Mar 2024 20:25:57 +0200
Subject: [PATCH 26/36] fix: opt-in prompts / completions

---
 docs/gen-ai/llm-spans.md | 4 ++--
 model/trace/gen-ai.yaml  | 6 ++++--
 2 files changed, 6 insertions(+), 4 deletions(-)

diff --git a/docs/gen-ai/llm-spans.md b/docs/gen-ai/llm-spans.md
index 6d88fcd5f6..9454b6764a 100644
--- a/docs/gen-ai/llm-spans.md
+++ b/docs/gen-ai/llm-spans.md
@@ -66,7 +66,7 @@ The event name MUST be `gen_ai.content.prompt`.
 
 | Attribute  | Type | Description  | Examples  | Requirement Level |
 |---|---|---|---|---|
-| [`gen_ai.prompt`](../attributes-registry/gen-ai.md) | string | The full prompt sent to an LLM. [1] | `[{'role': 'user', 'content': 'What is the capital of France?'}]` | Recommended |
+| [`gen_ai.prompt`](../attributes-registry/gen-ai.md) | string | The full prompt sent to an LLM. [1] | `[{'role': 'user', 'content': 'What is the capital of France?'}]` | Conditionally Required: if and only if corresponding event is enabled |
 
 **[1]:** It's RECOMMENDED to format prompts as JSON string matching [OpenAI messages format](https://platform.openai.com/docs/guides/text-generation)
 <!-- endsemconv -->
@@ -76,7 +76,7 @@ The event name MUST be `gen_ai.content.completion`.
 
 | Attribute  | Type | Description  | Examples  | Requirement Level |
 |---|---|---|---|---|
-| [`gen_ai.completion`](../attributes-registry/gen-ai.md) | string | The full response received from the LLM. [1] | `[{'role': 'assistant', 'content': 'The capital of France is Paris.'}]` | Recommended |
+| [`gen_ai.completion`](../attributes-registry/gen-ai.md) | string | The full response received from the LLM. [1] | `[{'role': 'assistant', 'content': 'The capital of France is Paris.'}]` | Conditionally Required: if and only if corresponding event is enabled |
 
 **[1]:** It's RECOMMENDED to format completions as JSON string matching [OpenAI messages format](https://platform.openai.com/docs/guides/text-generation)
 <!-- endsemconv -->
diff --git a/model/trace/gen-ai.yaml b/model/trace/gen-ai.yaml
index 0e3409699e..eb7e1f3cb0 100644
--- a/model/trace/gen-ai.yaml
+++ b/model/trace/gen-ai.yaml
@@ -50,7 +50,8 @@ groups:
       may be created, depending on the configuration of the instrumentation.
     attributes:
       - ref: gen_ai.prompt
-        requirement_level: recommended
+        requirement_level: 
+          conditionally_required: if and only if corresponding event is enabled
         note: >
           It's RECOMMENDED to format prompts as JSON string matching [OpenAI messages format](https://platform.openai.com/docs/guides/text-generation)
 
@@ -62,6 +63,7 @@ groups:
       may be created, depending on the configuration of the instrumentation.
     attributes:
       - ref: gen_ai.completion
-        requirement_level: recommended
+        requirement_level: 
+          conditionally_required: if and only if corresponding event is enabled
         note: >
           It's RECOMMENDED to format completions as JSON string matching [OpenAI messages format](https://platform.openai.com/docs/guides/text-generation)

From 766265538ec6f3662964519bff7282928861786e Mon Sep 17 00:00:00 2001
From: Nir Gazit <nirga@users.noreply.github.com>
Date: Fri, 22 Mar 2024 20:32:01 +0200
Subject: [PATCH 27/36] Update docs/gen-ai/README.md

Co-authored-by: Liudmila Molkova <limolkova@microsoft.com>
---
 docs/gen-ai/README.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/docs/gen-ai/README.md b/docs/gen-ai/README.md
index 87d807da2b..a939f2b78a 100644
--- a/docs/gen-ai/README.md
+++ b/docs/gen-ai/README.md
@@ -5,7 +5,7 @@ path_base_for_github_subdir:
   to: gen-ai/README.md
 --->
 
-# Semantic Conventions for AI systems
+# Semantic Conventions for Generative AI systems
 
 **Status**: [Experimental][DocumentStatus]
 

From 94ee6ea1c43c5bd6f0db58f40caf570ee0fa8197 Mon Sep 17 00:00:00 2001
From: Nir Gazit <nirga@users.noreply.github.com>
Date: Fri, 22 Mar 2024 20:32:09 +0200
Subject: [PATCH 28/36] Update docs/attributes-registry/gen-ai.md

Co-authored-by: Liudmila Molkova <limolkova@microsoft.com>
---
 docs/attributes-registry/gen-ai.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/docs/attributes-registry/gen-ai.md b/docs/attributes-registry/gen-ai.md
index 9a1651b8b2..2777f56977 100644
--- a/docs/attributes-registry/gen-ai.md
+++ b/docs/attributes-registry/gen-ai.md
@@ -5,7 +5,7 @@
 
 <!-- toc -->
 
-- [Generic LLM Attributes](#generic-llm-attributes)
+- [Generic GenAI Attributes](#generic-llm-attributes)
   - [Request Attributes](#request-attributes)
   - [Response Attributes](#response-attributes)
   - [Event Attributes](#event-attributes)

From d5d5daba4d27bb85950360ae6afff6371f00f3a2 Mon Sep 17 00:00:00 2001
From: Nir Gazit <nirga@users.noreply.github.com>
Date: Fri, 22 Mar 2024 20:32:16 +0200
Subject: [PATCH 29/36] Update model/registry/gen-ai.yaml

Co-authored-by: Liudmila Molkova <limolkova@microsoft.com>
---
 model/registry/gen-ai.yaml | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/model/registry/gen-ai.yaml b/model/registry/gen-ai.yaml
index 01dcd1b5d3..94305be170 100644
--- a/model/registry/gen-ai.yaml
+++ b/model/registry/gen-ai.yaml
@@ -1,5 +1,5 @@
 groups:
-  - id: registry.llm
+  - id: registry.gen_ai
     prefix: gen_ai
     type: attribute_group
     brief: >

From 3672d94d835f4c848555e4c771375049b5dd3071 Mon Sep 17 00:00:00 2001
From: Nir Gazit <nirga@users.noreply.github.com>
Date: Fri, 22 Mar 2024 20:32:22 +0200
Subject: [PATCH 30/36] Update docs/gen-ai/README.md

Co-authored-by: Liudmila Molkova <limolkova@microsoft.com>
---
 docs/gen-ai/README.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/docs/gen-ai/README.md b/docs/gen-ai/README.md
index a939f2b78a..680618dcc4 100644
--- a/docs/gen-ai/README.md
+++ b/docs/gen-ai/README.md
@@ -9,7 +9,7 @@ path_base_for_github_subdir:
 
 **Status**: [Experimental][DocumentStatus]
 
-This document defines semantic conventions for the following kind of AI systems:
+This document defines semantic conventions for the following kind of Generative AI systems:
 
 * LLMs
 

From 68ad466cc66928eb4e524cd7e7d9775a17a1ace8 Mon Sep 17 00:00:00 2001
From: Nir Gazit <nirga@users.noreply.github.com>
Date: Fri, 22 Mar 2024 20:35:51 +0200
Subject: [PATCH 31/36] fix: lint

---
 docs/attributes-registry/gen-ai.md | 8 ++++----
 model/trace/gen-ai.yaml            | 4 ++--
 2 files changed, 6 insertions(+), 6 deletions(-)

diff --git a/docs/attributes-registry/gen-ai.md b/docs/attributes-registry/gen-ai.md
index 2777f56977..cb05015351 100644
--- a/docs/attributes-registry/gen-ai.md
+++ b/docs/attributes-registry/gen-ai.md
@@ -5,7 +5,7 @@
 
 <!-- toc -->
 
-- [Generic GenAI Attributes](#generic-llm-attributes)
+- [Generic LLM Attributes](#generic-llm-attributes)
   - [Request Attributes](#request-attributes)
   - [Response Attributes](#response-attributes)
   - [Event Attributes](#event-attributes)
@@ -16,7 +16,7 @@
 
 ### Request Attributes
 
-<!-- semconv registry.llm(omit_requirement_level,tag=llm-generic-request) -->
+<!-- semconv registry.gen_ai(omit_requirement_level,tag=llm-generic-request) -->
 | Attribute  | Type | Description  | Examples  |
 |---|---|---|---|
 | `gen_ai.request.max_tokens` | int | The maximum number of tokens the LLM generates for a request. | `100` |
@@ -34,7 +34,7 @@
 
 ### Response Attributes
 
-<!-- semconv registry.llm(omit_requirement_level,tag=llm-generic-response) -->
+<!-- semconv registry.gen_ai(omit_requirement_level,tag=llm-generic-response) -->
 | Attribute  | Type | Description  | Examples  |
 |---|---|---|---|
 | `gen_ai.response.finish_reasons` | string[] | Array of reasons the model stopped generating tokens, corresponding to each generation received. | `[stop]` |
@@ -46,7 +46,7 @@
 
 ### Event Attributes
 
-<!-- semconv registry.llm(omit_requirement_level,tag=llm-generic-events) -->
+<!-- semconv registry.gen_ai(omit_requirement_level,tag=llm-generic-events) -->
 | Attribute  | Type | Description  | Examples  |
 |---|---|---|---|
 | `gen_ai.completion` | string | The full response received from the LLM. [1] | `[{'role': 'assistant', 'content': 'The capital of France is Paris.'}]` |
diff --git a/model/trace/gen-ai.yaml b/model/trace/gen-ai.yaml
index eb7e1f3cb0..7d9fb08e18 100644
--- a/model/trace/gen-ai.yaml
+++ b/model/trace/gen-ai.yaml
@@ -50,7 +50,7 @@ groups:
       may be created, depending on the configuration of the instrumentation.
     attributes:
       - ref: gen_ai.prompt
-        requirement_level: 
+        requirement_level:
           conditionally_required: if and only if corresponding event is enabled
         note: >
           It's RECOMMENDED to format prompts as JSON string matching [OpenAI messages format](https://platform.openai.com/docs/guides/text-generation)
@@ -63,7 +63,7 @@ groups:
       may be created, depending on the configuration of the instrumentation.
     attributes:
       - ref: gen_ai.completion
-        requirement_level: 
+        requirement_level:
           conditionally_required: if and only if corresponding event is enabled
         note: >
           It's RECOMMENDED to format completions as JSON string matching [OpenAI messages format](https://platform.openai.com/docs/guides/text-generation)

From f1fe74815b8159dfe2f37b217f88776a29aae3a7 Mon Sep 17 00:00:00 2001
From: Nir Gazit <nirga@users.noreply.github.com>
Date: Mon, 1 Apr 2024 16:12:23 +0200
Subject: [PATCH 32/36] Apply suggestions from code review

Co-authored-by: Patrice Chalin <chalin@users.noreply.github.com>
Co-authored-by: Liudmila Molkova <limolkova@microsoft.com>
Co-authored-by: Drew Robbins <drew@drewby.com>
---
 .github/CODEOWNERS                             | 2 +-
 docs/attributes-registry/{gen-ai.md => llm.md} | 3 ++-
 docs/gen-ai/README.md                          | 9 +++++++--
 docs/gen-ai/llm-spans.md                       | 8 +++++---
 model/registry/gen-ai.yaml                     | 2 +-
 model/trace/gen-ai.yaml                        | 3 +--
 6 files changed, 17 insertions(+), 10 deletions(-)
 rename docs/attributes-registry/{gen-ai.md => llm.md} (98%)

diff --git a/.github/CODEOWNERS b/.github/CODEOWNERS
index de284edf16..185fedbfdb 100644
--- a/.github/CODEOWNERS
+++ b/.github/CODEOWNERS
@@ -83,6 +83,6 @@
 /model/metrics/gen-ai.yaml @open-telemetry/specs-semconv-approvers @open-telemetry/semconv-llm-approvers 
 /model/trace/gen-ai.yaml @open-telemetry/specs-semconv-approvers @open-telemetry/semconv-llm-approvers
 /docs/gen-ai/ @open-telemetry/specs-semconv-approvers @open-telemetry/semconv-llm-approvers
-/docs/attributes-registry/gen-ai.md @open-telemetry/specs-semconv-approvers @open-telemetry/semconv-llm-approvers
+/docs/attributes-registry/llm.md @open-telemetry/specs-semconv-approvers @open-telemetry/semconv-llm-approvers
 
 # TODO - Add semconv area experts
diff --git a/docs/attributes-registry/gen-ai.md b/docs/attributes-registry/llm.md
similarity index 98%
rename from docs/attributes-registry/gen-ai.md
rename to docs/attributes-registry/llm.md
index cb05015351..5ea7cb8cdc 100644
--- a/docs/attributes-registry/gen-ai.md
+++ b/docs/attributes-registry/llm.md
@@ -1,7 +1,8 @@
 <!--- Hugo front matter used to generate the website version of this page:
+linkTitle: LLM
 --->
 
-# Large Language Model (LLM)
+# Large Language Model
 
 <!-- toc -->
 
diff --git a/docs/gen-ai/README.md b/docs/gen-ai/README.md
index 680618dcc4..1197a88522 100644
--- a/docs/gen-ai/README.md
+++ b/docs/gen-ai/README.md
@@ -1,7 +1,7 @@
 <!--- Hugo front matter used to generate the website version of this page:
-linkTitle: AI
+linkTitle: Generative AI
 path_base_for_github_subdir:
-  from: content/en/docs/specs/semconv/ai/_index.md
+  from: tmp/semconv/docs/gen-ai/_index.md
   to: gen-ai/README.md
 --->
 
@@ -9,6 +9,11 @@ path_base_for_github_subdir:
 
 **Status**: [Experimental][DocumentStatus]
 
+**Warning**:
+The semantic conventions for GenAI and LLM are currently in development.
+We encourage instrumentation libraries and telemetry consumers developers to
+use the conventions in limited non-critical workloads and share the feedback
+
 This document defines semantic conventions for the following kind of Generative AI systems:
 
 * LLMs
diff --git a/docs/gen-ai/llm-spans.md b/docs/gen-ai/llm-spans.md
index 9454b6764a..e0e4f63f39 100644
--- a/docs/gen-ai/llm-spans.md
+++ b/docs/gen-ai/llm-spans.md
@@ -1,5 +1,5 @@
 <!--- Hugo front matter used to generate the website version of this page:
-linkTitle: LLM Calls
+linkTitle: LLM requests
 --->
 
 # Semantic Conventions for LLM requests
@@ -18,8 +18,10 @@ linkTitle: LLM Calls
 
 A request to an LLM is modeled as a span in a trace.
 
+**Span kind:** MUST always be `CLIENT`.
+
 The **span name** SHOULD be set to a low cardinality value describing an operation made to an LLM.
-For example, the API name such as [Create chat completion](https://platform.openai.com/docs/api-reference/chat/create)
+For example, the API name such as [Create chat completion](https://platform.openai.com/docs/api-reference/chat/create) could be represented as `ChatCompletions gpt-4` to include the API and the LLM.
 
 ## Configuration
 
@@ -78,7 +80,7 @@ The event name MUST be `gen_ai.content.completion`.
 |---|---|---|---|---|
 | [`gen_ai.completion`](../attributes-registry/gen-ai.md) | string | The full response received from the LLM. [1] | `[{'role': 'assistant', 'content': 'The capital of France is Paris.'}]` | Conditionally Required: if and only if corresponding event is enabled |
 
-**[1]:** It's RECOMMENDED to format completions as JSON string matching [OpenAI messages format](https://platform.openai.com/docs/guides/text-generation)
+**[1]:** It's RECOMMENDED to format completions as JSON string matching [OpenAI messages format](https://platform.openai.com/docs/guides/text-generation).
 <!-- endsemconv -->
 
 [DocumentStatus]: https://github.com/open-telemetry/opentelemetry-specification/tree/v1.22.0/specification/document-status.md
diff --git a/model/registry/gen-ai.yaml b/model/registry/gen-ai.yaml
index 94305be170..3f53c22cfb 100644
--- a/model/registry/gen-ai.yaml
+++ b/model/registry/gen-ai.yaml
@@ -42,7 +42,7 @@ groups:
         tag: llm-generic-response
       - id: response.model
         type: string
-        brief: The name of the LLM a response is being made to.
+        brief: The name of the LLM a response was generated from.
         examples: ['gpt-4-0613']
         tag: llm-generic-response
       - id: response.finish_reasons
diff --git a/model/trace/gen-ai.yaml b/model/trace/gen-ai.yaml
index 7d9fb08e18..193817aa8f 100644
--- a/model/trace/gen-ai.yaml
+++ b/model/trace/gen-ai.yaml
@@ -26,8 +26,7 @@ groups:
       - ref: gen_ai.response.id
         requirement_level: recommended
       - ref: gen_ai.response.model
-        requirement_level:
-          conditionally_required: if response was received
+        recommended: if available
         note: >
           The name of the LLM serving a response. If the LLM is supplied by a vendor,
           then the value must be the exact name of the model actually used. If the LLM is a

From 17c4d01cbc31941de38f9801d07959af0cf0de80 Mon Sep 17 00:00:00 2001
From: Nir Gazit <nirga@users.noreply.github.com>
Date: Mon, 1 Apr 2024 17:23:38 +0200
Subject: [PATCH 33/36] chore: regenerated tables

---
 docs/attributes-registry/llm.md | 48 ++++++++++++++++-----------------
 docs/gen-ai/llm-spans.md        | 42 ++++++++++++++---------------
 model/registry/gen-ai.yaml      | 13 +++++++++
 model/trace/gen-ai.yaml         |  4 +--
 4 files changed, 60 insertions(+), 47 deletions(-)

diff --git a/docs/attributes-registry/llm.md b/docs/attributes-registry/llm.md
index 5ea7cb8cdc..9a20ed6199 100644
--- a/docs/attributes-registry/llm.md
+++ b/docs/attributes-registry/llm.md
@@ -18,40 +18,40 @@ linkTitle: LLM
 ### Request Attributes
 
 <!-- semconv registry.gen_ai(omit_requirement_level,tag=llm-generic-request) -->
-| Attribute  | Type | Description  | Examples  |
-|---|---|---|---|
-| `gen_ai.request.max_tokens` | int | The maximum number of tokens the LLM generates for a request. | `100` |
-| `gen_ai.request.model` | string | The name of the LLM a request is being made to. | `gpt-4` |
-| `gen_ai.request.temperature` | double | The temperature setting for the LLM request. | `0.0` |
-| `gen_ai.request.top_p` | double | The top_p sampling setting for the LLM request. | `1.0` |
-| `gen_ai.system` | string | The name of the LLM foundation model vendor. | `openai` |
-
-`gen_ai.system` has the following list of well-known values. If one of them applies, then the respective value MUST be used, otherwise a custom value MAY be used.
-
-| Value  | Description |
-|---|---|
-| `openai` | OpenAI |
+| Attribute  | Type | Description  | Examples  | Stability |
+|---|---|---|---|---|
+| `gen_ai.request.max_tokens` | int | The maximum number of tokens the LLM generates for a request. | `100` | ![Experimental](https://img.shields.io/badge/-experimental-blue) |
+| `gen_ai.request.model` | string | The name of the LLM a request is being made to. | `gpt-4` | ![Experimental](https://img.shields.io/badge/-experimental-blue) |
+| `gen_ai.request.temperature` | double | The temperature setting for the LLM request. | `0.0` | ![Experimental](https://img.shields.io/badge/-experimental-blue) |
+| `gen_ai.request.top_p` | double | The top_p sampling setting for the LLM request. | `1.0` | ![Experimental](https://img.shields.io/badge/-experimental-blue) |
+| `gen_ai.system` | string | The name of the LLM foundation model vendor. | `openai` | ![Experimental](https://img.shields.io/badge/-experimental-blue) |
+
+`gen_ai.system` has the following list of well-known values. If one of them applies, then the respective value MUST be used; otherwise, a custom value MAY be used.
+
+| Value  | Description | Stability |
+|---|---|---|
+| `openai` | OpenAI | ![Experimental](https://img.shields.io/badge/-experimental-blue) |
 <!-- endsemconv -->
 
 ### Response Attributes
 
 <!-- semconv registry.gen_ai(omit_requirement_level,tag=llm-generic-response) -->
-| Attribute  | Type | Description  | Examples  |
-|---|---|---|---|
-| `gen_ai.response.finish_reasons` | string[] | Array of reasons the model stopped generating tokens, corresponding to each generation received. | `[stop]` |
-| `gen_ai.response.id` | string | The unique identifier for the completion. | `chatcmpl-123` |
-| `gen_ai.response.model` | string | The name of the LLM a response is being made to. | `gpt-4-0613` |
-| `gen_ai.usage.completion_tokens` | int | The number of tokens used in the LLM response (completion). | `180` |
-| `gen_ai.usage.prompt_tokens` | int | The number of tokens used in the LLM prompt. | `100` |
+| Attribute  | Type | Description  | Examples  | Stability |
+|---|---|---|---|---|
+| `gen_ai.response.finish_reasons` | string[] | Array of reasons the model stopped generating tokens, corresponding to each generation received. | `[stop]` | ![Experimental](https://img.shields.io/badge/-experimental-blue) |
+| `gen_ai.response.id` | string | The unique identifier for the completion. | `chatcmpl-123` | ![Experimental](https://img.shields.io/badge/-experimental-blue) |
+| `gen_ai.response.model` | string | The name of the LLM a response was generated from. | `gpt-4-0613` | ![Experimental](https://img.shields.io/badge/-experimental-blue) |
+| `gen_ai.usage.completion_tokens` | int | The number of tokens used in the LLM response (completion). | `180` | ![Experimental](https://img.shields.io/badge/-experimental-blue) |
+| `gen_ai.usage.prompt_tokens` | int | The number of tokens used in the LLM prompt. | `100` | ![Experimental](https://img.shields.io/badge/-experimental-blue) |
 <!-- endsemconv -->
 
 ### Event Attributes
 
 <!-- semconv registry.gen_ai(omit_requirement_level,tag=llm-generic-events) -->
-| Attribute  | Type | Description  | Examples  |
-|---|---|---|---|
-| `gen_ai.completion` | string | The full response received from the LLM. [1] | `[{'role': 'assistant', 'content': 'The capital of France is Paris.'}]` |
-| `gen_ai.prompt` | string | The full prompt sent to an LLM. [2] | `[{'role': 'user', 'content': 'What is the capital of France?'}]` |
+| Attribute  | Type | Description  | Examples  | Stability |
+|---|---|---|---|---|
+| `gen_ai.completion` | string | The full response received from the LLM. [1] | `[{'role': 'assistant', 'content': 'The capital of France is Paris.'}]` | ![Experimental](https://img.shields.io/badge/-experimental-blue) |
+| `gen_ai.prompt` | string | The full prompt sent to an LLM. [2] | `[{'role': 'user', 'content': 'What is the capital of France?'}]` | ![Experimental](https://img.shields.io/badge/-experimental-blue) |
 
 **[1]:** It's RECOMMENDED to format completions as JSON string matching [OpenAI messages format](https://platform.openai.com/docs/guides/text-generation)
 
diff --git a/docs/gen-ai/llm-spans.md b/docs/gen-ai/llm-spans.md
index e0e4f63f39..49dc5f7d02 100644
--- a/docs/gen-ai/llm-spans.md
+++ b/docs/gen-ai/llm-spans.md
@@ -39,24 +39,24 @@ By default, these configurations SHOULD NOT capture prompts and completions.
 These attributes track input data and metadata for a request to an LLM. Each attribute represents a concept that is common to most LLMs.
 
 <!-- semconv gen_ai.request -->
-| Attribute  | Type | Description  | Examples  | Requirement Level |
-|---|---|---|---|---|
-| [`gen_ai.request.max_tokens`](../attributes-registry/gen-ai.md) | int | The maximum number of tokens the LLM generates for a request. | `100` | Recommended |
-| [`gen_ai.request.model`](../attributes-registry/gen-ai.md) | string | The name of the LLM a request is being made to. [1] | `gpt-4` | Required |
-| [`gen_ai.request.temperature`](../attributes-registry/gen-ai.md) | double | The temperature setting for the LLM request. | `0.0` | Recommended |
-| [`gen_ai.request.top_p`](../attributes-registry/gen-ai.md) | double | The top_p sampling setting for the LLM request. | `1.0` | Recommended |
-| [`gen_ai.response.finish_reasons`](../attributes-registry/gen-ai.md) | string[] | Array of reasons the model stopped generating tokens, corresponding to each generation received. | `[stop]` | Recommended |
-| [`gen_ai.response.id`](../attributes-registry/gen-ai.md) | string | The unique identifier for the completion. | `chatcmpl-123` | Recommended |
-| [`gen_ai.response.model`](../attributes-registry/gen-ai.md) | string | The name of the LLM a response is being made to. [2] | `gpt-4-0613` | Conditionally Required: if response was received |
-| [`gen_ai.system`](../attributes-registry/gen-ai.md) | string | The name of the LLM foundation model vendor. [3] | `openai` | Required |
-| [`gen_ai.usage.completion_tokens`](../attributes-registry/gen-ai.md) | int | The number of tokens used in the LLM response (completion). | `180` | Recommended |
-| [`gen_ai.usage.prompt_tokens`](../attributes-registry/gen-ai.md) | int | The number of tokens used in the LLM prompt. | `100` | Recommended |
+| Attribute  | Type | Description  | Examples  | [Requirement Level](https://opentelemetry.io/docs/specs/semconv/general/attribute-requirement-level/) | Stability |
+|---|---|---|---|---|---|
+| [`gen_ai.request.model`](../attributes-registry/llm.md) | string | The name of the LLM a request is being made to. [1] | `gpt-4` | `Required` | ![Experimental](https://img.shields.io/badge/-experimental-blue) |
+| [`gen_ai.system`](../attributes-registry/llm.md) | string | The name of the LLM foundation model vendor. [2] | `openai` | `Required` | ![Experimental](https://img.shields.io/badge/-experimental-blue) |
+| [`gen_ai.request.max_tokens`](../attributes-registry/llm.md) | int | The maximum number of tokens the LLM generates for a request. | `100` | `Recommended` | ![Experimental](https://img.shields.io/badge/-experimental-blue) |
+| [`gen_ai.request.temperature`](../attributes-registry/llm.md) | double | The temperature setting for the LLM request. | `0.0` | `Recommended` | ![Experimental](https://img.shields.io/badge/-experimental-blue) |
+| [`gen_ai.request.top_p`](../attributes-registry/llm.md) | double | The top_p sampling setting for the LLM request. | `1.0` | `Recommended` | ![Experimental](https://img.shields.io/badge/-experimental-blue) |
+| [`gen_ai.response.finish_reasons`](../attributes-registry/llm.md) | string[] | Array of reasons the model stopped generating tokens, corresponding to each generation received. | `[stop]` | `Recommended` | ![Experimental](https://img.shields.io/badge/-experimental-blue) |
+| [`gen_ai.response.id`](../attributes-registry/llm.md) | string | The unique identifier for the completion. | `chatcmpl-123` | `Recommended` | ![Experimental](https://img.shields.io/badge/-experimental-blue) |
+| [`gen_ai.response.model`](../attributes-registry/llm.md) | string | The name of the LLM a response was generated from. [3] | `gpt-4-0613` | `Recommended` | ![Experimental](https://img.shields.io/badge/-experimental-blue) |
+| [`gen_ai.usage.completion_tokens`](../attributes-registry/llm.md) | int | The number of tokens used in the LLM response (completion). | `180` | `Recommended` | ![Experimental](https://img.shields.io/badge/-experimental-blue) |
+| [`gen_ai.usage.prompt_tokens`](../attributes-registry/llm.md) | int | The number of tokens used in the LLM prompt. | `100` | `Recommended` | ![Experimental](https://img.shields.io/badge/-experimental-blue) |
 
 **[1]:** The name of the LLM a request is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model requested. If the LLM is a fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
 
-**[2]:** The name of the LLM serving a response. If the LLM is supplied by a vendor, then the value must be the exact name of the model actually used. If the LLM is a fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
+**[2]:** If not using a vendor-supplied model, provide a custom friendly name, such as a name of the company or project. If the instrumetnation reports any attributes specific to a custom model, the value provided in the `gen_ai.system` SHOULD match the custom attribute namespace segment. For example, if `gen_ai.system` is set to `the_best_llm`, custom attributes should be added in the `gen_ai.the_best_llm.*` namespace. If none of above options apply, the instrumentation should set `_OTHER`.
 
-**[3]:** If not using a vendor-supplied model, provide a custom friendly name, such as a name of the company or project. If the instrumetnation reports any attributes specific to a custom model, the value provided in the `gen_ai.system` SHOULD match the custom attribute namespace segment. For example, if `gen_ai.system` is set to `the_best_llm`, custom attributes should be added in the `gen_ai.the_best_llm.*` namespace. If none of above options apply, the instrumentation should set `_OTHER`.
+**[3]:** If available. The name of the LLM serving a response. If the LLM is supplied by a vendor, then the value must be the exact name of the model actually used. If the LLM is a fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
 <!-- endsemconv -->
 
 ## Events
@@ -66,9 +66,9 @@ In the lifetime of an LLM span, an event for prompts sent and completions receiv
 <!-- semconv gen_ai.content.prompt -->
 The event name MUST be `gen_ai.content.prompt`.
 
-| Attribute  | Type | Description  | Examples  | Requirement Level |
-|---|---|---|---|---|
-| [`gen_ai.prompt`](../attributes-registry/gen-ai.md) | string | The full prompt sent to an LLM. [1] | `[{'role': 'user', 'content': 'What is the capital of France?'}]` | Conditionally Required: if and only if corresponding event is enabled |
+| Attribute  | Type | Description  | Examples  | [Requirement Level](https://opentelemetry.io/docs/specs/semconv/general/attribute-requirement-level/) | Stability |
+|---|---|---|---|---|---|
+| [`gen_ai.prompt`](../attributes-registry/llm.md) | string | The full prompt sent to an LLM. [1] | `[{'role': 'user', 'content': 'What is the capital of France?'}]` | `Conditionally Required` if and only if corresponding event is enabled | ![Experimental](https://img.shields.io/badge/-experimental-blue) |
 
 **[1]:** It's RECOMMENDED to format prompts as JSON string matching [OpenAI messages format](https://platform.openai.com/docs/guides/text-generation)
 <!-- endsemconv -->
@@ -76,11 +76,11 @@ The event name MUST be `gen_ai.content.prompt`.
 <!-- semconv gen_ai.content.completion -->
 The event name MUST be `gen_ai.content.completion`.
 
-| Attribute  | Type | Description  | Examples  | Requirement Level |
-|---|---|---|---|---|
-| [`gen_ai.completion`](../attributes-registry/gen-ai.md) | string | The full response received from the LLM. [1] | `[{'role': 'assistant', 'content': 'The capital of France is Paris.'}]` | Conditionally Required: if and only if corresponding event is enabled |
+| Attribute  | Type | Description  | Examples  | [Requirement Level](https://opentelemetry.io/docs/specs/semconv/general/attribute-requirement-level/) | Stability |
+|---|---|---|---|---|---|
+| [`gen_ai.completion`](../attributes-registry/llm.md) | string | The full response received from the LLM. [1] | `[{'role': 'assistant', 'content': 'The capital of France is Paris.'}]` | `Conditionally Required` if and only if corresponding event is enabled | ![Experimental](https://img.shields.io/badge/-experimental-blue) |
 
-**[1]:** It's RECOMMENDED to format completions as JSON string matching [OpenAI messages format](https://platform.openai.com/docs/guides/text-generation).
+**[1]:** It's RECOMMENDED to format completions as JSON string matching [OpenAI messages format](https://platform.openai.com/docs/guides/text-generation)
 <!-- endsemconv -->
 
 [DocumentStatus]: https://github.com/open-telemetry/opentelemetry-specification/tree/v1.22.0/specification/document-status.md
diff --git a/model/registry/gen-ai.yaml b/model/registry/gen-ai.yaml
index 3f53c22cfb..100523a3b8 100644
--- a/model/registry/gen-ai.yaml
+++ b/model/registry/gen-ai.yaml
@@ -6,67 +6,80 @@ groups:
       This document defines the attributes used to describe telemetry in the context of LLM (Large Language Models) requests and responses.
     attributes:
       - id: system
+        stability: experimental
         type:
           allow_custom_values: true
           members:
             - id: openai
+              stability: experimental
               value: "openai"
               brief: 'OpenAI'
         brief: The name of the LLM foundation model vendor.
         examples: 'openai'
         tag: llm-generic-request
       - id: request.model
+        stability: experimental
         type: string
         brief: The name of the LLM a request is being made to.
         examples: 'gpt-4'
         tag: llm-generic-request
       - id: request.max_tokens
+        stability: experimental
         type: int
         brief: The maximum number of tokens the LLM generates for a request.
         examples: [100]
         tag: llm-generic-request
       - id: request.temperature
+        stability: experimental
         type: double
         brief: The temperature setting for the LLM request.
         examples: [0.0]
         tag: llm-generic-request
       - id: request.top_p
+        stability: experimental
         type: double
         brief: The top_p sampling setting for the LLM request.
         examples: [1.0]
         tag: llm-generic-request
       - id: response.id
+        stability: experimental
         type: string
         brief: The unique identifier for the completion.
         examples: ['chatcmpl-123']
         tag: llm-generic-response
       - id: response.model
+        stability: experimental
         type: string
         brief: The name of the LLM a response was generated from.
         examples: ['gpt-4-0613']
         tag: llm-generic-response
       - id: response.finish_reasons
+        stability: experimental
         type: string[]
         brief: Array of reasons the model stopped generating tokens, corresponding to each generation received.
         examples: ['stop']
         tag: llm-generic-response
       - id: usage.prompt_tokens
+        stability: experimental
         type: int
         brief: The number of tokens used in the LLM prompt.
         examples: [100]
         tag: llm-generic-response
       - id: usage.completion_tokens
+        stability: experimental
         type: int
         brief: The number of tokens used in the LLM response (completion).
         examples: [180]
         tag: llm-generic-response
       - id: prompt
+        stability: experimental
         type: string
         brief: The full prompt sent to an LLM.
         note: It's RECOMMENDED to format prompts as JSON string matching [OpenAI messages format](https://platform.openai.com/docs/guides/text-generation)
         examples: ["[{'role': 'user', 'content': 'What is the capital of France?'}]"]
         tag: llm-generic-events
       - id: completion
+        stability: experimental
         type: string
         brief: The full response received from the LLM.
         note: It's RECOMMENDED to format completions as JSON string matching [OpenAI messages format](https://platform.openai.com/docs/guides/text-generation)
diff --git a/model/trace/gen-ai.yaml b/model/trace/gen-ai.yaml
index 193817aa8f..bf1d112e37 100644
--- a/model/trace/gen-ai.yaml
+++ b/model/trace/gen-ai.yaml
@@ -26,9 +26,9 @@ groups:
       - ref: gen_ai.response.id
         requirement_level: recommended
       - ref: gen_ai.response.model
-        recommended: if available
+        requirement_level: recommended
         note: >
-          The name of the LLM serving a response. If the LLM is supplied by a vendor,
+          If available. The name of the LLM serving a response. If the LLM is supplied by a vendor,
           then the value must be the exact name of the model actually used. If the LLM is a
           fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
       - ref: gen_ai.response.finish_reasons

From d2e4befde5cbed929dc3271e71a4af2a975dc154 Mon Sep 17 00:00:00 2001
From: Nir Gazit <nirga@users.noreply.github.com>
Date: Mon, 1 Apr 2024 17:25:10 +0200
Subject: [PATCH 34/36] chore: top-level README

---
 docs/README.md | 1 +
 1 file changed, 1 insertion(+)

diff --git a/docs/README.md b/docs/README.md
index 1dfe42baf3..27c2c200a0 100644
--- a/docs/README.md
+++ b/docs/README.md
@@ -27,6 +27,7 @@ Semantic Conventions are defined for the following areas:
 * [Exceptions](exceptions/README.md): Semantic Conventions for exceptions.
 * [FaaS](faas/README.md): Semantic Conventions for Function as a Service (FaaS) operations.
 * [Feature Flags](feature-flags/README.md): Semantic Conventions for feature flag evaluations.
+* [Generative AI](gen-ai/README.md): Semantic Conventions for generative AI (LLM, etc.) operations.
 * [GraphQL](graphql/graphql-spans.md): Semantic Conventions for GraphQL implementations.
 * [HTTP](http/README.md): Semantic Conventions for HTTP client and server operations.
 * [Messaging](messaging/README.md): Semantic Conventions for messaging operations and systems.

From 4ec72e5971923c3d947e4f892264c51794c7cca1 Mon Sep 17 00:00:00 2001
From: Nir Gazit <nirga@users.noreply.github.com>
Date: Tue, 9 Apr 2024 19:26:01 -0700
Subject: [PATCH 35/36] Apply suggestions from code review

Co-authored-by: Liudmila Molkova <limolkova@microsoft.com>
---
 docs/gen-ai/llm-spans.md | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/docs/gen-ai/llm-spans.md b/docs/gen-ai/llm-spans.md
index 49dc5f7d02..b392b9be49 100644
--- a/docs/gen-ai/llm-spans.md
+++ b/docs/gen-ai/llm-spans.md
@@ -29,7 +29,7 @@ Instrumentations for LLMs MAY capture prompts and completions.
 Instrumentations that support it, MUST offer the ability to turn off capture of prompts and completions. This is for three primary reasons:
 
 1. Data privacy concerns. End users of LLM applications may input sensitive information or personally identifiable information (PII) that they do not wish to be sent to a telemetry backend.
-2. Data size concerns. Although there is no specified limit to sizes, there are practical limitations in programming languages and telemety systems. Some LLMs allow for extremely large context windows that end users may take full advantage of.
+2. Data size concerns. Although there is no specified limit to sizes, there are practical limitations in programming languages and telemetry systems. Some LLMs allow for extremely large context windows that end users may take full advantage of.
 3. Performance concerns. Sending large amounts of data to a telemetry backend may cause performance issues for the application.
 
 By default, these configurations SHOULD NOT capture prompts and completions.
@@ -54,7 +54,7 @@ These attributes track input data and metadata for a request to an LLM. Each att
 
 **[1]:** The name of the LLM a request is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model requested. If the LLM is a fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
 
-**[2]:** If not using a vendor-supplied model, provide a custom friendly name, such as a name of the company or project. If the instrumetnation reports any attributes specific to a custom model, the value provided in the `gen_ai.system` SHOULD match the custom attribute namespace segment. For example, if `gen_ai.system` is set to `the_best_llm`, custom attributes should be added in the `gen_ai.the_best_llm.*` namespace. If none of above options apply, the instrumentation should set `_OTHER`.
+**[2]:** If not using a vendor-supplied model, provide a custom friendly name, such as a name of the company or project. If the instrumentation reports any attributes specific to a custom model, the value provided in the `gen_ai.system` SHOULD match the custom attribute namespace segment. For example, if `gen_ai.system` is set to `the_best_llm`, custom attributes should be added in the `gen_ai.the_best_llm.*` namespace. If none of above options apply, the instrumentation should set `_OTHER`.
 
 **[3]:** If available. The name of the LLM serving a response. If the LLM is supplied by a vendor, then the value must be the exact name of the model actually used. If the LLM is a fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
 <!-- endsemconv -->

From a8ebe22da722edb7a60d3f958d84caa8b48dfc3e Mon Sep 17 00:00:00 2001
From: Nir Gazit <nirga@users.noreply.github.com>
Date: Tue, 16 Apr 2024 13:10:10 -0700
Subject: [PATCH 36/36] fix: PR reviews

---
 .chloggen/first-gen-ai.yaml | 2 +-
 docs/gen-ai/llm-spans.md    | 4 +---
 2 files changed, 2 insertions(+), 4 deletions(-)

diff --git a/.chloggen/first-gen-ai.yaml b/.chloggen/first-gen-ai.yaml
index 7539ba83c2..62dec0d56e 100755
--- a/.chloggen/first-gen-ai.yaml
+++ b/.chloggen/first-gen-ai.yaml
@@ -10,7 +10,7 @@ change_type: new_component
 component: gen-ai
 
 # A brief description of the change. Surround your text with quotes ("") if it needs to start with a backtick (`).
-note: Introducing semantic conventions for LLM clients.
+note: Introducing semantic conventions for GenAI clients.
 
 # Mandatory: One or more tracking issues related to the change. You can use the PR number here if no issue exists.
 # The values here must be integers.
diff --git a/docs/gen-ai/llm-spans.md b/docs/gen-ai/llm-spans.md
index b392b9be49..80d4176edf 100644
--- a/docs/gen-ai/llm-spans.md
+++ b/docs/gen-ai/llm-spans.md
@@ -32,8 +32,6 @@ Instrumentations that support it, MUST offer the ability to turn off capture of
 2. Data size concerns. Although there is no specified limit to sizes, there are practical limitations in programming languages and telemetry systems. Some LLMs allow for extremely large context windows that end users may take full advantage of.
 3. Performance concerns. Sending large amounts of data to a telemetry backend may cause performance issues for the application.
 
-By default, these configurations SHOULD NOT capture prompts and completions.
-
 ## LLM Request attributes
 
 These attributes track input data and metadata for a request to an LLM. Each attribute represents a concept that is common to most LLMs.
@@ -54,7 +52,7 @@ These attributes track input data and metadata for a request to an LLM. Each att
 
 **[1]:** The name of the LLM a request is being made to. If the LLM is supplied by a vendor, then the value must be the exact name of the model requested. If the LLM is a fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
 
-**[2]:** If not using a vendor-supplied model, provide a custom friendly name, such as a name of the company or project. If the instrumentation reports any attributes specific to a custom model, the value provided in the `gen_ai.system` SHOULD match the custom attribute namespace segment. For example, if `gen_ai.system` is set to `the_best_llm`, custom attributes should be added in the `gen_ai.the_best_llm.*` namespace. If none of above options apply, the instrumentation should set `_OTHER`.
+**[2]:** If not using a vendor-supplied model, provide a custom friendly name, such as a name of the company or project. If the instrumetnation reports any attributes specific to a custom model, the value provided in the `gen_ai.system` SHOULD match the custom attribute namespace segment. For example, if `gen_ai.system` is set to `the_best_llm`, custom attributes should be added in the `gen_ai.the_best_llm.*` namespace. If none of above options apply, the instrumentation should set `_OTHER`.
 
 **[3]:** If available. The name of the LLM serving a response. If the LLM is supplied by a vendor, then the value must be the exact name of the model actually used. If the LLM is a fine-tuned custom model, the value should have a more specific name than the base model that's been fine-tuned.
 <!-- endsemconv -->