Extend GenAI System to include OpenAI compatible platforms #1655

codefromthecrypt · 2024-12-06T08:24:41Z

This extends GenAI System to include OpenAI compatible platforms. These are usable by OpenAI client libraries:

Azure OpenAI
Gemini
Perplexity
xAI (note they also say they are compatible with anthropic!)
DeepSeek
Groq

At the moment, there is no way to attribute signals to Azure OpenAI, so people use "openai". At Elastic, we have different
dashboards for OpenAI (the platform) vs Azure OpenAI, as these are independent services. Lacking a GenAI system name, we have to resort to "openai" which is ambiguous between these two.

Using "server.address" isn't a great option due to subdomains used in the Azure OpenAI service depending on the application name in Azure. While we could use cloud semantics, they aren't guaranteed to be present, and aren't integrated in current code. Even if it were, this it is less ideal to navigate or aggregate on two attributes vs one.

This also adds a value for Google Gemini which has exactly the same concern as it can be accessed either via its native API or via OpenAI libraries.

Changes

Added the following, clarifying only when needed:

"az.ai.openai" to follow conventions of "az.ai.inference"
"gemini"", not "google.gemini" as "vertexai" is not prefixed either.
"perplexity"
"xai"
"deepseek"
"groq"

Merge requirement checklist

CONTRIBUTING.md guidelines followed.
Change log entry added, according to the guidelines in When to add a changelog entry.
- If your PR does not need a change log, start the PR title with [chore]
schema-next.yaml updated with changes to existing conventions.

codefromthecrypt · 2024-12-09T23:33:24Z

PS I used the Azure AI Inference client client per instructions to access Azure OpenAI service, and I guess what's happening here more generally, is that gen_ai.system might be conflated with the client itself.

Specifically, the Azure AI Inference client can access three different products:

GitHub models
Azure AI Studio models
Azure OpenAI Studio models

In each case the services are different (different subdomains, for example). However, status quo, all would set gen_ai.system=az.ai.inference

So, status quo gen_ai.system might be treated more like the SDK name than the actual system name. Should I update the text like that, as this applies even beyond openai compat?

karthikscale3 · 2024-12-10T00:15:56Z

This is definitely an issue that we have faced as well. Right now auto instrumentation default sets genai.system to openai even if the base_url used is different from the openai one. More broadly it's only looking at the client library being used as opposed to the inference server being used, which is the correct representation. In Langtrace's SDK, we capture the underlying server being used from the base url using regex and set a langtrace attribute called service-name. I think we can move this behavior up to the spec with genai.system. We also need to capture this for all the other LLM providers that support openai client library

xai
deepseek
groq
perplexity

docs/attributes-registry/gen-ai.md

lmolkova · 2024-12-10T03:36:08Z

Base URL regex is not reliable either - you can run smthing behind the proxy, self-host model in vllm, your provider can host different models behind one API. Instrumentation can do the best effort, but cannot guarantee anything beyond "a system this library was designed for" on the gen_ai.system.

codefromthecrypt · 2024-12-10T05:52:35Z

@lmolkova want me to fix the "dead link" on https://blogs.oracle.com/linux/post/understanding-linux-kernel-memory-statistics? it is result code zero which I've seen elsewhere when a link takes too long to load or otherwise

codefromthecrypt · 2024-12-10T06:51:35Z

The broken links might not be a new problem. We can work around this by retrying externally, in the make file, but I imagine for now, folks can just manually merge when MLC is flakey on a good link. #332 (comment)

xrmx · 2024-12-10T08:24:40Z

The broken links might not be a new problem. We can work around this by retrying externally, in the make file, but I imagine for now, folks can just manually merge when MLC is flakey on a good link. #332 (comment)

It has been here since a few and has not been a problem to have things merged :)

github-actions · 2024-12-28T03:20:38Z

This PR was marked stale due to lack of activity. It will be closed in 7 days.

codefromthecrypt · 2025-01-02T00:26:21Z

This has been approved for a while now, anything missing needed for merge?

lmolkova · 2025-01-03T02:11:24Z

@codefromthecrypt we require at least 2 approving reviews.

Signed-off-by: Adrian Cole <adrian.cole@elastic.co>

codefromthecrypt · 2025-01-03T23:05:45Z

Thanks for merging this.

Last night, I noticed I missed a spot as mistral (system/platform/llm provider) is also compatible with openai (chat completions api). I was testing our chatbot-rag-app, used by solutions architects for client demonstrations. It supports mistral as an llm implementation for LangChain. While LangChain uses raw HTTP to access mistral, it does so with the openai completions endpoint. I also noticed spring-ai squarely classifies mistral as openai (api) compatible. Basically as a cloud platform, mistral is accessed similar to the others out there. cc @ThomasVitale from spring-ai and I'll raise a follow-up PR with these notes.

cc @jack-berg on a sidebar as this PR puts modeling nuance in a very specific and concrete context. Others can ignore

mistral is a system/platform/llm provider e.g. https://docs.spring.io/spring-ai/reference/api/chat/mistralai-chat.html#_openai_api_compatibility shows its cloud endpoint
it is also a family of models, e.g. you can use mistral-nemo with ollama on your laptop, or on other cloud platforms.

depending on context, what we say is a "system" is a platform, api, model family or multiple of these. What's identifiable from the client is limited, but from the server you also get the fun of whether you report "ollama" or "vllm" as a "system" or not, regardless of it traffic came in on the "openai" compatible endpoint vs another inference one.

codefromthecrypt · 2025-01-03T23:18:13Z

#1719 on the follow-up for Mistral AI

codefromthecrypt requested review from a team as code owners December 6, 2024 08:24

codefromthecrypt force-pushed the az.ai.openai branch from 6a0bc8d to 1d76c72 Compare December 6, 2024 08:27

karthikscale3 reviewed Dec 10, 2024

View reviewed changes

docs/attributes-registry/gen-ai.md Show resolved Hide resolved

lmolkova approved these changes Dec 10, 2024

View reviewed changes

codefromthecrypt changed the title ~~Extend GenAI System to include Azure OpenAI and Gemini~~ Extend GenAI System to include OpenAI compatible platforms Dec 10, 2024

codefromthecrypt force-pushed the az.ai.openai branch from 78a512b to 9300a6d Compare December 10, 2024 05:32

codefromthecrypt requested review from lmolkova and karthikscale3 December 10, 2024 05:32

xrmx approved these changes Dec 10, 2024

View reviewed changes

codefromthecrypt mentioned this pull request Dec 11, 2024

feat: instrumentation-openai elastic/elastic-otel-node#469

Merged

lmolkova approved these changes Dec 12, 2024

View reviewed changes

github-actions bot added the Stale label Dec 28, 2024

lmolkova removed the Stale label Dec 29, 2024

karthikscale3 approved these changes Jan 3, 2025

View reviewed changes

codefromthecrypt added 3 commits January 3, 2025 14:05

Extend GenAI System to include Azure OpenAI and Gemini

ea14d99

Signed-off-by: Adrian Cole <adrian.cole@elastic.co>

more

fcb6610

Signed-off-by: Adrian Cole <adrian.cole@elastic.co>

drift

d8a14ba

Signed-off-by: Adrian Cole <adrian.cole@elastic.co>

codefromthecrypt force-pushed the az.ai.openai branch from 51ef958 to d8a14ba Compare January 3, 2025 06:08

Merge branch 'main' into az.ai.openai

13af341

lmolkova merged commit f0db775 into open-telemetry:main Jan 3, 2025
14 checks passed

codefromthecrypt deleted the az.ai.openai branch January 3, 2025 22:54

codefromthecrypt mentioned this pull request Jan 3, 2025

Adds mistral_ai as a gen_ai.system attribute value #1719

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extend GenAI System to include OpenAI compatible platforms #1655

Extend GenAI System to include OpenAI compatible platforms #1655

codefromthecrypt commented Dec 6, 2024 •

edited

Loading

codefromthecrypt commented Dec 9, 2024 •

edited

Loading

karthikscale3 commented Dec 10, 2024

lmolkova commented Dec 10, 2024

codefromthecrypt commented Dec 10, 2024

codefromthecrypt commented Dec 10, 2024

xrmx commented Dec 10, 2024

github-actions bot commented Dec 28, 2024

codefromthecrypt commented Jan 2, 2025

lmolkova commented Jan 3, 2025

codefromthecrypt commented Jan 3, 2025 •

edited

Loading

codefromthecrypt commented Jan 3, 2025

Extend GenAI System to include OpenAI compatible platforms #1655

Extend GenAI System to include OpenAI compatible platforms #1655

Conversation

codefromthecrypt commented Dec 6, 2024 • edited Loading

Changes

Merge requirement checklist

codefromthecrypt commented Dec 9, 2024 • edited Loading

karthikscale3 commented Dec 10, 2024

lmolkova commented Dec 10, 2024

codefromthecrypt commented Dec 10, 2024

codefromthecrypt commented Dec 10, 2024

xrmx commented Dec 10, 2024

github-actions bot commented Dec 28, 2024

codefromthecrypt commented Jan 2, 2025

lmolkova commented Jan 3, 2025

codefromthecrypt commented Jan 3, 2025 • edited Loading

codefromthecrypt commented Jan 3, 2025

codefromthecrypt commented Dec 6, 2024 •

edited

Loading

codefromthecrypt commented Dec 9, 2024 •

edited

Loading

codefromthecrypt commented Jan 3, 2025 •

edited

Loading