Introduce tracing with OpenTelemetry #1281

vtaskow · 2023-07-04T16:39:02Z

Resolves #39
The intention is the whole Seldon Core v2 ecosystem to have enabled distributed tracing. Currently, MLServer is missing this and this PR tracing features via the OpenTelemetry Python SDK. Other components of the system already have tracing enabled and exporting traces to an OTel collector service. By having it enabled in settings, it will provide transparency on requests, helping us to get a better picture of the journey of a request and identify any potential or already existing bottlenecks.
This PR adds basic tracing functionality by instrumenting both the REST and the gRPC servers on start. And adds a few tests to verify spans are being sent out. This is enough to give us basic information about incoming requests and their life-cycle in MLServer.

adriangonz

Nice one @vtaskow! PR looks great! 🚀

I've added a couple questions, but it's pretty much ready to go.

docs/examples/sklearn/playing_with_sklearn.py

mlserver/settings.py

tests/tracing/conftest.py

vtaskow added 5 commits June 29, 2023 16:34

Introduce OTel with initial settings

bf1c080

Showcase how to skip traces for endpoints/methods in REST and gRPC

cf72583

Refactor REST and gRPC instrumentations and write simple tests

b549061

Merge master

2eea28f

Add otel grpc and lint

d9b5fa7

vtaskow changed the title ~~Introduce otel~~ Introduce OpenTelemetry Tracing Jul 4, 2023

vtaskow changed the title ~~Introduce OpenTelemetry Tracing~~ Introduce tracing with OpenTelemetry Jul 4, 2023

adriangonz reviewed Jul 5, 2023

View reviewed changes

docs/examples/sklearn/playing_with_sklearn.py Outdated Show resolved Hide resolved

mlserver/settings.py Show resolved Hide resolved

tests/tracing/conftest.py Show resolved Hide resolved

Remove blank drafting file

db02a49

adriangonz approved these changes Jul 5, 2023

View reviewed changes

adriangonz marked this pull request as ready for review July 5, 2023 09:07

adriangonz merged commit ab448f8 into SeldonIO:master Jul 5, 2023

adriangonz mentioned this pull request Jul 7, 2023

Instrument MLServer (tracing, metrics & logging) #181

Closed

danielsoutar mentioned this pull request Dec 15, 2023

Examples/clarification over using OpenTelemetry tracing in MLServer #1513

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce tracing with OpenTelemetry #1281

Introduce tracing with OpenTelemetry #1281

vtaskow commented Jul 4, 2023 •

edited

Loading

adriangonz left a comment

Introduce tracing with OpenTelemetry #1281

Introduce tracing with OpenTelemetry #1281

Conversation

vtaskow commented Jul 4, 2023 • edited Loading

adriangonz left a comment

Choose a reason for hiding this comment

vtaskow commented Jul 4, 2023 •

edited

Loading