Metrics prototype scenario #146

reyang · 2021-02-18T04:48:08Z

Follow up on the 02/11/2021 #108 Metrics API/SDK SIG Mtg, I've created this OTEP which has described two scenarios for the metrics prototyping work.

The actual prototype will be submitted as PR(s) to the language client repo, for example:

text/metrics/0146-metrics-prototype-scenarios.md

cijothomas · 2021-02-18T05:41:37Z

text/metrics/0146-metrics-prototype-scenarios.md

+
+The application owner (developer Y) would only want the following metrics:
+
+* [System CPU Usage](#system-cpu-usage) reported every 5 seconds


Not sure if this is something we want in initial stage, but i'd like to add a requirement of same metric being reported with different interval, potentially with diff. dimensions.
For example, the app owner wants to see HTTP Server Duration metric exported every 1 second with only HttpStatusCode dimension, and HTTP Server Duration metric exported every 30 seconds with dimensions {hostname, HTTP Method, Host, Status Code, Client Type}. The former is typically used for near-real-time dashboards, and the latter for more permanent storage.

In your example - I guess normally people would only report the 1 second one from the SDK pre-aggregation, and rely on the metrics backend to aggregate the 30 seconds one (and daily/weekly/monthly summary if there is a need).

Co-authored-by: Cijo Thomas <cithomas@microsoft.com>

text/metrics/0146-metrics-prototype-scenarios.md

noahfalk · 2021-02-18T23:12:15Z

text/metrics/0146-metrics-prototype-scenarios.md

+store.process_order("customerA", {"tomato": 1})
+```
+
+When the store is closed, we will report the following metrics:


Do we think this type of offline historical reporting is a good primary use case for the metrics API? Although I can envision a metrics API doing it I'd guess it is a better fit for a standard transaction database where there are stronger guarantees about data consistency and richer data, but potentially worse performance/availability. I think of metrics being focused on high availability and low latency which is more oriented towards diagnostics/live monitoring/alerting where the grocery would be looking for signs like:

Is there an unexpected change in rate of sales suggesting an unknown incident may be occuring at the store?

Is inventory getting unexpectedly low so we need to dispatch an urgent delivery from the warehouse?

Is there a sudden spike in demand for a product so we need to consider rationing or price changes?

Of course if I am looking at it with too narrow a lense then this example might be accomplishing exactly what it intends, expanding my understanding of what scenarios a metrics API is intended to support.

I see (1) and (2) as good use-cases for monitoring using a metrics API, but maybe not (3). Although the example feels like it fell out of a textbook, you could re-imagine the store as a Message-Queue consumer processing orders in a horizontally scalable store. Can we ask another form of query: "how many stores were in operation at a given time?"

text/metrics/0146-metrics-prototype-scenarios.md

jmacd · 2021-02-19T07:34:05Z

text/metrics/0146-metrics-prototype-scenarios.md

+* HTTP request counters, reported every 5 seconds:
+  * Total number of received HTTP requests
+  * Total number of finished HTTP requests
+  * Number of currently-in-flight HTTP requests (concurrent HTTP requests)


I like how this example asks for three counters, because it seems possible to achieve with two instruments: a count of received requests and a histogram of response durations (i.e., seems to call for either a view or a 3rd instrument).

And it might affect the semantic convention open-telemetry/opentelemetry-specification#1378 (comment).

(Reply: open-telemetry/opentelemetry-specification#1378 (comment))

jmacd · 2021-02-19T07:41:14Z

text/metrics/0146-metrics-prototype-scenarios.md

+store.process_order("customerA", {"tomato": 1})
+```
+
+When the store is closed, we will report the following metrics:


I see (1) and (2) as good use-cases for monitoring using a metrics API, but maybe not (3). Although the example feels like it fell out of a textbook, you could re-imagine the store as a Message-Queue consumer processing orders in a horizontally scalable store. Can we ask another form of query: "how many stores were in operation at a given time?"

text/metrics/0146-metrics-prototype-scenarios.md

victlu · 2021-02-19T17:59:09Z

text/metrics/0146-metrics-prototype-scenarios.md

+The application owner (developer Y) would only want the following metrics:
+
+* Server temperature - reported every 5 seconds
+* Server humidity - reported every minute


AFAIK, we need to have multiple "pipelines" for different configurations. This may include...

Reporting period which is related to Collection rate, Export rate, etc...

Selection / grouping of desired metrics for this pipeline

These are all SDK configuration topics, to me. The use-cases in this document are about how code is instrumented, I think, and what the API looks like.

OpenCensus had a programmatic API for configuring the kinds of variables you mentioned. I'm not sure if a programmatic API or a configuration file is what user's want, but I'd argue to keep this setup out of the instrumentation API.

text/metrics/0146-metrics-prototype-scenarios.md

Co-authored-by: Leighton Chen <lechen@microsoft.com>

jsuereth

My main comment, thinking deeper on this, is whether or not "external annotation" needs to be called out specifically as a non-goal.

Specifically, should we call out a use case where an DevOps wants to add metric labels POST DEVELOPMENT to the API. I think this is an SDK concern and should be called out there (e.g. how you can add annotations to Resource via ENV, which would then influence metrics), but I think the scenarios listed here are sufficient for API discussions.

noahfalk

Looks great, thanks @reyang!

text/metrics/0146-metrics-prototype-scenarios.md

Co-authored-by: Cijo Thomas <cithomas@microsoft.com>

jmacd · 2021-02-25T20:20:29Z

I reviewed the open comments and believe that @reyang has addressed them all. Thank you!

* metrics prototype scenario * rename * fix typo * Update text/metrics/0146-metrics-prototype-scenarios.md Co-authored-by: Cijo Thomas <cithomas@microsoft.com> * Update text/metrics/0146-metrics-prototype-scenarios.md Co-authored-by: Cijo Thomas <cithomas@microsoft.com> * Update text/metrics/0146-metrics-prototype-scenarios.md Co-authored-by: Cijo Thomas <cithomas@microsoft.com> * Update text/metrics/0146-metrics-prototype-scenarios.md Co-authored-by: Cijo Thomas <cithomas@microsoft.com> * clarify GA/stable * add example to exemplar * adjust the proposed timeline considering most folks will be on vacation in Dec. * fix nits * adjust the ToC * fix nits * Update text/metrics/0146-metrics-prototype-scenarios.md Co-authored-by: Leighton Chen <lechen@microsoft.com> * Update text/metrics/0146-metrics-prototype-scenarios.md Co-authored-by: Leighton Chen <lechen@microsoft.com> * adjust wording * fix typo * address review comment * address comments * Update text/metrics/0146-metrics-prototype-scenarios.md Co-authored-by: Cijo Thomas <cithomas@microsoft.com> Co-authored-by: Cijo Thomas <cithomas@microsoft.com> Co-authored-by: Leighton Chen <lechen@microsoft.com>

reyang added 3 commits February 17, 2021 20:46

metrics prototype scenario

f0f993b

rename

0660044

fix typo

1e9eaf1

reyang marked this pull request as ready for review February 18, 2021 04:58

reyang requested review from a team, jsuereth, cijothomas, lzchen, tedsuo, alolita, bogdandrutu and jmacd February 18, 2021 04:58

cijothomas reviewed Feb 18, 2021

View reviewed changes

text/metrics/0146-metrics-prototype-scenarios.md Outdated Show resolved Hide resolved

cijothomas reviewed Feb 18, 2021

View reviewed changes

text/metrics/0146-metrics-prototype-scenarios.md Outdated Show resolved Hide resolved

cijothomas reviewed Feb 18, 2021

View reviewed changes

text/metrics/0146-metrics-prototype-scenarios.md Outdated Show resolved Hide resolved

cijothomas reviewed Feb 18, 2021

View reviewed changes

text/metrics/0146-metrics-prototype-scenarios.md Outdated Show resolved Hide resolved

cijothomas reviewed Feb 18, 2021

View reviewed changes

text/metrics/0146-metrics-prototype-scenarios.md Outdated Show resolved Hide resolved

cijothomas reviewed Feb 18, 2021

View reviewed changes

text/metrics/0146-metrics-prototype-scenarios.md Outdated Show resolved Hide resolved

cijothomas reviewed Feb 18, 2021

View reviewed changes

text/metrics/0146-metrics-prototype-scenarios.md Outdated Show resolved Hide resolved

cijothomas reviewed Feb 18, 2021

View reviewed changes

text/metrics/0146-metrics-prototype-scenarios.md Show resolved Hide resolved

cijothomas reviewed Feb 18, 2021

View reviewed changes

reyang and others added 6 commits February 17, 2021 21:59

Update text/metrics/0146-metrics-prototype-scenarios.md

c04673d

Co-authored-by: Cijo Thomas <cithomas@microsoft.com>

Update text/metrics/0146-metrics-prototype-scenarios.md

11c07e2

Co-authored-by: Cijo Thomas <cithomas@microsoft.com>

Update text/metrics/0146-metrics-prototype-scenarios.md

39dee6d

Co-authored-by: Cijo Thomas <cithomas@microsoft.com>

Update text/metrics/0146-metrics-prototype-scenarios.md

caa8bde

Co-authored-by: Cijo Thomas <cithomas@microsoft.com>

clarify GA/stable

5dfb0e9

add example to exemplar

9346bae

victlu reviewed Feb 18, 2021

View reviewed changes

text/metrics/0146-metrics-prototype-scenarios.md Outdated Show resolved Hide resolved

victlu reviewed Feb 18, 2021

View reviewed changes

text/metrics/0146-metrics-prototype-scenarios.md Outdated Show resolved Hide resolved

noahfalk reviewed Feb 19, 2021

View reviewed changes

reyang added 2 commits February 18, 2021 17:03

adjust the ToC

9ba2d1e

fix nits

e69b88a

jmacd approved these changes Feb 19, 2021

View reviewed changes

lzchen reviewed Feb 19, 2021

View reviewed changes

text/metrics/0146-metrics-prototype-scenarios.md Outdated Show resolved Hide resolved

lzchen reviewed Feb 19, 2021

View reviewed changes

text/metrics/0146-metrics-prototype-scenarios.md Outdated Show resolved Hide resolved

lzchen reviewed Feb 19, 2021

View reviewed changes

text/metrics/0146-metrics-prototype-scenarios.md Outdated Show resolved Hide resolved

lzchen reviewed Feb 19, 2021

View reviewed changes

text/metrics/0146-metrics-prototype-scenarios.md Outdated Show resolved Hide resolved

lzchen approved these changes Feb 19, 2021

View reviewed changes

victlu reviewed Feb 19, 2021

View reviewed changes

text/metrics/0146-metrics-prototype-scenarios.md Show resolved Hide resolved

reyang and others added 4 commits February 19, 2021 11:17

Update text/metrics/0146-metrics-prototype-scenarios.md

dd40330

Co-authored-by: Leighton Chen <lechen@microsoft.com>

Update text/metrics/0146-metrics-prototype-scenarios.md

9e1666c

Co-authored-by: Leighton Chen <lechen@microsoft.com>

adjust wording

415c794

fix typo

f4a6f5d

jsuereth approved these changes Feb 22, 2021

View reviewed changes

lalitb approved these changes Feb 24, 2021

View reviewed changes

reyang added 2 commits February 24, 2021 16:54

address review comment

71fdfdd

address comments

992e0ab

noahfalk approved these changes Feb 25, 2021

View reviewed changes

cijothomas reviewed Feb 25, 2021

View reviewed changes

text/metrics/0146-metrics-prototype-scenarios.md Outdated Show resolved Hide resolved

cijothomas reviewed Feb 25, 2021

View reviewed changes

text/metrics/0146-metrics-prototype-scenarios.md Show resolved Hide resolved

cijothomas approved these changes Feb 25, 2021

View reviewed changes

Update text/metrics/0146-metrics-prototype-scenarios.md

eaa6a10

Co-authored-by: Cijo Thomas <cithomas@microsoft.com>

jmacd merged commit c187e32 into open-telemetry:main Feb 25, 2021

codeboten mentioned this pull request May 10, 2021

Metrics API/SDK prototype open-telemetry/opentelemetry-python#1835

Closed

5 tasks

jmacd mentioned this pull request Jun 29, 2021

Prototype OTel-Go metrics APIv2 proposal open-telemetry/opentelemetry-go#2044

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Metrics prototype scenario #146

Metrics prototype scenario #146

reyang commented Feb 18, 2021 •

edited

Loading

cijothomas Feb 18, 2021

reyang Feb 18, 2021

noahfalk Feb 18, 2021

jmacd Feb 19, 2021

jmacd Feb 19, 2021

reyang Feb 23, 2021

jmacd Feb 24, 2021

jmacd Feb 19, 2021

victlu Feb 19, 2021

jmacd Feb 24, 2021

jsuereth left a comment

noahfalk left a comment

jmacd commented Feb 25, 2021


		The application owner (developer Y) would only want the following metrics:

		* [System CPU Usage](#system-cpu-usage) reported every 5 seconds

Metrics prototype scenario #146

Metrics prototype scenario #146

Conversation

reyang commented Feb 18, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jsuereth left a comment

Choose a reason for hiding this comment

noahfalk left a comment

Choose a reason for hiding this comment

jmacd commented Feb 25, 2021

reyang commented Feb 18, 2021 •

edited

Loading