Write deprecation logs to an index #46106

jasontedor · 2019-08-29T00:47:59Z

The Kibana Upgrade Assistant helps users prepare to the next major version of Elasticsearch. The upgrade assistant works by introspecting various aspects of a cluster and its usage to surface deprecated functionality that is in use, and also to prepare indices that require re-indexing. The deprecations that are surfaced in the migration assistant are those that can be ascertained by introspecting the current state of the cluster. It can not however, catch on-going uses of deprecated functionality (e.g., APIs) that can not be ascertained by introspecting the current state of the cluster. The upgrade assistant would be even more useful for our users if we could assist users in understanding their use of such deprecated functionality.

Today we do surface such deprecated functionality via the deprecation logs. The upgrade assistant does not have an easy way to get its hands on the deprecation logs.

The crux of this issue then is aimed at making it possible for the upgrade assistant to collect the deprecation logs from each running node. One way to do this is to write the deprecation logs to an index that the upgrade assistant could then read, along side the deprecations that it can already obtain.

It is likely that we want to consider the deprecation indices as system indices, and also manage them via ILM if ILM is available (e.g., use of a deprecated API no more recent than N months ago isn't relevant, it's likely that the user already migrated away from that API).

elasticmachine · 2019-08-29T00:48:00Z

Pinging @elastic/es-core-infra

elasticmachine · 2019-08-29T00:48:02Z

Pinging @elastic/es-core-features

pgomulka · 2019-09-13T08:34:10Z

This is a really cool idea. We were looking into this together with @ycombinator and @nik9000 to possibly start a filebeat that would consume ES logs and upload them back to an index.
The solution was proposed a while back here https://github.com/elastic/dev/issues/731

jaymode · 2019-12-06T18:26:49Z

We discussed this in a couple of team meetings. Two approaches were discussed with the first being that we would rely on a bundled filebeat to ingest the deprecation logs and the other would include building this functionality into the DeprecationLogger itself.

There is ongoing discussion about packaging filebeat and metricbeat with elasticsearch (#49399) but they would be disabled by default. Given the desire to have them be disabled and not add a new process by default, this would mean this functionality would explicitly require the user to do work. Whereas if we added this to the deprecation logger, we could enable it more easily. Given this, the discussion favored the process internal solution.

There was additional discussion regarding:

Handling rejections on indexing
- It would be ok to drop messages in this case since this is not critical and retries could cause the cluster/node to get in an even worse state.
Would indexing these messages add too much load?
- This should not be the case if we use deprecateAndMaybeLog correctly, which should prevent spamming.
- However this deduplication currently has no timeout and then we could lose the fact that a deprecated item is being used after the initial document was indexed.
- Should we use X-Opaque-Id in the deduplication to help the user determine where the deprecated usage is coming from? See Provide more information about cause of deprecation log messages #26836 (comment)
- For some items like scripting the same deprecated use will result in multiple messages since these could be at the shard level. This should be ok since the majority will not be at that level.

jasontedor · 2020-01-29T01:52:45Z

This should not be the case if we use deprecateAndMaybeLog correctly, which should prevent spamming.

While I was the one that suggested that we could use deprecateAndMaybeLog here, I do wonder if we we should consider trying to log all of these messages to the index. There's a trade-off here between potential performance issues (which maybe we can address in other ways, such as batching) and being able to surface to a user the actual last time that they used deprecated functionality.

cjcenizal · 2020-01-29T22:45:53Z

Pinging @elastic/es-ui

rjernst · 2020-01-30T00:15:21Z

I do wonder if we we should consider trying to log all of these messages to the index

Even batching I worry could exhaust resources in any case that a deprecation occurs per document in a query, as is typical when deprecations occur within scripting. I'm supportive of the idea of logging more if we move to an index, but I wanted to point out we still have edge cases we need to consider where logging the details of every warning is not practical.

jasontedor · 2020-01-30T03:53:05Z

I don’t think we should deprecation log anything per document. Per request, that’s okay though, and I think alleviates a lot of pressure here.

nik9000 · 2020-01-30T14:01:50Z

Could we use something like ScriptService#checkCompilationLimit to limit how much we log? If we're batching we're probably already going to synchronize somehow, somewhere, so adding the rate limiting would be pretty cheap.

rjernst · 2020-01-30T19:03:02Z

checkCompilationLimit relies on calling nanoTime, which I don't think we want to do per document. Instead, I think we can find a way to call deprecations through a script specific lock, so that we only call the deprecation on the first use when executing the script.

pugnascotia · 2020-03-04T11:28:39Z

I've been playing around with the current deprecation logger. Since the logger is called all over the place, it seems unfeasible to introduce anything that would require changes to the call sites.

Instead, I threw together a DeprecationIndexer, initialised it in Node and passed it to the DeprecationLogger. Now, if a new setting is true, deprecation messages are written in something resembling ECS (only if the logger is also writing a message to Log4J).

The main issue I had was security - the NodeClient I got from Node is authenticated as _system, which doesn't have permission to create templates, create indices, or write to indices, so I slapped a dirty hack to see if the rest of it worked. I was running Elasticsearch with ./gradlew run.

I also make the indexer listen to the cluster state so that when the cluster was ready, it could ensure an index template exists, then the indexer writes to a daily index. The indexer stops listening once it knows the template exists.

So the questions I have are:

Is my approach here remotely valid? Or is there prior art that I've missed?
What user should be used to write the deprecation logs?

pgomulka · 2020-03-04T14:01:46Z

@pugnascotia that sounds good to me.
Will this Indexer be synchronous and and blocking the execution? Can you link your branch?
I was thinking if maybe we could implement this as a log4j appender that would be used together with asynchronous logger?

I was hoping to reuse deprecation logger logic for compatible API warnings. So there will be even more usages.

With regards to ECS, I am meant to tackle this here #47105
It is actually almost done, I need to add more testing.

pugnascotia · 2020-03-05T11:54:02Z

@pgomulka here's the branch: 4ff5e03...pugnascotia:index-deprecation-logs

jasontedor · 2020-06-04T17:55:01Z

We can take this idea even further, and use this deprecation index as common collection point for deprecation logs across the Stack, and then expose in Kibana all the deprecated functionality in any Stack products that a user is using, helping give a full view across the Stack of changes a user might need to make when preparing to upgrade. We will need to hash out the details of this idea, which @pugnascotia will take charge on. 🙏

pugnascotia · 2020-06-05T20:22:05Z

I had a good chat with @jakelandis about this, and we realised that although there are parallels with the existing monitoring code, given that we're ripping all that out and relying on stack features, we should do the same here. We can ship an index template that creates the deprecation index as a data stream with some suitable ILM settings.

pugnascotia · 2020-06-30T15:41:16Z

@pgomulka would you mind take a quick look at a new implementation for writing deprecation logs? See:

master...pugnascotia:index-deprecation-logs-v2

Part of elastic#46106. Simplify the implementation of deprecation logging by relying of log4j more completely, and implementing additional behaviour through custom appenders and filters.

Part of #46106. Simplify the implementation of deprecation logging by relying of log4j more completely, and implementing additional behaviour through custom appenders and filters.

Backport of elastic#61474. Part of elastic#46106. Simplify the implementation of deprecation logging by relying of log4j more completely, and implementing additional behaviour through custom appenders and filters.

Backport of #61474. Part of #46106. Simplify the implementation of deprecation logging by relying of log4j more completely, and implementing additional behaviour through custom appenders and filters.

Closes #46106. Implement a new log4j appender for deprecation logging, in order to write logs to a dedicated data stream. This is controlled by a new setting, `cluster.deprecation_indexing.enabled`.

Backport of elastic#61484. Closes elastic#46106. Implement a new log4j appender for deprecation logging, in order to write logs to a dedicated data stream. This is controlled by a new setting, `cluster.deprecation_indexing.enabled`.

Backport of #58924. Closes #46106. Introduce a mechanism for writing deprecation logs to a data stream as well as to disk.

jasontedor added :Core/Infra/Logging Log management and logging utilities team-discuss :Core/Features/Features labels Aug 29, 2019

jasontedor mentioned this issue Aug 29, 2019

Expose deprecation indices to upgrade assistant #46107

Closed

jasontedor removed the :Core/Features/Features label Aug 29, 2019

jasontedor mentioned this issue Oct 8, 2019

Allow Deprecation Info API to check for deprecated security configuration #47714

Open

rjernst mentioned this issue Oct 10, 2019

Provide more information about cause of deprecation log messages #26836

Closed

gwbrown mentioned this issue Nov 18, 2019

Check Security Roles in Deprecation Info API #49212

Closed

jaymode removed the team-discuss label Dec 6, 2019

$@polyfractal$ polyfractal removed the 7x label Dec 12, 2019

jasontedor mentioned this issue Jan 29, 2020

Update docs with notes on deprecation logs throttling/limiting behavior #51578

Closed

cjcenizal added the :UI label Jan 29, 2020

tvernum mentioned this issue Jan 31, 2020

Add warnings for invalid realm order config (#51195) #51515

Merged

pugnascotia self-assigned this Feb 28, 2020

pgomulka mentioned this issue Apr 20, 2020

Logging improvements #49087

Closed

14 tasks

rjernst added the Team:Core/Infra Meta label for core/infra team label May 4, 2020

rjernst added the Team:Deployment Management Meta label for Management Experience - Deployment Management team label May 4, 2020

cjcenizal removed the :ES-UI label Jun 9, 2020

pugnascotia mentioned this issue Jun 15, 2020

Provide a mechanism for securely indexing internally irrespective of license #58117

Closed

pugnascotia mentioned this issue Jul 2, 2020

Write deprecation logs to a data stream #58924

Closed

This was referenced Aug 24, 2020

Implement deprecation logging using log4j #61474

Merged

Write deprecation logs to a data stream #61484

Merged

pugnascotia mentioned this issue Aug 27, 2020

Implement deprecation logging using log4j #61629

Merged

pugnascotia closed this as completed in #61484 Sep 3, 2020

pugnascotia mentioned this issue Sep 4, 2020

Write deprecation logs to a data stream #61966

Merged

pugnascotia added a commit that referenced this issue Sep 9, 2020

Write deprecation logs to a data stream (#61966)

b7fd7cf

Backport of #58924. Closes #46106. Introduce a mechanism for writing deprecation logs to a data stream as well as to disk.

Mpdreamz mentioned this issue Nov 16, 2020

7.10.1 Meta Ticket elastic/elasticsearch-net#5096

Closed

61 tasks

stevejgordon mentioned this issue Dec 17, 2020

7.11.0 Meta Ticket elastic/elasticsearch-net#5198

Closed

jakelandis mentioned this issue Sep 16, 2021

Document "cluster.deprecation_indexing.enabled" #77936

Closed

rlodermeier3397 mentioned this issue Sep 22, 2021

[Docs] Document network.* recommendations when using Elasticsearch in Docker #77937 sbasu-wsu/elasticsearch#6

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Write deprecation logs to an index #46106

Write deprecation logs to an index #46106

jasontedor commented Aug 29, 2019 •

edited

Loading

elasticmachine commented Aug 29, 2019

elasticmachine commented Aug 29, 2019

pgomulka commented Sep 13, 2019

jaymode commented Dec 6, 2019

jasontedor commented Jan 29, 2020

cjcenizal commented Jan 29, 2020

rjernst commented Jan 30, 2020

jasontedor commented Jan 30, 2020

nik9000 commented Jan 30, 2020

rjernst commented Jan 30, 2020

pugnascotia commented Mar 4, 2020

pgomulka commented Mar 4, 2020 •

edited

Loading

pugnascotia commented Mar 5, 2020

jasontedor commented Jun 4, 2020

pugnascotia commented Jun 5, 2020

pugnascotia commented Jun 30, 2020

Write deprecation logs to an index #46106

Write deprecation logs to an index #46106

Comments

jasontedor commented Aug 29, 2019 • edited Loading

elasticmachine commented Aug 29, 2019

elasticmachine commented Aug 29, 2019

pgomulka commented Sep 13, 2019

jaymode commented Dec 6, 2019

jasontedor commented Jan 29, 2020

cjcenizal commented Jan 29, 2020

rjernst commented Jan 30, 2020

jasontedor commented Jan 30, 2020

nik9000 commented Jan 30, 2020

rjernst commented Jan 30, 2020

pugnascotia commented Mar 4, 2020

pgomulka commented Mar 4, 2020 • edited Loading

pugnascotia commented Mar 5, 2020

jasontedor commented Jun 4, 2020

pugnascotia commented Jun 5, 2020

pugnascotia commented Jun 30, 2020

jasontedor commented Aug 29, 2019 •

edited

Loading

pgomulka commented Mar 4, 2020 •

edited

Loading