Use a single index with wildcard in Elasticsearch reader #1969

pavolloffay · 2019-12-09T12:34:34Z

Resolves #1361

Use a wildcard (jaeger-span-*) in queries instead of the concrete list of indices (jaeger-span-2018-12-24, jaeger-span-2018-12-25...).

The motivation for this change:

simplified index management and usage - no --es.max-span-age parameter
search screen and trace detail screen will show the same results.
align with rollover (a single read index) and later deprecate rollover lookback option (similar to --es-max-span-age) https://medium.com/jaegertracing/using-elasticsearch-rollover-to-manage-indices-8b3d0c77915d

Possible negative impacts:

Performance - although kibana also uses wildcard pattern therefore time range queries (on span timestamp) should be well optimized.

Docker images to test:

pavolloffay/jaeger-query:es-wildcard
pavolloffay/jaeger-collector:es-wildcard

GetTrace, GetServices and GetOperatios used --es.max-span-age to compose a list of historical indices for the query - now query all indices via the wildcard
FidTraceIDs and FindTraces use times from query parameters to compose the list of historical indices - now query all indices but they use timestamp range query.

Signed-off-by: Pavol Loffay <ploffay@redhat.com>

codecov · 2019-12-09T13:14:30Z

Codecov Report

Merging #1969 into master will decrease coverage by <.01%.
The diff coverage is 100%.

@@            Coverage Diff             @@
##           master    #1969      +/-   ##
==========================================
- Coverage   96.99%   96.99%   -0.01%     
==========================================
  Files         203      203              
  Lines       10061    10034      -27     
==========================================
- Hits         9759     9732      -27     
  Misses        264      264              
  Partials       38       38

Impacted Files	Coverage Δ
plugin/storage/es/factory.go	`100% <ø> (ø)`	⬆️
plugin/storage/es/spanstore/reader.go	`100% <100%> (ø)`	⬆️
plugin/storage/es/dependencystore/storage.go	`82.6% <100%> (-3.11%)`	⬇️
plugin/storage/es/options.go	`100% <100%> (ø)`	⬆️
plugin/storage/es/spanstore/service_operation.go	`100% <100%> (ø)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update c1bc28d...8797957. Read the comment docs.

Signed-off-by: Pavol Loffay <ploffay@redhat.com>

pavolloffay · 2019-12-09T14:19:07Z

@jaegertracing/elasticsearch we are looking for volunteers to test this functionality especially if the query performance changed. Any feedback is appreciated. The docker images are listed in the PR description.

annanay25

Should we also change the esCleaner.py script as part of this PR and allow users to specify a max TTL for span data?

annanay25 · 2019-12-19T17:08:44Z

plugin/storage/es/options.go

-		nsConfig.MaxSpanAge,
-		"The maximum lookback for spans in Elasticsearch")
+		time.Hour*72,
+		"(deprecated) The maximum lookback for spans in Elasticsearch. Now all indices are searched.")


This seems confusing. Can we add the message Now all indices are searched in the parenthesis?

pavolloffay · 2019-12-19T18:04:09Z

Should we also change the esCleaner.py script as part of this PR and allow users to specify a max TTL for span data?

I am not sure if I follow you. The purpose of esCleaner.py script is to clean old data. Elasticsearch does not allow to specify TTL in any form.

annanay25 · 2019-12-20T10:55:50Z

@pavolloffay Sorry I didn't frame that properly.

Elasticsearch does not allow to specify TTL in any form.

Yes. What I meant was that should we modify the esCleaner.py script as part of this PR as well, to include the change made in index naming format, so that users can continue to specify how long they want to store data.

pavolloffay · 2019-12-20T11:09:22Z

There is no change in index name format in this PR. The writer still writes to daily indices.

pavolloffay · 2020-01-29T10:37:42Z

@jkandasa run performance tests to measure query time with this patch. The results prove increased query time by on average 300-400%. Therefore I am closing this PR.

https://docs.google.com/spreadsheets/d/1rwTzDvnHHhssNdZibhFuHBZDf6njt6b1DrvRiJrkt14/edit?usp=sharing

Use wildcard in Elasticsearch indices in reader

1498f31

Signed-off-by: Pavol Loffay <ploffay@redhat.com>

pavolloffay requested review from black-adder, jpkrohling, objectiser, tiffon, vprithvi and yurishkuro as code owners December 9, 2019 12:34

pavolloffay added the storage/elasticsearch label Dec 9, 2019

Remove other references to lookback

239a4f5

Signed-off-by: Pavol Loffay <ploffay@redhat.com>

pavolloffay added 3 commits December 9, 2019 14:26

Fix dependencies index

d809da5

Signed-off-by: Pavol Loffay <ploffay@redhat.com>

Use now instead of date in the past

e1080aa

Signed-off-by: Pavol Loffay <ploffay@redhat.com>

fix test

8797957

Signed-off-by: Pavol Loffay <ploffay@redhat.com>

pavolloffay changed the title ~~Use wildcard in Elasticsearch indices in reader~~ Use a single index with wildcard in Elasticsearch reader Dec 9, 2019

annanay25 reviewed Dec 19, 2019

View reviewed changes

pavolloffay mentioned this pull request Jan 28, 2020

Use Elasticsearch Rollover API to manage indices #1242

Closed

pavolloffay closed this Jan 29, 2020

pavolloffay mentioned this pull request Jan 29, 2020

Use NumericRangeQuery in ES queries when rollover is used #1361

Closed

pavolloffay mentioned this pull request Feb 17, 2020

Support regex tags search for Elasticseach backend #2049

Merged

pavolloffay mentioned this pull request Apr 29, 2020

Remove processing time interval limitation in the Jaeger Query with Elastic Search (ES) backend #2208

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use a single index with wildcard in Elasticsearch reader #1969

Use a single index with wildcard in Elasticsearch reader #1969

pavolloffay commented Dec 9, 2019 •

edited

Loading

codecov bot commented Dec 9, 2019 •

edited

Loading

pavolloffay commented Dec 9, 2019

annanay25 left a comment

annanay25 Dec 19, 2019

pavolloffay commented Dec 19, 2019

annanay25 commented Dec 20, 2019

pavolloffay commented Dec 20, 2019

pavolloffay commented Jan 29, 2020

Use a single index with wildcard in Elasticsearch reader #1969

Use a single index with wildcard in Elasticsearch reader #1969

Conversation

pavolloffay commented Dec 9, 2019 • edited Loading

codecov bot commented Dec 9, 2019 • edited Loading

Codecov Report

pavolloffay commented Dec 9, 2019

annanay25 left a comment

Choose a reason for hiding this comment

annanay25 Dec 19, 2019

Choose a reason for hiding this comment

pavolloffay commented Dec 19, 2019

annanay25 commented Dec 20, 2019

pavolloffay commented Dec 20, 2019

pavolloffay commented Jan 29, 2020

pavolloffay commented Dec 9, 2019 •

edited

Loading

codecov bot commented Dec 9, 2019 •

edited

Loading