-
Notifications
You must be signed in to change notification settings - Fork 2.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Remove the need for external tools for managing Elasticsearch #6283
Comments
I am working on it |
Hi @yurishkuro , for
Does it sound good? |
IndexDateSeparator - if the main storage doesn't need this argument, why does cleaner need it? Archive bool - archive storage is configured as separate storage in v2, so it can have its own cleaner if desired. Rollover bool - not clear. What's the alternative? DeleteBeforeThisDate time.Time - doesn't make sense, cleaner is a permanent periodic job, not something that takes an absolute time. We already have max span age parameter iirc, is it not sufficient to drive the behavior of the cleaner? |
I mostly took out the filters used in the current implementation.
Yes, I don't think so we need it. There is
Makes sense. Thanks
I was thinking maybe I can use the exact filter used for the rollover case, in the current implementation. But I probably have to read more about how this is being used here before commenting further. If you can provide me with some pointers, that would be really helpful? Thanks
Was mostly referring to the time calculated before which we should delete indices. This calculation should actually be done based on a Also, looks like rohan has already started working on it. |
Also, shouldn't this be added as an extension? |
it could be an extension but I think it's very closely coupled with the storage implementation that it makes more sense for it to be just an option in the config |
Just wanted to add this info. We won't need this config as well. Rollover indices can be treated as |
Hi @Rohanraj123 , let me know if you are working on it? I would like to take up merging |
yeah go ahead please! |
Thanks, I will start working on it. |
Hi @yurishkuro , I see that for |
ideally all logic should go through regular clients |
@yurishkuro I am little confused on the question, whether after this change, will we still be giving the cmds (For those who still want to keep these services in their control) or should we have to discard the cmd and move the code to pkg or a mixture of these two, keeping the cmd as it is but moving the code to a sharerd package for es and cmd? |
@Manik2708 they should become sub-commands of
|
@yurishkuro Also is there any connection between |
@Manik2708 you have to check the code where/how it's used |
@yurishkuro Es Index Rollover runs for a single server, but while integrating it with Jaeger Binary we can have multiple servers. So should we always rollover run for all servers or we have to ask from user for through config file, like in this way
|
@Manik2708 there are two ways
|
@yurishkuro As far I have understood from the code and by using it, all the operations of es-rollover cmd seems to be idempotent. Still I want to clear some doubts:
|
this is my question to you. Also the timing - if one instance initiates a rollover at :00min and then another at :05min, what's going to happen? Why do you need to run goroutine per-server? There's only one cluster.
As I understand init needs to execute only once, so you may need to add a check for it to verify if it's already been executed on a cluster. I would approach all of steps like that - check first, then run. |
@yurishkuro So as per my search:
|
correct, a command can be sent to any node. |
@yurishkuro For the |
@Rohanraj123 Are you still working on esmapping-generator? |
Yes I was waiting for the review. I will (re)start working from today only. |
@yurishkuro This is what I am thinking of options of
And this as the flow: Please review this, if it looks good, I would start working on it! |
For your struct
|
As per my understanding it requires manually, please see this: https://www.elastic.co/guide/en/elasticsearch/reference/current/getting-started-index-lifecycle-management.html#ilm-gs-alias-apply-policy It requires
This idea seems great! We are concerned for only
Could agree but have a question which is linked with the next point, if let's say
For For |
Your config could look like: backends:
my_es:
elasticsearch:
...
index_maintenance: I.e. even if this Another high level question I have - do we even need to invest in custom ILM? Or should we be looking into data streams #4708, which may have different management process. |
I studied about data streams also, official docs suggest using ILM over data streams for rollover. https://www.elastic.co/guide/en/elasticsearch/reference/current/data-streams.html#data-streams-rollover |
@yurishkuro Then where should we head? ILM or data streams! |
@Manik2708 my preference is data streams, but like you said they are not mutually exclusive. |
Ok, then let's try with data streams? |
@yurishkuro I feel a need of a helping PR for this issue (can be a prerequisite) which will be refactoring of e2e tests of Index Rollover and Cleaner to make them reusable for v2 e2e tests just like storage integration tests. This will help the contributors to test the functionality e2e and for reviewers it will make the job little easy as it will reduce the PR size (You can see the PR #6500 becoming very big due to this refactoring). Should I raise a PR or should it be included with the functionality itself? |
smaller PRs are always better. |
<!-- !! Please DELETE this comment before posting. We appreciate your contribution to the Jaeger project! 👋🎉 --> ## Which problem is this PR solving? - part of #6283 ## Description of the changes - Decoupled the Esmapping generation logic from cmd/esmappnig-generator/main.go and kept it into reusable renderer package and then reimplemented the subcommand in the plugin/storage/es/esmapping-generator.go file and then registered in jaeger/main.go. ## How was this change tested? - go test ./... -count=1 ## Checklist - [X] I have read https://github.com/jaegertracing/jaeger/blob/master/CONTRIBUTING_GUIDELINES.md - [X] I have signed all commits - [ ] I have added unit tests for the new functionality - [X] I have run lint and test steps successfully - for `jaeger`: `make lint test` - for `jaeger-ui`: `yarn lint` and `yarn test` --------- Signed-off-by: Rohanraj123 <rajrohan88293@gmail.com>
Per discussions elsewhere, we decided to limit this to esmappings command only, and ignore rollover so that it can be deprecated altogether in the future, as we switch towards support for data streams and ILM. |
PR #5797 is a good example of eliminating the need for schema initialization script for Cassandra by moving the logic directly into Jaeger binary.
We have three standalone tools for managing ES that we can also merge into Jaeger binary directly. All of them are already implemented in Go, so are fairly straightforward to embed in the Jaeger binary (fewer binary artifacts to release).
cmd/es-index-cleaner/
is used as a cron job to delete old ES indices. The logic can instead be executed from the Jaeger binary on a timer.cmd/es-rollover/
- similar to cleaner but for rolling over indices.esmapping-generator/
- just prints the index mappings to stdout (probably for manual installation)Note: this issue is not about removing the existing binaries, which we still want to maintain until Jaeger v1 EOL in 2026. We just want to make them not required for operating Jaeger-v2.
Tasks
enabled
properties for each.es-mappings
to jaeger v2 binary to print es mappingsThe text was updated successfully, but these errors were encountered: