Ability to make task manager run a task immediately #50214

mikecote · 2019-11-11T20:54:00Z

Follow up task as result of #45152

We'll add an API to Task Manager that allows us to run a Task immediately, unless it is currently running, and this will allow us to force a refresh a scheduled tasks manually.

elasticmachine · 2019-11-11T20:54:11Z

Pinging @elastic/kibana-stack-services (Team:Stack Services)

gmmorris · 2019-12-03T15:38:44Z

Here's a curiosity... if you try to call runNow for a task that has failed... should we retry it, or throw an error? 🤔

I can see someone trying to rerun a task that has failed to see if its behaviour is now valid (if, for example, its results relies on a response from an external service.
This is a behavioural change as we do not currently allow a failed task to be rerun in any way (we reschedule several times before giving up, failed means we ran out of attempts and they all failed).

any thoughts @mikecote @peterschretlen ?

bmcconaghy · 2019-12-03T15:42:49Z

I would say runNow should run it regardless of it being in the failed state. Presumably the person calling this would have reason to believe it should now succeed and it doesn't really hurt anything if we try and it just fails again.

gmmorris · 2019-12-03T15:44:54Z

That's my instinct as well.
Need to run through the implications within the TM lifecycle, but it makes sense.

gmmorris · 2019-12-10T11:55:00Z

I have an open issue in my PR, which I'm not actually sure needs to be addressed, but I'd like to hear your thoughts.

If the task being run by runNow fails, it is treated the same as any task that fails - it is rescheduled to try again, assuming there are more attempts available to it. This means a call to runNow might report a failure, and it might then succeed minutes later. Some thought needs to be put into how to address/communicate this.

What this means is that if you schedule a task with a runAt in the future, and then call runNow, it will try to run it now instead of at the runAt.
Now, presume the task run fails, you'll get a response from runNow saying it has failed to run.
But, as this is just a normal task run, Task Manager will reschedule the task to try it again.
This means, that by default, 5 minutes later Task Manager will rerun the task and that time - it might pass.

This means you would have had a runNow API call that failed, and 5 minutes later, out of nowhere, it passes.

This could be confusing, on the other hand - it is Task Manager's normal behaviour, so I'm not sure this is actually a problem.

Any thoughts?

gmmorris · 2019-12-11T10:09:25Z

Another question: what should a successful runNow return?
My instinct is that the state of the task is private, and a run should simply result in success (the promise has resolved) or failure (promise is rejected with an appropriate error, depending on why it failed), but we could return the state of the task if we wished... what do we thing?

mikecote · 2019-12-12T15:02:18Z

What this means is that if you schedule a task with a runAt in the future, and then call runNow, it will try to run it now instead of at the runAt.
Now, presume the task run fails, you'll get a response from runNow saying it has failed to run.
But, as this is just a normal task run, Task Manager will reschedule the task to try it again.
This means, that by default, 5 minutes later Task Manager will rerun the task and that time - it might pass.

I think we're missing one issue that would solve this question. Based on #39349, alerts that fail running would just try again at the next interval. I think TM supports this when providing 'interval' but possibly not alerting's usage. So we may have to create an issue to cover this gap now that we're not moving over to TM's interval.

This would solve the question where the alert would just run again at its next interval.

Another question: what should a successful runNow return?

I think simply resolving the promise for now is good enough, we won't be doing anything with the result at this time.

gmmorris · 2019-12-12T15:06:18Z

What this means is that if you schedule a task with a runAt in the future, and then call runNow, it will try to run it now instead of at the runAt.
Now, presume the task run fails, you'll get a response from runNow saying it has failed to run.
But, as this is just a normal task run, Task Manager will reschedule the task to try it again.
This means, that by default, 5 minutes later Task Manager will rerun the task and that time - it might pass.

I think we're missing one issue that would solve this question. Based on #39349, alerts that fail running would just try again at the next interval. I think TM supports this when providing 'interval' but possibly not alerting's usage. So we may have to create an issue to cover this gap now that we're not moving over to TM's interval.

This would solve the question where the alert would just run again at its next interval.

yes, that's true - when using TM's interval it'll default to rerunning at that point.
As long as the returned runAt from alerting takes the next interval into account I think that'll be fine. but I'll double check.

Another question: what should a successful runNow return?

I think simply resolving the promise for now is good enough, we won't be doing anything with the result at this time.

Cool, I'll keep the ID in there for clarity.

LeeDr · 2020-01-16T19:50:50Z

We're past 7.6.0 Feature Freeze so if this isn't a bug it should probably bump to v7.7.0.

pmuellr · 2020-01-16T19:54:37Z

This marked in the GH project as done, so I think it should be closed, but @gmmorris would know for sure.

gmmorris · 2020-01-20T16:26:02Z

Yup, this was done, not sure why the PR didn't close this at the time.

mikecote added Feature:Task Manager Team:Stack Services labels Nov 11, 2019

mikecote mentioned this issue Nov 11, 2019

Ability run an alert immediately #50215

Closed

mikecote mentioned this issue Nov 26, 2019

[DISCUSS] Task manager update API to allow changing a task's interval #45152

Closed

gmmorris self-assigned this Nov 26, 2019

gmmorris mentioned this issue Nov 26, 2019

[Task Manager] Add run now api #51574

Closed

bmcconaghy added Team:ResponseOps Label for the ResponseOps team (formerly the Cases and Alerting teams) and removed Team:Stack Services labels Dec 12, 2019

mikecote added the v7.6.0 label Dec 17, 2019

gmmorris closed this as completed Jan 20, 2020

kobelb added the needs-team Issues missing a team label label Jan 31, 2022

botelastic bot removed the needs-team Issues missing a team label label Jan 31, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ability to make task manager run a task immediately #50214

Ability to make task manager run a task immediately #50214

mikecote commented Nov 11, 2019 •

edited by gmmorris

Loading

elasticmachine commented Nov 11, 2019

gmmorris commented Dec 3, 2019 •

edited

Loading

bmcconaghy commented Dec 3, 2019

gmmorris commented Dec 3, 2019

gmmorris commented Dec 10, 2019

gmmorris commented Dec 11, 2019

mikecote commented Dec 12, 2019

gmmorris commented Dec 12, 2019

LeeDr commented Jan 16, 2020

pmuellr commented Jan 16, 2020

gmmorris commented Jan 20, 2020

Ability to make task manager run a task immediately #50214

Ability to make task manager run a task immediately #50214

Comments

mikecote commented Nov 11, 2019 • edited by gmmorris Loading

elasticmachine commented Nov 11, 2019

gmmorris commented Dec 3, 2019 • edited Loading

bmcconaghy commented Dec 3, 2019

gmmorris commented Dec 3, 2019

gmmorris commented Dec 10, 2019

gmmorris commented Dec 11, 2019

mikecote commented Dec 12, 2019

gmmorris commented Dec 12, 2019

LeeDr commented Jan 16, 2020

pmuellr commented Jan 16, 2020

gmmorris commented Jan 20, 2020

mikecote commented Nov 11, 2019 •

edited by gmmorris

Loading

gmmorris commented Dec 3, 2019 •

edited

Loading