Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

processUnack not called after re-establishing connection to Redis DynoQueue #85

Open
manjurad opened this issue Jan 7, 2020 · 1 comment

Comments

@manjurad
Copy link

manjurad commented Jan 7, 2020

I have a Netflix conductor server along with Dynomite DB that is running a workflow where the client doesn't respond and the timeout kicks in via sweeper functionality the task times out and is re-scheduled.

But if I take the Dynomite server down temporarily and bring it back, then timeout of tasks stop working. When Dynomite server is not reachable I see logs/traceback about error in running sweeper which is as expected, once connection is re-established those errors stops but sweeper doesn't pick up any workflows to sweep.

Based on debug logs and redis commands issued it looks like the unack queue has some entries but those unack entries are not processed.
I have tried dyno-queue with version release 2.0.13 and see this behavior.

Any ideas/comments on how to fix this?

@manjurad
Copy link
Author

Any help here? Is it possible we are missing exception handling and the thread exits when we lose connection to the queue?

I see we have added exception handling here

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant