Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix KPO task hanging when pod fails to start within specified timeout #37514

Merged
merged 1 commit into from
Feb 18, 2024

Conversation

pankajkoti
Copy link
Member

I am observing an issue wrt to the recent deferrable KPO
changes in PR #37279 and #37454,
where when the pod fails to start within a specified timeout value,
the KPO task is hanging forever whereas it is expected to fail after the
timeout. This PR fixes the issue by correcting a logical error for
detecting if elapsed timeout has occurred for raising the timeout
trigger event.


^ Add meaningful description above
Read the Pull Request Guidelines for more information.
In case of fundamental code changes, an Airflow Improvement Proposal (AIP) is needed.
In case of a new dependency, check compliance with the ASF 3rd Party License Policy.
In case of backwards incompatible changes please leave a note in a newsfragment file, named {pr_number}.significant.rst or {issue_number}.significant.rst, in newsfragments.

I am observing an issue wrt to the recent deferrable KPO changes in
PR apache#37279 and apache#37454,
where when the pod fails to start within a specified timeout value,
the KPO task is hanging forever whereas it is expected to fail after
the timeout. This PR fixes the issue by correcting a logical error
for detecting if elapsed timeout has occured for raising the timeout
trigger event.
@pankajkoti pankajkoti force-pushed the fix-kpo-start-timeout-task-fail branch from 470a2fa to 0bccabb Compare February 18, 2024 09:28
@pankajastro pankajastro merged commit 6412b06 into apache:main Feb 18, 2024
63 checks passed
@pankajastro pankajastro deleted the fix-kpo-start-timeout-task-fail branch February 18, 2024 12:34
sunank200 pushed a commit to astronomer/airflow that referenced this pull request Feb 21, 2024
…apache#37514)

I am observing an issue wrt to the recent deferrable KPO changes in
PR apache#37279 and apache#37454,
where when the pod fails to start within a specified timeout value,
the KPO task is hanging forever whereas it is expected to fail after
the timeout. This PR fixes the issue by correcting a logical error
for detecting if elapsed timeout has occured for raising the timeout
trigger event.
abhishekbhakat pushed a commit to abhishekbhakat/my_airflow that referenced this pull request Mar 5, 2024
…apache#37514)

I am observing an issue wrt to the recent deferrable KPO changes in
PR apache#37279 and apache#37454,
where when the pod fails to start within a specified timeout value,
the KPO task is hanging forever whereas it is expected to fail after
the timeout. This PR fixes the issue by correcting a logical error
for detecting if elapsed timeout has occured for raising the timeout
trigger event.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area:providers provider:cncf-kubernetes Kubernetes provider related issues
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants