You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Client Version: v1.32.0
Kustomize Version: v5.5.0
Server Version: v1.32.1-eks-8cce635
Helm Version
ArgoCD deployment
Description
Everything works fine initially. Then few dags were importing via GitSync. Webserver GUI stops working. Specially after integrating with Google SSO
Yet, issue occured before. But now the GUI is unusable after Google SSO and few dags. Total (3 dags)
Suddenly, webserver keeps restarting due to Liveness Probe, we increased it to 60 seconds and increased the workers in Gunicorn to 10 workers. We don't have any resource issue as K8s clearly show minimum utilization of memory (almost 1.2 GB per pod)
To explain the problem.
The GUI takes forever to load. Sometimes partial load is happening with many 503 error on CSS and more things. Making the usage of webserver impossible
Ignore SSL configs. We're using self-signed cert generated by init container to complete the SSO setup of Google and its working fine
Relevant Logs
2025-02-26 19:25:41.646 [2025-02-26T16:25:41.645+0000] {impl.py:170} INFO - Pool recreating
2025-02-26 19:25:41.646 [2025-02-26T16:25:41.645+0000] {impl.py:170} INFO - Pool recreating
2025-02-26 19:25:41.646 [2025-02-26T16:25:41.644+0000] {impl.py:195} INFO - Pool disposed. Pool size: 5 Connections in pool: 0 Current Overflow: -5 Current Checked out connections: 0
2025-02-26 19:25:41.646 [2025-02-26T16:25:41.644+0000] {impl.py:195} INFO - Pool disposed. Pool size: 5 Connections in pool: 0 Current Overflow: -5 Current Checked out connections: 0
2025-02-26 19:25:41.646 [2025-02-26 16:25:41 +0000] [104] [INFO] Worker exiting (pid: 104)
2025-02-26 19:25:41.636 [2025-02-26 16:25:41 +0000] [38] [ERROR] Worker (pid:95) was sent SIGTERM!
2025-02-26 19:25:41.624 [2025-02-26T16:25:41.618+0000] {impl.py:170} INFO - Pool recreating
2025-02-26 19:25:41.624 [2025-02-26T16:25:41.618+0000] {impl.py:170} INFO - Pool recreating
2025-02-26 19:25:41.623 [2025-02-26T16:25:41.616+0000] {impl.py:195} INFO - Pool disposed. Pool size: 5 Connections in pool: 0 Current Overflow: -5 Current Checked out connections: 0
2025-02-26 19:25:41.623 [2025-02-26T16:25:41.616+0000] {impl.py:195} INFO - Pool disposed. Pool size: 5 Connections in pool: 0 Current Overflow: -5 Current Checked out connections: 0
2025-02-26 19:25:41.623 [2025-02-26T16:25:41.614+0000] {impl.py:170} INFO - Pool recreating
2025-02-26 19:25:41.623 [2025-02-26T16:25:41.614+0000] {impl.py:170} INFO - Pool recreating
2025-02-26 19:25:41.623 [2025-02-26T16:25:41.613+0000] {impl.py:195} INFO - Pool disposed. Pool size: 5 Connections in pool: 0 Current Overflow: -5 Current Checked out connections: 0
2025-02-26 19:25:41.623 [2025-02-26T16:25:41.613+0000] {impl.py:195} INFO - Pool disposed. Pool size: 5 Connections in pool: 0 Current Overflow: -5 Current Checked out connections: 0
2025-02-26 19:25:41.623 [2025-02-26T16:25:41.603+0000] {impl.py:170} INFO - Pool recreating
2025-02-26 19:25:41.623 [2025-02-26T16:25:41.603+0000] {impl.py:170} INFO - Pool recreating
2025-02-26 19:25:41.623 [2025-02-26T16:25:41.610+0000] {impl.py:170} INFO - Pool recreating
2025-02-26 19:25:41.623 [2025-02-26T16:25:41.610+0000] {impl.py:170} INFO - Pool recreating
2025-02-26 19:25:41.623 [2025-02-26T16:25:41.610+0000] {impl.py:170} INFO - Pool recreating
2025-02-26 19:25:41.623 [2025-02-26T16:25:41.610+0000] {impl.py:170} INFO - Pool recreating
2025-02-26 19:25:41.623 [2025-02-26T16:25:41.609+0000] {impl.py:195} INFO - Pool disposed. Pool size: 5 Connections in pool: 0 Current Overflow: -5 Current Checked out connections: 0
2025-02-26 19:25:41.623 [2025-02-26T16:25:41.608+0000] {impl.py:195} INFO - Pool disposed. Pool size: 5 Connections in pool: 0 Current Overflow: -5 Current Checked out connections: 0
2025-02-26 19:25:41.623 [2025-02-26T16:25:41.609+0000] {impl.py:195} INFO - Pool disposed. Pool size: 5 Connections in pool: 0 Current Overflow: -5 Current Checked out connections: 0
2025-02-26 19:25:41.623 [2025-02-26T16:25:41.608+0000] {impl.py:195} INFO - Pool disposed. Pool size: 5 Connections in pool: 0 Current Overflow: -5 Current Checked out connections: 0
2025-02-26 19:25:41.623 [2025-02-26T16:25:41.606+0000] {impl.py:170} INFO - Pool recreating
2025-02-26 19:25:41.623 [2025-02-26T16:25:41.606+0000] {impl.py:170} INFO - Pool recreating
2025-02-26 19:25:41.623 [2025-02-26T16:25:41.606+0000] {impl.py:170} INFO - Pool recreating
2025-02-26 19:25:41.623 [2025-02-26T16:25:41.606+0000] {impl.py:170} INFO - Pool recreating
2025-02-26 19:25:41.623 [2025-02-26T16:25:41.604+0000] {impl.py:195} INFO - Pool disposed. Pool size: 5 Connections in pool: 0 Current Overflow: -5 Current Checked out connections: 0
2025-02-26 19:25:41.623 [2025-02-26T16:25:41.604+0000] {impl.py:195} INFO - Pool disposed. Pool size: 5 Connections in pool: 0 Current Overflow: -5 Current Checked out connections: 0
2025-02-26 19:25:41.623 [2025-02-26T16:25:41.604+0000] {impl.py:195} INFO - Pool disposed. Pool size: 5 Connections in pool: 0 Current Overflow: -5 Current Checked out connections: 0
2025-02-26 19:25:41.623 [2025-02-26T16:25:41.604+0000] {impl.py:195} INFO - Pool disposed. Pool size: 5 Connections in pool: 0 Current Overflow: -5 Current Checked out connections: 0
2025-02-26 19:25:41.623 [2025-02-26 16:25:41 +0000] [100] [INFO] Worker exiting (pid: 100)
2025-02-26 19:25:41.623 [2025-02-26 16:25:41 +0000] [101] [INFO] Worker exiting (pid: 101)
2025-02-26 19:25:41.623 [2025-02-26 16:25:41 +0000] [38] [INFO] Handling signal: term
2025-02-26 19:25:41.623 [2025-02-26 16:25:41 +0000] [96] [INFO] Worker exiting (pid: 96)
2025-02-26 19:25:41.623 [2025-02-26 16:25:41 +0000] [99] [INFO] Worker exiting (pid: 99)
2025-02-26 19:25:41.623 [2025-02-26 16:25:41 +0000] [97] [INFO] Worker exiting (pid: 97)
2025-02-26 19:25:41.623 [2025-02-26 16:25:41 +0000] [98] [INFO] Worker exiting (pid: 98)
2025-02-26 19:25:41.603 [2025-02-26T16:25:41.601+0000] {impl.py:195} INFO - Pool disposed. Pool size: 5 Connections in pool: 0 Current Overflow: -5 Current Checked out connections: 0
2025-02-26 19:25:41.603 [2025-02-26T16:25:41.601+0000] {impl.py:195} INFO - Pool disposed. Pool size: 5 Connections in pool: 0 Current Overflow: -5 Current Checked out connections: 0
2025-02-26 19:25:41.601 [2025-02-26 16:25:41 +0000] [102] [INFO] Worker exiting (pid: 102)
2025-02-26 19:25:41.599 [2025-02-26T16:25:41.598+0000] {impl.py:170} INFO - Pool recreating
2025-02-26 19:25:41.599 [2025-02-26T16:25:41.598+0000] {impl.py:170} INFO - Pool recreating
2025-02-26 19:25:41.598 [2025-02-26T16:25:41.597+0000] {impl.py:195} INFO - Pool disposed. Pool size: 5 Connections in pool: 0 Current Overflow: -5 Current Checked out connections: 0
2025-02-26 19:25:41.598 [2025-02-26T16:25:41.597+0000] {impl.py:195} INFO - Pool disposed. Pool size: 5 Connections in pool: 0 Current Overflow: -5 Current Checked out connections: 0
2025-02-26 19:25:41.598 [2025-02-26T16:25:41.597+0000] {impl.py:170} INFO - Pool recreating
2025-02-26 19:25:41.597 [2025-02-26T16:25:41.597+0000] {impl.py:170} INFO - Pool recreating
2025-02-26 19:25:41.597 [2025-02-26T16:25:41.596+0000] {impl.py:195} INFO - Pool disposed. Pool size: 5 Connections in pool: 0 Current Overflow: -5 Current Checked out connections: 0
2025-02-26 19:25:41.597 [2025-02-26T16:25:41.596+0000] {impl.py:195} INFO - Pool disposed. Pool size: 5 Connections in pool: 0 Current Overflow: -5 Current Checked out connections: 0
2025-02-26 19:25:41.596 [2025-02-26T16:25:41.594+0000] {webserver_command.py:429} INFO - Received signal: 15. Closing gunicorn.
2025-02-26 19:25:41.595 [2025-02-26 16:25:41 +0000] [95] [INFO] Worker exiting (pid: 95)
2025-02-26 19:25:41.595 [2025-02-26 16:25:41 +0000] [103] [INFO] Worker exiting (pid: 103)
2025-02-26 19:25:35.544 10.0.30.87 - - [26/Feb/2025:16:25:35 +0000] "GET /health HTTP/1.1" 200 318 "-""kube-probe/1.32+"
2025-02-26 19:25:35.541 [2025-02-26T16:25:35.540+0000] {base.py:1092} INFO - COMMIT
2025-02-26 19:25:35.541 [2025-02-26T16:25:35.540+0000] {base.py:1092} INFO - COMMIT
2025-02-26 19:25:35.539 [2025-02-26T16:25:35.538+0000] {base.py:1868} INFO - [cached since 0.01995s ago] {'job_type_1': 'DagProcessorJob', 'param_1': <JobState.RUNNING: 'running'>, 'param_2': 0, 'param_3': 1, 'param_4': 1}
2025-02-26 19:25:35.539 [2025-02-26T16:25:35.538+0000] {base.py:1868} INFO - [cached since 0.01995s ago] {'job_type_1': 'DagProcessorJob', 'param_1': <JobState.RUNNING: 'running'>, 'param_2': 0, 'param_3': 1, 'param_4': 1}
2025-02-26 19:25:35.539 LIMIT %(param_4)s
2025-02-26 19:25:35.539 WHERE job.job_type = %(job_type_1)s ORDER BY CASE job.state WHEN %(param_1)s THEN %(param_2)s ELSE %(param_3)s END, job.latest_heartbeat DESC
2025-02-26 19:25:35.539 FROM job
2025-02-26 19:25:35.539 [2025-02-26T16:25:35.537+0000] {base.py:1863} INFO - SELECT job.id, job.dag_id, job.state, job.job_type, job.start_date, job.end_date, job.latest_heartbeat, job.executor_class, job.hostname, job.unixname
2025-02-26 19:25:35.539 LIMIT %(param_4)s
2025-02-26 19:25:35.539 WHERE job.job_type = %(job_type_1)s ORDER BY CASE job.state WHEN %(param_1)s THEN %(param_2)s ELSE %(param_3)s END, job.latest_heartbeat DESC
2025-02-26 19:25:35.539 FROM job
2025-02-26 19:25:35.539 [2025-02-26T16:25:35.537+0000] {base.py:1863} INFO - SELECT job.id, job.dag_id, job.state, job.job_type, job.start_date, job.end_date, job.latest_heartbeat, job.executor_class, job.hostname, job.unixname
2025-02-26 19:25:35.539 [2025-02-26T16:25:35.536+0000] {base.py:1032} INFO - BEGIN (implicit)
2025-02-26 19:25:35.537 [2025-02-26T16:25:35.536+0000] {base.py:1032} INFO - BEGIN (implicit)
2025-02-26 19:25:35.531 [2025-02-26T16:25:35.530+0000] {base.py:1092} INFO - COMMIT
2025-02-26 19:25:35.531 [2025-02-26T16:25:35.530+0000] {base.py:1092} INFO - COMMIT
2025-02-26 19:25:35.528 [2025-02-26T16:25:35.528+0000] {base.py:1868} INFO - [cached since 0.009677s ago] {'job_type_1': 'TriggererJob', 'param_1': <JobState.RUNNING: 'running'>, 'param_2': 0, 'param_3': 1, 'param_4': 1}
2025-02-26 19:25:35.528 [2025-02-26T16:25:35.528+0000] {base.py:1868} INFO - [cached since 0.009677s ago] {'job_type_1': 'TriggererJob', 'param_1': <JobState.RUNNING: 'running'>, 'param_2': 0, 'param_3': 1, 'param_4': 1}
2025-02-26 19:25:35.528 LIMIT %(param_4)s
2025-02-26 19:25:35.528 WHERE job.job_type = %(job_type_1)s ORDER BY CASE job.state WHEN %(param_1)s THEN %(param_2)s ELSE %(param_3)s END, job.latest_heartbeat DESC
2025-02-26 19:25:35.528 FROM job
2025-02-26 19:25:35.528 [2025-02-26T16:25:35.527+0000] {base.py:1863} INFO - SELECT job.id, job.dag_id, job.state, job.job_type, job.start_date, job.end_date, job.latest_heartbeat, job.executor_class, job.hostname, job.unixname
2025-02-26 19:25:35.528 LIMIT %(param_4)s
2025-02-26 19:25:35.528 WHERE job.job_type = %(job_type_1)s ORDER BY CASE job.state WHEN %(param_1)s THEN %(param_2)s ELSE %(param_3)s END, job.latest_heartbeat DESC
2025-02-26 19:25:35.527 FROM job
2025-02-26 19:25:35.527 [2025-02-26T16:25:35.527+0000] {base.py:1863} INFO - SELECT job.id, job.dag_id, job.state, job.job_type, job.start_date, job.end_date, job.latest_heartbeat, job.executor_class, job.hostname, job.unixname
2025-02-26 19:25:35.526 [2025-02-26T16:25:35.526+0000] {base.py:1032} INFO - BEGIN (implicit)
2025-02-26 19:25:35.526 [2025-02-26T16:25:35.526+0000] {base.py:1032} INFO - BEGIN (implicit)
2025-02-26 19:25:35.523 [2025-02-26T16:25:35.522+0000] {base.py:1092} INFO - COMMIT
2025-02-26 19:25:35.522 [2025-02-26T16:25:35.522+0000] {base.py:1092} INFO - COMMIT
2025-02-26 19:25:35.520 [2025-02-26T16:25:35.519+0000] {base.py:1868} INFO - [generated in 0.00081s] {'job_type_1': 'SchedulerJob', 'param_1': <JobState.RUNNING: 'running'>, 'param_2': 0, 'param_3': 1, 'param_4': 1}
2025-02-26 19:25:35.520 [2025-02-26T16:25:35.519+0000] {base.py:1868} INFO - [generated in 0.00081s] {'job_type_1': 'SchedulerJob', 'param_1': <JobState.RUNNING: 'running'>, 'param_2': 0, 'param_3': 1, 'param_4': 1}
2025-02-26 19:25:35.519 LIMIT %(param_4)s
Checks
User-Community Airflow Helm Chart
.Chart Version
8.9.0
Kubernetes Version
Helm Version
Description
Everything works fine initially. Then few dags were importing via GitSync. Webserver GUI stops working. Specially after integrating with Google SSO
Yet, issue occured before. But now the GUI is unusable after Google SSO and few dags. Total (3 dags)
Suddenly, webserver keeps restarting due to Liveness Probe, we increased it to 60 seconds and increased the workers in Gunicorn to 10 workers. We don't have any resource issue as K8s clearly show minimum utilization of memory (almost 1.2 GB per pod)
To explain the problem.
The GUI takes forever to load. Sometimes partial load is happening with many 503 error on CSS and more things. Making the usage of webserver impossible
Ignore SSL configs. We're using self-signed cert generated by init container to complete the SSO setup of Google and its working fine
Relevant Logs
Custom Helm Values
The text was updated successfully, but these errors were encountered: