Add an ability to configure health checks for container in Workpsace.Next ChePlugin #10273

garagatyi · 2018-07-04T15:18:27Z

Description

Right now we use hardcoded health checks, but with the Workspace.Next we need to make them configurable

Reproduction Steps

OS and version:

Diagnostics:

sleshchenko · 2018-10-01T14:15:04Z

@garagatyi
The title says about containers health checks while the description contains an information that right now we use hardcoded health checks and actually we don't have any health checks for containers but we have for Che Servers(ws-agent, terminal, exec). Correct me If I'm wrong.
So, could you please clarify whether we should:

Introduce health checks for plugins containers;
Introduce health checks for plugins endpoints since they will be transformed to Che Server and it would be useful to has actual statuses there, especially for Workspace Loader, when it waits for Editor Endpoint before opening it in an iframe.
For both of Plugins Containers and Endpoints.

garagatyi · 2018-10-01T18:39:52Z

I think we need to add container health checks. We couldn't do that in Che 6 because each container could have several apps because of installers. In Che 7 we should put 1 app per container, so container liveness checks would be enough. Apart from that, we would be able to reuse existing functionality of k8s/OS/Docker.
In any case, since WS.NEXT flow is not implemented for docker we should add liveness checks for containers only.
I'm talking about Plugin sidecar containers only. In case a user needs health checks in workspace recipe he can use native health check for the recipe type.

Health checks (liveness probes in k8s terms) can check app running in a container, not container itself, so they can check app state. If app state is OK then I consider as fare to set statuses of all the servers of that container as RUNNING. Maybe later we can remove those statuses at all.

BarryDrez · 2019-03-20T11:35:42Z

@garagatyi For custom stacks, could this be something the user can configure in their recipe - e.g.,

"recipe": {
  "type": "kubernetes",
  "content": "kind: List\nitems:\n - \n  kind: Service\n  apiVersion: v1\n  metadata:\n   name: isservice\n  spec:\n   selector:\n    name: IS103\n   ports:\n    - \n     name: isadmin\n     protocol: TCP\n     port: 5555\n     targetPort: 5555\n - \n  kind: Pod\n  apiVersion: v1\n  metadata:\n   name: is103\n  spec:\n   containers:\n    - \n     image: 'daerepository03.eur.ad.sag:4443/design-server/is:10.3.0.0xa'\n     name: integrationserver\n     ports:\n      - \n       containerPort: 5555\n       protocol: TCP\n     resources:\n      limits:\n       memory: 2048Mi\n     livenessProbe:\n       failureThreshold: 11\n       initialDelaySeconds: 5\n       periodSeconds: 5\n       successThreshold: 1\n       tcpSocket:\n         port: 5555\n       timeoutSeconds: 45\n     readinessProbe:\n       failureThreshold: 10\n       initialDelaySeconds: 20\n       periodSeconds: 5\n       successThreshold: 1\n       tcpSocket:\n         port: 5555\n       timeoutSeconds: 120\n",
  "contentType": "text/x-yaml"
}

Formatted content:

kind: List
items:
 - 
  kind: Service
  apiVersion: v1
  metadata:
   name: isservice
  spec:
   selector:
    name: IS103
   ports:
    - 
     name: isadmin
     protocol: TCP
     port: 5555
     targetPort: 5555
 - 
  kind: Pod
  apiVersion: v1
  metadata:
   name: is103
  spec:
   containers:
    - 
     image: 'daerepository03.eur.ad.sag:4443/design-server/is:10.3.0.0xa'
     name: integrationserver
     ports:
      - 
       containerPort: 5555
       protocol: TCP
     resources:
      limits:
       memory: 2048Mi
     livenessProbe:
       failureThreshold: 11
       initialDelaySeconds: 5
       periodSeconds: 5
       successThreshold: 1
       tcpSocket:
         port: 5555
       timeoutSeconds: 45
     readinessProbe:
       failureThreshold: 10
       initialDelaySeconds: 10
       periodSeconds: 3
       successThreshold: 1
       tcpSocket:
         port: 5555
       timeoutSeconds: 2

I have tried this, but it does not work.

garagatyi · 2019-03-21T09:32:58Z

@BarryDrez if you are talking about user recipe then it should be already supported. @sleshchenko correct me if I'm mistaken.

Original suggestion was about configuring health checks for IDE plugins. Such configuration should be similar to what is defined in the k8s deployment

sleshchenko · 2019-03-21T10:02:26Z

@garagatyi You're right, it should be supported.
@BarryDrez

I have tried this, but it does not work.

What the error? Could you provide workspace related Deployment that is created by Che Server?
It would be better if you create a dedicated issue and we'll continue your problem investigation there. Thanks.

BarryDrez · 2019-03-21T12:15:42Z

@garagatyi, @sleshchenko Thank you for clarifying this. I have done some more experimenting with my liveness and readiness probes, and it looks like I needed to add a longer delay for the liveness probe. If this still looks like a bug, I will open a new issue as you suggest, but it is beginning to look like it is working well (as designed).

che-bot · 2019-09-17T20:29:21Z

Issues go stale after 180 days of inactivity. lifecycle/stale issues rot after an additional 7 days of inactivity and eventually close.

Mark the issue as fresh with /remove-lifecycle stale in a new comment.

If this issue is safe to close now please do so.

Moderators: Add lifecycle/frozen label to avoid stale mode.

l0rd mentioned this issue Jul 4, 2018

ws.next walking skeleton #10123

Closed

24 tasks

garagatyi added kind/task Internal things, technical debt, and to-do tasks to be performed. team/osio labels Jul 10, 2018

l0rd mentioned this issue Aug 3, 2018

EclipseCon Europe Workshop #10644

Closed

57 tasks

skabashnyuk mentioned this issue Oct 1, 2018

Setup jupyter editor eclipse-che/che-plugin-registry#39

Merged

skabashnyuk self-assigned this Oct 1, 2018

skabashnyuk added the team/platform label Oct 1, 2018

This was referenced Nov 1, 2018

Do not hardcode IDE servers names on Che master #11802

Closed

Added dirigible server to liveness probes list #11829

Merged

skabashnyuk removed the team/platform label Mar 6, 2019

skabashnyuk removed their assignment Mar 10, 2019

che-bot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Sep 17, 2019

che-bot closed this as completed Sep 25, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add an ability to configure health checks for container in Workpsace.Next ChePlugin #10273

Add an ability to configure health checks for container in Workpsace.Next ChePlugin #10273

garagatyi commented Jul 4, 2018

sleshchenko commented Oct 1, 2018 •

edited

Loading

garagatyi commented Oct 1, 2018

BarryDrez commented Mar 20, 2019

garagatyi commented Mar 21, 2019

sleshchenko commented Mar 21, 2019

BarryDrez commented Mar 21, 2019

che-bot commented Sep 17, 2019

Add an ability to configure health checks for container in Workpsace.Next ChePlugin #10273

Add an ability to configure health checks for container in Workpsace.Next ChePlugin #10273

Comments

garagatyi commented Jul 4, 2018

Description

Reproduction Steps

sleshchenko commented Oct 1, 2018 • edited Loading

garagatyi commented Oct 1, 2018

BarryDrez commented Mar 20, 2019

garagatyi commented Mar 21, 2019

sleshchenko commented Mar 21, 2019

BarryDrez commented Mar 21, 2019

che-bot commented Sep 17, 2019

sleshchenko commented Oct 1, 2018 •

edited

Loading