ADR 0010 ODH/Caikit/TGIS integration #20

Xaenalt · 2023-09-26T18:33:43Z

Overview of the architecture and diagram of the ODH+Caikit+TGIS architecture

Description

How Has This Been Tested?

Merge criteria:

The commits are squashed in a cohesive manner and have meaningful messages.
Testing instructions have been added in the PR body (for PRs involving changes that are not immediately obvious).
The developer has manually tested the changes and verified that the changes work

Xaenalt · 2023-09-26T19:08:03Z

@jwforres @etirelli PTAL

ODH-ADR-0010-caikit-tgis-architecture.md

anishasthana · 2023-09-26T20:00:41Z

Imo we should have the lucidchart be included in this ADR itself

Xaenalt · 2023-09-26T20:39:58Z

Imo we should have the lucidchart be included in this ADR itself

It's linked in "other docs" and in the "references" section, is there a way to include it more generally?

Co-authored-by: Anish Asthana <anishasthana1@gmail.com>

astefanutti · 2023-10-05T13:14:12Z

Imo we should have the lucidchart be included in this ADR itself

It's linked in "other docs" and in the "references" section, is there a way to include it more generally?

I'd recommend exporting the diagram to SVG, commit the SVG file along the ADR, and include the SVG into the ADR markdown file as suggested by @anishasthana.

Also as Lucidchart is being retired at Red Hat, I'd recommend to export the diagram in VSDX format, so it can be imported into other diagramming solutions like draw.io.

anishasthana · 2023-10-06T20:19:32Z

ODH-ADR-0010-caikit-tgis-architecture.md

+| ---------------- | ------------------------------------------------------------------------------------------------------------------------------ |
+| Date           | 2023-Sept-13                                                                                                                 |
+| Scope          | OpenDataHub and Caikit/TGIS integration architecture                                                                         |
+| Status         | Accepted                                                                                                                     |


Why is this saying accepted already? :-)

strangiato · 2023-10-09T15:30:27Z

ODH-ADR-0010-caikit-tgis-architecture.md

+
+## Non-Goals
+
+## How


How will the Caikit be deployed on a k8s cluster? Will it be additional pods/services that get deployed to the cluster?

Will it have it's own controller/operator? Will it have it's own CRDs that it manages? If so what will they do?

How will it be integrated with the ODH Operator? Will it be a new component that the DSC will need to deploy?

Will it be able to function if a user has only deployed Ray or KServe and not both?

What is the relationship between a Caikit CR and the Ray/KServe objects? Will it be like a DSPA where an instance of Caikit will need to be deployed in every Data Science Project?

Is there something that a user needs to do to make the Caikit SDK to work with Ray/Kserve or will all of the compatibility be handled on the users end (e.g. Elyra handles 100% of the translation from an "Elyra Pipeline" to a kfp-tekton compatible pipeline in the running notebook so dsp never needs to "understand" Elyra)?

Will anything be required by the Ray or KServe stacks to get Caikit to function or will it slot in on top of them as they exist today?

Some sort of rough architecture diagram would probably be very helpful here.

strangiato · 2023-10-09T15:31:59Z

ODH-ADR-0010-caikit-tgis-architecture.md

+
+## How
+
+Users will have a few ways to interact with the software stack. Caikit will be used both as a backend software runtime, which is used by the Caikit SDK that users can code against to create their models. These models can be trained in Ray using the Caikit runtime stack as the training backend on the nodes. Caikit will also be integrated as a serving runtime under KServe. All of these components can be interacted with using the standard OpenShift APIs, creating CRs in OpenShift, etc. Additionally, Caikit will also expose an API that can run on the cluster, allowing for several convenience features such as moving a model between training and serving, as well as some tracking. These features will be implemeted in the same manner, creating CRs and calling OpenShift APIs.


Caikit will be used both as

This sentence only lists one option. The second option probably got moved into a separate sentence while it was being edited so "both" no longer makes sense here.

strangiato · 2023-10-09T15:32:58Z

ODH-ADR-0010-caikit-tgis-architecture.md

+
+## How
+
+Users will have a few ways to interact with the software stack. Caikit will be used both as a backend software runtime, which is used by the Caikit SDK that users can code against to create their models. These models can be trained in Ray using the Caikit runtime stack as the training backend on the nodes. Caikit will also be integrated as a serving runtime under KServe. All of these components can be interacted with using the standard OpenShift APIs, creating CRs in OpenShift, etc. Additionally, Caikit will also expose an API that can run on the cluster, allowing for several convenience features such as moving a model between training and serving, as well as some tracking. These features will be implemeted in the same manner, creating CRs and calling OpenShift APIs.


creating CRs in OpenShift

What CRs? What will they do?

israel-hdez · 2023-10-10T16:44:41Z

ODH-ADR-0010-caikit-tgis-architecture.md

+
+## What
+
+This ADR describes the architecture of the joint IBM-RedHat integration of ODH and Caikit/TGIS into the AI stack.


Are we good mentioning company ascription here?

It doesn't seem relevant for ODH.

Xaenalt · 2023-10-25T17:53:43Z

github-actions · 2024-07-18T03:17:17Z

This PR was closed because it has been stale for 21+7 days with no activity.

ADR 10 ODH/Caikit/TGIS integration

d1086e2

Xaenalt mentioned this pull request Sep 26, 2023

Caikit/TGIS ADR opendatahub-io/caikit#13

Open

2 tasks

anishasthana requested changes Sep 26, 2023

View reviewed changes

ODH-ADR-0010-caikit-tgis-architecture.md Outdated Show resolved Hide resolved

Update ODH-ADR-0010-caikit-tgis-architecture.md

da11454

Co-authored-by: Anish Asthana <anishasthana1@gmail.com>

anishasthana reviewed Oct 6, 2023

View reviewed changes

strangiato reviewed Oct 9, 2023

View reviewed changes

israel-hdez reviewed Oct 10, 2023

View reviewed changes

etirelli added the Stale label Jul 9, 2024

github-actions bot closed this Jul 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ADR 0010 ODH/Caikit/TGIS integration #20

ADR 0010 ODH/Caikit/TGIS integration #20

Xaenalt commented Sep 26, 2023

Xaenalt commented Sep 26, 2023

anishasthana commented Sep 26, 2023

Xaenalt commented Sep 26, 2023

astefanutti commented Oct 5, 2023 •

edited

Loading

anishasthana Oct 6, 2023

strangiato Oct 9, 2023

strangiato Oct 9, 2023

strangiato Oct 9, 2023

israel-hdez Oct 10, 2023

Xaenalt commented Oct 25, 2023

github-actions bot commented Jul 18, 2024


		## How

		Users will have a few ways to interact with the software stack. Caikit will be used both as a backend software runtime, which is used by the Caikit SDK that users can code against to create their models. These models can be trained in Ray using the Caikit runtime stack as the training backend on the nodes. Caikit will also be integrated as a serving runtime under KServe. All of these components can be interacted with using the standard OpenShift APIs, creating CRs in OpenShift, etc. Additionally, Caikit will also expose an API that can run on the cluster, allowing for several convenience features such as moving a model between training and serving, as well as some tracking. These features will be implemeted in the same manner, creating CRs and calling OpenShift APIs.


		## What

		This ADR describes the architecture of the joint IBM-RedHat integration of ODH and Caikit/TGIS into the AI stack.

ADR 0010 ODH/Caikit/TGIS integration #20

ADR 0010 ODH/Caikit/TGIS integration #20

Conversation

Xaenalt commented Sep 26, 2023

Description

How Has This Been Tested?

Merge criteria:

Xaenalt commented Sep 26, 2023

anishasthana commented Sep 26, 2023

Xaenalt commented Sep 26, 2023

astefanutti commented Oct 5, 2023 • edited Loading

anishasthana Oct 6, 2023

Choose a reason for hiding this comment

strangiato Oct 9, 2023

Choose a reason for hiding this comment

strangiato Oct 9, 2023

Choose a reason for hiding this comment

strangiato Oct 9, 2023

Choose a reason for hiding this comment

israel-hdez Oct 10, 2023

Choose a reason for hiding this comment

Xaenalt commented Oct 25, 2023

github-actions bot commented Jul 18, 2024

astefanutti commented Oct 5, 2023 •

edited

Loading