[TEP-0080] Proposal: TaskRun Pre/Post steps #502

mattmoor · 2021-08-19T16:10:51Z

This is the first of a handful of TEPs I'm putting together to weave a more complete story. Once the constituent TEPs have been staged, I can open up an Uber-TEP to discuss the bigger picture they are trying to compose into (a la Voltron), but want to keep the individual aspects finely scoped.

imjasonh · 2021-08-19T17:52:18Z

The main requirement is a mechanism to enable users to scope the intent of things
like Hermekton's ExecutionPolicy, but the execution of the result must be
equivalent to append(pre, steps, post) modulo features that are sensitive to the
distinction (e.g. Hermekton).

Would it be sufficient to support per-step hermeticity? My (incomplete) understanding of the hermekton TEP was that it was mainly a matter of how the entrypointer ran the step's command+args, which in my (incomplete) understanding would be doable with a per-step executionMode field.

mattmoor · 2021-08-19T17:58:53Z

@imjasonh possibly, though (to cross the streams a bit) that would also mean that in a world where parameters can have type Step (ref) that the TaskRun wouldn't be in control of whether that step were hermetically executed, which may be undesirable.

To crib my example from there:

params:
    - name: source-step
      description: The step to execute to fetch source
      schema:
        type: dev.tekton.taskruns.Step # This is a strawperson
  presteps:
  - $(params.source-step)
  steps:
  ... # More literal steps

I sort of like the idea of having clear expectations about whether a parameter of type Step should/shouldn't be executed hermetically.

imjasonh · 2021-08-19T18:05:00Z

I sort of like the idea of having clear expectations about whether a parameter of type Step should/shouldn't be executed hermetically.

Absolutely.

The way this TEP is framed, pre- and post-steps seem to mainly be a matter of cordoning off which steps are hermetic and which aren't, which doesn't motivate the API change by itself since hermkton is not widely used today (possibly because it's too hard, which this TEP would fix 🙃). I suspect your other planned TEPs help drive this need better, in which case I'm content to wait and see how that all shapes up.

mattmoor · 2021-08-19T18:06:48Z

I suspect your other planned TEPs help drive this need better

I'm working on them! Just trying to keep the parts bite-sized.

bobcatfish · 2021-08-20T17:26:40Z

Hey @mattmoor a few initial questions:

Would there be some way to reuse existing tasks for pre and post steps e.g. if you wanted to do a git clone as a pre step or would you need to copy the steps?
Would post steps run in the case of an error? This was an issue with PipelineResources - e.g. folks would want to do something at the end of a task, like upload results, but if the task steps failed it wouldnt happen so they'd have to do something artificial to make the steps pass - today with onError this gets a bit easier but if you still ultimately want the task to fail you'd have to include a final post step to do that - and it starts to look more and more like the "finally" syntax we already have in pipelines - and is why ive been pursuing running a pipeline in a pod as the solution to the use cases you are describing
Would this functionality be available from a pipeline as well (which generates taskruns) or just from a taskrun directly?

You might find TEP-0044 interesting - esp. the task composition alternative which suggests something very similar to what you are proposing but from the context of using a task in a pipeline.

I've been trying to find a way to meet the use cases you are describing and it feels like the main missing piece is being able to use the composition expressed in a pipeline but running in a way that doesnt require volumes to share data, i.e. in one pod - which is why ive been pursuing TEP-0044 and #447

mattmoor · 2021-08-20T18:19:32Z

Would there be some way to reuse existing tasks for pre and post steps e.g. if you wanted to do a git clone as a pre step or would you need to copy the steps?

As it stands today, yes, however, one of my comments on your PR was about enabling structured substitutions as well as structured params/results, and allowing params / results to have types reflecting those substitutions. The example I gave was specifically around being able to pass a Step as a parameter to a task. I hack around this in mink by just passing a container, but this leaves a lot to be desired.

Would post steps run in the case of an error?

They can if we say we want that, I'm happy to call it "finally" as well, if that's what we want. Ultimately it seems like you'd need to capacity to express "finally" in the TaskRun spec in order for PipelineRun->TaskRun transpilation to WAI with "finally", or go the "straight to Pod" route, which has other gotchas.

Would this functionality be available from a pipeline as well (which generates taskruns) or just from a taskrun directly?

My expectation is that this is part of TaskSpec (lmk if there's a way to better convey that), so Tasks could define these too, and even take aspects of the pre/post work as parameters. This would means the TaskRuns executed by a Pipeline (or anything else) would fundamentally have this available to them.

You might find TEP-0044 interesting - esp. the task composition alternative which suggests something very similar to what you are proposing but from the context of using a task in a pipeline.

I've been trying to find a way to meet the use cases you are describing and it feels like the main missing piece is being able to use the composition expressed in a pipeline but running in a way that doesnt require volumes to share data, i.e. in one pod - which is why ive been pursuing TEP-0044 and #447

I can take a look (there's a lot to read through and grok), but my major concerns in general are that Pipeline is overkill for what we want, and it's likely impossible to fully transpile ~any conformant Pipeline into a Task, so it is a potential minefield for folks using them that way. It'd be good to understand how you hope to help folks navigate what's supported vs. not.

jerop · 2021-08-23T16:17:27Z

/assign @bobcatfish @vdemeester @imjasonh @jerop

tekton-robot · 2021-08-26T16:26:44Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
To complete the pull request process, please assign bobcatfish
You can assign the PR to them by writing /assign @bobcatfish in a comment when ready.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

teps/OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

bobcatfish · 2021-08-27T15:54:14Z

and it's likely impossible to fully transpile ~any conformant Pipeline into a Task

I agree - in the long run as we keep pursuing this, I think we'll break away from the intermediate TaskRun and go straight to a pod (or possibly converge Pipelines and Tasks to be more similar but that's one of the more out there options :D) - there's a bit more about this in the "what comes next" in #447

but my major concerns in general are that Pipeline is overkill for what we want

When you say "overkill" could you elaborate a bit? Usually it seems like the main argument against Pipelines for something like this is because of the overhead of requiring volumes for sharing data, but I'm interested if there are other aspects that make Pipelines overkill if that is no longer a problem?

mattmoor · 2021-08-30T04:39:47Z

When you say "overkill" could you elaborate a bit?

Pipelines are a DAG abstraction, whereas Tasks are already a sequence of colocated steps. Needing to involve Pipeline just to scope aspects of Tasks semantics in still-sequential execution is why it seems like overkill.

in the long run as we keep pursuing this, I think we'll break away from the intermediate TaskRun and go straight to a pod

I have a large number of concerns about this, many/all of which I think I've mentioned. To list a few here (since it also speaks to the "overkill" comment):

I doubt it is possible to capture the full semantics of Pipeline in a single Pod (certainly not without heroic work), so there are conformance issues and ultimately there's a high risk of user confusion around what is/isn't supported by this non-conformant variant
There is bound to be a huge amount of redundant code across these implementations, and even well factored as packages, that code will likely become much more complex (needing to reason about things in terms of the more complex DAG model).
There is already significant conceptual leakage across Pipeline/Task, and a variety of layering violations. This is bound to make those things substantially worse.

Cloud Build supports DAG execution, and we specifically dropped it from knative/build (👋 taskruns) because it was a complexity nightmare, which was deemed (by folks on the team) more trouble than it was worth. Pipeline would only be more complex to try to fully capture due to the possibility of things like distinct service accounts across TaskRuns.

So I'm genuinely wondering whether there is a 80/20 or even 90/10 sweet spot here, where we can capture the vast majority of use cases with a much simpler model.

vdemeester

So reading the proposal, it seems to be solely about hermekton but reading the comments, it seems to be for a broader set of reason why we would want this. I would like to either have the specifics of hermekton be discussed in an update of the hermekton TEP or have this TEP have more use cases and motivation "reasons".

I think this is one of the reason of @imjasonh's comment. If we look solely at hermekton what benefit has this versus updating the hermekton TEP to support per-step hermeticity, and I agree. (ok reading the other comments again is exactly about that 😛).

My expectation is that this is part of TaskSpec (lmk if there's a way to better convey that), so Tasks could define these too, and even take aspects of the pre/post work as parameters. This would means the TaskRuns executed by a Pipeline (or anything else) would fundamentally have this available to them.

You might find TEP-0044 interesting - esp. the task composition alternative which suggests something very similar to what you are proposing but from the context of using a task in a pipeline. […]

I can take a look (there's a lot to read through and grok), but my major concerns in general are that Pipeline is overkill for what we want, and it's likely impossible to fully transpile ~any conformant Pipeline into a Task, so it is a potential minefield for folks using them that way. It'd be good to understand how you hope to help folks navigate what's supported vs. not.

I tend to agree, the initial problem statement of TEP-0044 seems to be very similar to this proposal and pre/post steps could be a possible implementation of TEP-0044. The idea in both being : how to keep Tasks simple and shareable while still making composable without necessarly the need for a Pipeline ?

For example, if we were to allow to inject steps in a TaskRun (before the execution and after), wouldn't this be the wanted behavior without impacting the current Task definition ? I am not entirely sure to see the value of adding those in the TaskSpec for example, compared to having it in the TaskRunSpec. The go-build task would change, but at runtime, when you want to run a quick go build, hermetically, on your project, you would "inject" a git clone before (or a gcs …) and something to report after. Those could be other embedded Step spec, or could refer other Tasks (git-clone for example). This is something that was proposed (a bit differently) here.

I definitely understand the value of being able to use one Task without going with Pipeline complexity for simple use case where you just want to build an image, … This is something that I think we should cover better than we do today. The question is, how ? 😛

vdemeester · 2021-08-30T07:07:28Z

teps/0080-taskrun-prepost-steps.md

+ - `pre-steps`: a section of the TaskRun execution prior to the "real" work.  This
+ would generally consist of "setup" and "download" related work.


This point is basically init containers right ? If we were to allow a user to inject init containers (pre-steps, whatever we call them), that would be feeling that need, right ?

Yes, although Tekton has the capacity to do this pre-work with sidecars, which could be nice.

vdemeester · 2021-08-30T07:08:10Z

teps/0080-taskrun-prepost-steps.md

+ - `post-steps`: a section of the TaskRun execution posterior to the "real" work.
+ This would generally consist of "tear-down" and "upload" related work.


This point is handled by TEP-0040, aka be able to run steps no matter what was the result before, doesn't it ?

I think you are assuming (because I haven't stated as much!) that this runs like a finally block (e.g. regardless of success/failure). I could go either way on this, but I see (at least) three possibilities:

post-steps have finally semantics, which would enable TEP-0040 like cleanup logic.

post-steps have normal semantics, which would complement TEP-0040 in that folks could use post-steps like a finally block, if they so choose.

post-steps augments TEP-0040, so that folks can write onError: post-steps (or something) to indicate that when a step fails, that execution should flow to the post-steps (perhaps without executing the intervening steps).

IMO 2. and 3. leave this as non-conflicting/complementary to TEP-0040, where 1. this might subsume/replace TEP-0040.

Honestly, when I first wrote this 2. is what I had in mind, but I think you read it as 1., so it would be good to flesh this out and hopefully that shows why I wasn't thinking of this as conflicting with TEP-0040 😅

vdemeester · 2021-08-30T07:09:44Z

teps/0080-taskrun-prepost-steps.md

+One of the primary motivations for this work is to enable users to help express
+their intent when utilizing "[hermekton](./0025-hermekton.md)" builds.  Hermekton
+lets users disable network access during step execution, but as a `bool` it doesn't
+provide any real leeway to have "pre" steps fetch sources (e.g. `git clone`) or
+dependencies (e.g. `go mod download`) over the network, or "post" steps publish
+results (e.g. `gsutil cp`) outside of that network jail.


Reading the comment should we have more points here than just kermekton ? Especially as "as a bool it doesn't provide any real leeway to …" could be seem as something to work on as part of hermekton and then as a consequence here.

It's certainly most complementary to Hermekton at the moment, but as I mention above (re: TEP-0040) there are ways that this could be quite complementary as a way of expressing finally-like semantics (without that necessarily being part of what we do here).

The re-usable unit in Tekton is the Task - in that sense I think it would be nice to keep hermekton features associated to tasks. Cloning a repo and uploading results are things that could be done in tasks specialised for that, and they would not run in hermetic mode.

vdemeester · 2021-08-30T07:10:06Z

teps/0080-taskrun-prepost-steps.md

+Enable TaskRuns to hermetically execute the bulk of their work, while enabling the
+execution of "pre" steps that can fetch inputs and "post" steps that can publish
+outputs.


Same as above

dlorenc · 2021-08-30T12:18:06Z

I'm having a hard time following the comparisons between this and TEP-0040. It's possible they overlap because TEP-0040 is pretty broad, but the actual design still seems to be TBD.

vdemeester · 2021-08-30T12:26:28Z

I'm having a hard time following the comparisons between this and TEP-0040. It's possible they overlap because TEP-0040 is pretty broad, but the actual design still seems to be TBD.

It definitely is broader, and TEP-0044 design is also TBD, but they share similarities in what they are trying to achieve, aka be able to "inject" some containers before and after a Task without the need to rely on Pipeline. This is why we are comparing to it, and a decision on this TEP would affect TEP-0044 heavily (no matter what the decision is), thus trying to take it into account while reviewing this TEP.

bobcatfish · 2021-09-03T19:48:57Z

I'm having a hard time following the comparisons between this and TEP-0040.

To me the overlap is in the problem that each TEP is trying to address - this seems to me like it could be a possible solution to the problems described in TEP-0040. TEP-0040 also tries to describe what some of these solutions could be and compares them to each other, so I want to make sure we look at this potential solution in light of the others we've discussed.

(I think im just restating what @vdemeester already said 🙏 )

This change offers an alternative "fence"-based approach, which is intended to be less invasive than: tektoncd#502

mattmoor · 2021-09-05T20:55:00Z

FWIW, I posted #511 as a potential alternative to this, in the context of Hermekton.

This change offers an alternative "fence"-based approach, which is intended to be less invasive than: tektoncd#502

tekton-robot · 2021-09-08T12:13:26Z

@mattmoor: PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

tekton-robot · 2022-01-05T22:22:28Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale with a justification.
Stale issues rot after an additional 30d of inactivity and eventually close.
If this issue is safe to close now please do so with /close with a justification.
If this issue should be exempted, mark the issue as frozen with /lifecycle frozen with a justification.

/lifecycle stale

Send feedback to tektoncd/plumbing.

tekton-robot requested review from kimsterv and pritidesai August 19, 2021 16:10

tekton-robot added the size/L Denotes a PR that changes 100-499 lines, ignoring generated files. label Aug 19, 2021

mattmoor force-pushed the tep-80 branch from e1d8b78 to fd8bce0 Compare August 19, 2021 16:27

vdemeester added the kind/tep Categorizes issue or PR as related to a TEP (or needs a TEP). label Aug 23, 2021

tekton-robot assigned bobcatfish, imjasonh, jerop and vdemeester Aug 23, 2021

tekton-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Aug 24, 2021

mattmoor added 2 commits August 26, 2021 09:26

Proposal: TaskRun Pre/Post steps

fbeeaf9

Wrap long lines

ad1a8af

mattmoor force-pushed the tep-80 branch from fd8bce0 to ad1a8af Compare August 26, 2021 16:26

tekton-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Aug 26, 2021

vdemeester reviewed Aug 30, 2021

View reviewed changes

mattmoor added a commit to mattmoor/community-2 that referenced this pull request Sep 5, 2021

Update the Hermekton API to be fence-based

4529c5d

This change offers an alternative "fence"-based approach, which is intended to be less invasive than: tektoncd#502

mattmoor mentioned this pull request Sep 5, 2021

Update the Hermekton API to be fence-based #511

Closed

mattmoor added a commit to mattmoor/community-2 that referenced this pull request Sep 5, 2021

Update the Hermekton API to be fence-based

8865c90

This change offers an alternative "fence"-based approach, which is intended to be less invasive than: tektoncd#502

tekton-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Sep 8, 2021

jerop removed their assignment Oct 7, 2021

vdemeester mentioned this pull request Nov 5, 2021

Require TEP-0044 before removing PipelineResources 🙋‍♀️ #554

Merged

vdemeester mentioned this pull request Nov 18, 2021

[TEP-0044] Add more details to alternatives 🔍 #559

Closed

tekton-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jan 5, 2022

mattmoor closed this Jan 24, 2022

bobcatfish changed the title ~~Proposal: TaskRun Pre/Post steps~~ [TEP-0080] Proposal: TaskRun Pre/Post steps Feb 25, 2022

jerop mentioned this pull request Jan 18, 2023

TEP-0126: Allow Task sidecars to be specified in PipelineRun #877

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TEP-0080] Proposal: TaskRun Pre/Post steps #502

[TEP-0080] Proposal: TaskRun Pre/Post steps #502

mattmoor commented Aug 19, 2021

imjasonh commented Aug 19, 2021

mattmoor commented Aug 19, 2021

imjasonh commented Aug 19, 2021

mattmoor commented Aug 19, 2021

bobcatfish commented Aug 20, 2021

mattmoor commented Aug 20, 2021

jerop commented Aug 23, 2021

tekton-robot commented Aug 26, 2021

bobcatfish commented Aug 27, 2021

mattmoor commented Aug 30, 2021

vdemeester left a comment

vdemeester Aug 30, 2021

mattmoor Aug 30, 2021

vdemeester Aug 30, 2021

mattmoor Aug 30, 2021

mattmoor Aug 30, 2021

vdemeester Aug 30, 2021

mattmoor Aug 30, 2021

afrittoli Sep 20, 2021

vdemeester Aug 30, 2021

dlorenc commented Aug 30, 2021

vdemeester commented Aug 30, 2021 •

edited

Loading

bobcatfish commented Sep 3, 2021 •

edited

Loading

mattmoor commented Sep 5, 2021

tekton-robot commented Sep 8, 2021

tekton-robot commented Jan 5, 2022

		- `pre-steps`: a section of the TaskRun execution prior to the "real" work. This
		would generally consist of "setup" and "download" related work.

		- `post-steps`: a section of the TaskRun execution posterior to the "real" work.
		This would generally consist of "tear-down" and "upload" related work.

[TEP-0080] Proposal: TaskRun Pre/Post steps #502

[TEP-0080] Proposal: TaskRun Pre/Post steps #502

Conversation

mattmoor commented Aug 19, 2021

imjasonh commented Aug 19, 2021

mattmoor commented Aug 19, 2021

imjasonh commented Aug 19, 2021

mattmoor commented Aug 19, 2021

bobcatfish commented Aug 20, 2021

mattmoor commented Aug 20, 2021

jerop commented Aug 23, 2021

tekton-robot commented Aug 26, 2021

bobcatfish commented Aug 27, 2021

mattmoor commented Aug 30, 2021

vdemeester left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dlorenc commented Aug 30, 2021

vdemeester commented Aug 30, 2021 • edited Loading

bobcatfish commented Sep 3, 2021 • edited Loading

mattmoor commented Sep 5, 2021

tekton-robot commented Sep 8, 2021

tekton-robot commented Jan 5, 2022

vdemeester commented Aug 30, 2021 •

edited

Loading

bobcatfish commented Sep 3, 2021 •

edited

Loading