Template for packaging spinalcordtoolbox datasets in pip. #1

kousu · 2021-04-07T15:28:40Z

No description provided.

kousu · 2021-04-07T15:28:49Z

@joshuacwnewton

joshuacwnewton

I'm wondering if we should address all of the existing TODOs/comments before making any new repos with this. 🤔

.github/workflows/build.yml

.github/workflows/release.yml

README.md

setup.py

README.md

feedback: * https://github.com/spinalcordtoolbox/spinalcordtoolbox-data-template/pull/1/files#r608804968

taowa · 2021-07-30T15:24:14Z

yay!

taowa · 2021-07-30T15:29:54Z

So that last commit is definitely kind of sinful. But I think that we should be doing /something/ to validate changes to the template, lest we change it in ways that break other things.

.github/workflows/build.yml

taowa · 2021-07-30T19:32:51Z

On Jul 30, 2021, at 14:36, Joshua Newton ***@***.***> wrote: + perl -pi -e 's/\${dataset_name}/sample-dataset/' setup.py IIUC, this is a find and replace?

That’s right :).

Just out of curiosity, why perl -pi -e? (I tried googling and found this SO answer, but I still don't entirely understand the purpose.)

Largely to avoid differences between the BSD/OSX sed and the GNU sed. It could just as well be `sed -i ’s/[…]/‘`, and I’m happy to change it to that, but that doesn’t play well with the OSX sed and I wanted to (lazily) test it out before pushing. Taowa

README.md

Answer to TODO: No, steps cannot be parallelized. However, *jobs* can run in parallel. But, to do so would require uploading files as artifacts, then spinning up entirely new VM instances, then downloading the artifacts. And, at that point, the setup of these VMs would probably take longer and use more resources than just running the steps sequentially.

The existing values were copied over from ivadomed. Here, in SCT, I've added a temporary new token (until we can sort out NeuroPoly's PyPI accounts).

I think this is meant to specify the target branch (i.e. `trunk`, not the branch used for the PR.)

joshuacwnewton · 2022-06-03T16:25:19Z

.github/workflows/release.yml

+    - name: Build
+      run: |
+          python -m build --wheel --sdist
+    - name: Publish to Github
+      uses: softprops/action-gh-release@v1
+      with:
+        files: 'dist/*'
+        fail_on_unmatched_files: true
+        tag_name: 2.0.3  #  ${{ github.event.inputs.version }} # in the workflow_dispatch case, make a new tag from the given input; in the published release case, this will be empty and will fall back to updating that release.


Currently, in our setup.py we've set the option use_scm_version=True, which will use the most recent git tag as the package version number.

There's a subtle bug here, though: For the workflow_dispatch case, we create a tag in the Publish step. But, the publish step happens after the build step, so the chosen tag won't be present for python -m build to pick up.

I think this means we have to try creating a tag prior to the build step?

Specifically, this manifests in this PR as:

ERROR HTTPError: 400 Bad Request from https://test.pypi.org/legacy/ '0.1.dev1+g169fe48.d20220603' is an invalid value for Version. Error: Can't use PEP 440 local versions. See [ https://packaging.python.org/specifications/core](https://packaging.python.org/specifications/core-metadata)-metadata for m information.

The package is being built with 0.1.dev1, when we want it to be built using the dummy example tag name I chose (2.0.3).

Ahh! Fascinating! use_scm_version=True does not do what I thought it did (CI run):

ERROR HTTPError: 400 Bad Request from https://test.pypi.org/legacy/ '2.0.4.dev0+g5929d76.d20220603' is an invalid value for Version. Error: Can't use PEP 440 local versions. See [ https://packaging.python.org/specifications/core](https://packaging.python.org/specifications/core-metadata)-metadata for m information.

Setting the tag 2.0.3 prior to release will actually auto-generate a package with version number 2.0.4.dev0!

My best guess is that use_scm_version=True is intended for the workflow described in the README.md of the pypi-publish action:

For example, you could implement a parallel workflow that pushes every commit to TestPyPI or your own index server, like devpi. For this, you'd need to (1) specify a custom repository_url value and (2) generate a unique version number for each upload so that they'd not create a conflict. The latter is possible if you use setuptools_scm package but you could also invent your own solution based on the distance to the latest tagged commit.

In other words, setuptools_scm is meant for rapidly generating unique version numbers during development (e.g. dev0 -> dev1 -> dev2), rather than generating the version number for stable releases.

This means that we should probably stop using setuptools_scm for building stable releases! (Though, we could optionally continue to use it to automatically build test/dev releases, but that's not really what workflow_dispatch is for.)

Right! So, performing a find and replace on the version="dev' string (7394563) gets us our desired result:

https://test.pypi.org/project/spinalcordtoolbox-data-template/2.0.3/

https://github.com/spinalcordtoolbox/data-template/releases/tag/2.0.3

That doesn't make sense to me. I've used it lots of times and its always behaved. Take a look at https://github.com/ufoym/imbalanced-dataset-sampler/actions/runs/2371746877 to see for yourself.

If you want to do a test deploy you should use just tag it 'v1.0.3rc0' or 'v1.0.3rc1' -- that'll get tagged as a test build too -- particularly we should include this line

https://github.com/ufoym/imbalanced-dataset-sampler/actions/runs/2371746877

I'm on my phone atm. I'll need some time to look deeper about why setuptools-scm isn't working. But I really think we should use it, it really improves the workflow.

We just need to ask people need to manually fill in the name.

Maybe we can write some awesome actions script that edits setup.py to sync it with the repo's name on every commit. That would be cool, and in line with using git as the single source of truth, but maybe a bit Extra, y'know? I think doing step manually is good enough because how many times are we really going to be making a new dataset (vs making a new release of that dataset)?

Oh of course. I missed the fact that editing setup.py means that the git repo is no longer in a clean state.

Thanks for catching that. 🤦

We just need to ask people need to manually fill in the name.

We already do! The perl edit step is only relevant for the data-template repo, and I only added it because otherwise the build on data-template (i.e. this PR) would fail due to (I think) an invalid repo URL in the metadata, hehe.

We can just straight-up remove the perl step, and allow the builds to fail on data-template? Since the problem goes away the moment people actually use the template then follow the README.

From @kousu:

Ah good. I was pretty sure we already did...

I tried out the template using PAM50, and all the steps went well: https://github.com/joshuacwnewton/data-PAM50/releases/tag/2022.6.6

I then tried installing the wheel and it behaved as expected.

So, as soon as I add back setuptools_scm, this (maybe possibly) is almost ready for merging?

This reverts commit 73f9b33.

* Stop repeating version number * Use || to make fallback case more explicit

It's more straightforward to just have a single "correct" way of doing things, and the release UI has a lot of other nice usage benefits, too (e.g. checking if you're duplicating a tag name).

The `startsWith(github.ref, 'refs/tags/')` pattern is recommended by: * https://github.com/softprops/action-gh-release * https://github.com/pypa/gh-action-pypi-publish

README.md

I just want to see how pip will use date-based versions! GitHub Actions, pls stop being so finicky.

Template for packaging spinalcordtoolbox datasets in pip.

ec0f257

joshuacwnewton requested changes Apr 7, 2021

View reviewed changes

This was referenced May 10, 2021

Implement a check for bin/ that is more thorough than ls -lA bin spinalcordtoolbox/spinalcordtoolbox#3380

Closed

Turn spinalcordtoolbox into a pip-installable PyPI package spinalcordtoolbox/spinalcordtoolbox#1526

Open

joshuacwnewton mentioned this pull request Jul 9, 2021

Remove mirror neuro.polymtl.ca spinalcordtoolbox/spinalcordtoolbox#3443

Closed

kousu and others added 11 commits July 13, 2021 17:16

Update README.md

cbbac73

feedback: * https://github.com/spinalcordtoolbox/spinalcordtoolbox-data-template/pull/1/files#r608804968

Add files via upload

dbcd1e3

Yep

7173954

Update README.md

24ba7f2

Update README.md

68fb7ff

Update README.md

305fb5e

Update README.md

2b0291a

Update setup.py

f4d8f1e

setup.py: Add link to front page for 'documentation'

b828e86

Update setup.py

2521744

Create LICENSE.txt

f4beed7

joshuacwnewton approved these changes Jul 13, 2021

View reviewed changes

kousu and others added 2 commits July 13, 2021 18:29

Update setup.py

97683bc

add a missing comma to setup.py

cc38cc6

taowa force-pushed the ng/template branch from 1f8f0b1 to cc38cc6 Compare July 19, 2021 02:54

replace the name string during builds for testing

f45045b

joshuacwnewton reviewed Jul 30, 2021

View reviewed changes

.github/workflows/build.yml Outdated Show resolved Hide resolved

replace google group email with a forwarder @neuropoly.org

8dde7fd

taowa reviewed Sep 1, 2021

View reviewed changes

README.md Outdated Show resolved Hide resolved

kousu mentioned this pull request Oct 11, 2021

Integrating git into training scripts neuropoly/data-management#136

Open

README.md: Fix numbering

4410d0e

joshuacwnewton added 2 commits May 24, 2022 10:45

README.md: Improve readability of the Troubleshooting section

2afa7c3

README.md: Add missing .git

eaea4c2

This comment was marked as resolved.

Sign in to view

joshuacwnewton closed this May 24, 2022

joshuacwnewton reopened this May 24, 2022

joshuacwnewton added 5 commits June 3, 2022 11:38

release.yml: Update PYPI_API_TOKEN

fe246fc

The existing values were copied over from ivadomed. Here, in SCT, I've added a temporary new token (until we can sort out NeuroPoly's PyPI accounts).

release.yml: Test publishing on PRs for ng/template

65021c2

release.yml: Remove branches: ng/template from test

dfb1e5c

I think this is meant to specify the target branch (i.e. `trunk`, not the branch used for the PR.)

release.yml: Add missing perl -pi -e

0053d78

joshuacwnewton reviewed Jun 3, 2022

View reviewed changes

joshuacwnewton added 3 commits June 3, 2022 12:31

release.yml: Try tagging release prior to build

73f9b33

Revert "release.yml: Try tagging release prior to build"

9d6f082

This reverts commit 73f9b33.

release.yml: Explicitly set version= instead of using setuptools_scm

7394563

joshuacwnewton force-pushed the ng/template branch from 9439d5c to 7394563 Compare June 3, 2022 18:27

release.yml: Make VERSION_NUMBER more explicit

804b510

* Stop repeating version number * Use || to make fallback case more explicit

joshuacwnewton force-pushed the ng/template branch from a72262e to 804b510 Compare June 3, 2022 18:46

joshuacwnewton added 3 commits June 3, 2022 15:03

release.yml: Drop workflow_dispatch

7ed3fe3

It's more straightforward to just have a single "correct" way of doing things, and the release UI has a lot of other nice usage benefits, too (e.g. checking if you're duplicating a tag name).

release.yml: Drop test build workflow, and use only 1 workflow

3f67509

The `startsWith(github.ref, 'refs/tags/')` pattern is recommended by: * https://github.com/softprops/action-gh-release * https://github.com/pypa/gh-action-pypi-publish

README.md: Fix typo (missing `)

2fb6da0

joshuacwnewton reviewed Jun 4, 2022

View reviewed changes

README.md Outdated Show resolved Hide resolved

joshuacwnewton added 8 commits June 3, 2022 20:36

release.yml: Test version tag r20220603

12a97d2

release.yml: Test version tag 2022.06.03

ffeb997

release.yml: Use quotes for version string

07ca856

release.yml: Temporarily stop using ||

eb91ae0

I just want to see how pip will use date-based versions! GitHub Actions, pls stop being so finicky.

README.md: Fix typo (missing `)

a383f68

release.yml: Revert back to using release tag

1322a88

release.yml: Re-enable 'publish' if statements

6aae1ca

README.md: Clarify date-based version numbering

3746bea

spinalcordtoolbox deleted a comment from kousu Jun 6, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Template for packaging spinalcordtoolbox datasets in pip. #1

Template for packaging spinalcordtoolbox datasets in pip. #1

kousu commented Apr 7, 2021

kousu commented Apr 7, 2021 •

edited

Loading

joshuacwnewton left a comment

taowa commented Jul 30, 2021

taowa commented Jul 30, 2021

taowa commented Jul 30, 2021 via email

This comment was marked as resolved.

joshuacwnewton Jun 3, 2022 •

edited

Loading

joshuacwnewton Jun 3, 2022 •

edited

Loading

joshuacwnewton Jun 3, 2022 •

edited

Loading

joshuacwnewton Jun 3, 2022 •

edited

Loading

kousu Jun 3, 2022

kousu Jun 6, 2022

joshuacwnewton Jun 6, 2022 •

edited

Loading

joshuacwnewton Jun 6, 2022 •

edited

Loading

joshuacwnewton Jun 6, 2022

joshuacwnewton Jun 6, 2022 •

edited

Loading

Template for packaging spinalcordtoolbox datasets in pip. #1

Are you sure you want to change the base?

Template for packaging spinalcordtoolbox datasets in pip. #1

Conversation

kousu commented Apr 7, 2021

kousu commented Apr 7, 2021 • edited Loading

joshuacwnewton left a comment

Choose a reason for hiding this comment

taowa commented Jul 30, 2021

taowa commented Jul 30, 2021

taowa commented Jul 30, 2021 via email

This comment was marked as resolved.

joshuacwnewton Jun 3, 2022 • edited Loading

Choose a reason for hiding this comment

joshuacwnewton Jun 3, 2022 • edited Loading

Choose a reason for hiding this comment

joshuacwnewton Jun 3, 2022 • edited Loading

Choose a reason for hiding this comment

joshuacwnewton Jun 3, 2022 • edited Loading

Choose a reason for hiding this comment

kousu Jun 3, 2022

Choose a reason for hiding this comment

kousu Jun 6, 2022

Choose a reason for hiding this comment

joshuacwnewton Jun 6, 2022 • edited Loading

Choose a reason for hiding this comment

joshuacwnewton Jun 6, 2022 • edited Loading

Choose a reason for hiding this comment

joshuacwnewton Jun 6, 2022

Choose a reason for hiding this comment

joshuacwnewton Jun 6, 2022 • edited Loading

Choose a reason for hiding this comment

kousu commented Apr 7, 2021 •

edited

Loading

joshuacwnewton Jun 3, 2022 •

edited

Loading

joshuacwnewton Jun 3, 2022 •

edited

Loading

joshuacwnewton Jun 3, 2022 •

edited

Loading

joshuacwnewton Jun 3, 2022 •

edited

Loading

joshuacwnewton Jun 6, 2022 •

edited

Loading

joshuacwnewton Jun 6, 2022 •

edited

Loading

joshuacwnewton Jun 6, 2022 •

edited

Loading