Add realistic example #38

ElliottKasoar · 2023-09-20T17:10:04Z

Fixes #32

Replaces the example ones tensor with a more realistic example, based on a Pytorch Resnet example.

To do:

Update resnet_infer_fortran.f90 to read in the saved text file for the image
Update resnet_infer_fortran.f90 to highest probabilities with labels (possibly via saving the loaded labels in python, in a similar manner to the image array?)

examples/1_ResNet18/resnet18.py

examples/1_ResNet18/resnet_infer_fortran.f90

TomMelt · 2023-09-22T18:39:25Z

Thanks for starting this. It looks great so far 👍

jatkinson1000

Thanks for working on this @ElliottKasoar
A couple of comments:

Whilst it is arguably 'cleaner' in terms of storage to fetch the image from url, there is no guarantee it will always be available. Since it is only a single image I would consider just adding it to the repository to make the example self-contained. This will also arguably make the python a bit cleaner.
Saving the data to file is a good idea, but could we do it more nicely than .flatten().to_file()? And consider adding *.dat to .gitignore. Also a comment in the docs about writing/reading between row/column major systems for clarity could be really helpful for users' understanding.

.gitignore

examples/1_ResNet18/README.md

ElliottKasoar · 2023-09-25T13:34:28Z

* Saving the data to file is a good idea, but could we do it more nicely than `.flatten().to_file()`? And consider adding `*.dat` to `.gitignore`. Also a comment in the docs about writing/reading between row/column major systems for clarity could be really  helpful for users' understanding.

@jatkinson1000 What sort of thing would you consider to be nicer? My feeling was that flattening it would make it the least ambiguous, or do you mean the way it is flattened/saved? (I'll add a commend about writing/reading anyway)

ElliottKasoar · 2023-09-25T14:01:48Z

* Whilst it is arguably 'cleaner' in terms of storage to fetch the image from url, there is no guarantee it will always be available. Since it is only a single image I would consider just adding it to the repository to make the example self-contained. This will also arguably make the python a bit cleaner.

What about something in between (which I'm halfway along, having added the .jpg to the PR): try to download the data, but, if necessary, catch the HTTPError for both the image and the categories, as we can direct users to the pre-downloaded files.

I quite like having some form of the fetch present, as, for example, I wanted to test a different dog, which only requires changing one line of code at the moment, and I think it makes an example better if the barrier to trying out different data is as low as possible.

jatkinson1000 · 2023-09-25T14:23:13Z

* Whilst it is arguably 'cleaner' in terms of storage to fetch the image from url, there is no guarantee it will always be available. Since it is only a single image I would consider just adding it to the repository to make the example self-contained. This will also arguably make the python a bit cleaner.
What about something in between (which I'm halfway along, having added the .jpg to the PR): try to download the data, but, if necessary, catch the HTTPError for both the image and the categories, as we can direct users to the pre-downloaded files.

I quite like having some form of the fetch present, as, for example, I wanted to test a different dog, which only requires changing one line of code at the moment, and I think it makes an example better if the barrier to trying out different data is as low as possible.

Yes, testing alternative images did cross my mind, but the purpose of this example is to serve as an (ideally minimal) example of how to save a pytorch net and couple it to Fortran. To this end I'd maybe favour having the image in the repo to read from and removing the requests code, perhaps with a note in the Readme to describe how a user might go about trying other images.

jatkinson1000 · 2023-09-25T14:29:06Z

@jatkinson1000 What sort of thing would you consider to be nicer? My feeling was that flattening it would make it the least ambiguous, or do you mean the way it is flattened/saved? (I'll add a comment about writing/reading anyway)

For the MiMA case we wrote out with the indices with the value (e.g. see here) to remove any ambiguity when reading back in and make interpretable.

ElliottKasoar · 2023-09-26T14:41:09Z

As discussed with @TomMelt and @jatkinson1000:

Precision
- While, in general, a separate precision module would be preferable, for this example it is probably clearest to keep the definition within the same file, but using c_wp rather than wp, to remove ambiguity with the commonly used Fortran-typed wp.
Transposing
- I added some discussion about when it is required, which hopefully clarifies things significantly
- As suggested, I used reshape to simplify the operation in Fortran. As users are more likely to be familiar with Python reshaping/transposing than Fortran reshaping/transposing, I moved the transpose into the Python script, although I also added a note about how the alternative would be carried out
- This complexity would also arguably be solved through the suggestion of storing indices alongside values, but I don't believe we concluded that there was a strong preference, so hopefully this is sufficiently clear as is

ElliottKasoar · 2023-09-27T15:25:20Z

At some point @TomMelt mentioned that it would be nice to have an assert to check that the results are actually consistent e.g. call assert_real_2d(big_array, big_result/2., test_name=msg) (currently unused) in the resnet benchmarking branch.

Given that the probabilities are more complex than that example, and seem to only match to ~5sf, is it worth just adding an assert for the label? Or maybe we should check the first 4-5 significant figures of the probability?

jatkinson1000 · 2023-09-27T16:00:47Z

Given that the probabilities are more complex than that example, and seem to only match to ~5sf, is it worth just adding an assert for the label? Or maybe we should check the first 4-5 significant figures of the probability?

Yes, @ElliottKasoar, something like (checks relative error):

    ! Check error
    if (maxval(abs((gwfcng_x - gwfcng_x_ref)/gwfcng_x_ref)) >= 1.0e-6) then
      write(*,*) "AAARRGGHHHHH"
      stop
    endif

jatkinson1000

This is looking pretty good, nice work reading the categories etc. in to Fortran to make it a complete example!

Couple of comments as below.

Also, the python code could be more black, pylint, and mypy compliant.

.gitignore

examples/1_ResNet18/README.md

examples/1_ResNet18/resnet18.py

examples/1_ResNet18/README.md

ElliottKasoar · 2023-09-28T12:34:11Z

Also, the python code could be more black, pylint, and mypy compliant.

Was there anything obvious that you haven't mentioned? I'd already run it my main changes to resnet18.py through flake8 and black, and I think there were only a couple of minor things in resnet_infer_python.py, which I hadn't looked at enough yet (as part of the last set of changes, it's now updated to use the new image, as it was still using the ones.)

I've made a few more minor changes based on a few things from pylint/mypy too in the last commit, so hopefully it's ok now?

TomMelt · 2023-09-28T13:39:50Z

At some point @TomMelt mentioned that it would be nice to have an assert to check that the results are actually consistent e.g. call assert_real_2d(big_array, big_result/2., test_name=msg) (currently unused) in the resnet benchmarking branch.

Given that the probabilities are more complex than that example, and seem to only match to ~5sf, is it worth just adding an assert for the label? Or maybe we should check the first 4-5 significant figures of the probability?

The assert_real_2d subroutine has an optional argument which allows you to change the relative tolerance. So I think a good idea would be to use assert_real_2d (or something like it) but just lower the tolerance to acknowledge that the variable in question has higher inaccuracy.

snippet of assert_real_2d

  subroutine assert_real_2d(a, b, test_name, rtol_opt)

    implicit none

    character(len=*) :: test_name
    real, intent(in), dimension(:,:) :: a, b
    real, optional :: rtol_opt
    real :: relative_error, rtol

jatkinson1000

I'm happy with this now, and all running fine under checks for me.
I'll let @TomMelt take a look as well since it's been a large PR.

TomMelt

Looks great, just need to tidy up some fortran variable attributes

examples/2_ResNet18/resnet_infer_fortran.f90

ElliottKasoar · 2023-09-29T10:16:15Z

Thanks @TomMelt! I think addressed all of your Fortran variable suggestions.

I also wanted to double check you were happy with the changes just before/at the same time as your review along side the assertion checks, which included the addition of wp separately to c_wp (the possibility of which was part of the intention for c_wp in the first place), since I'd changed probabilities to be a normal Fortran real.

TomMelt reviewed Sep 22, 2023

View reviewed changes

examples/1_ResNet18/resnet18.py Outdated Show resolved Hide resolved

TomMelt reviewed Sep 22, 2023

View reviewed changes

examples/1_ResNet18/resnet18.py Outdated Show resolved Hide resolved

TomMelt reviewed Sep 22, 2023

View reviewed changes

examples/1_ResNet18/resnet_infer_fortran.f90 Outdated Show resolved Hide resolved

jatkinson1000 reviewed Sep 25, 2023

View reviewed changes

TomMelt reviewed Sep 25, 2023

View reviewed changes

.gitignore Show resolved Hide resolved

TomMelt reviewed Sep 25, 2023

View reviewed changes

examples/1_ResNet18/README.md Outdated Show resolved Hide resolved

ElliottKasoar mentioned this pull request Sep 27, 2023

Update resnet benchmarking Cambridge-ICCS/FTorch-benchmarks#13

Closed

ElliottKasoar requested review from jatkinson1000 and TomMelt September 27, 2023 13:50

jatkinson1000 requested changes Sep 27, 2023

View reviewed changes

jatkinson1000 mentioned this pull request Sep 28, 2023

CMake Improvements #14

Closed

ElliottKasoar added 11 commits September 28, 2023 12:15

Add example image to python prediction

2c490d9

Use binary format for saved tensor

1c9843b

Improve python prediction output

d238893

Use python output for fortran inference

17b4f61

Predict and output probabilities

48ee015

Remove unnecessary loop

6795238

Add comments and type hinting

54721f7

Print category label

810194a

Add binary and TorchScript model to gitignore

a4cf939

Add example data

3b46b03

Update readme with expected outputs

8e7a4ed

ElliottKasoar added 9 commits September 28, 2023 12:21

Add sp and dp options

d764a26

Define C working precision

0ca6748

Use built in functions for softmax

76db896

Transpose and use built in reshape

eecdfc4

Move downloads to readme

10297c5

Add tranpose explanation to readme

f88b3d9

Update data download description

44d959b

Update python inference with new example

5e33767

Tidy code

46bbc78

ElliottKasoar force-pushed the add-image-example branch from aca0a77 to 46bbc78 Compare September 28, 2023 12:25

Update example description

1dd9372

jatkinson1000 approved these changes Sep 28, 2023

View reviewed changes

Add assertions for inference outputs

74aeead

TomMelt requested changes Sep 28, 2023

View reviewed changes

ElliottKasoar added 2 commits September 28, 2023 17:17

Add warning about assertions for different images

3a09be0

Tidy Fortran variable attributes

bf2a074

ElliottKasoar marked this pull request as ready for review September 29, 2023 10:17

ElliottKasoar requested a review from TomMelt September 29, 2023 10:17

TomMelt approved these changes Oct 2, 2023

View reviewed changes

ElliottKasoar merged commit b1111c2 into main Oct 2, 2023

ElliottKasoar deleted the add-image-example branch October 2, 2023 10:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add realistic example #38

Add realistic example #38

ElliottKasoar commented Sep 20, 2023 •

edited by TomMelt

Loading

TomMelt commented Sep 22, 2023

jatkinson1000 left a comment

ElliottKasoar commented Sep 25, 2023 •

edited

Loading

ElliottKasoar commented Sep 25, 2023

jatkinson1000 commented Sep 25, 2023

jatkinson1000 commented Sep 25, 2023

ElliottKasoar commented Sep 26, 2023

ElliottKasoar commented Sep 27, 2023

jatkinson1000 commented Sep 27, 2023

jatkinson1000 left a comment

ElliottKasoar commented Sep 28, 2023

TomMelt commented Sep 28, 2023

jatkinson1000 left a comment

TomMelt left a comment

ElliottKasoar commented Sep 29, 2023

Add realistic example #38

Add realistic example #38

Conversation

ElliottKasoar commented Sep 20, 2023 • edited by TomMelt Loading

TomMelt commented Sep 22, 2023

jatkinson1000 left a comment

Choose a reason for hiding this comment

ElliottKasoar commented Sep 25, 2023 • edited Loading

ElliottKasoar commented Sep 25, 2023

jatkinson1000 commented Sep 25, 2023

jatkinson1000 commented Sep 25, 2023

ElliottKasoar commented Sep 26, 2023

ElliottKasoar commented Sep 27, 2023

jatkinson1000 commented Sep 27, 2023

jatkinson1000 left a comment

Choose a reason for hiding this comment

ElliottKasoar commented Sep 28, 2023

TomMelt commented Sep 28, 2023

jatkinson1000 left a comment

Choose a reason for hiding this comment

TomMelt left a comment

Choose a reason for hiding this comment

ElliottKasoar commented Sep 29, 2023

ElliottKasoar commented Sep 20, 2023 •

edited by TomMelt

Loading

ElliottKasoar commented Sep 25, 2023 •

edited

Loading