Datatype restructure for optimisation objects #224

BradyPlanden · 2024-03-01T12:47:39Z

I'm raising this commit to a PR as I believe there are enough changes to the backend to warrant a review at this stage. This started as an update to the plotting class to include alternative x-axis variables, but moved towards refactoring the pybop backend to make this possible.

This PR restructures PyBOP's datatypes for model output (y & dy). As such, knock-on changes to the problem and cost classes were required. The main benefits of these changes are to move away from numpy array objects where possible and replace them with dictionaries. This PR achieves the following:

Order independent access from the dictionary. This benefit can be seen in the updates to the plotting classes.
Clean multi-signal integration, as iteration over the problem.signal becomes the only requirement.
A reduction in the datatype castings we needed within model.simulate and the corresponding cost function classes.
Adds a problem.additional_variables object to capture variables needed within the design problem class, as well as plotting.
Updates tests for the new datatypes
Fixes the integration tests as the previous assertions where compared to x0 and not the groundtruth. They have been updated to sample from a random normal distribution to set the groundtruth, with x0 also randomly sampled too. This now represents the true optimiser performance.
Updates SciPyMinimize default to be Nelder-Mead
Updates initial point in cost landscape to be green circle
Adds gradient cost landscape plots with optional arg

@NicolaCourtier @martinjrobins, there are quite a few changes in this one, and I would appreciate it if you both could take a good look at any potential bugs or improvements. I've spent a lot of time double-checking the cost function calculations, but it's very possible that I've missed something.

At the moment, the UKF tests are failing, I need to take a second look at them. @martinjrobins, if you can take a look as well that would be appreciated.

…y. Update tests, add base model classes to init, cleaner multi-signal interaction

martinjrobins

Hi @BradyPlanden, I've had a quick look through and all seems good, happy with the change to dictionaries naming the y and dy outputs. I wasn't sure what default_variables was, but I don't really know how the design stuff works so that is probably why

pybop/_problem.py

BradyPlanden · 2024-03-01T14:56:49Z

tests/unit/test_standalone.py


-        np.testing.assert_allclose(x, 3.138, atol=1e-2)
+        np.testing.assert_allclose(rmse_x, 3.05615, atol=1e-2)


This change was a bit worrisome, since the value should remain constant. I did a comparison with Pints' logic and it returned the updated value as well, pointing towards a bug in the previous RMSE calculation.

…er tests, dict output observer.evaluate()

codecov · 2024-03-02T19:19:40Z

Codecov Report

Attention: Patch coverage is 93.60465% with 11 lines in your changes are missing coverage. Please review.

Project coverage is 94.08%. Comparing base (ed2bf7c) to head (afd4990).

Files	Patch %	Lines
pybop/costs/fitting_costs.py	93.47%	3 Missing ⚠️
pybop/observers/observer.py	90.00%	3 Missing ⚠️
pybop/models/lithium_ion/echem_base.py	50.00%	2 Missing ⚠️
pybop/costs/design_costs.py	85.71%	1 Missing ⚠️
pybop/plotting/plot_cost2d.py	96.00%	1 Missing ⚠️
pybop/plotting/plot_problem.py	87.50%	1 Missing ⚠️

Additional details and impacted files

@@                      Coverage Diff                       @@
##           177b-plotting-capabilities     #224      +/-   ##
==============================================================
+ Coverage                       93.92%   94.08%   +0.15%     
==============================================================
  Files                              35       35              
  Lines                            1762     1826      +64     
==============================================================
+ Hits                             1655     1718      +63     
- Misses                            107      108       +1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

…version due to breaking change in 8.1.0

NicolaCourtier

I think this PR is heading in a very useful direction! Thanks @BradyPlanden. Here are some initial comments.

On the naming of additional_variables - can you make it clear what these variables are in addition to, in the description? Are they optional extras for plotting? Or perhaps required_variables, i.e. the variables that are required for the specific optimisation problem?
Do you think that Discharge capacity [A.h] will be required for all types of design problem? I would expect different requirements for different design costs. Could further variables be added as part of the Cost?
I'm also not sure why the passing of additional_variables is limited to the Base and EChem models, why not the PyBaMM ECM models for example? If this logic is required, please add a warning for times when the input is not being used.
In the cost functions, it is not clear to me why these two variables have been selected:

if key not in ["Time [s]", "Discharge capacity [A.h]"]:

and

if signal not in ["Time [s]", "Discharge capacity [A.h]"]

The original check was designed to cope with simulations that terminate early (which we expect in several situations such as reaching a voltage limit during the simulation). However, as written I think it is too general and could hide other, unexpected errors.

Please can you increase the coverage and check why one of the tests appears to be failing?

pybop/costs/fitting_costs.py

BradyPlanden · 2024-03-07T15:39:27Z

On the naming of additional_variables - can you make it clear what these variables are in addition to, in the description? Are they optional extras for plotting? Or perhaps required_variables, i.e. the variables that are required for the specific optimisation problem?

These variables are in addition to the variables selected in the signal object. At the moment we lack this functionality, so this provides the user with the ability to capture additional variables from the forward model prediction. In the case of plotting, the better way to do this would be to save the pybamm solution object and reuse it in plotting. That's probably worth a separate PR in the near future.

Do you think that Discharge capacity [A.h] will be required for all types of design problem? I would expect different requirements for different design costs. Could further variables be added as part of the Cost?

At the moment this is an uncharted area, this PR provides the mechanisms to explore this further in future PRs. The decision on the additional_variables is very much something we can do as it becomes clearer. In the case of Discharge Capacity [A.h], it is within the design cost base as both of the current child classes require it.

I'm also not sure why the passing of additional_variables is limited to the Base and EChem models, why not the PyBaMM ECM models for example? If this logic is required, please add a warning for times when the input is not being used.

Probably because our tests don't pick this up. Another good place to improve. Although it's not specifically passed to EChem models, it's only within BaseModel at the moment. This was missed in the shuffle.

In the cost functions, it is not clear to me why these two variables have been selected:

Added a few comments to make this more clear. Probably needs to be added to the docstring as well.

Please can you increase the coverage and check why one of the tests appears to be failing?

I need to do a double check on the failing windows examples, but I'm not confident it's isolated to this PR. I'm pretty sure we've been seeing it on the scheduled tests.

… Discharge capacity as default additional_variable

…icode, add infeasible unit test

…ions check

BradyPlanden · 2024-03-13T17:47:50Z

I believe the above issues have been addressed, I'm going to merge this into 177b and we can continue the discussion there.

Revamp model, problem, and cost object from numpy arrays to dictionar…

4cf9108

…y. Update tests, add base model classes to init, cleaner multi-signal interaction

BradyPlanden requested review from martinjrobins and NicolaCourtier March 1, 2024 12:47

Fix ukf examples, temporarily limits ukf to signal output model

3428c97

martinjrobins approved these changes Mar 1, 2024

View reviewed changes

pybop/_problem.py Show resolved Hide resolved

BradyPlanden commented Mar 1, 2024

View reviewed changes

default_variables to additional_variables w/ docstrings, updt. observ…

43521da

…er tests, dict output observer.evaluate()

BradyPlanden changed the title ~~Datatype restructure to optimisation objects~~ Datatype restructure for optimisation objects Mar 2, 2024

BradyPlanden added 2 commits March 4, 2024 13:50

Fix integration test logic, add gradient landscape plots, pin pytest …

67d2887

…version due to breaking change in 8.1.0

Add tests for gradient plots, up coverage

b6a073b

BradyPlanden linked an issue Mar 4, 2024 that may be closed by this pull request

Datatype restructure for optimisation objects #227

Closed

Set default SciPyMinimize method to Nelder-Mead, clean-up repo

ee4cdff

BradyPlanden linked an issue Mar 6, 2024 that may be closed by this pull request

Add Nelder-Mead optimiser #195

Closed

NicolaCourtier reviewed Mar 7, 2024

View reviewed changes

NicolaCourtier self-requested a review March 7, 2024 13:54

BradyPlanden commented Mar 7, 2024

View reviewed changes

pybop/costs/fitting_costs.py Outdated Show resolved Hide resolved

BradyPlanden commented Mar 7, 2024

View reviewed changes

pybop/costs/fitting_costs.py Outdated Show resolved Hide resolved

BradyPlanden added 2 commits March 8, 2024 09:55

unicode fix for win notebooks, update prediction shape checks, remove…

66efaba

… Discharge capacity as default additional_variable

Updt. cost2d/optim2d x0 shape/colour, revert conftest win platform un…

9b03734

…icode, add infeasible unit test

BradyPlanden mentioned this pull request Mar 13, 2024

[Bug]: Limit max iterations not working on SciPy optimisers #237

Closed

Updt SciPy & BaseOptimiser for maximum iterations limit - fixes #237

e7aef79

BradyPlanden linked an issue Mar 13, 2024 that may be closed by this pull request

[Bug]: Limit max iterations not working on SciPy optimisers #237

Closed

add infeasible cost tests, remove redundant scipyminimise maxiter opt…

afd4990

…ions check

BradyPlanden merged commit a9ea84c into 177b-plotting-capabilities Mar 13, 2024
29 of 31 checks passed

BradyPlanden deleted the 177c-plotting-capabilities branch March 13, 2024 17:48

BradyPlanden mentioned this pull request Mar 15, 2024

Additions to #177 #198

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Datatype restructure for optimisation objects #224

Datatype restructure for optimisation objects #224

BradyPlanden commented Mar 1, 2024 •

edited

Loading

martinjrobins left a comment

BradyPlanden Mar 1, 2024

codecov bot commented Mar 2, 2024 •

edited

Loading

NicolaCourtier left a comment

BradyPlanden commented Mar 7, 2024

BradyPlanden commented Mar 13, 2024


		np.testing.assert_allclose(x, 3.138, atol=1e-2)
		np.testing.assert_allclose(rmse_x, 3.05615, atol=1e-2)

Datatype restructure for optimisation objects #224

Datatype restructure for optimisation objects #224

Conversation

BradyPlanden commented Mar 1, 2024 • edited Loading

martinjrobins left a comment

Choose a reason for hiding this comment

BradyPlanden Mar 1, 2024

Choose a reason for hiding this comment

codecov bot commented Mar 2, 2024 • edited Loading

Codecov Report

NicolaCourtier left a comment

Choose a reason for hiding this comment

BradyPlanden commented Mar 7, 2024

BradyPlanden commented Mar 13, 2024

BradyPlanden commented Mar 1, 2024 •

edited

Loading

codecov bot commented Mar 2, 2024 •

edited

Loading