Refactoring of `DoEStrategy` data model class #448

jduerholt · 2024-10-11T15:49:12Z

This PR provides a draft for the restructured DoEStrategy.

The following things changed:

The objective is now criterion to prevent confusion with the objectives of the output features.
The former objectives like DOptimality are not anymore enums, but classes, for example DOptimalityCriterion. This allows for the clean implementation of criterion specific attributes.
The formula is now anymore a direct attribute of the DoEStrategy but instead an attribute of the criterion.

When the functional part of the DoE module is refactored, we should have a mapper which takes the criterion and domain and returns a functional criterion class which provides then the actual objective funtion for the actual ipopt optimizer.

On the long run, we can then also provide a common base class on the data model level for both the DoEStrategy and the SpaceFillingStrategy. In case of the latter one the only allowed criterion will then be the SpaceFillingCriterion which has no formula attribute.

@Osburg: What do you think?

dlinzner-bcs · 2024-12-04T15:05:07Z

@jduerholt @Osburg i made some changes. We still want to support the old interface with find_local_max_ipopt. For this I allowed mapping a criterion to the old enum. Is this a valid compromise? Update: We no longer support the old interface.

dlinzner-bcs · 2024-12-12T12:26:40Z

@jduerholt @Osburg This PR is now ready for merging. Do you approve?

jduerholt · 2024-12-12T12:55:27Z

We first review :-) Thank you very much. I try to have a look at it on the weekend. @Osburg would be good if you also have a look.

jduerholt · 2024-12-12T13:52:49Z

There are failing tests due to a new sklearn version, I already went into the rabbit hole. Will post an issue later.

Osburg

@dlinzner-bcs Thanks!!! :)) <3 looks good to me, left some minor comments. Feel free to ignore them if they are not helpful.
Cheers, Aaron :)

Osburg · 2024-12-13T10:23:25Z

bofire/strategies/doe/objective.py

@@ -83,6 +74,71 @@ def evaluate_jacobian(self, x: np.ndarray) -> np.ndarray:
    def _evaluate_jacobian(self, x: np.ndarray) -> np.ndarray:
        pass

+    @abstractmethod


Are these functions needed in the Objective class? Since the SpaceFilling objective actually does not have to implement any of these functions, right? (The only function that is used for any objective is get_model_matrix() in the exit message find_local_max_ipopt() i think, but imo it is not necessary there as well...). What is your opinion?

Osburg · 2024-12-13T10:25:17Z

bofire/strategies/doe/objective.py

@@ -493,19 +552,66 @@ def _convert_input_to_tensor(
        )
        return torch.tensor(X.values, requires_grad=requires_grad, **tkwargs)

+    def get_model_matrix(self, design: pd.DataFrame) -> pd.DataFrame:


This could then also be removed

Osburg · 2024-12-13T10:29:27Z

bofire/strategies/doe/design.py

@@ -572,7 +199,7 @@ def find_local_max_ipopt(
    if _ipopt_options[b"print_level"] > 12:  # type: ignore


In my opinion this could be removed. The objective criterion can be viewed via the IPOPT output and any other criterion can also be calculated with little effort by creating another Objective. What do you think? :) --> then there is also no need anymore to implement get_model_matrix() for SpaceFilling.
Also the metrics(), g_optimality(), a_optimality(), d_optimality() functions in doe utils could be deleted then since they are not used anywhere else.

Many thanks Aaron for the review :)

jduerholt · 2024-12-16T16:32:12Z

Thanks @Osburg, I will finally have look tmr. Sorry @dlinzner-bcs !

R-M-Lee · 2024-12-17T12:43:30Z

why is the nbstripout precommit hook not being executed properly? The notebook output should not be there. Right now there are some images and outputs referencing local paths on someone's machine

edit: ah, never mind... we use nbstripout with the argument to keep outputs. But the warnings should still be removed I think

jduerholt

Looks overall very good, I let some questions and I am open to also implement some of the mentioned points ;)

jduerholt · 2024-12-17T13:52:57Z

setup.py

Why are you adding this? We removed it during the hackathon in favor of having everything in the pyproject.toml.

jduerholt · 2024-12-17T13:54:15Z

bofire/data_models/strategies/doe.py

-
-    objective: OptimalityCriterionEnum = OptimalityCriterionEnum.D_OPTIMALITY
-
-    transform_range: Optional[Bounds] = None


Where is the transform_range gone?

jduerholt · 2024-12-17T13:56:51Z

bofire/strategies/doe/design.py

-    n_experiments: Optional[int] = None,
-    delta: float = 1e-7,
+    n_experiments: int,
+    criterion: Optional[AnyOptimalityCriterion] = None,


Why is the criterion here optional? We always need one, or?

jduerholt · 2024-12-17T14:01:59Z

bofire/strategies/space_filling.py

Can we get rid of this functional strategy here, and just map the data model of the SpaceFillingStrategy to the DoeStrategy in the mapper?

I would have liked to use the SpaceFillingStrategy as it is to generate the design space grid for the I-optimality criterion. To avoid circular imports it would be nice if the space filling strategy would be available indepent of the DoeStrategy (@dlinzner-bcs is it okay if I add a commit for the I-optimality - now compatible with your changes - criterion to this branch?).
~~Only if there are no good reasons against keeping it this way ofc @jduerholt .~~

edit: nvm

jduerholt · 2024-12-17T14:04:28Z

tests/bofire/data_models/specs/strategies.py

We should also test the optimality criteria data models for serialization and desirilization, should I implement this into this PR? One has to register a few new fixures for this.

jduerholt and others added 7 commits October 11, 2024 17:38

add draft of restrucuted doe class

98668ae

refactoring doe

d83e241

Merge branch 'main' into refactor/doe_data_model

8441817

add formulaic to be installed always

fca9559

add formulaic to be installed always

a825e68

add formulaic to be installed always

ba22c38

add formulaic to be installed always

f760838

dlinzner-bcs marked this pull request as ready for review December 4, 2024 15:02

linznedd added 21 commits December 4, 2024 16:21

check style

035d8db

check style

ea9011d

check style

19b4bae

remove enumns

73d4461

remove enumns

bf21702

remove enumns

08d8565

fix branch and bound

94453f6

move delta into criterion

6c7ebce

move delta into criterion

9487eda

move delta into criterion

652d7a0

move delta into criterion

152e96d

move default criterion

35e9bcb

move default criterion

545acc8

move default criterion

45428b2

move default criterion

e9ab45f

refactor formulas and number of experiments

028a2c0

pyright

2d4a850

fix test

8706a59

Merge remote-tracking branch 'origin/main' into refactor/doe_data_model

b0863e1

fix test

ca5fc52

fix test

a69654f

linznedd added 5 commits December 12, 2024 09:51

fix test

7aa8dfa

fix tutorial

6136202

fix tutorial

388e79e

fix tutorial

5374b4b

fix test

b61e512

dlinzner-bcs self-requested a review December 12, 2024 12:14

fix test

e1028d1

fix getting started

980869e

dlinzner-bcs requested a review from Osburg December 12, 2024 13:00

Osburg reviewed Dec 13, 2024

View reviewed changes

aarons review

80e30c8

rmv unneded tests

ee2ffbd

dlinzner-bcs mentioned this pull request Dec 17, 2024

update DoE tutorials #478

Closed

formulaic version fixed bc of breaking changes

e499762

rosonaeldred mentioned this pull request Dec 17, 2024

more general DoE tutorial #477

Closed

R-M-Lee added 3 commits December 17, 2024 14:08

add explanatory text to doe basic examples

ce7ba15

typo in basic_examples.ipynb

78bf5c9

format basic doe example

54e799c

jduerholt commented Dec 17, 2024

View reviewed changes

Osburg mentioned this pull request Dec 18, 2024

I optimality criterion and nonlinear constraints defined by python function #485

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactoring of `DoEStrategy` data model class #448

Refactoring of `DoEStrategy` data model class #448

jduerholt commented Oct 11, 2024

dlinzner-bcs commented Dec 4, 2024 •

edited

Loading

dlinzner-bcs commented Dec 12, 2024

jduerholt commented Dec 12, 2024

jduerholt commented Dec 12, 2024

Osburg left a comment

Osburg Dec 13, 2024

Osburg Dec 13, 2024

Osburg Dec 13, 2024

dlinzner-bcs Dec 16, 2024

jduerholt commented Dec 16, 2024

R-M-Lee commented Dec 17, 2024 •

edited

Loading

jduerholt left a comment

jduerholt Dec 17, 2024

jduerholt Dec 17, 2024

jduerholt Dec 17, 2024

jduerholt Dec 17, 2024

Osburg Dec 17, 2024 •

edited

Loading

jduerholt Dec 17, 2024

		@@ -572,7 +199,7 @@ def find_local_max_ipopt(
		if _ipopt_options[b"print_level"] > 12: # type: ignore


		objective: OptimalityCriterionEnum = OptimalityCriterionEnum.D_OPTIMALITY

		transform_range: Optional[Bounds] = None

Refactoring of DoEStrategy data model class #448

Are you sure you want to change the base?

Refactoring of DoEStrategy data model class #448

Conversation

jduerholt commented Oct 11, 2024

dlinzner-bcs commented Dec 4, 2024 • edited Loading

dlinzner-bcs commented Dec 12, 2024

jduerholt commented Dec 12, 2024

jduerholt commented Dec 12, 2024

Osburg left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jduerholt commented Dec 16, 2024

R-M-Lee commented Dec 17, 2024 • edited Loading

jduerholt left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Osburg Dec 17, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Refactoring of `DoEStrategy` data model class #448

Refactoring of `DoEStrategy` data model class #448

dlinzner-bcs commented Dec 4, 2024 •

edited

Loading

R-M-Lee commented Dec 17, 2024 •

edited

Loading

Osburg Dec 17, 2024 •

edited

Loading