Flaky test test/sample/test_sample.py::test_bridge_sampling #1461

dweindl · 2024-09-16T12:00:14Z

For example, in https://github.com/ICB-DCM/pyPESTO/actions/runs/10882550532/job/30193828221?pr=1459:

=================================== FAILURES ===================================
_____________________________ test_bridge_sampling _____________________________

    @pytest.mark.flaky(reruns=2)
    def test_bridge_sampling():
        tol = 2
        # define problem
        objective = Objective(
            fun=lambda x: -gaussian_llh(x),
            grad=gaussian_nllh_grad,
            hess=gaussian_nllh_hess,
        )
        prior_true = NegLogParameterPriors(
            [
                {
                    "index": 0,
                    "density_fun": lambda x: (1 / (10 + 10)),
                    "density_dx": lambda x: 0,
                    "density_ddx": lambda x: 0,
                },
            ]
        )
        problem = pypesto.Problem(
            objective=AggregatedObjective([objective, prior_true]),
            lb=[-10],
            ub=[10],
            x_names=["x"],
        )
    
        # run optimization and MCMC
        result = optimize.minimize(problem, progress_bar=False, n_starts=10)
        result = sample.sample(
            problem,
            n_samples=1000,
            result=result,
        )
    
        # compute the log evidence using harmonic mean
        bridge_log_evidence = sample.evidence.bridge_sampling_log_evidence(result)
        harmonic_evidence = sample.evidence.harmonic_mean_log_evidence(result)
>       assert np.isclose(bridge_log_evidence, harmonic_evidence, atol=tol)
E       assert False
E        +  where False = <function isclose at 0x7f3c1c042bf0>(0.2670074583034374, -3.788734352667505, atol=2)
E        +    where <function isclose at 0x7f3c1c042bf0> = np.isclose

test/sample/test_sample.py:969: AssertionError
----------------------------- Captured stderr call -----------------------------

Increase number of optimizations or samples or reduce the tolerance of increase number of trials.

The text was updated successfully, but these errors were encountered:

PaulJonasJost · 2024-09-20T12:45:08Z

in a quick test, neither increasing starts to 100 nor increasing samples to 5000 really helped. Results are also so far apart that no reasonable tolerance could be set IMO. Seems to be something deeper, @arrjon could you have a look at this?

arrjon · 2024-09-20T14:02:40Z

I will have look!

* change test_bridge_sampling to laplace * increase flaky --------- Co-authored-by: Paul Jonas Jost <70631928+PaulJonasJost@users.noreply.github.com>

dweindl added CI sampling Related to sampling labels Sep 16, 2024

PaulJonasJost assigned arrjon Sep 20, 2024

arrjon mentioned this issue Sep 20, 2024

Fix #1461: test for bridge sampling #1473

Merged

arrjon closed this as completed Sep 20, 2024

PaulJonasJost added a commit that referenced this issue Sep 26, 2024

Fix #1461: test for bridge sampling (#1473)

ab61c4a

* change test_bridge_sampling to laplace * increase flaky --------- Co-authored-by: Paul Jonas Jost <70631928+PaulJonasJost@users.noreply.github.com>

dilpath mentioned this issue Sep 30, 2024

Reduce computation and non-deterministic behaviour in evidence and sampling tests #1475

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flaky test test/sample/test_sample.py::test_bridge_sampling #1461

Flaky test test/sample/test_sample.py::test_bridge_sampling #1461

dweindl commented Sep 16, 2024

PaulJonasJost commented Sep 20, 2024

arrjon commented Sep 20, 2024

Flaky test test/sample/test_sample.py::test_bridge_sampling #1461

Flaky test test/sample/test_sample.py::test_bridge_sampling #1461

Comments

dweindl commented Sep 16, 2024

PaulJonasJost commented Sep 20, 2024

arrjon commented Sep 20, 2024