Add ppe method for predictive elicitation (experimental) #336

aloctavodia · 2024-02-28T14:47:44Z

Early draft for a predictive elicitation method. I only tested on a couple of simple models, I already know it will fail for others, this is just a proof of concept.

The main idea is that the user provides a model (currently only PyMC, adding Bambi should be easy) and a "target distribution". This distribution is not any particular data set, but the "not yet observed data". The author of Understanding Advanced Statistical Methods calls this "DATA" as opposed to "data" (the dataset I want to "fit"). So if my model is about the height of adults in San Luis (from which I got a sample, i.e. my data). I can use my domain knowledge of adult humans (DATA) to elicit the target distribution.

A summary of the algorithm is:
Generate a sample from the target distribution.
Maximize the model's likelihood to that sample (i.e. we find the parameters for a fixed "observation").
Generate a new sample from the target and repeat.
Collect the optimized values in an array (one per prior parameter in the original model).
Use MLE to fit the optimized values to their corresponding families in the original model.

This approach is similar to what we do in Kulprit. One difference is that for kulprit the "target" is actually the posterior predictive distribution of a reference model, and we are interested in finding submodels (and their psoteriors) that will induce predictions as close as possible to the predictions from the reference model. Here we don't have a reference model, we instead have a human (or potentially a few humans). The other difference is that for kulrpit the optimized values are an approximation to the posterior that we care, here we need to fit those values to the prior's families in the original model, because we can not use samples as priors in a PyMC (or other PPLs) model.
The other difference is that here we use a slightly different approach to obtain the likelihood function for the optimization routine. If this can be generalized, we can use it in Kulprit too, I think this approach was not available when we discussed Kulprit's design and it could potentially make the code easier to maintain and extend.

codecov-commenter · 2024-03-07T18:17:46Z

Codecov Report

Attention: Patch coverage is 16.08392% with 120 lines in your changes are missing coverage. Please review.

Project coverage is 84.05%. Comparing base (b2732e5) to head (e5829ec).

Files	Patch %	Lines
preliz/ppls/pymc_io.py	12.71%	103 Missing ⚠️
preliz/internal/optimization.py	9.09%	10 Missing ⚠️
preliz/predictive/ppe.py	41.66%	7 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #336      +/-   ##
==========================================
- Coverage   86.28%   84.05%   -2.23%     
==========================================
  Files          40       42       +2     
  Lines        4425     4567     +142     
==========================================
+ Hits         3818     3839      +21     
- Misses        607      728     +121

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

aloctavodia requested a review from OriolAbril February 28, 2024 14:47

aloctavodia added 6 commits March 7, 2024 13:24

draft ppe

10214dc

draft ppe:refactor

c8a9e5a

add coments

3d19775

ugly draft handle hyperpriors, sort of

56ca9a7

_s

b933a14

complete draft

3d51d80

aloctavodia force-pushed the ppe branch from 5944f00 to 3d51d80 Compare March 7, 2024 17:03

fix linters

4f9712c

aloctavodia changed the title ~~[WIP] PPE draft~~ Add ppe method for predictive elicitation (experimental) Mar 7, 2024

aloctavodia added 2 commits March 7, 2024 14:46

fix import

c01ca69

fix import

e5829ec

aloctavodia merged commit fd9fcb6 into arviz-devs:main Mar 7, 2024
4 checks passed

aloctavodia deleted the ppe branch March 7, 2024 18:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ppe method for predictive elicitation (experimental) #336

Add ppe method for predictive elicitation (experimental) #336

aloctavodia commented Feb 28, 2024

codecov-commenter commented Mar 7, 2024

Add ppe method for predictive elicitation (experimental) #336

Add ppe method for predictive elicitation (experimental) #336

Conversation

aloctavodia commented Feb 28, 2024

codecov-commenter commented Mar 7, 2024

Codecov Report