Handle Beta degen from Inf params #167

quildtide · 2024-08-08T07:28:50Z

Main changes:

Handle beta distribution degenerate cases that emerge when alpha, beta, or both are Inf.
- All behavior is consistent with equivalent functions for the Dirac distribution in Distributions.jl

Additional changes:

Implement degenerate case handling for betapdf and betalogpdf
- Undefined (NaN) at the single point of support, 0 everywhere else
- This diverges from the Dirac behavior in Distributions.jl, since Dirac is treated as a discrete distribution and pdf(::Dirac) is PMF instead of PDF. PMF is 1, PDF is a value which integrates to 1 at a single point.
Quantile functions now always return a constant on degenerate cases
- previously, quantile functions sometimes returned values that are not in the support of the distribution when the distribution is degenerate.
- The support of Beta(Inf, 1) does NOT include anything but 1. Docs for quantile both here and in Distributions.jl say that the returned value should be within the support.
- This is also consistent with how quantile(::Dirac) works in Distributions.jl
Additional and adjusted tests to handle all changes in behavior
- Tests on degenerate beta distributions are performed on a wider range, but with wider increments now (behavior is not going to change between betacdf(0, 0.5, 0.45) and betacdf(0, 0.5, 0.46))

Primary motivation:

I'm fixing handling of degenerate Beta distributions in Distributions.jl right now, and median(Beta(Inf, 1)) hangs because quantile(Beta(Inf, 1), 0.5) hangs.

Between this pull request and one about to be opened in Distributions.jl, degenerate Beta distribution behavior will align with the Dirac distribution behavior, EXCEPT returning pdf = NaN instead of pmf = 1.

quildtide · 2024-08-08T08:36:37Z

JuliaStats/Distributions.jl#1881 is the Distributions.jl-side counterpart.

andreasnoack · 2024-08-08T19:34:35Z

I think the current behavior (NaN) is reasonable. You don't know where the point with mass will be located. It completely depends on how you reach the limit. If you parameterize with mode and concentration and let the concentration to go infinity then both α and β go to infinity but the location of the mass point will depend on the mode parameter. Usually when the limit is indeterminate like that then you'd have to return NaN.

quildtide · 2024-08-08T22:19:35Z

So there's a total of 5 edge cases where the Beta distribution turns into a Dirac. StatsFuns.jl currently handles 2 of them, while Distributions.jl allows the other 3 (but does not handle them correctly).

Letting c be a finite positive value, there are:

Beta(0, c) = Dirac(0), handled by StatsFuns.jl already
Beta(c, 0) = Dirac(1), handled by StatsFuns.jl already
Beta(Inf, c) = Dirac(1), handled by this pull request
Beta(c, Inf) = Dirac(0), handled by this pull request
Beta(Inf, Inf) = Dirac(?), handled by this pull request

Even if we cannot come to an agreement on Beta(Inf, Inf), I think the other two are worth supporting.

Admittedly, I couldn't find any source for Beta(Inf, Inf) = Dirac(0.5) aside from Wikipedia. I wound up checking the R implementation, and I found that they handle these edge cases consistently:

> rbeta(2, Inf, Inf)
[1] 0.5 0.5
> rbeta(2, Inf, 1)
[1] 1 1
> rbeta(2, 1, Inf)
[1] 0 0
> rbeta(2, 0, 1)
[1] 0 0
> rbeta(2, 1, 0)
[1] 1 1

I don't entirely agree with their implementation, since pbeta(.5, Inf, Inf) returns 1 when I think the value is more of a NaN, and dbeta(0, Inf, 0) returns 0 when I feel that 0 is not in the support.

R actually supports another edge case, Beta(0, 0), or Haldane's Prior, which converges to Bernoulli(0.5).

R's documentation actually calls out:

pl.beta(0, 0)   ## point masses at  {0, 1}

pl.beta(0, 2)   ## point mass at 0 ; the same as
pl.beta(1, Inf)

pl.beta(Inf, 2) ## point mass at 1 ; the same as
pl.beta(3, 0)

pl.beta(Inf, Inf)# point mass at 1/2

The Beta(Inf, Inf) = Dirac(0.5) idea admittedly seems to be dependent on the constraint that α and β approach infinity at the same rate. I suppose that loosening that constraint (e.g. α = 2β) would allow you to create arbitrary Dirac(α/(α+β )) from Beta(Inf, Inf).

quildtide · 2024-08-09T01:59:56Z

The way pdf(Normal(mu, 0), mu) is handled here and in Distributions.jl is to return Inf, not NaN. I can see valid arguments for both.

quildtide · 2024-08-09T07:39:33Z

Closing the companion pull request in Distributions.jl with intent of an alternative pull request that just blocks creation of Beta distributions with Inf parameters.

Handle Beta degen from Inf params

aaab071

quildtide mentioned this pull request Aug 8, 2024

Handle degenerate Beta distribution cases JuliaStats/Distributions.jl#1881

Closed

quildtide closed this Aug 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle Beta degen from Inf params #167

Handle Beta degen from Inf params #167

quildtide commented Aug 8, 2024

quildtide commented Aug 8, 2024

andreasnoack commented Aug 8, 2024 •

edited

Loading

quildtide commented Aug 8, 2024

quildtide commented Aug 9, 2024

quildtide commented Aug 9, 2024

Handle Beta degen from Inf params #167

Handle Beta degen from Inf params #167

Conversation

quildtide commented Aug 8, 2024

Main changes:

Additional changes:

Primary motivation:

quildtide commented Aug 8, 2024

andreasnoack commented Aug 8, 2024 • edited Loading

quildtide commented Aug 8, 2024

quildtide commented Aug 9, 2024

quildtide commented Aug 9, 2024

andreasnoack commented Aug 8, 2024 •

edited

Loading