Entropy is NaN for MatrixDirichlet with 0 entries #137

MagnusKoudahl · 2023-04-11T10:59:53Z

When calculating the entropy of a MatrixDirichlet with 0 entries, the result is NaN.

This follows from the entropy function making an elementwise call to SpecialFunctions.loggamma on dist.a which evaluates to Inf on each 0 entry.

This means that when learning parameters of an HMM for example, a marginal that evaluates to I will break FE calculation.

The text was updated successfully, but these errors were encountered:

bvdmitri · 2023-04-17T17:03:50Z

@ismailsenoz @ThijsvdLaar WDYT?

ThijsvdLaar · 2023-04-18T07:06:09Z

Good catch. In the entropy for the MatrixDirichlet, for a column with a zero entry (e.g. [2.0; 0.0]), the NaN value stems from a -Inf+Inf evaluation (first summation term plus last summation term). Introduced NaN values are notoriously hard to debug, so it would be good to prevent returning NaN.

Numerically, for a parameter matrix with an entry approaching zero, the entropy appears to approach -Inf. This would be good to verify mathematically. If this holds then we could return -Inf for the entropy; similar to how other Julia functions handle singularities.

I'd say it's up to the user to choose vague priors with appropriate epsilon approximations to prevent singularities in their algorithm.

ismailsenoz · 2023-04-18T07:21:10Z

Passing $[2,0]$ as a parameter vector to the Dirichlet distribution or any parameters vector containing $0$ is not allowed. Dirichlet distribution is defined for an $\alpha$ where each $\alpha_i > 0$. What @MagnusKoudahl is trying to do is a violation of the theory. Please see the definition of a Dirichlet distribution. I do not see the point of this as an issue since there is nothing wrong with the computation. As a way around SpecialFunctions, implement the xlogx function that defines $0\log(0) \triangleq 0$, and for discrete random variables, this will avoid inf and nans in the entropy computations. However, the same trick won't work in the case of continuum. But there is no need for such a convention here because the case of $\alpha$ with zero entries violates domain specification and is not an issue of ReactiveMP.

ThijsvdLaar · 2023-04-18T07:26:50Z

Good point, in that case I think it might be good to throw a DomainError to prevent confusion.

ismailsenoz · 2023-04-18T07:36:51Z

Yes, we can throw an assertion statement that checks the parameter vector does not contain 0. Or we can add a jitter assuming that the user is trying to pass a small value and is not aware of the domain specifications for distributions. I am not a big fan of jitters, but I also understand that getting a NaN error is annoying, and avoiding that might also be beneficial. @MagnusKoudahl Which one would have been more useful in your experience? To get an assertion error or injection of tinies automatically? @ThijsvdLaar @bvdmitri I am fine either way or open to other alternatives.

MagnusKoudahl · 2023-04-18T07:39:47Z

I would be in favour of throwing an error then. Adding tinies can also lead to unexpected behaviours and hiding it from the user makes it really hard to debug. With an error the user is still in control over how they want to set up their model and can decide if adding tinies is the right call

ismailsenoz · 2023-04-18T07:46:01Z

But Dirichlet distribution is from Distributions.jl, and I think it is impossible to modify their structure. Maybe they already implement this kind of check.

bvdmitri · 2023-04-18T10:55:39Z

Well as far as I remember the inference function actually should check for NaN/Infs already. It basically says something like Failed to compute node bound free energy component. The result is NaN` and same for entropies here: https://github.com/biaslab/RxInfer.jl/blob/main/src/score/bfe.jl#L57. Do you see this error or it is something different for you @MagnusKoudahl ?

MagnusKoudahl · 2023-04-24T10:47:44Z

@bvdmitri That's the same error I see

bvdmitri · 2023-04-24T11:32:51Z

MatrixDirichlet is our structure and we can add check for zero entries in the constructor. Technically diagonal prior should not be allowed and we should warn users about that.

In the meantime another workaround could be the functional form constraints, which would allow you to add jitter @MagnusKoudahl . See the corresponding section in the documentation: https://biaslab.github.io/ReactiveMP.jl/stable/custom/custom-functional-form/#custom-functional-form-example .

albertpod · 2023-06-01T09:48:41Z

@MagnusKoudahl do you have time to handle this?

MagnusKoudahl · 2023-06-19T09:59:31Z

Related to this issue, we also allow for other nonvalid parameterisations, ex

using ReactiveMP
my_var = NormalMeanVariance(0,-1)

does not throw an error. Is ensuring correct parameterisation something worth spending time on more generally?

MagnusKoudahl self-assigned this Jun 19, 2023

bvdmitri transferred this issue from ReactiveBayes/ReactiveMP.jl Oct 5, 2023

albertpod added this to RxInfer Jan 11, 2024

albertpod moved this to 🤔 Ideas in RxInfer Jan 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Entropy is NaN for MatrixDirichlet with 0 entries #137

Entropy is NaN for MatrixDirichlet with 0 entries #137

MagnusKoudahl commented Apr 11, 2023

bvdmitri commented Apr 17, 2023

ThijsvdLaar commented Apr 18, 2023

ismailsenoz commented Apr 18, 2023

ThijsvdLaar commented Apr 18, 2023

ismailsenoz commented Apr 18, 2023

MagnusKoudahl commented Apr 18, 2023

ismailsenoz commented Apr 18, 2023

bvdmitri commented Apr 18, 2023

MagnusKoudahl commented Apr 24, 2023

bvdmitri commented Apr 24, 2023

albertpod commented Jun 1, 2023

MagnusKoudahl commented Jun 19, 2023

Entropy is NaN for MatrixDirichlet with 0 entries #137

Entropy is NaN for MatrixDirichlet with 0 entries #137

Comments

MagnusKoudahl commented Apr 11, 2023

bvdmitri commented Apr 17, 2023

ThijsvdLaar commented Apr 18, 2023

ismailsenoz commented Apr 18, 2023

ThijsvdLaar commented Apr 18, 2023

ismailsenoz commented Apr 18, 2023

MagnusKoudahl commented Apr 18, 2023

ismailsenoz commented Apr 18, 2023

bvdmitri commented Apr 18, 2023

MagnusKoudahl commented Apr 24, 2023

bvdmitri commented Apr 24, 2023

albertpod commented Jun 1, 2023

MagnusKoudahl commented Jun 19, 2023