Implement MVN.unsqueeze #2624

saitcakmak · 2025-01-17T18:05:05Z

Implements an unsqueeze method for MVN, that constructs a new MVN with the underlying tensors / linear operators unsqueezed along the given batch dimension. The choice of unsqueezing along the batch dimensions is consistent with the definition of expand.

MVN.expand allows us to start with MVN.batch_shape = b2 x b1 and add additional batch dimensions to the left, so that new_MVN.batch_shape = b3 x b2 x b1. unsqueeze, when combined with expand will allow us to add batch dimensions in the middle, so that new_MVN.batch_shape = b2 x b3 x b1 or b2 x b1 x b3.

The use case for this is to match MVNs produced by two models of different batch shapes. Assume that m1.batch_shape = mb2 x mb1 and m2.batch_shape = mb1. If we evaluate these two models with X = xb1 x q x d, we get m1(X).batch_shape. = xb1 x mb2 x mb1 and m2(X) = xb1 x mb1. By calling m2(X).unsqueeze(-2).expand(-1, mb2, -1), we can match the batch shapes of the two MVNs, allowing them to be combined into a single MTMVN using the from_independent_mvns method.

Balandat · 2025-01-19T18:30:23Z

gpytorch/distributions/multivariate_normal.py

+        if self.islazy:
+            new_covar = self._covar.unsqueeze(dim)
+            new = self.__class__(mean=new_loc, covariance_matrix=new_covar)
+            if self.__unbroadcasted_scale_tril is not None:
+                # Reuse the scale tril if available.
+                new.__unbroadcasted_scale_tril = self.__unbroadcasted_scale_tril.unsqueeze(dim)
+        else:
+            # Non-lazy MVN is represented using scale_tril in PyTorch.
+            # Constructing it from scale_tril will avoid unnecessary computation.
+            # Initialize using  __new__, so that we can skip __init__ and use scale_tril.
+            new = self.__new__(type(self))
+            new._islazy = False
+            new_scale_tril = self.__unbroadcasted_scale_tril.unsqueeze(dim)
+            super(MultivariateNormal, new).__init__(loc=new_loc, scale_tril=new_scale_tril)
+            # Set the covar matrix, since it is always available for GPyTorch MVN.
+            new.covariance_matrix = self.covariance_matrix.unsqueeze(dim)


A lot of this is duplicate from the expand() code above. Can we make a helper to reuse most of this? Or potentially consider allowing to instantiate a new MVN from the scale_tril directly as suggested on the other PR. Could be a class method MultivariateNormal.from_scale_tril() or the like if we don't want to change the __init__() signature.

They follow the same pattern but I wouldn't call them duplicates. You could make a helper that takes in the operation to apply to the tensors, but there are two separate operations for expand (for loc & covar), so I don't know if the resulting helper would improve the code readability.

Could be a class method MultivariateNormal.from_scale_tril() or the like if we don't want to change the init() signature.

This would avoid the issue of checking for covar & scale_tril compatibility in __init__. If we're not using __init__, would this just extract the self.__new__ based construction used here into a separate method? I suppose this'd work for non-lazy scale_tril as is. We'd have to have a separate case for lazy, at which point we end up with a duplicate of __init__ and I again question the added value of it over just keeping this as is.

If we have a method exposed that allows constructing the MVN from scale_tril (either as part of an updated __init__() method or as a separate method) then I would expect that people will see and use this for other purposes going forward (but they most likely won't look for the logic for this somewhere deep in the code as in this PR.

Not going to die on that hill, but the fact that we don't expose a natural way of doing this seems like a gap we ideally could address.

gpytorch/distributions/multivariate_normal.py

Balandat · 2025-01-21T21:53:23Z

gpytorch/distributions/multivariate_normal.py

+        if self.islazy:
+            new_covar = self._covar.unsqueeze(dim)
+            new = self.__class__(mean=new_loc, covariance_matrix=new_covar)
+            if self.__unbroadcasted_scale_tril is not None:
+                # Reuse the scale tril if available.
+                new.__unbroadcasted_scale_tril = self.__unbroadcasted_scale_tril.unsqueeze(dim)
+        else:
+            # Non-lazy MVN is represented using scale_tril in PyTorch.
+            # Constructing it from scale_tril will avoid unnecessary computation.
+            # Initialize using  __new__, so that we can skip __init__ and use scale_tril.
+            new = self.__new__(type(self))
+            new._islazy = False
+            new_scale_tril = self.__unbroadcasted_scale_tril.unsqueeze(dim)
+            super(MultivariateNormal, new).__init__(loc=new_loc, scale_tril=new_scale_tril)
+            # Set the covar matrix, since it is always available for GPyTorch MVN.
+            new.covariance_matrix = self.covariance_matrix.unsqueeze(dim)


If we have a method exposed that allows constructing the MVN from scale_tril (either as part of an updated __init__() method or as a separate method) then I would expect that people will see and use this for other purposes going forward (but they most likely won't look for the logic for this somewhere deep in the code as in this PR.

Not going to die on that hill, but the fact that we don't expose a natural way of doing this seems like a gap we ideally could address.

saitcakmak added 2 commits January 17, 2025 10:33

fix expand implementation

498743d

cast to torch.Size

9eaecdc

saitcakmak requested review from gpleiss and Balandat January 17, 2025 23:05

Balandat reviewed Jan 19, 2025

View reviewed changes

saitcakmak added 3 commits January 21, 2025 13:38

expand covar matrix

b37a85b

implement MVN.unsqueeze

0df1c55

add dim validation

312b0f1

saitcakmak force-pushed the mvn_unsqueeze branch from 6287b0b to 312b0f1 Compare January 21, 2025 18:46

saitcakmak requested a review from Balandat January 21, 2025 18:47

Balandat approved these changes Jan 21, 2025

View reviewed changes

saitcakmak enabled auto-merge January 22, 2025 17:50

Merge branch 'main' into mvn_unsqueeze

179a1b4

saitcakmak merged commit 2633973 into cornellius-gp:main Jan 22, 2025
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement MVN.unsqueeze #2624

Implement MVN.unsqueeze #2624

saitcakmak commented Jan 17, 2025

Balandat Jan 19, 2025

saitcakmak Jan 21, 2025

Balandat Jan 21, 2025

Balandat Jan 21, 2025

Implement MVN.unsqueeze #2624

Implement MVN.unsqueeze #2624

Conversation

saitcakmak commented Jan 17, 2025

Balandat Jan 19, 2025

Choose a reason for hiding this comment

saitcakmak Jan 21, 2025

Choose a reason for hiding this comment

Balandat Jan 21, 2025

Choose a reason for hiding this comment

Balandat Jan 21, 2025

Choose a reason for hiding this comment