Cascading flow surrogate posterior #1345

gisilvs · 2021-05-28T08:57:35Z

Cascading Flows algorithm

This reverts commit 5bb28b0

This reverts commit 1620ebd

This reverts commit 5bb28b0

This reverts commit 1620ebd

…rrogate_posterior

gisilvs · 2021-05-28T08:57:49Z

@davmre

davmre

Hi Gianluigi, here's a first round of comments---I think there are some fairly subtle challenges, but this is a really nice start!

davmre · 2021-06-01T18:24:59Z

tensorflow_probability/python/experimental/vi/cascading_flows.py

+    return distribution
+
+
+def register_cf_substitution_rule(condition, substitution_fn):


I'm not sure we actually need this substitution machinery for cascading flows.

We originally added it because ASVI (at least as implemented in TFP) depends on the parameterized distribution family: e.g., a Uniform(0., 1.) prior distribution is the same distribution as Beta(1., 1.), but they give rise to different posterior families as you tune the parameters. But cascading flows just use the prior distribution as-is.

I'd go ahead and delete this (and _as_substituted_distribution, and the specific registrations below) for now unless we have a specific use that requires it. (we can always add it later if we decide it's useful).

davmre · 2021-06-01T18:27:04Z

tensorflow_probability/python/experimental/vi/cascading_flows.py

+                _extract_variables_from_coroutine_model(
+                    posterior_generator, seed=seed)))
+
+    # Temporary workaround for bijector caching issues with autobatched JDs.


I think we can delete this comment now.

davmre · 2021-06-01T19:27:03Z

tensorflow_probability/python/experimental/vi/cascading_flows.py

+    return surrogate_posterior, variables
+
+
+# todo: sample_shape is not used.. can remove?


We're not special-casing tfd.Sample the way that ASVI does, so yes, I think we can get rid of sample_shape (here and elsewhere).

davmre · 2021-06-01T19:57:15Z

tensorflow_probability/python/experimental/vi/cascading_flows.py

+                # save the variables.
+                value_out = yield (surrogate_posterior if flat_variables
+                                   else (surrogate_posterior, variables))
+                if type(value_out) == list:


I assume this is to detect auxiliary variables, but it would also fire (incorrectly I believe) on list-valued distributions like JointDistributionSequential. Would it work to directly check if num_auxiliary_variables > 0 ?

This could probably also use a short explanatory comment.

davmre · 2021-06-01T19:58:46Z

tensorflow_probability/python/experimental/vi/cascading_flows.py

+                                   else (surrogate_posterior, variables))
+                if type(value_out) == list:
+                    if len(dist.event_shape) == 0:
+                        dist = prior_gen.send(tf.squeeze(value_out[0], -1))


As a style matter, we prefer array slicing notation where possible. e.g.:

x[..., 0] instead of squeeze(x, -1) (as here)

x[..., tf.newaxis] instead of expand_dims(x, -1)

etc.

davmre · 2021-06-02T16:16:20Z

tensorflow_probability/python/experimental/vi/cascading_flows.py

+          [tf.random.uniform((1,), minval=0.01, maxval=1.)
+        for _ in range(num_aux_vars)], bijector=tfb.Softplus()), -1)), 1)
+
+    def target_log_prob_aux_vars(z_and_eps):


It would be nice to do something to help the user define the loss. Maybe we could add a method along the lines of augment_target_log_prob that would takes the target_log_prob_fn and the prior distribution as arguments, and returns the equivalent of this method target_log_prob_aux_vars ?

Aside from convenience, one reason to do this is that it gives us leeway on the structure of the surrogate posterior. Instead of an exact specification, we can just provide the contract that

augmented_log_prob = augment_target_log_prob(target_log_prob, prior) surrogate_posterior = build_cf_surrogate_posterior(prior, ...) lp = augmented_log_prob(*surrogate_posterior.sample())

works to compute a valid log-density. Then if we decide later on to change the structure of the surrogate (as in my comment below about using the Restructure bijector), we just have to make the corresponding changes to augment_target_log_prob in order to avoid breaking code that uses this pattern.

davmre · 2021-06-02T16:48:02Z

tensorflow_probability/python/experimental/vi/cascading_flows_test.py

+
+
+@test_util.test_all_tf_execution_regimes
+class TestCFDistributionSubstitution(test_util.TestCase):


We can probably delete this test along with the substitution code.

davmre · 2021-06-02T16:50:59Z

tensorflow_probability/python/experimental/vi/cascading_flows.py

+
+
+# todo: sample_shape is not used.. can remove?
+def _cf_convex_update_for_base_distribution(dist,


This isn't a convex update any more --- maybe call it something like _cascading_flow_update_for_base_distribution ?

davmre · 2021-06-02T16:57:43Z

tensorflow_probability/python/experimental/vi/cascading_flows_test.py

+  def make_prior_dist(self):
+    def _prior_model_fn():
+      innovation_noise = 1.
+      yield tfd.HalfNormal(


One thing we need to think about is: how should cascading flows apply to distributions that have constrained support (like this HalfNormal)?

Since flows don't really respect constraints on their own, IMHO the natural approach would be to transform the distribution into an unconstrained space, apply the flow, and then reapply the constraint. That is, in _cf_update_for_base_distribution, you'd do something like:

constraining_bijector = dist.experimental_default_event_space_bijector() unconstrained_dist = invert.Invert(constraining_bijector)(dist) cascading_flow = ... # Build cascading flow from unconstrained dist. # Now reapply the constraint to the sampled event (but not the auxiliary part). constrained_cascading_flow = tfb.JointMap([constraining_bijector, identity.Identity()])(cascading_flow)

Then we'd want to test that the surrogate has the same support as the prior. Probably the easiest way to do this is to verify that the log-probs are finite. For example, you could add a test to _TrainableCFSurrogate that checks self.assertAllFinite(prior_dist.log_prob(surrogate_posterior.sample(10, seed=test_util.test_seed())))

davmre · 2021-06-02T17:00:31Z

tensorflow_probability/python/experimental/vi/cascading_flows.py

+        surrogate posterior distribution, with the same structure and `name` as
+        `dist`, and with addition of global and local auxiliary variables if
+        `num_auxiliary_variables > 0`.
+      variables: Nested structure of `tf.Variable` trainable parameters for the


Since the components of variables are now bijectors, maybe we should call it a nested structure 'containing' tf.Variables rather than 'of' tf.Variables ?

…rrogate_posterior

… flow

gisilvs added 30 commits May 14, 2021 10:51

fixed conflicts

e6a2713

Revert "Revert "initial tests, updated init and build""

c501d2b

This reverts commit 5bb28b0

reverted commit

b6be9d9

Revert "removed cascading_flows from pr"

dbf371b

This reverts commit 1620ebd

reverted to latest version

c6118b1

fixed surrogate posterior type

bcf95e1

small fixes

4d4b291

fixed global variables if no auxiliary variabled

6cd8871

added number of layers parameter

4690d0a

fixed conflicts

36ce254

readded highway flow

3b182aa

fixed init

3e11546

removed highway flow from this branch

2ff7130

removed highway flow from this branch

4cc20e3

working on tests

0b386c6

more testing

d8f4780

pulled highway flow from master

79b0b99

fixed conflicts

af9a5ba

Revert "Revert "initial tests, updated init and build""

5cf8ce9

This reverts commit 5bb28b0

reverted commit

755bca9

Revert "removed cascading_flows from pr"

bbc38a4

This reverts commit 1620ebd

reverted to latest version

a89d60a

fixed surrogate posterior type

ea80d7b

small fixes

cf11c70

fixed global variables if no auxiliary variabled

d9e2828

added number of layers parameter

80e8ee7

readded highway flow

1360229

fixed init

e1a2218

working on tests

4f667ee

more testing

75d8b53

gisilvs added 5 commits May 28, 2021 10:41

some refactoring

7bd8457

Merge remote-tracking branch 'upstream/master' into cascading_flow_su…

17b5c6f

…rrogate_posterior

merged conflicts

78a18d8

changed seed

ae34080

reverted to master

d0e287f

googlebot added the cla: yes Declares that the user has signed CLA label May 28, 2021

davmre suggested changes Jun 2, 2021

View reviewed changes

gisilvs added 22 commits June 3, 2021 15:03

removed substitution rule and updated dependencies

1e9a486

removed sample_shape

6305084

changed if statement and array slicing for value_out

2f27b95

changed docstrings for target_dist

326a766

expanded cf to cascading flows and changed bijector

1f295da

removed testCFDistributionSubstitution

487f7dd

removed convex from name

e361a46

fixed comment

70ffe7b

adjusted names

398d459

fixed dimensions of prior

2c44c48

readded batchbroadcast

5fb11ec

small fixes

5a20976

removed try except

14e34ee

added support for distributions withc constrained support and test

fa69f67

fixed output reshape

e296c6f

removed discrete test

55155e8

working on batch shape

9eb8b97

small bug fixed

0c0fb39

Merge remote-tracking branch 'upstream/master' into cascading_flow_su…

05c652e

…rrogate_posterior

changed shapes to static and added auxiliary variables without global…

8d8777d

… flow

fixed constraining_bijector

d514f72

working cf and cf with local aux vars

5eabcf8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cascading flow surrogate posterior #1345

Cascading flow surrogate posterior #1345

gisilvs commented May 28, 2021

gisilvs commented May 28, 2021

davmre left a comment

davmre Jun 1, 2021

davmre Jun 1, 2021

davmre Jun 1, 2021

davmre Jun 1, 2021

davmre Jun 1, 2021

davmre Jun 1, 2021

davmre Jun 2, 2021

davmre Jun 2, 2021

davmre Jun 2, 2021

davmre Jun 2, 2021

davmre Jun 2, 2021

davmre Jun 2, 2021

		return distribution


		def register_cf_substitution_rule(condition, substitution_fn):

		return surrogate_posterior, variables


		# todo: sample_shape is not used.. can remove?



		@test_util.test_all_tf_execution_regimes
		class TestCFDistributionSubstitution(test_util.TestCase):



		# todo: sample_shape is not used.. can remove?
		def _cf_convex_update_for_base_distribution(dist,

Cascading flow surrogate posterior #1345

Are you sure you want to change the base?

Cascading flow surrogate posterior #1345

Conversation

gisilvs commented May 28, 2021

gisilvs commented May 28, 2021

davmre left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment