PyMC/PyTensor Implementation of Pathfinder VI #387

aphc14 · 2024-10-31T10:42:57Z

Another version to draft PR #386 which uses more of PyTensor's symbolic variables and compiling functions.

Questions for Review

Which implementations should I continue for future improvements?
Are there additional PyTensor optimisations we could leverage?

…sion

`fit_pathfinder` - Edited `fit_pathfinder` to produce `pathfinder_state`, `pathfinder_info`, `pathfinder_samples` and `pathfinder_idata` for closer examination of the outputs. - Changed the `num_samples` argument name to `num_draws` to avoid `TypeError` got multiple values for keyword argument 'num_samples'. - Initial points are automatically set to jitter as jitter is required for pathfinder. Extras - New function 'get_jaxified_logp_ravel_inputs' to simplify previous code structure in fit_pathfinder. Tests - Added extra test for pathfinder to test pathfinder_info variables and pathfinder_idata are consistent for a given random seed.

Add a new PyMC-based implementation of Pathfinder VI that uses PyTensor operations which provides support for both PyMC and BlackJAX backends in fit_pathfinder.

- Implemented in to support running multiple Pathfinder instances in parallel. - Implemented function in for Pareto Smoothed Importance Resampling (PSIR). - Moved relevant pathfinder files into the directory. - Updated tests to reflect changes in the Pathfinder implementation and added tests for new functionalities.

aphc14 · 2024-11-04T19:31:18Z

Suppose the preferred approach is to stick with symbolic variables in PyTensor than the other non-symbolic approach in #386. In that case, I'd be happy to refactor the Multipath Pathfinder implementation in #386 to use symbolic variables and pytensor.function.

…nd .

…race data to InferenceData

… for bfgs_sample

aphc14 · 2024-11-07T18:15:31Z

This version runs much faster than #386, but the codes are messier due to the numerous pytensor symbolic variables created for the compiled pytensor functions (see the lines of code between def compute_logp and def single_pathfinder). Any suggestions for a cleaner setup would be appreciated

tests/test_pathfinder.py

pymc_experimental/inference/pathfinder/pathfinder.py

fonnesbeck · 2024-11-08T02:42:08Z

pymc_experimental/inference/pathfinder/lbfgs.py

+    g: np.ndarray
+
+
+class LBFGSHistoryManager:


Cleaner to use a data class? Don't know.

yep, I agree. dataclass now added

pymc_experimental/inference/pathfinder/importance_sampling.py

Summaryh of changes: - Remove multiprocessing code in favour of reusing compiled for each path - takes only random_seed as argument for each path - Compute graph significantly smaller by using pure pytensor op and symoblic variables - Added LBFGSOp to compile with pytensor.function - Cleaned up codes using pytensor variables

…and . - Corrected the dimensions in comments for matrices Q and R in the function. - Uumerical stability in the calculation by changing from to .

fonnesbeck · 2024-11-17T19:40:34Z

pymc_experimental/inference/fit.py

@@ -31,11 +31,13 @@ def fit(method, **kwargs):
    arviz.InferenceData
    """
    if method == "pathfinder":
+        # TODO: Remove this once we have a pure PyMC implementation


This PR will provide that, no?

the latest commit addresses this

Fixed incorrect and inconsistent posterior approximations in the Pathfinder VI algorithm by: 1. Adding missing parentheses in the phi calculation to ensure proper order of operations in matrix multiplications 2. Changing the sign in mu calculation from 'x +' to 'x -' to match Stan's implementation (which differs from the original paper) The resulting changes now make the posterior approximations more reliable.

Implements both sparse and dense BFGS sampling approaches for Pathfinder VI: - Adds bfgs_sample_dense for cases where 2*maxcor >= num_params. - Moved existing and computations to bfgs_sample_sparse, making the sparse use cases more explicit. Other changes: - Sets default maxcor=5 instead of dynamic sizing based on parameters Dense approximations are recommended when the target distribution has higher dependencies among the parameters.

Bigger changes: - Made pmx.fit compatible with method='pathfinder' - Remove JAX dependency when inference_backend='pymc' to support Windows users - Improve runtime performance by setting trust_input=True for compiled functions Minor changes: - Change default num_paths from 1 to 4 for stable and reliable approximations - Change LBFGS code using dataclasses - Update tests to handle both PyMC and BlackJAX backends

- Add LBFGSInitFailed exception for failed LBFGS initialisation - Skip failed paths in multipath_pathfinder and track number of failures - Handle NaN values from Cholesky decompsition in bfgs_sample - Add checks for numericl stabilty in matrix operations Slight performance improvements: - Set allow_gc=False in scan ops - Use FAST_RUN mode consistently

aphc14 added 7 commits October 19, 2024 23:48

renamed samples argument name and pathfinder variables to avoid confu…

4540b84

…sion

extract additional pathfinder objects from high level API for debugging

8835cd5

changed pathfinder samples argument to num_draws

663a60a

Merge branch 'replicate_pathfinder_w_pytensor' into scipy_lbfgs

05aeeaf

feat(pathfinder): add PyMC-based Pathfinder VI implementation

0db91fe

Add a new PyMC-based implementation of Pathfinder VI that uses PyTensor operations which provides support for both PyMC and BlackJAX backends in fit_pathfinder.

aphc14 added 4 commits November 7, 2024 20:40

Added type hints and epsilon parameter to fit_pathfinder

2efb511

Removed initial point values (l=0) to reduce iterations. Simplified a…

fdc3f38

…nd .

Added placeholder/reminder to remove jax dependency when converting t…

1fd7a11

…race data to InferenceData

Sync updates with draft PR pymc-devs#386. \n- Added pytensor.function…

ef2956f

… for bfgs_sample

aphc14 force-pushed the pathfinder_w_pytensor_symbolic branch from 9bfc48c to ef2956f Compare November 7, 2024 18:04

aphc14 changed the title ~~Pathfinder w pytensor symbolic~~ PyMC/PyTensor Implementation of Pathfinder VI Nov 7, 2024

fonnesbeck reviewed Nov 8, 2024

View reviewed changes

tests/test_pathfinder.py Show resolved Hide resolved

fonnesbeck reviewed Nov 8, 2024

View reviewed changes

pymc_experimental/inference/pathfinder/pathfinder.py Outdated Show resolved Hide resolved

fonnesbeck reviewed Nov 8, 2024

View reviewed changes

pymc_experimental/inference/pathfinder/importance_sampling.py Outdated Show resolved Hide resolved

aphc14 mentioned this pull request Nov 11, 2024

PyMC Implementation of Pathfinder VI #386

Closed

aphc14 marked this pull request as ready for review November 11, 2024 17:52

aphc14 marked this pull request as draft November 11, 2024 17:53

- Added TODO comments for implementing Taylor approximation methods: …

6484b3d

…and . - Corrected the dimensions in comments for matrices Q and R in the function. - Uumerical stability in the calculation by changing from to .

fonnesbeck reviewed Nov 17, 2024

View reviewed changes

aphc14 added 4 commits November 21, 2024 18:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PyMC/PyTensor Implementation of Pathfinder VI #387

PyMC/PyTensor Implementation of Pathfinder VI #387

aphc14 commented Oct 31, 2024

aphc14 commented Nov 4, 2024

aphc14 commented Nov 7, 2024 •

edited

Loading

fonnesbeck Nov 8, 2024

aphc14 Nov 25, 2024

fonnesbeck Nov 17, 2024

aphc14 Nov 25, 2024

PyMC/PyTensor Implementation of Pathfinder VI #387

Are you sure you want to change the base?

PyMC/PyTensor Implementation of Pathfinder VI #387

Conversation

aphc14 commented Oct 31, 2024

aphc14 commented Nov 4, 2024

aphc14 commented Nov 7, 2024 • edited Loading

fonnesbeck Nov 8, 2024

Choose a reason for hiding this comment

aphc14 Nov 25, 2024

Choose a reason for hiding this comment

fonnesbeck Nov 17, 2024

Choose a reason for hiding this comment

aphc14 Nov 25, 2024

Choose a reason for hiding this comment

aphc14 commented Nov 7, 2024 •

edited

Loading