Batching with pmap #46

smartalecH · 2023-10-06T23:54:21Z

Currently, FMMAX performs a lot of broadcasting under the hood, such that it's easy to simulate a device over multiple wavelength points and/or multiple k-points (all of which are completely independent simulations and can be executed in an embarrassingly parallel fashion).

In some cases, the full cartesian product of simulations cannot fit on a single accelerator, and it would be nice to distribute this across multiple accelerators.

Jax has some functionality for this using pmap. It might be nice to set up an example that takes an arbitrary combination of wavelengths and k-points and distributes the computation across all available accelerators. There are a lot of things to consider here, of course. For one, the eigendecomposition actually dispatches back to the host. So if all the devices perform this same dispatch, this could quickly become a bottleneck. Also, there are some limitations with pmap semantics when the number of parallel jobs is not an integer number of local accelerators, or if the accelerators live on different nodes.

The text was updated successfully, but these errors were encountered:

smartalecH · 2023-11-03T19:42:33Z

A simple example comparing vmap to pmap would be useful here.

smartalecH added the documentation Improvements or additions to documentation label Oct 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batching with pmap #46

Batching with pmap #46

smartalecH commented Oct 6, 2023 •

edited

Loading

smartalecH commented Nov 3, 2023

Batching with pmap #46

Batching with pmap #46

Comments

smartalecH commented Oct 6, 2023 • edited Loading

smartalecH commented Nov 3, 2023

smartalecH commented Oct 6, 2023 •

edited

Loading