Replies: 1 comment
-
It seems that Fordead already uses the |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
@kenoz and @floriandeboissieu,
We've been discussing the integration of Dask with nrt, here's some simple experiments to illustrate the "problem" and look for alternatives
Currently, Dask does not fully integrate with nrt. When we pass a Dask-based DataArray to the
fit()
method, it immediately triggers the computation of the array, negating Dask's lazy evaluation. While the computation can be parallelized at a lower level using numba, the necessity to fully evaluate the array before starting the fit() computation create a bottleneck. The example below illustrates this behavior:The following snippet provides a simplified illustration of what happens internally in nrt. The main issue is with the input array flattening via fancy indexing (
y_flat = y[:, mask]
), which Dask does not support:One potential solution is to wrap the computation in a function and apply it using
map_blocks
. This approach works well for this simplified example but not sure it will be as easy to port to nrt:Any thoughts?
Beta Was this translation helpful? Give feedback.
All reactions