You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks a lot for incorporating furrr! It's really great to see it get some love and use in other packages. I had a suggestion on "best practices" of using future based packages, hopefully you find it useful.
I suggest you remove the future::plan() call from featureSelection(). It is best practice to let the user supply the plan, and the developer only worries about what code is parallelized, not how it is parallelized.
The reason for this is that you are inherently limiting the user by setting plan(multiprocess) to only be able to use their local computer for parallel feature selection. future can do much more than this, like run on EC2 or a remote cluster. Ideally, this is what you'd have:
# by default, future_map() runs sequentially if you don't specify any plan
featureSelection(...)
# runs in parallel on your local computer
plan(multiprocess)
featureSelection(...)
# runs in parallel sharded over a cluster somewhere
plan(cluster)
featureSelection(...)
# runs in parallel on multiple ec2 instances
plan(cluster, workers=ec2_ip_addresses)
featureSelection(...)
# sends x, y, and z each to a node of the cluster AND runs in parallel on those cluster nodes
plan(list(cluster, multiprocess))
map(list(x,y,z), featureSelection(.x))
See how many fun things you can do if you let the user specify the plan?
The text was updated successfully, but these errors were encountered:
Thanks a lot for incorporating
furrr
! It's really great to see it get some love and use in other packages. I had a suggestion on "best practices" of usingfuture
based packages, hopefully you find it useful.I suggest you remove the
future::plan()
call fromfeatureSelection()
. It is best practice to let the user supply the plan, and the developer only worries about what code is parallelized, not how it is parallelized.The reason for this is that you are inherently limiting the user by setting
plan(multiprocess)
to only be able to use their local computer for parallel feature selection.future
can do much more than this, like run on EC2 or a remote cluster. Ideally, this is what you'd have:See how many fun things you can do if you let the user specify the plan?
The text was updated successfully, but these errors were encountered: