[Q&A] Retrieve files from server workspace while job is running #2576
Unanswered
ianh-apheris
asked this question in
Q&A
Replies: 2 comments 2 replies
-
WanB is not designed to be used in house. As far as I know it is designed as SaaS. |
Beta Was this translation helpful? Give feedback.
2 replies
-
@ianh-apheris thanks for this discussion. @chesterxgchen @yhwen @yanchengnv asides from W&B design, I think we do have two general enhancement directions we can think of:
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Python version (
python3 -V
)3.10
NVFlare version (
python3 -m pip list | grep "nvflare"
)2.4.0
NVFlare branch (if running examples, please use the branch that corresponds to the NVFlare version,
git branch
)2.4
Operating system
Ubuntu 20.04
Have you successfully run any of the following examples?
Please describe your question
We are using NVFlare in a closed environment, where general access to the outside internet is blocked from both the server and the client containers.
I'd like to use some experiment tracking tools to retrieve live metrics (e.g. wandb), but the default setup for wandb expects to have access to the wandb server from the flare server. In our case, this is blocked by a network policy in our environment.
We can set wandb into offline mode, and write the tracking logs into the job directory (by passing in the
dir
parameter in theinit
config to include{JOB_ID}
), but this is only available at the end of the run, which makes tracking the progress of a multi-day training run quite challenging.We've thought about polling and parsing the server logs, using
session.download_job_result
, but it's not ideal as we'd have to write our own wandb wrapper, and would ideally like to use flare built-ins.I wondered if there's a better way to periodically download the wandb directory from the flare server while the job is still running?
Beta Was this translation helpful? Give feedback.
All reactions