example for train.sh #24

maxin-cn · 2022-09-07T06:57:30Z

torchpack dist-run -np 32 -H $server1:8, $server2:8, $server3:8, $server4:8 \
python train.py configs/netaug.yaml \
--data_provider "{/mnt/dolphinfs/hdd_pool/docker/user/hadoop-basecv/maxin/data/ILSVRC2012, image_size:176, base_batch_size:64}" \
--run_config "{base_lr:0.0125}" \
--model "{name:mcunet}" \
--netaug "{aug_expand_list:[1.0,1.6,2.2,2.8], aug_width_mult_list:[1.0,1.6,2.2]}" \
--path runs/mcunet_imagenet

Is torchpack used for multiple machine training? And how could I use the bash above for one machine multiple GPUs training? Could you please give me one example? Thanks~

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

example for train.sh #24

example for train.sh #24

maxin-cn commented Sep 7, 2022

example for train.sh #24

example for train.sh #24

Comments

maxin-cn commented Sep 7, 2022