[Tutorial][PTD] Deprecate Training Transformer models using Distributed Data Parallel and Pipeline Parallelism
and redirect the page to parallelism APIs
#3772
Job | Run time |
---|---|
2m 42s | |
2m 42s |