Name		Name	Last commit message	Last commit date
parent directory ..
computationgraph		computationgraph
trainmodel		trainmodel
yaml		yaml
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

README.md

pytorch-ms-df-analyzer

This a project that can anany computing graphs in pytorch from many aspects.

How to install docker evnvironment

install cuda-toolkit from official documents:

from https://developer.nvidia.com/zh-cn/cuda-downloads

proxy setting:

need to set proxy of docker pull and apt-get

install docker and kubernetes from offical tutorials:

from https://kubernetes.io/docs/setup/production-environment/tools/kubeadm/install-kubeadm/
from https://docs.docker.com/engine/install/ubuntu/

with single master, you need to taint master first:

kubectl taint node xyj-precision-tower-3620 node-role.kubernetes.io/master-

install nvidia-docker:

installation guide from https://github.com/NVIDIA/nvidia-docker

install kubeshare

https://github.com/NTHU-LSALAB/KubeShare

do necessary check

apiVersion: kubeshare.nthu/v1
kind: SharePod
metadata:
  name: sharepod1
  annotations:
    "kubeshare/gpu_request": "0.5" # required if allocating GPU
    "kubeshare/gpu_limit": "1.0" # required if allocating GPU
    "kubeshare/gpu_mem": "1073741824" # required if allocating GPU # 1Gi, in bytes
    #"kubeshare/sched_affinity": "red" # optional
    #"kubeshare/sched_anti-affinity": "green" # optional
    #"kubeshare/sched_exclusion": "blue" # optional
spec: # PodSpec
  containers:
  - name: cuda
    image: nvidia/cuda:11.0-base
    #command: ["nvidia-smi", "-L"]
    command: [ "/bin/bash", "-ce", "tail -f /dev/null" ]
    resources:
      limits:
        cpu: "1"
        memory: "500Mi"

Running in Docker

sets proper environment variables:
prepare train, test and evaluation data through volume mount.
collect json data with different yaml settings.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pytorch-analyzer

pytorch-analyzer

README.md

pytorch-ms-df-analyzer

How to install docker evnvironment

Running in Docker

Files

pytorch-analyzer

Directory actions

More options

Directory actions

More options

Latest commit

History

pytorch-analyzer

Folders and files

parent directory

README.md

pytorch-ms-df-analyzer

How to install docker evnvironment

Running in Docker