From 98e46e60a1d03799d43f9f96c4aeecb2cc35d69f Mon Sep 17 00:00:00 2001 From: Wafaa Taie Date: Mon, 27 Mar 2023 10:36:51 -0700 Subject: [PATCH] Import GPU Max and Flex Series workloads from develop-gpu (#1080) MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit * Add GPU DLRM FP16 inference * Change to install ATS drivers from local repo * Add GPU PYT bert large FP16 Inference * fix _FusedMatmlul issue in GPU * Updated PyTorch to use the common compiler partial and added ARG for the env var file since that changes per compiler * Add package for ResNet 50 v1.5 int8 Inference pytorch gpu * Update specs & build files for alpha2 rc1 whls * Add ResNet50 v1.5 bf16 Training PYT GPU * Add wrapper package for TF GPU tool container * Update TF GPU training packages to use alpha2-rc1 * Update IPEX tools container and resnet50v1.5 models for alpha2 rc1 * Update PYT Bert LG and DLRM FP16 inference alpha2-rc1 * Update tf-gpu branch for ww15 dpcpp compiler * Set ITEX_ENABLE_ONEDNN_LAYOUT_OPT=0 for bert training * Add section to validate base container, fix dlrm printed statement * Update the docs for alpha2-rc2 models * fix ipex tool container readme * Fix dlrm print using CPU statement to be XPU * add 1t env vars * Use add instead of addn * Update bert large docs to be specific about which pretrained model to use * Sync with develop Signed-off-by: Abolfazl Shahbazi * Update the main benchmarks README for gpu models * Set ITEX_ENABLE_ONEDNN_LAYOUT_OPT=0 in ResNet50v1.5 bf16 training quickstart scripts * Revert "tmp fix res50v1_5 int8" This reverts commit 3c120e0bee3a576ee1548d9258b611a889897ee6 * Updates to match batch sizes in docs and updated pb links * Updating compilar binary * Update PYT GPU packages for IPEX alpha2 rc6 * rfcn-fp32-inference-k8s package Signed-off-by: Kam D Kasravi * Update GPU specs to make the docs section a list and update TF training docs for DevCloud * Doc updates for ResNet50v1.5 and BERT large training for GPU * tf-gpu doc updates * Fix the BKC and environment for resnet50v1.5 INT8, bert-larget and resenet50v1.5 BF16 training * Update GPU PYT packages to have 2 READMEs * Remove duplicate license from package * AI Kit Model Package README * Clean up PYT model pkgs and update baremetal docs * Fix GPU tests (#5) Signed-off-by: Abolfazl Shahbazi * Sync with 'develop' and resolve conflicts (#3) * Update README.md for IPS 00513014 and 00514541 * Enable remapper pass in densenet169 execution * Adds protoc and pycocotools dependencies * K8s packages tests: Checks if username has underscore before creating a namespace * Fix and simplify serving k8s package path variables * Upgrade to 'TensorFlow Serving 2.4.0' Signed-off-by: Abolfazl Shahbazi * rfcn-fp32-inference-k8s package Signed-off-by: Kam D Kasravi * Quickstart updates for using synthetic data or real data, except SSD-ResNet batch will always use synthetic * Add Centos8 partials for SPR TF models * Fix the URL for 'oneAPI-samples' repo * snapshot Signed-off-by: Kam D Kasravi * Add a copy of existing pytorch ipex icx centos specs to specs/centos * Fix High vulnaribility issues reported by SNYK Signed-off-by: Abolfazl Shahbazi * Setting OMP_NUM_THREADS based on num_intra_threads * Weekly SNYK fixes Signed-off-by: Abolfazl Shahbazi * Fixes broken links in the Launch Benchmarks documentation * Fix '3d-unet' docker image links Signed-off-by: Abolfazl Shahbazi * Fix Python and TensorFlow Pip package versions for TF v1.15.2 Signed-off-by: Abolfazl Shahbazi * Adding a minor fix to dynamically calculate the number of remaining images to be steps provided x batch size. Currently the max number of steps the RN50 inference supports is max of 5000 / batch size.. The 50k hard limit is not letting us to perform long inference runs for platform analysis. Hence requesting this fix. This will enable us to collect telemetric data (like emon) to be collected for longer duration (like 5 mins). Signed-off-by: Rajendrakumar Chinnaiyan * Remove unused 'num_cores' from 'rfcn' Signed-off-by: Abolfazl Shahbazi * Upgrade to 'Pillow>=8.1.2' Signed-off-by: Abolfazl Shahbazi * Compatibility fixes for automation * Parameterized model name in resnet50v1.5 serving script * Increase timeout and modify output * Adjusts inceptionv3 client input and output * fix mpi operator cluster scope issue * Fixes SSD-MobileNet perf comparison by pre-installing numpy with --no-binary * Enable more models for Perf Analysis notebooks and add auto testing for notebooks * Update quickstart bare metal documentation to use ./quickstart/