Support torch.compile in enet, fbnet and 3d-unet pytorch samples #189

dvrogozh · 2024-08-23T19:17:01Z

No description provided.

* support LCM int8 * fix LCM int8-bf16 * fix LCM int8 * update README

… files (#2022) * ipex/efficientnet: extract get_system_config to separate module * ipex/efficientnet: move system_config.py to common folder * common: rename system_config.py to js_sysinfo.py * common: add js_merge.py js_merge is a tool to merge few .json output files together preserving all unique values. Values type mismatch is considered a fatal error. With few values of the same type for the same key only one of the first input is kept with the warning printed out. * common/sysinfo: collect docker info * common/sysinfo: collect dkms info * common/sysinfo: collect dpkg info for key packages * common: add readme * common/sysinfo: add svr-info support * common/sysinfo: differentiate more between docker and baremetal * common/sysinfo: break get_docker_info into 2 functions * common/sysinfo: report key hardware information * common/sysinfo: add lshw support and fetch memory info * common/sysinfo: parse lshw to get cpu info * core/sysinfo: expand cpu info configuration * common/sysinfo: expand configuration for memory info * common/sysinfo: add -o option and improve messaging * common: update readme for sysinfo --------- Signed-off-by: Dmitry Rogozhkin <[email protected]>

* make num_iter flexbile * bugfix for bert-large ddp * bkc for rn50 ddp training update * bkc for rn50 ddp training update * bkc for dlrm_v1 ddp training update * bugfix for llms output --------- Co-authored-by: mahathis <[email protected]>

Signed-off-by: Dmitry Rogozhkin <[email protected]>

* Imported and ran through image recognition and language modelling for Tensorflow CPU workloads Co-authored-by: Mahathi Vatsal <[email protected]>

Signed-off-by: Dmitry Rogozhkin <[email protected]>

* enable yolov5 on CPU Co-authored-by: nick.camarena <[email protected]> Co-authored-by: Clayne Robison <[email protected]> Co-authored-by: Jitendra Patil <[email protected]>

model script runer and setup shell script readme helper to get dataset test for container

Run as: sudo \ IMAGE=enet \ OUTPUT_DIR=/tmp/output \ PROFILE=$(pwd)/models_v2/pytorch/efficientnet/inference/gpu/profiles/b0.bf16.csv \ PYTHONPATH=$(pwd)/models_v2/common $(pwd)/models_v2/pytorch/efficientnet/inference/gpu/benchmark.sh This commit also adds dummy and framework fields to efficientnet results output and fixes stdev naming in couple places. Co-authored-by: Voas, Tanner <[email protected]> Signed-off-by: Dmitry Rogozhkin <[email protected]> Signed-off-by: Voas, Tanner <[email protected]>

Added new tool json_to_csv to dump multiple json objects to a single CSV (will serialize the json objects) Signed-off-by: Voas, Tanner <[email protected]> Signed-off-by: Dmitry Rogozhkin <[email protected]> Co-authored-by: Voas, Tanner <[email protected]>

Signed-off-by: Dmitry Rogozhkin <[email protected]>

Align summary_utils to efficientnet

Combine functions for dummy / non-dummy inputs

* remove ipex for inductor * fix calibration no prompt for inductor * add fp16 for llm * llm torch.compile forward only

- use descriptive variable names iteration and test rather than i and t in the loop - remove unwanted try/catch statements in common code Signed-off-by: Voas, Tanner <[email protected]> Co-authored-by: mahathis <[email protected]>

* Add dummy mode to swin transformer * Random data, no dataset needed in dummy mode

This patch adds some external metadata to benchmark results. Signed-off-by: Dmitry Rogozhkin <[email protected]>

* update bkc for pvc itex 2.15.0.0 * update bkc for atsm itex 2.15.0.0 * TF 2.15.0 Flex containers (#2087) * validate flex 170 and 140 * Updated baremetal for itex 2.15 (#2098) --------- Co-authored-by: XumingGai <[email protected]> Co-authored-by: Srikanth Ramakrishna <[email protected]> Co-authored-by: Mahathi Vatsal <[email protected]>

- add new telemetry.py tool for capturing telemetry - Start SMI telemetry capture as its own process inside benchmark.py - Support UNIX socket communication and python multiprocessing PIPEs for external control of telemetry start, stop, and termination - Add requirements to efficientnet sample to work with this - Add processing code to convert the output CSV into a JSON file - mentioned metadata in benchmark.py readme UNIX socket API implemented by Dmitry Rogozhkin <[email protected]> UNIX socket API adapted into commit by Voas, Tanner <[email protected]> Co-authored-by: Dmitry Rogozhkin <[email protected]> Signed-off-by: Voas, Tanner <[email protected]>

Signed-off-by: Voas, Tanner <[email protected]>

- Core inference has been moved to its own class Inference. - Replaced "NUM_IMAGES" and "NUM_ITERATIONS" with single param "NUM_INPUTS" - Aligns with other samples usage model (IFRNet, RIFE) - "NUM_INPUTS" is functionally same as old "NUM_IMAGES" was - "NUM_ITERATIONS" is 1 in accuracy mode and is dynamic in benchmark mode based on specified min/max test durations. - Added support to PyTorch EfficientNet samples to specify min and max test duration - Logs raw perf with finer granularity now since we have it available - Use test duration in benchmark for enet - Remove unused quantization code paths from code Signed-off-by: Voas, Tanner <[email protected]>

* gpt-j mlperf model

…(#2370) Bumps [torch](https://github.com/pytorch/pytorch) from 1.13.1 to 2.2.0. - [Release notes](https://github.com/pytorch/pytorch/releases) - [Changelog](https://github.com/pytorch/pytorch/blob/main/RELEASE.md) - [Commits](pytorch/pytorch@v1.13.1...v2.2.0) --- updated-dependencies: - dependency-name: torch dependency-type: direct:production ... Signed-off-by: dependabot[bot] <[email protected]>

…ws (#2374) Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.25.13 to 3.25.15. - [Release notes](https://github.com/github/codeql-action/releases) - [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md) - [Commits](github/codeql-action@v3.25.13...v3.25.15) --- updated-dependencies: - dependency-name: github/codeql-action dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

…(#2375) Bumps [ossf/scorecard-action](https://github.com/ossf/scorecard-action) from 2.3.3 to 2.4.0. - [Release notes](https://github.com/ossf/scorecard-action/releases) - [Changelog](https://github.com/ossf/scorecard-action/blob/main/RELEASE.md) - [Commits](ossf/scorecard-action@v2.3.3...v2.4.0) --- updated-dependencies: - dependency-name: ossf/scorecard-action dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <[email protected]>

* Update main README * Update main README * Updated README * Merge pull request intel#184 from intel/mahathi/update_readme Update README * Fix torch version to fix dependabot issue * Merge pull request intel#185 from intel/mahathi/fix_dependabot_issues Fix torch version to fix dependabot issue

* refactor gpu * refactor tf * add tf max-gpu * respond to lint errors * remove max-gpu folder * add cuda models to pytorch

Bumps [keras](https://github.com/keras-team/keras) from 2.6.0rc3 to 2.13.1rc0. - [Release notes](https://github.com/keras-team/keras/releases) - [Commits](keras-team/keras@v2.6.0-rc3...v2.13.1-rc0) --- updated-dependencies: - dependency-name: keras dependency-type: direct:production ... Signed-off-by: dependabot[bot] <[email protected]>

…ows (#2382) * Bump super-linter/super-linter from 6.7.0 to 6.8.0 in /.github/workflows Bumps [super-linter/super-linter](https://github.com/super-linter/super-linter) from 6.7.0 to 6.8.0. - [Release notes](https://github.com/super-linter/super-linter/releases) - [Changelog](https://github.com/super-linter/super-linter/blob/main/CHANGELOG.md) - [Commits](super-linter/super-linter@v6.7.0...v6.8.0) --- updated-dependencies: - dependency-name: super-linter/super-linter dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <[email protected]> * Resolved super linter issues --------- Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: Mahathi Vatsal <[email protected]>

* Added new bkm=c 2.4 changes

* specify the version of sympy (#2373)

…dated (#2389) * Removed all instances of miniconda, unvalidated * fixed miniforge base image path

* Removed all instances of miniconda, unvalidated *Also replaced all intel channel to the specific software repo link

Getting 217 fps on fp32 with torch.compile vs. 187 on eager mode running on PVC. Signed-off-by: Dmitry Rogozhkin <[email protected]>

Getting 300 fps on fp32 with torch.compile vs. 187 on eager mode running on PVC. Signed-off-by: Dmitry Rogozhkin <[email protected]>

Signed-off-by: Dmitry Rogozhkin <[email protected]>

jiayisunx and others added 30 commits May 5, 2024 15:04

support LCM int8 (#2038)

7a94c99

* support LCM int8 * fix LCM int8-bf16 * fix LCM int8 * update README

Llms output bugfix (#2037)

f46c65f

* make num_iter flexbile * bugfix for bert-large ddp * bkc for rn50 ddp training update * bkc for rn50 ddp training update * bkc for dlrm_v1 ddp training update * bugfix for llms output --------- Co-authored-by: mahathis <[email protected]>

models_v2/common: add back a table with links to scripts in readme

36b64bf

Signed-off-by: Dmitry Rogozhkin <[email protected]>

Update cpu workloads (#2044)

af7afb7

* Imported and ran through image recognition and language modelling for Tensorflow CPU workloads Co-authored-by: Mahathi Vatsal <[email protected]>

Add numa policy for ResNet50 and ViT accuracy runs (#2056)

bb34d24

models_v2/enet: align with schema definition for the output data

28ea384

Signed-off-by: Dmitry Rogozhkin <[email protected]>

Chatglm: Fix bf16 accuracy issue (#2060)

f3d1242

[Tensorflow] Enable bfloat16, fp16, and int8 for Yolo V5 (#1790)

c4e7498

* enable yolov5 on CPU Co-authored-by: nick.camarena <[email protected]> Co-authored-by: Clayne Robison <[email protected]> Co-authored-by: Jitendra Patil <[email protected]>

Yolov5 Inference (#2000)

f66b5de

model script runer and setup shell script readme helper to get dataset test for container

removing unused scripts for SD (#2069)

2da91f4

update for cluster (#2071)

7acf90f

doc/common: add example for benchmark.py script in readme

f254b1c

Signed-off-by: Dmitry Rogozhkin <[email protected]>

chatglm: fix int8 accuracy issue (#2075)

75a0c77

refine gpt-j latency output (#2076)

1cde401

add inductor fp16 path for bert (#2082)

dc0c2ff

yolov5: update summary_utils

dd68af6

Align summary_utils to efficientnet

Yolov5: combine dummy and non-dummy paths

6f9a4e2

Combine functions for dummy / non-dummy inputs

modify llm models for inductor test (#2072)

58a6d78

* remove ipex for inductor * fix calibration no prompt for inductor * add fp16 for llm * llm torch.compile forward only

common: use descriptive variable names

29fde3b

- use descriptive variable names iteration and test rather than i and t in the loop - remove unwanted try/catch statements in common code Signed-off-by: Voas, Tanner <[email protected]> Co-authored-by: mahathis <[email protected]>

Add dummy to swin (#2055)

7d6c5f0

* Add dummy mode to swin transformer * Random data, no dataset needed in dummy mode

Add script for downloading dataset for LCM and SD (#2094)

b5bd9e1

models_v2/common: add save_to_json.py tool and benchmark metadata

bebd034

This patch adds some external metadata to benchmark results. Signed-off-by: Dmitry Rogozhkin <[email protected]>

BKC update for dlrmv1/v2 (#2110)

58f5b0b

common: update sysinfo to handle multiple matches from dpkg

0e340b5

Signed-off-by: Voas, Tanner <[email protected]>

Mahathi-Vatsal and others added 23 commits July 23, 2024 10:55

Added openssf badge (#2366)

c0104c9

update docs (#2367)

36eea1a

correct url (#2368)

2cc80b0

Mlperf 4.0 GPT-J inference model (#2011)

53ad636

* gpt-j mlperf model

Fixed linter issues

6a7b02f

Fixed linter issues

c74d1f1

r3.2: pin imagenet revision (#2371)

80b42b9

Refactor docker to models_v2 format (#2301)

8e44068

* refactor gpu * refactor tf * add tf max-gpu * respond to lint errors * remove max-gpu folder * add cuda models to pytorch

Update pytorch CPU workloads with 2.4 bkcs (#2383)

07f403e

* Added new bkm=c 2.4 changes

update build for gpu (#2384)

86b0bce

refactor pyt-cpu docker folder and remove dataset (#2377)

cbe98be

revert doc instructions (#2376)

57fa514

* specify the version of sympy (#2373)

Removed all instances of miniconda and replace with miniforge, unvali…

94f10e6

…dated (#2389) * Removed all instances of miniconda, unvalidated * fixed miniforge base image path

Leminh/miniconda removal fix (#2392)

2d2a06a

* Removed all instances of miniconda, unvalidated *Also replaced all intel channel to the specific software repo link

enet: support torch.compile

39c1f36

Getting 217 fps on fp32 with torch.compile vs. 187 on eager mode running on PVC. Signed-off-by: Dmitry Rogozhkin <[email protected]>

fbnet: support torch.compile

73d90e5

Getting 300 fps on fp32 with torch.compile vs. 187 on eager mode running on PVC. Signed-off-by: Dmitry Rogozhkin <[email protected]>

3d-unet: support torch.compile

dcaceaf

Signed-off-by: Dmitry Rogozhkin <[email protected]>

dvrogozh requested review from ashahba, claynerobison, jitendra42, lerealno and Mahathi-Vatsal as code owners August 23, 2024 19:17

dvrogozh mentioned this pull request Aug 23, 2024

xpu: efficientnet inference underperforms ipex pytorch/pytorch#132176

Open

tfqaprod force-pushed the main branch from c52b6e6 to 499e601 Compare September 17, 2024 15:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support torch.compile in enet, fbnet and 3d-unet pytorch samples #189

Support torch.compile in enet, fbnet and 3d-unet pytorch samples #189

dvrogozh commented Aug 23, 2024

Support torch.compile in enet, fbnet and 3d-unet pytorch samples #189

Are you sure you want to change the base?

Support torch.compile in enet, fbnet and 3d-unet pytorch samples #189

Conversation

dvrogozh commented Aug 23, 2024