Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add extra warmup for chatglm3-6b in igpu-performance test #11197

Conversation

Oscilloscope98
Copy link
Contributor

Description

The performance of ChatGLM3-6B in nightly performance test is very unstable, especially for 32-32 (int4+fp32). Thus, extra warmup (load + inference once) is added to ChatGLM3-6B to record more stable performance.

@Oscilloscope98 Oscilloscope98 force-pushed the igpu-chatglm3-perf-stabalize branch 2 times, most recently from 1208831 to fb0f662 Compare June 4, 2024 05:58
Copy link
Contributor

@liu-shaojun liu-shaojun left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@Oscilloscope98 Oscilloscope98 merged commit 9f8074c into intel-analytics:main Jun 4, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants