Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

OPEA docker images take a lot of disk space #921

Open
xiguiw opened this issue Nov 20, 2024 · 2 comments
Open

OPEA docker images take a lot of disk space #921

xiguiw opened this issue Nov 20, 2024 · 2 comments
Assignees
Labels
help wanted Extra attention is needed

Comments

@xiguiw
Copy link
Collaborator

xiguiw commented Nov 20, 2024

OPEA takes a lot of disk space.

"To deploy them on ITAC using medium or large Xeon VMs, 64GB isn’t sufficient and the next tier is 2TB which is much more expensive."

OPEA consumes lots of space: Both memory and disk space.

It is helpful to reduce the OPEA size.

@xiguiw xiguiw added the help wanted Extra attention is needed label Nov 20, 2024
@erojaso
Copy link

erojaso commented Nov 20, 2024

I've had similar problems
Creating an image for embedding-tei-server used to take 5 min now it takes 20 min and 4.6 gb

REPOSITORY TAG IMAGE ID CREATED SIZE
acraksexplore.azurecr.io/embedding-tei 0.0.3 881d61f97080 10 hours ago 4.6GB
opea/embedding-tei 0.0.3 881d61f97080 10 hours ago 4.6GB
ghcr.io/huggingface/text-embeddings-inference cpu-1.5 64f8afb02d72 4 months ago 138MB

@eero-t
Copy link
Contributor

eero-t commented Dec 2, 2024

OPEA consumes lots of space: Both memory and disk space.

RAM usage depends almost completely on the specified inferencing model, and data type. Suitable model depends on the needs of the user, but on newer Xeons, memory usage can be halved by using BF16 data type instead of FP32, see: opea-project/GenAIExamples#330

I've had similar problems Creating an image for embedding-tei-server used to take 5 min now it takes 20 min and 4.6 gb

For which OPEA version and application this is? E.g. v1.1 ChatQnA works fine without the TEI/TGI wrapper services.


#265 includes examples for significant reductions in OPEA Comps image sizes, and https://github.com/opea-project/GenAIExamples/pull/1031PR is step 1/2 of reducing image sizes for the apps using them.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

4 participants