#

autoround

Here is 1 public repository matching this topic...

intel / intel-extension-for-transformers

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

retrieval chatbot rag habana large-language-model chatpdf llm-inference 4-bits speculative-decoding llm-cpu streamingllm intel-optimized-llamacpp neural-chat neural-chat-7b autoround gaudi3

Updated Oct 8, 2024
Python

Improve this page

Add a description, image, and links to the autoround topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the autoround topic, visit your repo's landing page and select "manage topics."