Qubitium

Follow

Qubitium-ModelCloud Qubitium

Follow

Golang, Python, Kotlin, Swift. I prefer strongly typed languages and I do not worship PEP. @ModelCloudAi

42 followers · 55 following

ModelCloud.ai
Earth/Epoch 2.0
https://modelcloud.ai
@qubitium

Achievements

Achievements

Pinned Loading

ModelCloud/GPTQModel ModelCloud/GPTQModel Public

Production ready LLM model compression/quantization toolkit with accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.

Python 128 29
sgl-project/sglang sgl-project/sglang Public

SGLang is a fast serving framework for large language models and vision language models.

Python 6.2k 528
vllm-project/vllm vllm-project/vllm Public

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 30.7k 4.7k
AutoGPTQ/AutoGPTQ AutoGPTQ/AutoGPTQ Public

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4.5k 487
flashinfer-ai/flashinfer flashinfer-ai/flashinfer Public

FlashInfer: Kernel Library for LLM Serving

Cuda 1.5k 143
Dao-AILab/flash-attention Dao-AILab/flash-attention Public

Fast and memory-efficient exact attention

Python 14.4k 1.3k