Skip to content
@basetenlabs

Baseten

Machine learning infrastructure for developers

Welcome to Baseten

Baseten is an AI infrastructure platform. We combine applied performance research, distributed multi-cloud infrastructure, and developer tooling to run models of all modalities in production.

Get started:

  • Deploy an open-source model in two clicks from the model library.
  • Read our docs to package and serve a fine-tuned or custom model.

Pinned Loading

  1. truss truss Public

    The simplest way to serve AI/ML models in production

    Python 946 78

  2. truss-examples truss-examples Public

    Examples of models deployable with Truss

    Python 157 40

Repositories

Showing 10 of 48 repositories
  • truss Public

    The simplest way to serve AI/ML models in production

    basetenlabs/truss’s past year of commit activity
    Python 946 MIT 78 63 (5 issues need help) 15 Updated Feb 8, 2025
  • stargz-snapshotter Public Forked from containerd/stargz-snapshotter

    Fast container image distribution plugin with lazy pulling

    basetenlabs/stargz-snapshotter’s past year of commit activity
    Go 0 Apache-2.0 125 0 2 Updated Feb 7, 2025
  • truss-examples Public

    Examples of models deployable with Truss

    basetenlabs/truss-examples’s past year of commit activity
    Python 157 MIT 40 11 51 Updated Feb 7, 2025
  • flashinfer Public Forked from flashinfer-ai/flashinfer

    FlashInfer: Kernel Library for LLM Serving

    basetenlabs/flashinfer’s past year of commit activity
    Cuda 0 Apache-2.0 200 0 0 Updated Feb 6, 2025
  • TensorRT-LLM Public Forked from NVIDIA/TensorRT-LLM

    TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

    basetenlabs/TensorRT-LLM’s past year of commit activity
    C++ 0 Apache-2.0 1,105 0 0 Updated Feb 5, 2025
  • lws Public Forked from kubernetes-sigs/lws

    LeaderWorkerSet: An API for deploying a group of pods as a unit of replication

    basetenlabs/lws’s past year of commit activity
    Go 0 Apache-2.0 40 0 0 Updated Jan 29, 2025
  • .github Public
    basetenlabs/.github’s past year of commit activity
    1 0 0 0 Updated Jan 13, 2025
  • autoscaler Public Forked from kubernetes/autoscaler

    Autoscaling components for Kubernetes

    basetenlabs/autoscaler’s past year of commit activity
    Go 0 Apache-2.0 4,099 0 3 Updated Dec 11, 2024
  • axolotl Public Forked from axolotl-ai-cloud/axolotl

    Go ahead and axolotl questions

    basetenlabs/axolotl’s past year of commit activity
    Python 0 Apache-2.0 955 0 2 Updated Nov 7, 2024
  • HackMIT-2024 Public
    basetenlabs/HackMIT-2024’s past year of commit activity
    Jupyter Notebook 2 1 0 0 Updated Sep 14, 2024

Top languages

Loading…

Most used topics

Loading…