Skip to content
Change the repository type filter

All

    Repositories list

    • llamafile

      Public
      Distribute and run LLMs with a single file. Rubra customized version that adds grammar to chat completion api.
      C++
      Other
      1.1k000Updated Dec 9, 2024Dec 9, 2024
    • rubra

      Public
      Open Weight, tool-calling LLMs
      Makefile
      Apache License 2.0
      2115150Updated Oct 24, 2024Oct 24, 2024
    • tools.cpp

      Public
      LLM inference in C/C++, further modified for Rubra function calling models
      C++
      MIT License
      10k400Updated Oct 24, 2024Oct 24, 2024
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs. Extended for Rubra function calling models
      Python
      Apache License 2.0
      4.9k100Updated Jul 15, 2024Jul 15, 2024
    • Jupyter Notebook
      0000Updated Jun 29, 2024Jun 29, 2024
    • Python
      Apache License 2.0
      0000Updated Feb 28, 2024Feb 28, 2024