Skip to content
View freelerobot's full-sized avatar

Block or report freelerobot

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Codebase for the WayveScenes101 Dataset

Python 169 6 Updated Sep 25, 2024

World's First Large-scale High-quality Robotic Manipulation Benchmark

Python 1,113 75 Updated Jan 9, 2025

Affordance-based Robot Manipulation with Flow Matching

Shell 87 8 Updated Jan 7, 2025
Jupyter Notebook 970 67 Updated Nov 27, 2024

A generative world for general-purpose robotics & embodied AI learning.

Python 22,235 1,792 Updated Jan 9, 2025

PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)

123 10 Updated Nov 12, 2024

Towards Human-Friendly, Fast Learning and Adaptable Agent Communities

Jupyter Notebook 111 10 Updated Dec 30, 2024

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 1,726 67 Updated Jan 2, 2025

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 2,741 259 Updated Aug 9, 2024

Model Context Protocol Servers

JavaScript 6,523 769 Updated Jan 8, 2025

The official Python SDK for Model Context Protocol servers and clients

Python 1,353 128 Updated Jan 6, 2025

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 8,798 1,156 Updated Jan 9, 2025

Out-of-the-box (OOTB) GUI Agent for Windows and macOS

Python 1,125 100 Updated Jan 1, 2025

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 4,696 453 Updated Dec 26, 2024

One app to rule them all!

TypeScript 1,505 210 Updated Dec 1, 2024

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 17,557 1,808 Updated Oct 15, 2024

Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (…

Python 5,531 802 Updated Jan 8, 2025

Audio Large Language Models

Python 288 19 Updated Jan 3, 2025

SOTA Open Source TTS

Python 18,215 1,362 Updated Jan 4, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 37,589 4,837 Updated Jan 8, 2025

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,375 93 Updated Aug 13, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,120 336 Updated Jan 9, 2025

Easy-to-Use Speech MOS predictors

Python 249 16 Updated Oct 24, 2023

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Python 16,043 1,879 Updated Jan 10, 2025

An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement

Python 125 6 Updated Dec 29, 2024

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python 3,660 388 Updated Dec 4, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 37,461 4,579 Updated Dec 26, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 7,198 677 Updated Jan 9, 2025

Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate

Python 469 26 Updated Nov 19, 2024
Next
Showing results