Stars
Sky-T1: Train your own O1 preview model within $450
This repo contains the dataset and code for the paper "SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?"
Fully open reproduction of DeepSeek-R1
verl: Volcano Engine Reinforcement Learning for LLMs
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data
A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large Language Model Inference-Time Self-Improvement.
Search-o1: Agentic Search-Enhanced Large Reasoning Models
This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.
An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.
A course on aligning smol models.
Retrieval and Retrieval-augmented LLMs
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality
👨💻 An awesome and curated list of best code-LLM for research.
Stanford NLP Python library for Representation Finetuning (ReFT)
A curated list of awesome approaches to AI model routing
Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
A modular graph-based Retrieval-Augmented Generation (RAG) system
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
Enterprise graph machine learning framework for billion-scale graphs for ML scientists and data scientists.
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.
A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.
[ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"
🤠 Agent-as-a-Judge and DevAI dataset