Skip to content
View nguyenvo09's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report nguyenvo09

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

s1: Simple test-time scaling

Python 5,737 654 Updated Feb 23, 2025

Sky-T1: Train your own O1 preview model within $450

Python 3,004 309 Updated Feb 27, 2025

This repo contains the dataset and code for the paper "SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?"

Python 1,153 92 Updated Feb 26, 2025

Fully open reproduction of DeepSeek-R1

Python 21,622 1,916 Updated Feb 26, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 3,878 343 Updated Feb 27, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 10,748 1,380 Updated Feb 1, 2025

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 2,985 221 Updated Feb 19, 2025

A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large Language Model Inference-Time Self-Improvement.

69 2 Updated Dec 24, 2024

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Python 657 76 Updated Feb 13, 2025

This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.

501 29 Updated Oct 28, 2024

Corrective Retrieval Augmented Generation

Python 349 35 Updated Oct 8, 2024

An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.

Python 1,617 181 Updated Sep 9, 2024

A course on aligning smol models.

Jupyter Notebook 5,492 1,894 Updated Jan 24, 2025

Retrieval and Retrieval-augmented LLMs

Python 8,683 630 Updated Feb 13, 2025

A framework for serving and evaluating LLM routers - save LLM costs without compromising quality

Python 3,657 272 Updated Aug 10, 2024

👨‍💻 An awesome and curated list of best code-LLM for research.

1,145 64 Updated Dec 10, 2024

Stanford NLP Python library for Representation Finetuning (ReFT)

Python 1,427 123 Updated Feb 6, 2025
Python 480 58 Updated Jan 2, 2025

A curated list of awesome approaches to AI model routing

77 10 Updated Oct 4, 2024

Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models

1,089 70 Updated Feb 24, 2025

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

TypeScript 41,156 3,629 Updated Feb 27, 2025

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 22,887 2,275 Updated Feb 27, 2025

"LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 12,255 1,711 Updated Feb 26, 2025

The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval

Python 1,115 158 Updated Sep 3, 2024

Enterprise graph machine learning framework for billion-scale graphs for ML scientists and data scientists.

Python 399 61 Updated Feb 27, 2025

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Python 1,988 180 Updated May 25, 2024

A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.

Jupyter Notebook 2,774 180 Updated Feb 8, 2025

[ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"

Python 729 75 Updated Jul 30, 2024

🤠 Agent-as-a-Judge and DevAI dataset

Python 324 41 Updated Jan 20, 2025
Next
Showing results