Skip to content
View nguyenvo09's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report nguyenvo09

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

s1: Simple test-time scaling

Python 5,716 647 Updated Feb 23, 2025

Sky-T1: Train your own O1 preview model within $450

Python 2,989 309 Updated Feb 26, 2025

This repo contains the dataset and code for the paper "SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?"

Python 1,140 88 Updated Feb 25, 2025

Fully open reproduction of DeepSeek-R1

Python 21,530 1,903 Updated Feb 26, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 3,839 339 Updated Feb 26, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 10,705 1,376 Updated Feb 1, 2025

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 2,975 220 Updated Feb 19, 2025

A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large Language Model Inference-Time Self-Improvement.

69 2 Updated Dec 24, 2024

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Python 652 75 Updated Feb 13, 2025

This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.

500 29 Updated Oct 28, 2024

Corrective Retrieval Augmented Generation

Python 349 35 Updated Oct 8, 2024

An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.

Python 1,615 181 Updated Sep 9, 2024

A course on aligning smol models.

Jupyter Notebook 5,486 1,894 Updated Jan 24, 2025

Retrieval and Retrieval-augmented LLMs

Python 8,671 628 Updated Feb 13, 2025

A framework for serving and evaluating LLM routers - save LLM costs without compromising quality

Python 3,657 271 Updated Aug 10, 2024

👨‍💻 An awesome and curated list of best code-LLM for research.

1,145 64 Updated Dec 10, 2024

Stanford NLP Python library for Representation Finetuning (ReFT)

Python 1,425 123 Updated Feb 6, 2025
Python 479 58 Updated Jan 2, 2025

A curated list of awesome approaches to AI model routing

76 10 Updated Oct 4, 2024

Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models

1,088 70 Updated Feb 24, 2025

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

TypeScript 40,765 3,598 Updated Feb 26, 2025

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 22,825 2,271 Updated Feb 26, 2025

"LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 12,210 1,701 Updated Feb 25, 2025

The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval

Python 1,112 158 Updated Sep 3, 2024

Enterprise graph machine learning framework for billion-scale graphs for ML scientists and data scientists.

Python 399 61 Updated Feb 26, 2025

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Python 1,986 179 Updated May 25, 2024

A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.

Jupyter Notebook 2,773 180 Updated Feb 8, 2025

[ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"

Python 729 75 Updated Jul 30, 2024

🤠 Agent-as-a-Judge and DevAI dataset

Python 324 41 Updated Jan 20, 2025
Next
Showing results