-
NVIDIA
- Seattle, Washington
- http://kaichun-mo.github.io
- @KaichunMo
Stars
Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources
This repository compiles a list of papers related to the application of video technology in the field of robotics! Star⭐ the repo and follow me if you like what you see🤩.
A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vision, including papers, codes, and related websites
Code of [CVPR 2024] "Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling"
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Open-Sora: Democratizing Efficient Video Production for All
PyTorch code and models for V-JEPA self-supervised learning from video.
Large World Model -- Modeling Text and Video with Millions Context
SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning
Imitation learning benchmark focusing on complex locomotion tasks using MuJoCo.
Official codebase for TRILL (Teleoperation and Imitation Learning for Loco-manipulation)
Simulating SMPL humanoid, supporting PHC/PHC-MJX/PULSE/SimXR code bases.
This repo contains the python code as well as the webpage html files for the Spice-E project from VAILab at TAU.
A curated list of foundation models for vision and language tasks
Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023)
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Software design & development with AI
Pointcept: a codebase for point cloud perception research. Latest works: PTv3 (CVPR'24 Oral), PPT (CVPR'24), OA-CNNs (CVPR'24), MSC (CVPR'23)
[CVPR'24 Oral] Official repository of Point Transformer V3 (PTv3)
Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.
An open-source impl. of Large Reconstruction Models
Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.
MeshGPT: Generating Triangle Meshes with Decoder-Only Transformers