Skip to content
View shihao1895's full-sized avatar

Highlights

  • Pro

Block or report shihao1895

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICCV 2023] Adaptive Rotated Convolution for Rotated Object Detection

Python 120 6 Updated Jan 13, 2025

Official Code for RVT-2 and RVT

Jupyter Notebook 303 38 Updated Dec 14, 2024
Python 29 Updated Jan 2, 2025

World's First Large-scale High-quality Robotic Manipulation Benchmark

Python 1,262 82 Updated Jan 20, 2025
Python 253 8 Updated Jan 24, 2025

Code for paper: Nullu: Mitigating Object Hallucinations in Large Vision-Language Models via HalluSpace Projection

OpenEdge ABL 13 Updated Dec 22, 2024

[CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding

Python 145 10 Updated Jul 13, 2024

Official repository of Slide-Transformer (CVPR2023)

Python 161 6 Updated Aug 27, 2024
Python 89 12 Updated Dec 15, 2024

Official repository of Uni-AdaFocus (TPAMI 2024).

Python 38 1 Updated Dec 17, 2024
Python 901 206 Updated Jul 23, 2024

[CVPR 2024 & NeurIPS 2024] EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI

Python 529 39 Updated Jan 11, 2025

[NeurIPS 2024] ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis

Python 22 Updated Nov 28, 2024

Official repository of InLine attention (NeurIPS 2024)

Python 36 1 Updated Dec 22, 2024

具身智能入门指南 Embodied-AI-Guide

1,511 79 Updated Jan 28, 2025
Python 112 11 Updated Dec 20, 2024

1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundation visual backbones.

Python 217 9 Updated Aug 23, 2024

Official repository of FLatten Transformer (ICCV2023)

Python 409 24 Updated Nov 4, 2024
Python 16 1 Updated Oct 27, 2024

[TPAMI 2024] Probabilistic Contrastive Learning for Long-Tailed Visual Recognition

Python 71 8 Updated Sep 30, 2024

[ICML 2024] SimPro: A Simple Probabilistic Framework Towards Realistic Long-Tailed Semi-Supervised Learning

Python 27 Updated Sep 30, 2024

Autoregressive Policy for Robot Learning

Python 97 6 Updated Dec 6, 2024

[ECCV 2024] AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation

Python 32 1 Updated Sep 12, 2024
Python 70 4 Updated Dec 29, 2024

[ECCV 2024] Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators

Python 43 Updated Sep 11, 2024

ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation

Python 619 66 Updated Aug 30, 2024

[CVPR2024] GSVA: Generalized Segmentation via Multimodal Large Language Models

Python 112 Updated Sep 12, 2024
Python 51 6 Updated Oct 17, 2024

(TPAMI 2024) A Survey on Open Vocabulary Learning

877 50 Updated Dec 10, 2024

Official repository of MLLA (NeurIPS 2024)

Python 271 14 Updated Nov 25, 2024
Next
Showing results