- 👀 I’m interested in perception and prediction in computer vision, such as Multi-object Tracking. I'm also interested in the inference acceleration of neural networks in edge devices.
- 🌱 I'm currently exploring multimodal large language models.
- 📫 How to reach me: zhihu
A Ph.D. candidate at the University of Chinese Academy of Sciences.
- Beijing
-
13:47
(UTC +08:00) - www.kppkkp.top
Pinned Loading
-
Vary
Vary PublicForked from Ucas-HaoranWei/Vary
Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.
Python
-
Vary-toy
Vary-toy PublicForked from Ucas-HaoranWei/Vary-toy
Official code implementation of Vary-toy (Small Language Model Meets with Reinforced Vision Vocabulary)
Python
-
UP-AAAI2023challenge-1st
UP-AAAI2023challenge-1st Public1st Place Solution for Efficient and Accurate Models towards Practical Deep Learning
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.