Change the repository type filter
All
Repositories list
74 repositories
- VisionLLM Series
- [CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
- [ECCV2024] Video Foundation Models & Data for Multimodal Understanding
- [ICLR 2025] TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning
- [CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
- [NeurIPS 2024 Spotlight ⭐️] Parameter-Inverted Image Pyramid Networks (PIIP)
vinci
PublicInternVL
Public[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型PVC
PublicVLMEvalKit_InternVL2_5
PublicHulk
PublicMM-NIAH
Public[NeurIPS 2024] Needle In A Multimodal Haystack (MM-NIAH): A comprehensive benchmark designed to systematically evaluate the capability of existing MLLMs to comprehend long multimodal documents.GUI-Odyssey
PublicVision-RWKV
PublicOV-OAD
PublicInternVL-MMDetSeg
PublicTrain InternViT-6B in MMSegmentation and MMDetection with DeepSpeedPhyGenBench
Public- [CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
OmniQuant
Public[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.MMIU
PublicChartAst
PublicInternGPT
PublicInternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)