Skip to content
View ParadoxZW's full-sized avatar
😋
😋
  • Hangzhou, China

Organizations

@MILVLG

Block or report ParadoxZW

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. MILVLG/prophet MILVLG/prophet Public

    Implementation of CVPR 2023 paper "Prompting Large Language Models with Answer Heuristics for Knowledge-based Visual Question Answering".

    Python 270 27

  2. MILVLG/imp MILVLG/imp Public

    a family of highly capabale yet efficient large multimodal models

    Python 174 16

  3. MILVLG/openvqa MILVLG/openvqa Public

    A lightweight, scalable, and general framework for visual question answering research

    Python 322 64

  4. GaiZhenbiao/Phi3V-Finetuning GaiZhenbiao/Phi3V-Finetuning Public

    Parameter-efficient finetuning script for Phi-3-vision, the strong multimodal language model by Microsoft.

    Python 55 19

  5. LLaVA-UHD-Better LLaVA-UHD-Better Public

    A bug-free and improved implementation of LLaVA-UHD, based on the code from the official repo

    Python 32 3

  6. ANNS ANNS Public

    project to explore approximate nearest neighborhood search method.

    C++