Environment Setup

To get started, clone the repository and set up a conda environment with the required packages:

git clone https://github.com/XiandaGuo/Drive-MLLM.git
cd Drive-MLLM

conda create -n drive_mllm python=3.10
source activate drive_mllm
pip install -r requirements.txt

# setup PYTHONPATH
echo 'export PYTHONPATH=$(pwd):$PYTHONPATH' >> ~/.bashrc
source ~/.bashrc

Additional Environment Setup

Depending on your needs, set up the following environments for API calls or local model inference:

For GPT API calls:

pip install openai==1.42.0

For Gemini API calls:

pip install google-generativeai==0.7.2

For Local LLaVA-Next inference:

git clone https://github.com/LLaVA-VL/LLaVA-NeXT.git
cd LLaVA-NeXT/
pip install --upgrade pip  
pip install -e ".[train]" 
pip install git+https://github.com/LLaVA-VL/LLaVA-NeXT.git 
cd ..

## flash atten (optional)
conda install -c "nvidia/label/cuda-12.1.0" cuda
pip install flash-attn --no-build-isolation --no-cache-dir

For Local QWen2-VL inference:

git clone https://github.com/QwenLM/Qwen2-VL.git
cd Qwen2-VL
pip install -r requirements_web_demo.txt
pip install git+https://github.com/huggingface/transformers@21fac7abba2a37fae86106f87fcf9974fd1e3830 accelerate
pip install qwen-vl-utils[decord]
cd ..

## flash atten (optional)
conda install -c "nvidia/label/cuda-12.1.0" cuda
pip install flash-attn --no-build-isolation --no-cache-dir

Reference Links:

OpenAI API Quick Start
Gemini API Quick Start
LLaVA-NeXT Official Gitgub Website
Qwen2-VL Official Github Website
Flash Attention
Cuda Installation Guide

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

EnvironmentSetup.md

EnvironmentSetup.md

Environment Setup

Additional Environment Setup

Files

EnvironmentSetup.md

Latest commit

History

EnvironmentSetup.md

File metadata and controls

Environment Setup

Additional Environment Setup