Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Llama3.2-11b-Vision #25

Open
gabrielle-ong opened this issue Sep 27, 2024 · 2 comments
Open

Llama3.2-11b-Vision #25

gabrielle-ong opened this issue Sep 27, 2024 · 2 comments
Assignees

Comments

@gabrielle-ong
Copy link

No description provided.

@dan-homebrew dan-homebrew transferred this issue from janhq/cortex.tensorrt-llm Sep 29, 2024
@github-project-automation github-project-automation bot moved this from Investigating to Review + QA in Jan & Cortex Sep 29, 2024
@dan-homebrew dan-homebrew reopened this Sep 29, 2024
@github-project-automation github-project-automation bot moved this from Review + QA to In Progress in Jan & Cortex Sep 29, 2024
@dan-homebrew dan-homebrew moved this from In Progress to Scheduled in Jan & Cortex Sep 29, 2024
@gabrielle-ong gabrielle-ong moved this from Scheduled to Icebox in Jan & Cortex Oct 1, 2024
@hahuyhoang411 hahuyhoang411 self-assigned this Oct 22, 2024
@hahuyhoang411 hahuyhoang411 moved this from Icebox to Investigating in Jan & Cortex Oct 22, 2024
@hahuyhoang411
Copy link
Collaborator

llama.cpp state: ggerganov/llama.cpp#9643
Screenshot 2024-10-22 at 20 41 51

@dan-homebrew
Copy link

dan-homebrew commented Nov 12, 2024

@hahuyhoang411 @vansangpfiev I'd like to take next sprint to see if we can contribute Vision back to llama.cpp.

I think a good starting point is to build on top of @ngxson's PR here: ggerganov/llama.cpp#9687

We need an extensible architecture on top of llama.cpp for multi-modality. For Ichigo, we likely need to generate audio tokens in a separate Python process though (i.e. whisper-speech)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: Investigating
Development

No branches or pull requests

3 participants