-
Notifications
You must be signed in to change notification settings - Fork 18
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support video in MiniCPM-V 2.6 #14
Comments
I tried MiniCPM-V-2_6 naively and I got server-1 | INFO: 192.168.155.172:39070 - "POST /v1/chat/completions HTTP/1.1" 422 Unprocessable Entity So need @matatonic assistance |
Currently testing, but image only so far, no video. |
I've updated a dev branch with the latest changes, including MiniCPM-V 2.6, microsoft/Phi-3.5-vision-instruct and fancyfeast/joy-caption-pre-alpha. I'm still testing and the :dev image is still building, so YMMV. |
By video, they mean collection of images (so not quite video) |
the dev build works. thanks |
Yes, it's an image sampler technique - but still it's not working for me, the sample code they provide is failing to identify the video in my tests. Perhaps still my error, but it probably wont be fixed for this release. |
There's another project that I like called amblegpt We can try to use that to test this functionality |
Merged to main, 0.29.0 release. I will leave this ticket open until video is supported. |
i tried this standalone python code and it runs on my 4090 gpu
fight.mp4
|
@matatonic |
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video
The claim is it performs very well for an 8 billion size model
I am interested in learning what it takes to add support for 2.6 when 2.5 is already supported
Thanks
The text was updated successfully, but these errors were encountered: