You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It is inconsistent with the instructions to build dataset in readme file. In reademe file , the "video" is not a list but a string object.
[
{
"video": "images/xxx.jpg",
"conversations": [
{
"from": "human",
"value": "<image>\nWhat are the colors of the bus in the image?"
},
{
"from": "gpt",
"value": "The bus in the image is white and red."
},
...
],
}
{
"video": "videos/xxx.mp4",
"conversations": [
{
"from": "human",
"value": "<video>\nWhat are the main activities that take place in the video?"
},
{
"from": "gpt",
"value": "The main activities that take place in the video are the preparation of camera equipment by a man, a group of men riding a helicopter, and a man sailing a boat through the water."
},
...
],
},
...
]
The text was updated successfully, but these errors were encountered:
the code of processing video code is as
VideoLLaMA3/videollama3/train.py
Line 248 in 22a53e3
It is inconsistent with the instructions to build dataset in readme file. In reademe file , the "video" is not a list but a string object.
The text was updated successfully, but these errors were encountered: