Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

how to inference with the model without AR during training? #198

Open
LaLaLailalai opened this issue Nov 21, 2024 · 2 comments
Open

how to inference with the model without AR during training? #198

LaLaLailalai opened this issue Nov 21, 2024 · 2 comments

Comments

@LaLaLailalai
Copy link

LaLaLailalai commented Nov 21, 2024

Hi @jy0205,

I tried to finetune the dit without AR, so is there any difference when inferencing with this kind of model, especially the i2v task? Thanks!

@LaLaLailalai LaLaLailalai changed the title how to inference with th model without ar during training? how to inference with the model without AR during training? Nov 22, 2024
@feifeiobama
Copy link
Collaborator

For I2V there is. Please follow recent papers on video diffusion model (e.g. SVD, Emu Video etc.) to manually add support for I2V generation, since only the AR version can natively support it.

@LaLaLailalai
Copy link
Author

For I2V there is. Please follow recent papers on video diffusion model (e.g. SVD, Emu Video etc.) to manually add support for I2V generation, since only the AR version can natively support it.

Thanks! Then what is the difference for t2v generation when loading the model without AR and the one with AR ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants