Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ideas , given 2023 situation #307

Open
dobkeratops opened this issue Sep 4, 2023 · 1 comment
Open

ideas , given 2023 situation #307

dobkeratops opened this issue Sep 4, 2023 · 1 comment

Comments

@dobkeratops
Copy link

observations about the current state of AI..

so over the past year there's been these diffusion models & large language models become available to be public

There's the controversy over training data (scrapes)

  • e.g. Steam doesn't allow games with AI generated content, or even use of ChatGPT ; they're covering themselves

  • That seems to me to validate the need for a efforts such as imagemonkey to produce more CC0 open training data?

Individuals such as ourselves can't currently produce foundation models.

but models like stable-diffusion can be fine-tuned (stability.ai's argument about the scrapes is that a fine-tuned model can remove its ability to recreate copyrighted material in the original scrapes, which sometimes come through)

  • I see that you had VGC finetuning integrated in imagemonkey.. have you considered extending your service to finetune diffusion models?

  • Also consider that in time multi-modal models may appear.. ones trained simultaneously on images & text

@dobkeratops
Copy link
Author

dobkeratops commented Oct 18, 2023

https://github.com/haotian-liu/LLaVA

opensource multimodal LLMs .. text+vision .. imagine being able to finetune these on community curated datasets
another potential usecase for imagemonkey

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant