You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
People are curious about LLMs. It would be nice if we could go through the lifecycle that we expect other groups with large data corpi to go through. We have Terabytes of github data, the textual nature of which is mostly commit messages and issue comments (but not code). What can we do here?
I suspect that this is two inter-related questions:
What kinds of conversations would we want to ask of this data?
What is the right way to feed this data into an LLM in order to be able to ask those questions.
Technologically speaking, I'm hopeful that this involves both some training (maybe just a big GPU, but maybe several) on a regular basis, as well as some simple serving.
The text was updated successfully, but these errors were encountered:
People are curious about LLMs. It would be nice if we could go through the lifecycle that we expect other groups with large data corpi to go through. We have Terabytes of github data, the textual nature of which is mostly commit messages and issue comments (but not code). What can we do here?
I suspect that this is two inter-related questions:
Technologically speaking, I'm hopeful that this involves both some training (maybe just a big GPU, but maybe several) on a regular basis, as well as some simple serving.
The text was updated successfully, but these errors were encountered: