Skip to content

Could we use this to improve MMLU capability? #5

Answered by Kipok
YixinSong-e asked this question in Q&A
Discussion options

You must be logged in to vote

Yes, certainly! We are working on improving coding skills already.

In general, you can directly apply our pipeline to any task for which you have some training data of the form "question" / "answer" where "answer" can be automatically verified for correctness. As long as you have this, you can just write a couple of few-shot examples of the solutions you want to teach LLM to produce, modify the code that checks answer for correctness (it's currently specific for math) and then run synthetic data generation and SFT. To take the most out of our pipeline, it's best if the solutions also leverage Python code in some way.

Let us know if you have any questions about the details of this - we'd b…

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by Kipok
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants
Converted from issue

This discussion was converted from issue #3 on February 20, 2024 18:00.