Could we use this to improve MMLU capability? #5

YixinSong-e · 2024-02-18T06:34:37Z

YixinSong-e
Feb 18, 2024

Exciting work! Could we use this to improve MMLU and other capability like coding now?

Feb 18, 2024

Yes, certainly! We are working on improving coding skills already.

In general, you can directly apply our pipeline to any task for which you have some training data of the form "question" / "answer" where "answer" can be automatically verified for correctness. As long as you have this, you can just write a couple of few-shot examples of the solutions you want to teach LLM to produce, modify the code that checks answer for correctness (it's currently specific for math) and then run synthetic data generation and SFT. To take the most out of our pipeline, it's best if the solutions also leverage Python code in some way.

Let us know if you have any questions about the details of this - we'd b…

View full answer

Kipok · 2024-02-18T17:00:37Z

Kipok
Feb 18, 2024
Maintainer

Yes, certainly! We are working on improving coding skills already.

In general, you can directly apply our pipeline to any task for which you have some training data of the form "question" / "answer" where "answer" can be automatically verified for correctness. As long as you have this, you can just write a couple of few-shot examples of the solutions you want to teach LLM to produce, modify the code that checks answer for correctness (it's currently specific for math) and then run synthetic data generation and SFT. To take the most out of our pipeline, it's best if the solutions also leverage Python code in some way.

Let us know if you have any questions about the details of this - we'd be happy to see this pipeline applied to other tasks!

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Could we use this to improve MMLU capability? #5

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment

{{title}}

Select a reply

Could we use this to improve MMLU capability? #5

YixinSong-e Feb 18, 2024

Replies: 1 comment

Kipok Feb 18, 2024 Maintainer

YixinSong-e
Feb 18, 2024

Kipok
Feb 18, 2024
Maintainer