Paper or Dataset Available? #5

Broyojo · 2024-12-05T07:10:44Z

Is there a paper available or material available on how this PRM was trained? It would be very valuable to know for reproducibility and what use case this PRM is good for based on its training, i.e. is it trained using MathShepherd strategy, making it a useful Q value estimator, or is it trained using PRM800K style so it assesses step correctness instead?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Paper or Dataset Available? #5

Paper or Dataset Available? #5

Broyojo commented Dec 5, 2024

Paper or Dataset Available? #5

Paper or Dataset Available? #5

Comments

Broyojo commented Dec 5, 2024