Reward model #2

kirudang · 2024-10-13T17:17:03Z

Hello there,
I want to test this watermark for two model: Llama2 7B and Mistral 7B, can I use a same reward model, let's say OPT 1.3B?
Thank you

xiaojunxu · 2024-10-14T23:01:08Z

Our implementation does not directly support it, as we are using the same tokenizer for the LLM and the reward model. Since the models you mentioned use different tokenizers, you may need some adaptations in the code to get it work.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reward model #2

Reward model #2

kirudang commented Oct 13, 2024

xiaojunxu commented Oct 14, 2024

Reward model #2

Reward model #2

Comments

kirudang commented Oct 13, 2024

xiaojunxu commented Oct 14, 2024