About DPO Data Preparation #193

pratim808 · 2025-01-20T20:56:29Z

After using this code
`#method2
def format_function(example):
# Format 'chosen' text
messages_chosen = [
{"role": "user", "content": str(example["chosen"])} # convert list to string
]
formatted_chosen = tokenizer.apply_chat_template(
messages_chosen,
tokenize=False,
add_generation_prompt=False
)

    # Format 'rejected' text
    messages_rejected = [
        {"role": "user", "content": str(example["rejected"])} # convert list to string
    ]
    formatted_rejected = tokenizer.apply_chat_template(
        messages_rejected,
        tokenize=False,
        add_generation_prompt=False
    )
    
    chosen = example["chosen"]
    rejected = example["rejected"]
    return {
        "formatted_chosen": formatted_chosen,
        "formatted_rejected": formatted_rejected,
        #"chosen": chosen,
        #"rejected": rejected,
    }`

i get this output

`
should i need to prepare my data this way i mean i need the 'text' column?
Please tell me how can i prepare my data for fintuing DPO?
Please help me out.

Manas.
This is the collab notbook link https://colab.research.google.com/github/huggingface/smol-course/blob/main/2_preference_alignment/notebooks/dpo_finetuning_example.ipynb

https://github.com/pratim808/smol-course/blob/main/2_preference_alignment/notebooks/dpo_finetuning_example.ipynb

The text was updated successfully, but these errors were encountered:

wang-jinghui · 2025-02-08T13:17:03Z

Dataset({
features: ['chosen', 'rejected', 'prompt'],
num_rows: 62135
})
For example：
{'prompt': 'Is the milk produced by a hippopotamus pink in color?', 'chosen': 'No, the milk produced by a hippopotamus is not pink. It is ' 'typically white or beige in color. The misconception arises due to ' 'the hipposudoric acid, a red pigment found in hippo skin ' 'secretions, which people mistakenly assume affects the color of ' 'their milk.', 'rejected': 'No, hippopotamus milk is not pink in color. It is actually white ' 'or grayish-white.'}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About DPO Data Preparation #193

About DPO Data Preparation #193

pratim808 commented Jan 20, 2025

wang-jinghui commented Feb 8, 2025

About DPO Data Preparation #193

About DPO Data Preparation #193

Comments

pratim808 commented Jan 20, 2025

wang-jinghui commented Feb 8, 2025