[feature] Exempt arbitrary tokens from penalties #3675

shibe2 · 2023-10-19T08:11:50Z

Some tokens are meant to appear often in the text, and it may be desirable to avoid penalizing them because of their frequency. For newline token, there is an option: penalize_nl or --no-penalize-nl. However, other tokens that are used in prompt formatting appear often as well, and there's no option to exclude them from penalties.

Models with added tokens may have some tokens both in formatting and in model's output. In particular, some models use im_end token as stop token. This is already being discussed in #3538. The concern is that responses may get unnecessarily long as the stop token gets penalized more and more because of its presence in every message.

To address this issue, I propose adding an option to exempt arbitrary tokens from penalties. This new option would work similarly to penalize_nl, and would supersede it. It should be independent from the logit_bias.

Another possible solution is to automatically exclude all stop tokens from penalties. While this approach may be easier, it does not cover all cases, such as semicolon. Though in fact, these approaches are not mutually exclusive and could both be implemented if desired.

The text was updated successfully, but these errors were encountered:

github-actions · 2024-04-04T01:07:54Z

This issue was closed because it has been inactive for 14 days since being marked as stale.

aehlke · 2024-04-05T16:55:26Z

dont think this is stale?

shibe2 · 2024-04-06T09:07:19Z

Someone else proposed a more general solution: supplying separate text for penalty calculation. It can exclude formatting elements like "ASSISTANT:" and end tokens. But I can't find that proposal now.

z80maniac mentioned this issue Oct 22, 2023

server : allow to specify custom prompt for penalty calculation #3727

Merged

p-e-w mentioned this issue Feb 18, 2024

Penalty threshold: A mechanism for improving repetition penalties #5561

Open

github-actions bot added the stale label Mar 19, 2024

github-actions bot closed this as completed Apr 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feature] Exempt arbitrary tokens from penalties #3675

[feature] Exempt arbitrary tokens from penalties #3675

shibe2 commented Oct 19, 2023

github-actions bot commented Apr 4, 2024

aehlke commented Apr 5, 2024

shibe2 commented Apr 6, 2024

[feature] Exempt arbitrary tokens from penalties #3675

[feature] Exempt arbitrary tokens from penalties #3675

Comments

shibe2 commented Oct 19, 2023

github-actions bot commented Apr 4, 2024

aehlke commented Apr 5, 2024

shibe2 commented Apr 6, 2024