Replies: 1 comment 3 replies
-
Folks, it is time to sleep and this is the first thing that has come to me. How about we fine-tune a GliNER based model to recognize condition spans and action spans. Then, we feed the original sentence along with the detected spans, and ask the LLM to give us the condition, action pairs. |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Motivation
Today, guidelines are specified via the API (and CLI) with two parts: a condition and an action.
However, in real-world use cases, we often find that people like to specify composite conditions and actions. For example:
While there may be different ways to approach guideline design, this looks like something that should be allowed and expected to work well.
However, we know from experience that the attention of LLMs struggle with following instructions. Currently, the
GuidelineProposer
andMessageEventGenerator
overcome this challenge by looking at each guideline in isolation—as we know, this dramatically improves the instruction-following consistency.But such composite conditions and actions kind of "backdoor" this modeling, and break it to some extent, in that consistency is not quite the same in such cases.
Solution Proposal
While keeping guideline API the same (a singly-specified condition/action), decompose guidelines into one or more conditions and actions. The example above would thus be rendered as follows:
Given such decomposition, we refactor
GuidelineProposer
to evaluate the conditions in isolation, thus achieving more control and clearer optimization opportunities, hence higher accuracy.Discussion
Beta Was this translation helpful? Give feedback.
All reactions