feat: tool result callback #290

carlos-verdes · 2025-02-10T08:35:46Z

Fixes #289

carlos-verdes · 2025-02-10T09:53:14Z

@0xMochan @cvauclair with this PR we have tool chain properly implemented

cvauclair · 2025-02-10T15:21:45Z

Hey @carlos-verdes ! This is a feature we 100% do want to support so thanks for the issue/PR 🦾 !

The one concern I have with this implementation is that users lose the ability to perform a "single turn" request to the LLM, or in other words, that the Chat::chat trait method now performs multi-turn by default. It's not hard to imagine that users might want to control that.

That being said, it is still possible to perform a single turn (and get a ToolCall message from the model) by calling the Agent::completion method like so:

// response here can be used to get the `ToolCall`
let response = agent.completion(prompt, chat_history)
    .await?
    .send()
    .await?;

I'm definitely not against keeping the Prompt and Chat traits as "batteries-included" traits which handle tool result callback, but I'm curious to know your thoughts on this

s6nqou · 2025-02-10T16:46:47Z

@carlos-verdes This is really helpful! I actually wrote almost the exact same loop logic in my project, and now I can finally remove it!

I also noticed another issue: the completion impls of the providers always append the prompt to the end of full_history before sending it to the LLM. I'm not sure if this will cause any issues. For example:

Expected messages?:

first request:
[Prompt]
second request:
[Prompt, ToolCall, ToolResult]
third request:
[Prompt, ToolCall, ToolResult, AssistantContent]

Current messages:

first request:
[Prompt]
second request:
[ToolCall, ToolResult, Prompt]
third request:
[ToolCall, ToolResult, Prompt, AssistantContent]

I'm wondering if the prompt argument is really necessary, and could it be made optional or even removed? That way, we'd only need to manage chat_history. Would love to hear your thoughts.

carlos-verdes · 2025-02-10T17:32:09Z

Hey @carlos-verdes ! This is a feature we 100% do want to support so thanks for the issue/PR 🦾 !

The one concern I have with this implementation is that users lose the ability to perform a "single turn" request to the LLM, or in other words, that the Chat::chat trait method now performs multi-turn by default. It's not hard to imagine that users might want to control that.

That being said, it is still possible to perform a single turn (and get a ToolCall message from the model) by calling the Agent::completion method like so:
// response here can be used to get the `ToolCall`
let response = agent.completion(prompt, chat_history)
    .await?
    .send()
    .await?;
I'm definitely not against keeping the Prompt and Chat traits as "batteries-included" traits which handle tool result callback, but I'm curious to know your thoughts on this

The contract between the client and the LLM is like this:
"I give you a human readable prompt and you give me a human readable answer".

This PR implements this contract, whatever has to happen to comply with the contract is implementation and hence is hidden from the client. Actually the prompt method returns a String not a message right?

If you use function calls (tools) you need to understand there are going to be extra calls to the LLM, the same way as when you chose a reasoning model you know there will be extra steps too (and they are executed as part of the prompt call, not exposed to the caller).

If you read the documentation from OpenAI you also see this PR implements exactly the flow as they tell you:
OpenAI function calling

Without this logic "inside" the completion method you move the problem to the client (as @s6nqou mentioned this is what he has doing on his current project code and he was happy to find out this PR removes this problem from him).

Now imagine we are still not convinced and we want to handle the extra call outside prompt method.
First we need to return Message instead of String:

impl<M: CompletionModel> Prompt for &Agent<M> {
    async fn prompt(&self, prompt: impl Into<Message> + Send) -> Result<Message, PromptError> {
        self.chat(prompt, vec![]).await
    }
}

Then we are force to handle all type of messages in the client code:

// now I need to have the chat history vector in the client
let history = vec![Message::from("Can you please let me know title and url of rig platform?")];

loop {
    // assume we changed prompt implementation to return a Message
    let response: Message = search_agent
            .chat(history)
            .await?;

    match response {
        case Message::Text(text) => break Ok(text.text),
        case Message::ToolCall(tool_call) =>
            history.push(tool_call);

            let tool_result = ... // make the call to the tool here
            histroy.push(tool_result);
    }    
}

... it has not sense to me, I would not use this library if I have to do this.

I honestly don't see any justification to not make the extra calls to the LLM outside the method.

One option if you want low level control is to have 2 traits / methods, one is responsible to send a message and return the LLM response (also as a message).

pub trait LlmMessageHandler: Send + Sync {
    fn send_message(
        &self,
        message: impl Into<Message> + Send,
    ) -> impl std::future::Future<Output = Result<Message, PromptError>> + Send;
}

And keep the current traits as they are but with a default implementation with the loop from this PR calling the method from the previous trait:

pub trait Prompt: Send + Sync {
    fn prompt(
        &self,
        prompt: impl Into<Message> + Send,
    ) -> impl std::future::Future<Output = Result<String, PromptError>> + Send {
        // here we can implement the loop calling send_message method
    }
}

carlos-verdes · 2025-02-10T17:35:25Z

I also noticed another issue: the completion impls of the providers always append the prompt to the end of full_history before sending it to the LLM. I'm not sure if this will cause any issues. For example:

This is not correct, let me check this out.
The order should be as you mentioned, prompt first, then the tool call, then tool result.

0xMochan · 2025-02-11T06:36:52Z

I also noticed another issue: the completion impls of the providers always append the prompt to the end of full_history before sending it to the LLM. I'm not sure if this will cause any issues. For example:
Expected messages?:

first request:
[Prompt]
second request:
[Prompt, ToolCall, ToolResult]
third request:
[Prompt, ToolCall, ToolResult, AssistantContent]

Current messages:

first request:
[Prompt]
second request:
[ToolCall, ToolResult, Prompt]
third request:
[ToolCall, ToolResult, Prompt, AssistantContent]
I'm wondering if the prompt argument is really necessary, and could it be made optional or even removed? That way, we'd only need to manage chat_history. Would love to hear your thoughts.

Alright, this is a mistake, unfortuntely not caught in reviews 🫠. Prompt should always be the latest message w/ and w/o history! I'll handle this in a sep. issue + PR, good catch.

carlos-verdes · 2025-02-11T07:22:36Z

Prompt should always be the latest message w/ and w/o history!

I'm not sure about this, when you have function calls I think the proper order is:

prompt
call tool message
tool result message

carlos-verdes · 2025-02-12T08:48:29Z

@s6nqou commit 7b84024 should fix the order issue.

carlos-verdes · 2025-02-12T09:04:11Z

Commit 4c96852 fixes issues found on DeepSeek model after we merged PR #283

nothingalike · 2025-02-13T12:03:02Z

ok i tested more after the filter update and this works for OpenAPI / Ollama when using the url approach.

closing: #303

carlos-verdes force-pushed the feature/tool-result-callback branch 2 times, most recently from f192a2b to 2c8ce0b Compare February 10, 2025 09:50

carlos-verdes mentioned this pull request Feb 10, 2025

fix(openai): correct some fields for tools #286

Merged

carlos-verdes force-pushed the feature/tool-result-callback branch from 2c8ce0b to ce26e92 Compare February 10, 2025 10:01

cvauclair added this to the 2025-02-24 milestone Feb 10, 2025

carlos-verdes mentioned this pull request Feb 10, 2025

feat: handle multiple tool calls #224

Open

carlos-verdes added 2 commits February 12, 2025 11:13

feat: add test with json response tool

edff72e

feat: call model with tool result

630b157

carlos-verdes force-pushed the feature/tool-result-callback branch from ce26e92 to 630b157 Compare February 12, 2025 07:13

fix: proper order of messages when calling tools

7b84024

fix: deepseek call id and tool serde

4c96852

nothingalike mentioned this pull request Feb 13, 2025

draft: Updated Agent to Better Handle Response Choices #303

Closed

fix: filter empty content for Ollama

976e545

carlos-verdes added 2 commits February 13, 2025 16:13

Merge branch 'main' into feature/tool-result-callback

288914e

fix: format

d56c0f7

joshua-mo-143 added the breaking label Feb 24, 2025

Merge branch 'main' into feature/tool-result-callback

a18c3cb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: tool result callback #290

feat: tool result callback #290

carlos-verdes commented Feb 10, 2025

carlos-verdes commented Feb 10, 2025

cvauclair commented Feb 10, 2025

s6nqou commented Feb 10, 2025

carlos-verdes commented Feb 10, 2025 •

edited

Loading

carlos-verdes commented Feb 10, 2025

0xMochan commented Feb 11, 2025

carlos-verdes commented Feb 11, 2025

carlos-verdes commented Feb 12, 2025 •

edited

Loading

carlos-verdes commented Feb 12, 2025

nothingalike commented Feb 13, 2025

feat: tool result callback #290

Are you sure you want to change the base?

feat: tool result callback #290

Conversation

carlos-verdes commented Feb 10, 2025

carlos-verdes commented Feb 10, 2025

cvauclair commented Feb 10, 2025

s6nqou commented Feb 10, 2025

carlos-verdes commented Feb 10, 2025 • edited Loading

carlos-verdes commented Feb 10, 2025

0xMochan commented Feb 11, 2025

carlos-verdes commented Feb 11, 2025

carlos-verdes commented Feb 12, 2025 • edited Loading

carlos-verdes commented Feb 12, 2025

nothingalike commented Feb 13, 2025

carlos-verdes commented Feb 10, 2025 •

edited

Loading

carlos-verdes commented Feb 12, 2025 •

edited

Loading