Introduce UID for Messages #77

OskarStark · 2024-10-01T11:20:17Z

We propose introducing UIDs for messages, which would be particularly beneficial in a frontend context. This approach would enable more efficient message handling by reducing unnecessary back-and-forth communication between the client and backend, especially when only the latest updates need to be tracked or pushed to the frontend.

Whenever responding to the frontend, we could push only the new messages.

The UID should ideally be generated based on the content of the Message. Initially, I considered using ULIDs, as they are sortable. However, ULIDs only work when generated on the fly with new, which isn’t feasible for our case since we need to be able to “recalculate” the UID when necessary.

Otherwise, we’d need to implement our own logic to handle this.

cc @silasjoisten as you’re working on the frontend
@DZunke for your awareness

One thing we’re uncertain about is whether we can send UIDs to the LLM or if they should be removed before querying the LLM.

DZunke · 2024-10-01T13:48:26Z

Would it really be useful to implement UID within this library? Just asking because the library is not delivering a persistence, so every message in the bag would get a UID, yes, but for what would it be utilized?

I think the UID is more useful when you bring this stateless library to a point to have a state. This is why i have wrapped an ExtendedMessage around the Messages that enhances the Message, that is put in there, with a UID to make the message storable in the storage of my application and so to have always the same identifier for each message.

Also when you want to just push new messages to a frontend you would have to have a storage with the already sent messages, or at least, identifiers in it, but, as mentioned ... such persistence is from my PoV more a concern of the application implementing this library 🤔

silasjoisten · 2024-10-01T15:38:25Z

Having a UID natively generated by the library can significantly simplify and standardize how messages are managed across various applications using the library. While it’s true that applications can wrap messages and generate their own UIDs, having a native solution reduces the burden on developers to implement this logic repeatedly in every use case. It ensures consistency in how messages are identified and tracked, regardless of the implementation context.

Moreover, even if the library is stateless, the UIDs help in creating a more predictable and reliable flow of messages between different parts of an application (or even between different systems). This is especially important when dealing with distributed or real-time systems, where keeping track of the order and uniqueness of messages is critical. In these cases, the frontend can easily recognize and handle only new messages or changes without additional overhead in comparing message content manually.

Additionally, having a UID allows for the potential future enhancement of the library, like built-in persistence layers, caching strategies, or easier debugging capabilities, since each message could be traced back more easily. Overall, the idea is to provide a more cohesive and feature-complete experience to developers, who can then build upon it without reinventing the wheel each time.

OskarStark · 2024-10-01T17:03:11Z

Just asking because the library is not delivering a persistence, so every message in the bag would get a UID, yes, but for what would it be utilized?

Right now not, that's true, but I plan to implement on to be able to have a "thread" wrapped around the messages.

This is why i have wrapped an ExtendedMessage around the Messages that enhances the Message, that is put in there,

Can you show an example?

chr-hertel · 2024-10-01T20:51:37Z

In general i'd like to better understand the use case since the models and the lib don't have any stakes to that UID.
But also I feel like we should consider the use-case of having additional state per message.

My current standpoint would be: we should make it simpler to have custom state at the message and with the current setup a custom implementation would need to reimplement the interface - which is to be fair only the getRole(). but still i'm wondering if a base Message class, open for inheritance would be an easier way in general. extending the message would not do the job tho, since we have specific subclasses. which i'm totally in favor bc of that better defined state and purpose.

... not done thinking here ...

OskarStark · 2024-10-02T08:43:08Z

It could make sense to always generate a UID for each message, regardless of persistence. This UID would serve as a unique identifier for tracking and managing messages. Then, you could introduce a method such as MessageBag()->messagesNewer($ID) (dummy code) that allows you to retrieve messages that are newer than a certain UID.

This approach would ensure:

Consistent identification of messages.
The possibility of efficiently tracking updates or changes in the message flow.
Flexibility for those who want to extend the library with their own persistence solutions, without having to re-implement UIDs from scratch.

Additionally, generating UIDs would future-proof the library for potential enhancements like state management, persistence, or real-time message handling, which might require more structured identification and tracking of messages.

DZunke · 2024-10-29T19:33:50Z

Hm. Sorry for the long latency here but i do not really get it. When there will be a use case for persistency or sth. like this, hey... fine! No critics from my side 🚀 but for the given thoughts i do not get it.

A functionality to get a "newer" message from the message bag would require, from my PoV, a date based uid (which would force the format?) or an additional timestamp in the messages which would then also be something that is maybe not needed currently? From my PoV the integer keys of the bag has the same stability as uids currently even it sounds not that "modern" but the order does not change between the serializing processes.

Flexibility for those who want to extend it, is also now working for some use-cases at least. I just know my usage and not yours, so sorry when i do not get it - i am sure we differ in complexity. So like mentioned before, i have created an ExtendedMessage on top that is maybe a bit messy currently because of prototyping myself but it looks like this (cause you asked):

<?php

// [..]

use PhpLlm\LlmChain\Message\MessageInterface;

class ExtendedMessage implements JsonSerializable
{
    public string $id;

    public function __construct(
        public readonly MessageInterface $message,
        // Some additional domain specific data like utilized documents, images, tools, etc. for generating the message
    ) {
        $this->id = Uuid::v4()->toString();
    }

    public function jsonSerialize(): array
    {
        return [
            'id' => $this->id,
            'message' => $this->message,
            // [...]
        ];
    }
}

Something a like is adapting the MessageBag and what i then do is handling the given response from the package and recording some additional data that is then all put together to the extended message i have and this is persisted. So no real magic here but hiding the packages interface a bit behind some classes and working with my own in the rest of the application.

In general i will surely not block with hard discussions an approach to UIDs directly in the messages. I just try to understand the need that could not be covered currently without bloating the existing stuff 😉

So ... go ahead and have fun with what we love to do. In the end i am more with Christopher's last message. Have more customization and extension possibilities with the messages itself for "maybe" later extensions that should also be optional to have simple to complex use cases available. Support all of them with a wild mix of message possibilities.

OskarStark · 2024-12-20T09:08:24Z

I am reworking my project, to completely use this lib and the bundle:

I am working on this topic while writing this comment to keep track of m experience. I created and EnhancedMessage:

<?php

namespace App\Bridge\PhpLlm;

use PhpLlm\LlmChain\Message\MessageInterface;
use PhpLlm\LlmChain\Message\Role;
use Safe\DateTimeImmutable;
use Symfony\Component\Uid\Ulid;

final class EnhancedMessage implements MessageInterface
{
    public function __construct(
        public readonly MessageInterface $message,
        public readonly Ulid $id = new Ulid(),
        public readonly DateTimeImmutable $timestamp = new DateTimeImmutable(),
    ) {
    }

    public function getRole(): Role
    {
        return $this->message->getRole();
    }

    public function jsonSerialize(): array
    {
        return array_merge(
            $this->message->jsonSerialize(),
            [
                'id' => $this->id->toString(),
                'timestamp' => $this->timestamp->format(DateTimeImmutable::ATOM),
            ]
        );
    }
}

it would then look like:

$response = $this->chain->call(
    messages: new MessageBag(
        new EnhancedMessage(Message::ofUser($this->message)),
    ),
);

dump(new EnhancedMessage(Message::ofAssistant($response->getContent())));

result would be:

and with jsonSerialize:

{
    "role": "assistant",
    "content": "{\"type\":\"message\",\"message\":\"Hello! How can I assist you today?\"}",
    "id": "01JFHNE07HB3HQHAK7NWE79RM8",
    "timestamp": "2024-12-20T10:12:30+01:00"
}

In my case I persist the MessageBag to reuse it later, so I need to have the id and date information on the message. It's quite easy to make the mistake and use PhpLlm\LlmChain\Model\Message\Message instead of the new EnhancedMessage.

I agree it is doable, but the API looks weird.

DZunke · 2024-12-20T09:18:33Z

Maybe you are right and surely it would not be a big painful problem to have the identifier within the library directly. But as you showed you are also adding a timestamp, so would this be the next attribute to add? And what with people where the domain utilized different identifiers then a Ulid? For example "auto increments" or sequences? Or what if someone has a domain with more fields within the messages?

To me those stuff feels like domain specific logic and a library should not decide what is the correct implementation for additional fields within the mesages and force people to specific attributes.

OskarStark · 2024-12-20T09:20:53Z

I agree with you and the way I showcased is definitely a way I could go, but maybe we can find some more ideas 😄

Thanks for your feedback!

OskarStark · 2024-12-20T09:48:27Z

Maybe its better do decorate the chain and only use the EnhancedMessage there

OskarStark · 2024-12-20T09:52:56Z

Maybe its better do decorate the chain and only use the EnhancedMessage there

Ah this is not possible, as it does not return a message, but a ResponseInterface

OskarStark · 2024-12-20T10:10:07Z

Would it make sense or be possible to add MessageBagInterface, so one could implement its own bag? In my case I could to the new EnhancedMessage($message) thing in the bag or am I missing sth. ?

Use MessageBagInterface instead of concrete MessageBag implementation #166

OskarStark · 2024-12-31T07:35:53Z

I solved this with a custom message bag and the mentioned EnhancedMessage

OskarStark added enhancement New feature or request idea 💡 labels Oct 1, 2024

OskarStark changed the title ~~Enable ULID for Messages~~ Enable UID for Messages Oct 1, 2024

OskarStark changed the title ~~Enable UID for Messages~~ Introduce UID for Messages Oct 1, 2024

chr-hertel mentioned this issue Oct 1, 2024

feat: introduce message metadata and traits (#91) #92

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce UID for Messages #77

Introduce UID for Messages #77

OskarStark commented Oct 1, 2024 •

edited

Loading

DZunke commented Oct 1, 2024

silasjoisten commented Oct 1, 2024

OskarStark commented Oct 1, 2024

chr-hertel commented Oct 1, 2024

OskarStark commented Oct 2, 2024 •

edited

Loading

DZunke commented Oct 29, 2024

OskarStark commented Dec 20, 2024 •

edited

Loading

DZunke commented Dec 20, 2024 •

edited

Loading

OskarStark commented Dec 20, 2024

OskarStark commented Dec 20, 2024

OskarStark commented Dec 20, 2024 •

edited

Loading

OskarStark commented Dec 20, 2024 •

edited

Loading

OskarStark commented Dec 31, 2024

Introduce UID for Messages #77

Introduce UID for Messages #77

Comments

OskarStark commented Oct 1, 2024 • edited Loading

DZunke commented Oct 1, 2024

silasjoisten commented Oct 1, 2024

OskarStark commented Oct 1, 2024

chr-hertel commented Oct 1, 2024

OskarStark commented Oct 2, 2024 • edited Loading

DZunke commented Oct 29, 2024

OskarStark commented Dec 20, 2024 • edited Loading

DZunke commented Dec 20, 2024 • edited Loading

OskarStark commented Dec 20, 2024

OskarStark commented Dec 20, 2024

OskarStark commented Dec 20, 2024 • edited Loading

OskarStark commented Dec 20, 2024 • edited Loading

OskarStark commented Dec 31, 2024

OskarStark commented Oct 1, 2024 •

edited

Loading

OskarStark commented Oct 2, 2024 •

edited

Loading

OskarStark commented Dec 20, 2024 •

edited

Loading

DZunke commented Dec 20, 2024 •

edited

Loading

OskarStark commented Dec 20, 2024 •

edited

Loading

OskarStark commented Dec 20, 2024 •

edited

Loading