Overloaded MultipleCompletionLLMModel.call type #13

maykcaldas · 2024-12-05T23:08:44Z

Overloaded typing in MultipleCompletionLLMModel.call. It returns a list or a single element of LLMResult, depending on how many completions are requested.

This PR is motivated by the fact that LDP had a dummy class, LLMModel, which only filtered the return of MultipleCompletionLLMModel. To make it more general, MultipleCompletionLLMModel now adapts its call return to be either LLMResult (if only one completion n is requested) or list[LLMResult] (if n>1).
The call method was overloaded to satisfy pydantic.

…her a list or a single element of LLMResult depending on how many completions are requested

sidnarayanan · 2024-12-06T23:53:06Z

llmclient/llms.py

+    async def call(
+        self,
+        messages: list[Message],
+        callbacks: list[Callable] | None = None,
+        output_type: type[BaseModel] | None = None,
+        tools: list[Tool] | None = None,
+        tool_choice: Tool | str | None = TOOL_CHOICE_REQUIRED,
+        n: Literal[1] = 1,
+        **chat_kwargs,
+    ) -> LLMResult: ...
+
+    @overload
+    async def call(
+        self,
+        messages: list[Message],
+        callbacks: list[Callable] | None = None,
+        output_type: type[BaseModel] | None = None,
+        tools: list[Tool] | None = None,
+        tool_choice: Tool | str | None = TOOL_CHOICE_REQUIRED,
+        n: int | None = None,
+        **chat_kwargs,
+    ) -> list[LLMResult]: ...


When would we expect someone to use these overloads instead of the dedicated methods call_single and call_multiple?

IMO, it would be easier to maintain just two methods:

async def call(self, ..., n: int) -> list[LLMResult]: assert n > 0 ... async def call_single(self, ...) -> LLMResult: return self.call(..., n=1)[0]

I like Sid's suggestion too. Also, let's make a docstring somewhere mentioning what n does on the "back end". Readers won't intuitively know what n means, it can refer to so many things.

On a related note, MultipleCompletionLLMModel.achat calls litellm.acompletion. Can we rename MultipleCompletionLLMModel.achat to be MultipleCompletionLLMModel.acompletion to standardize with the actual API endpoint ultimately being invoked?

It looks like we still have the overloads, call_single, and call_multiple here - can we reduce this to call and call_single?

uv.lock

tests/test_llms.py

jamesbraza · 2024-12-07T20:30:50Z

llmclient/llms.py

+    async def call(
+        self,
+        messages: list[Message],
+        callbacks: list[Callable] | None = None,
+        output_type: type[BaseModel] | None = None,
+        tools: list[Tool] | None = None,
+        tool_choice: Tool | str | None = TOOL_CHOICE_REQUIRED,
+        n: Literal[1] = 1,
+        **chat_kwargs,
+    ) -> LLMResult: ...
+
+    @overload
+    async def call(
+        self,
+        messages: list[Message],
+        callbacks: list[Callable] | None = None,
+        output_type: type[BaseModel] | None = None,
+        tools: list[Tool] | None = None,
+        tool_choice: Tool | str | None = TOOL_CHOICE_REQUIRED,
+        n: int | None = None,
+        **chat_kwargs,
+    ) -> list[LLMResult]: ...


I like Sid's suggestion too. Also, let's make a docstring somewhere mentioning what n does on the "back end". Readers won't intuitively know what n means, it can refer to so many things.

On a related note, MultipleCompletionLLMModel.achat calls litellm.acompletion. Can we rename MultipleCompletionLLMModel.achat to be MultipleCompletionLLMModel.acompletion to standardize with the actual API endpoint ultimately being invoked?

llmclient/llms.py

uv.lock

…or local embeddings

sidnarayanan · 2024-12-09T21:08:13Z

llmclient/llms.py

+        if not n or n <= 0:
+            logger.info(
+                "Invalid number of completions `n` requested to the call function. "
+                "Will get it from the model's configuration."
+            )


We should raise an error if n<=0. And I don't think we need to emit a logging message if n is unspecified, since that will be a common case

sidnarayanan · 2024-12-09T21:10:17Z

llmclient/llms.py

+    async def call(
+        self,
+        messages: list[Message],
+        callbacks: list[Callable] | None = None,
+        output_type: type[BaseModel] | None = None,
+        tools: list[Tool] | None = None,
+        tool_choice: Tool | str | None = TOOL_CHOICE_REQUIRED,
+        n: Literal[1] = 1,
+        **chat_kwargs,
+    ) -> LLMResult: ...
+
+    @overload
+    async def call(
+        self,
+        messages: list[Message],
+        callbacks: list[Callable] | None = None,
+        output_type: type[BaseModel] | None = None,
+        tools: list[Tool] | None = None,
+        tool_choice: Tool | str | None = TOOL_CHOICE_REQUIRED,
+        n: int | None = None,
+        **chat_kwargs,
+    ) -> list[LLMResult]: ...


It looks like we still have the overloads, call_single, and call_multiple here - can we reduce this to call and call_single?

Messages are not implemented in llmclient anymore

…o remove_numpy

sidnarayanan

Nice!

Overloaded typying in MultipleCompletionLLMModel.call. It returns eit…

7146c91

…her a list or a single element of LLMResult depending on how many completions are requested

maykcaldas self-assigned this Dec 5, 2024

maykcaldas added 5 commits December 5, 2024 15:18

Improved logging for call_multiple

1dcda13

removed deprecated check of n in kwargs

2847af7

Merge branch 'main' into over-mult

2eac4a6

Added cassets for TestMultipleCompletionLLMModel

6fbf2f2

Fix lint

5d3a3c9

maykcaldas marked this pull request as ready for review December 6, 2024 23:30

maykcaldas requested review from sidnarayanan and jamesbraza December 6, 2024 23:40

sidnarayanan reviewed Dec 6, 2024

View reviewed changes

jamesbraza reviewed Dec 7, 2024

View reviewed changes

maykcaldas added 6 commits December 9, 2024 11:01

Implemented tests to check kwarg priority when calling

3f650fc

Exposed missing classes

7edd613

added embedding_model_factory

bae8765

Added documentation to call functions

1e6eb78

skip lint checking for argument with default value in test_llms

cb16d19

Fixed pre-commit errors

7966f9a

maykcaldas requested review from sidnarayanan and jamesbraza December 9, 2024 20:07

jamesbraza reviewed Dec 9, 2024

View reviewed changes

llmclient/llms.py Outdated Show resolved Hide resolved

uv.lock Outdated Show resolved Hide resolved

maykcaldas and others added 5 commits December 9, 2024 12:31

Reverted changes in uv.lock

9e91858

Fixed line wrap in docstrings

29e4d91

reverting uv.lock

f8090bb

removed the dependency on numpy. It is now a conditional dependency f…

418fa3b

…or local embeddings

Merge branch 'main' into remove_numpy

ba974e5

jamesbraza approved these changes Dec 9, 2024

View reviewed changes

sidnarayanan reviewed Dec 9, 2024

View reviewed changes

sidnarayanan requested changes Dec 9, 2024

View reviewed changes

Removed image group dependency

c34b02c

Messages are not implemented in llmclient anymore

maykcaldas added 5 commits December 9, 2024 13:49

Merge branch 'remove_numpy' of github.com:Future-House/llm-client int…

270948e

…o remove_numpy

Fixed typos

86d455d

Removed overload from the multiple completion llm call

7ef8f49

Merge branch 'remove_numpy' into over-mult

03ede77

Merge branch 'update_init' into over-mult

7d196df

maykcaldas changed the base branch from main to update_init December 9, 2024 22:23

maykcaldas requested a review from sidnarayanan December 9, 2024 22:31

sidnarayanan approved these changes Dec 9, 2024

View reviewed changes

Base automatically changed from update_init to main December 9, 2024 22:42

maykcaldas merged commit 5a2deb7 into main Dec 9, 2024
5 checks passed

maykcaldas deleted the over-mult branch December 9, 2024 22:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Overloaded MultipleCompletionLLMModel.call type #13

Overloaded MultipleCompletionLLMModel.call type #13

maykcaldas commented Dec 5, 2024

sidnarayanan Dec 6, 2024

jamesbraza Dec 7, 2024

sidnarayanan Dec 9, 2024

jamesbraza Dec 7, 2024

sidnarayanan Dec 9, 2024

sidnarayanan Dec 9, 2024

sidnarayanan left a comment

Overloaded MultipleCompletionLLMModel.call type #13

Overloaded MultipleCompletionLLMModel.call type #13

Conversation

maykcaldas commented Dec 5, 2024

sidnarayanan Dec 6, 2024

Choose a reason for hiding this comment

jamesbraza Dec 7, 2024

Choose a reason for hiding this comment

sidnarayanan Dec 9, 2024

Choose a reason for hiding this comment

jamesbraza Dec 7, 2024

Choose a reason for hiding this comment

sidnarayanan Dec 9, 2024

Choose a reason for hiding this comment

sidnarayanan Dec 9, 2024

Choose a reason for hiding this comment

sidnarayanan left a comment

Choose a reason for hiding this comment