Global cost tracking #28

sidnarayanan · 2024-12-31T04:25:52Z

I'd like to have a way to track costs across all LLM API usage - including agents and environments. This PR implements that.

I made a few semi-arbitrary choices here - there are other ways of implementing this and I'm open to suggestions.

First: I decided to capture costs at the lowest level (i.e. all calls to acompletion, achat, etc.). We could alternatively only support functions that return LLMResult and use LLMResult.cost. But we'd run the risk of not catching everything

Second: litellm async streaming calls don't return AsyncGenerators (fixed our type hints), but instead CustomStreamWrappers, which implement __aiter__ and __anext__. I had to create a shim called TrackedStreamWrapper to handle this - open to alternative suggestions (see comment in code about why async for ... yield doesn't work)

maykcaldas

This is a great feature. I’ve added a few comments requesting docstrings to ensure future users can easily understand how to implement it. And I’m approving it.

It works well for now, but it could become misleading if we overlook including the cost_tracker in any new features.

llmclient/cost_tracker.py

jamesbraza · 2024-12-31T16:58:55Z

llmclient/cost_tracker.py

+
+
+TRACK_COSTS = contextvars.ContextVar[bool]("track_costs", default=False)
+REPORT_EVERY_USD = 1.0


Few comments here:

What do you think of making it a ClassVar of CostTracker? We still get global state, but it's less awkward with the global and setters/getters

Can you make this name more intuitive, and add units to it? It's unclear if

It's a frequency (Hz)

It's a dollar threshold (USD)

I'm confused on the last bullet - _USD is the units - how would you describe it?

I've renamed set_reporting_frequency -> set_reporting_threshold to remove the ambiguity.

And made both ClassVars.

Actually I went further and made them instance variables - no reason not to.

llmclient/cost_tracker.py

jamesbraza · 2024-12-31T17:01:02Z

tests/conftest.py

@@ -73,3 +74,10 @@ def fixture_reset_log_levels(caplog) -> Iterator[None]:
        logger = logging.getLogger(name)
        logger.setLevel(logging.NOTSET)
        logger.propagate = True
+
+
+class CILLMModelNames(StrEnum):


We ought to just put this in llmclient as one enum:

class CommonLLMNames(StrEnum): # Use these for model defaults OPENAI_GENERAL = "gpt-4o-2024-08-06" # Cheap, fast, and decent # Use these in unit testing OPENAI_TEST = "gpt-4o-mini-2024-07-18" # Cheap and not OpenAI's cutting edge ANTHROPIC_TEST = "claude-3-haiku-20240307" # Cheap and not Anthropic's cutting edge

Then both the app and unit tests will just use CommonLLMNames

I'll leave that for another PR

I did this in #30

jamesbraza · 2025-01-02T18:36:23Z

llmclient/cost_tracker.py

+
+        if self.lifetime_cost_usd - self.last_report > self.report_every_usd:
+            logger.info(
+                f"Cumulative llmclient API call cost: ${self.lifetime_cost_usd:.8f}"


Suggested change

f"Cumulative llmclient API call cost: ${self.lifetime_cost_usd:.8f}"

f"Cumulative client API call cost: ${self.lifetime_cost_usd:.8f}"

We will eventually maybe rename from llmclient, and maybe do things besides just LLMs (e.g. embeddings), so let's just be generally worded here

This was intentional wording - the cost tracker only tracks llmclient costs right now.

jamesbraza · 2025-01-02T18:39:48Z

tests/test_cost_tracking.py

+                image = np.zeros((32, 32, 3), dtype=np.uint8)
+                image[:] = [255, 0, 0]


Suggested change

image = np.zeros((32, 32, 3), dtype=np.uint8)

image[:] = [255, 0, 0]

image = np.full((32, 32, 3), [255, 0, 0], dtype=np.uint8)

Please feel free to ignore this too

Yeah I didn't add this code - just mirroring what's in the other tests.

jamesbraza · 2025-01-02T18:40:41Z

tests/test_cost_tracking.py

+                        text="What color is this square? Show me your chain of reasoning.",
+                        images=image,
+                    ),
+                ]  # TODO: It's not decoding the image. It's trying to guess the color from the encoded image string.


Can you:

Move this TODO to not be at the end of a list

Clarify what "it" is, when you say "it's not decoding"

As above - I copied this from test_llms.py

jamesbraza · 2025-01-02T18:41:25Z

tests/test_cost_tracking.py

+                {
+                    "model_list": [
+                        {
+                            "model_name": "gpt-4o-mini",


Can we use CILLMModelNames.OPENAI here?

As above - I copied this from test_llms.py

sidnarayanan added 2 commits December 30, 2024 20:13

cost tracking

d9c9575

reporting frequency

f8125c0

sidnarayanan requested review from maykcaldas and a team December 31, 2024 04:25

sidnarayanan added 3 commits December 30, 2024 20:34

update aviary pin

08a8b9d

update aviary pin

dcfc7b0

suppress pylint

814c193

maykcaldas approved these changes Dec 31, 2024

View reviewed changes

llmclient/cost_tracker.py Outdated Show resolved Hide resolved

llmclient/cost_tracker.py Show resolved Hide resolved

jamesbraza reviewed Dec 31, 2024

View reviewed changes

docs; making things ClassVars

b6a51f6

jamesbraza reviewed Jan 2, 2025

View reviewed changes

sidnarayanan merged commit ab90c31 into main Jan 2, 2025
6 checks passed

sidnarayanan deleted the cost-tracking branch January 2, 2025 19:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Global cost tracking #28

Global cost tracking #28

sidnarayanan commented Dec 31, 2024 •

edited

Loading

maykcaldas left a comment

jamesbraza Dec 31, 2024

sidnarayanan Jan 1, 2025

sidnarayanan Jan 1, 2025

sidnarayanan Jan 1, 2025

sidnarayanan Jan 1, 2025

jamesbraza Dec 31, 2024

sidnarayanan Jan 1, 2025

jamesbraza Jan 7, 2025

jamesbraza Jan 2, 2025

sidnarayanan Jan 2, 2025

jamesbraza Jan 2, 2025

sidnarayanan Jan 2, 2025

jamesbraza Jan 2, 2025

sidnarayanan Jan 2, 2025

jamesbraza Jan 2, 2025

sidnarayanan Jan 2, 2025



		TRACK_COSTS = contextvars.ContextVar[bool]("track_costs", default=False)
		REPORT_EVERY_USD = 1.0

	f"Cumulative llmclient API call cost: ${self.lifetime_cost_usd:.8f}"
	f"Cumulative client API call cost: ${self.lifetime_cost_usd:.8f}"

		image = np.zeros((32, 32, 3), dtype=np.uint8)
		image[:] = [255, 0, 0]

	image = np.zeros((32, 32, 3), dtype=np.uint8)
	image[:] = [255, 0, 0]
	image = np.full((32, 32, 3), [255, 0, 0], dtype=np.uint8)

Global cost tracking #28

Global cost tracking #28

Conversation

sidnarayanan commented Dec 31, 2024 • edited Loading

maykcaldas left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sidnarayanan commented Dec 31, 2024 •

edited

Loading