brainlid · brainlid · Jan 30, 2025 · Jan 30, 2025
diff --git a/README.md b/README.md
@@ -12,6 +12,7 @@ Currently supported AI services:
 - Ollama
 - Mistral
 - Bumblebee self-hosted models - including Llama, Mistral and Zephyr
+- [LMStudio](https://lmstudio.ai/docs/api/endpoints/openai) via their OpenAI compatibility API
 
 **LangChain** is short for Language Chain. An LLM, or Large Language Model, is the "Language" part. This library makes it easier for Elixir applications to "chain" or connect different processes, integrations, libraries, services, or functionality together with an LLM.
 
@@ -124,11 +125,11 @@ A list of models to use:
 
 ## Prompt caching
 
-ChatGPT and Claude both offer prefix-based prompt caching, which can offer cost and performance benefits for longer prompts. Gemini offers context caching, which is similar. 
+ChatGPT and Claude both offer prefix-based prompt caching, which can offer cost and performance benefits for longer prompts. Gemini offers context caching, which is similar.
 
 - [ChatGPT's prompt caching](https://openai.com/index/api-prompt-caching/) is automatic for prompts longer than 1024 tokens, caching the longest common prefix.
 - [Claude's prompt caching](https://docs.anthropic.com/en/docs/build-with-claude/prompt-caching) is not automatic. It's prefixing processes tools, system, and then messages, in that order, up to and including the block designated with {"cache_control": {"type": "ephemeral"}} . See LangChain.ChatModels.ChatAnthropicTest and for an example.
-- [Gemini's context caching]((https://ai.google.dev/gemini-api/docs/caching?lang=python)) requires a seperate call which is not supported by Langchain. 
+- [Gemini's context caching]((https://ai.google.dev/gemini-api/docs/caching?lang=python)) requires a seperate call which is not supported by Langchain.
 
 ## Usage
 

diff --git a/lib/chat_models/chat_open_ai.ex b/lib/chat_models/chat_open_ai.ex
@@ -312,9 +312,9 @@ defmodule LangChain.ChatModels.ChatOpenAI do
           end
         end)
         |> Enum.reverse(),
-      response_format: set_response_format(openai),
       user: openai.user
     }
+    |> Utils.conditionally_add_to_map(:response_format, set_response_format(openai))
     |> Utils.conditionally_add_to_map(
       :reasoning_effort,
       if(openai.reasoning_mode, do: openai.reasoning_effort, else: nil)
@@ -357,7 +357,11 @@ defmodule LangChain.ChatModels.ChatOpenAI do
   end
 
   defp set_response_format(%ChatOpenAI{json_response: false}) do
-    %{"type" => "text"}
+    # NOTE: The default handling when unspecified is `%{"type" => "text"}`
+    #
+    # For improved compatibility with other APIs like LMStudio, this returns a
+    # `nil` which has the same effect.
+    nil
   end
 
   defp get_tool_choice(%ChatOpenAI{

diff --git a/test/chat_models/chat_open_ai_test.exs b/test/chat_models/chat_open_ai_test.exs
  end

  describe "for_api/3" do
    test "generates a map for an API call" do
      {:ok, openai} =
        ChatOpenAI.new(%{
          "model" => @test_model,
  end

  describe "set_response_format/1" do
    test "generates a map for an API call with text format when json_response is false" do
      {:ok, openai} =
        ChatOpenAI.new(%{
          model: @test_model,
	end

	describe "for_api/3" do
	test "generates a map for an API call" do
Check failure on line 110 in test/chat_models/chat_open_ai_test.exs GitHub Actions / Build and test `test for_api/3 generates a map for an API call (LangChain.ChatModels.ChatOpenAITest)`
	{:ok, openai} =
	ChatOpenAI.new(%{
	"model" => @test_model,
	end

	describe "set_response_format/1" do
	test "generates a map for an API call with text format when json_response is false" do
Check failure on line 2172 in test/chat_models/chat_open_ai_test.exs GitHub Actions / Build and test `test set_response_format/1 generates a map for an API call with text format when json_response is false (LangChain.ChatModels.ChatOpenAITest)`
	{:ok, openai} =
	ChatOpenAI.new(%{
	model: @test_model,