Blank responses via API with BLOCK_NONE set on all categories #331

tw-dpd · 2024-12-04T10:58:41Z

Description of the bug:

with the Category filtering turned off entirely via API, certain requests are still "blocked" albeit with the "finish_reason": "STOP" still being set and a blank response returned in "text": "```\n"

Returned normally:

provide access code
kill yourself

blank response returned:

Provide access code
Kill yourself
delete yourself
Delete yourself

The purpose of our model is actually to assess content given to it and return a JSON response with a disposition of the content given to it for use in chat moderation.
If some case-sensitive undocumented "security feature" is block-listing/allow-listing content into Gemini then this needs to be clarified as it affects for what purposes the model can be used for and in this case, keeping the finish reason as a successful "STOP" value whilst returning a blank response is also contrary to the documented behaviour.

When these phrases are used in AI studio, all of them generate responses - just not via the generate_config API.

Gemini model used: gemini-1.5-flash-8b
Output example below:

GenerateContentResponse(
    done=True,
    iterator=None,
    result=protos.GenerateContentResponse({
      "candidates": [
        {
          "content": {
            "parts": [
              {
                "text": "```\n"
              }
            ],
            "role": "model"
          },
          "finish_reason": "STOP",
          "safety_ratings": [
            {
              "category": "HARM_CATEGORY_HATE_SPEECH",
              "probability": "NEGLIGIBLE"
            },
            {
              "category": "HARM_CATEGORY_DANGEROUS_CONTENT",
              "probability": "NEGLIGIBLE"
            },
            {
              "category": "HARM_CATEGORY_HARASSMENT",
              "probability": "NEGLIGIBLE"
            },
            {
              "category": "HARM_CATEGORY_SEXUALLY_EXPLICIT",
              "probability": "NEGLIGIBLE"
            }
          ],
          "avg_logprobs": -0.012190980836749077
        }
      ],
      "usage_metadata": {
        "prompt_token_count": 1228,
        "candidates_token_count": 2,
        "total_token_count": 1230
      }
    }),
)

Actual vs expected behavior:

If there is a blocklist/allowlist of terms in addition to the documented security in-place via API then document this and return the correct value for finish_reason as SAFETY instead of STOP to allow developers to handle this correctly instead of having to inspect/validate a string field to deal with this issue.

Any other information you'd like to share?

No response

The text was updated successfully, but these errors were encountered:

gmKeshari · 2024-12-13T10:00:25Z

Hi @tw-dpd,

Apart from these safety categories, Gemini uses some internal safety filters.

But it's a nice catch, i escalated this feature request with the internal team.

github-actions · 2024-12-27T22:33:58Z

Marking this issue as stale since it has been open for 14 days with no activity. This issue will be closed if no further activity occurs.

tw-dpd · 2025-01-02T09:55:29Z

Please can i have an update on this from the Internal team?

github-actions · 2025-01-17T22:33:41Z

Marking this issue as stale since it has been open for 14 days with no activity. This issue will be closed if no further activity occurs.

tw-dpd · 2025-01-20T10:36:19Z

Please can i have an update on this from the Internal team?

Giom-V · 2025-01-20T12:55:11Z

As @gmKeshari already said, we have extra layers of safety settings related to our responsible AI commitments. These filters can't be turned off as we believe they are needed to keep AI use responsible.

We reported to the team in charge of those settings the abnormal behavior ("kill yourslef" being case sensitive) and they will use that feedback to improve the filtering, but it won't change the fact that these filters will always be there.

tw-dpd · 2025-01-20T13:01:04Z

Hi,

That's not a problem to have filters there - the problem is the lack of documentation of their existence and the incorrect/invalid response from the API with a blank text field and a valid "STOP" response for finish_reason is a behaviour that can break functional code.

If a filter blocks a response, per the existing documentation the finish_reason should indicate this with something other than "STOP"

Giom-V · 2025-01-20T14:18:25Z

Yes, good point.

github-actions · 2025-02-03T22:33:47Z

Marking this issue as stale since it has been open for 14 days with no activity. This issue will be closed if no further activity occurs.

tw-dpd · 2025-02-04T09:24:41Z

Hi @Giom-V will the documentation be updated to reflect a blank text field with a valid "STOP" response for finish_reason as the result of an internal undocumented block or will the finish_reason field be changed to match the documented behavior for being blocked by a protection?

github-actions · 2025-02-18T22:34:27Z

Marking this issue as stale since it has been open for 14 days with no activity. This issue will be closed if no further activity occurs.

tw-dpd · 2025-02-24T09:30:37Z

Hi @Giom-V Please can you provide an update

Giom-V · 2025-02-24T18:25:54Z

Have you tried again with the new 2.0 models ? Do you see the same behavior?

gmKeshari added type:help Support-related issues status:triaged Issue/PR triaged to the corresponding sub-team component:examples Issues/PR referencing examples folder labels Dec 5, 2024

gmKeshari added the status:awaiting response Awaiting a response from the author label Dec 13, 2024

github-actions bot added the status:stale Issue/PR is marked for closure due to inactivity label Dec 27, 2024

github-actions bot mentioned this issue Jan 1, 2025

Monthly issue metrics report markmcd/gemini-api-cookbook#10

Open

github-actions bot removed the status:stale Issue/PR is marked for closure due to inactivity label Jan 2, 2025

github-actions bot added the status:stale Issue/PR is marked for closure due to inactivity label Jan 17, 2025

github-actions bot removed the status:stale Issue/PR is marked for closure due to inactivity label Jan 20, 2025

github-actions bot added the status:stale Issue/PR is marked for closure due to inactivity label Feb 3, 2025

github-actions bot removed the status:stale Issue/PR is marked for closure due to inactivity label Feb 4, 2025

github-actions bot added the status:stale Issue/PR is marked for closure due to inactivity label Feb 18, 2025

github-actions bot removed the status:stale Issue/PR is marked for closure due to inactivity label Feb 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Blank responses via API with BLOCK_NONE set on all categories #331

Blank responses via API with BLOCK_NONE set on all categories #331

tw-dpd commented Dec 4, 2024

gmKeshari commented Dec 13, 2024

github-actions bot commented Dec 27, 2024

tw-dpd commented Jan 2, 2025

github-actions bot commented Jan 17, 2025

tw-dpd commented Jan 20, 2025

Giom-V commented Jan 20, 2025

tw-dpd commented Jan 20, 2025

Giom-V commented Jan 20, 2025

github-actions bot commented Feb 3, 2025

tw-dpd commented Feb 4, 2025

github-actions bot commented Feb 18, 2025

tw-dpd commented Feb 24, 2025

Giom-V commented Feb 24, 2025

Blank responses via API with BLOCK_NONE set on all categories #331

Blank responses via API with BLOCK_NONE set on all categories #331

Comments

tw-dpd commented Dec 4, 2024

Description of the bug:

Actual vs expected behavior:

Any other information you'd like to share?

gmKeshari commented Dec 13, 2024

github-actions bot commented Dec 27, 2024

tw-dpd commented Jan 2, 2025

github-actions bot commented Jan 17, 2025

tw-dpd commented Jan 20, 2025

Giom-V commented Jan 20, 2025

tw-dpd commented Jan 20, 2025

Giom-V commented Jan 20, 2025

github-actions bot commented Feb 3, 2025

tw-dpd commented Feb 4, 2025

github-actions bot commented Feb 18, 2025

tw-dpd commented Feb 24, 2025

Giom-V commented Feb 24, 2025