Not sure where else to post this, but when attempting to use any of the Gemini 2...

Tiberium · 2025-06-17T17:38:39 1750181919

What finish reason are you getting? Perhaps your code sets a low max_tokens, so the generation stops while the model is still thinking, without giving any actual output.

zelias · 2025-06-17T18:03:26 1750183406

The finish reason is `length`. I have tried setting minimal token budgets, really small prompts, and max lengths of various sizes from 100-4000 and nothing seems to make a consistent dent in the behavioral pattern.

danbrooks · 2025-06-18T03:03:01 1750215781

This can happen if the prompt or response is blocked by a safety filter. Check some of the other fields in the response.