Is there an undocumented hardcoded timeout for Gemini responses even in streamin... | Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

login

troupo 19 days ago | parent | context | favorite | on: Google Gemini has the worst LLM API

Is there an undocumented hardcoded timeout for Gemini responses even in streaming mode? JSON output according to a schema can get quite lengthy, and I can't seem to get all of it for some inputs because Gemini seemingly terminates requests

NoahZuniga 19 days ago [–]

This is probably just you hitting the model's internal output length maximum. Its 65,536 tokens for 2.5 pro and flash.

For other models, see this link and open up the collapsed section for your specific model: https://ai.google.dev/gemini-api/docs/models

troupo 19 days ago | [–]

Thanks! It might just be that!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact