Hacker News new | past | comments | ask | show | jobs | submit login

Is there an undocumented hardcoded timeout for Gemini responses even in streaming mode? JSON output according to a schema can get quite lengthy, and I can't seem to get all of it for some inputs because Gemini seemingly terminates requests



This is probably just you hitting the model's internal output length maximum. Its 65,536 tokens for 2.5 pro and flash.

For other models, see this link and open up the collapsed section for your specific model: https://ai.google.dev/gemini-api/docs/models


Thanks! It might just be that!




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: