> but inference is in the long run gonna get the lion share of the work. I'm not... | Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

login

wyager 32 days ago | parent | context | favorite | on: Ironwood: The first Google TPU for the age of infe...

> but inference is in the long run gonna get the lion share of the work.

I'm not sure - might not the equilibrium state be that we are constantly fine-tuning models with the latest data (e.g. social media firehose)?

NoahZuniga 32 days ago [–]

Head of groq said that in his experience at google training was less than 10% of compute.

lostmsu 32 days ago | [–]

Isn't Groq still more expensive than GPU-based providers?

Consider applying for YC's Summer 2025 batch! Applications are open till May 13
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact