actually that might not be the case. don't underestimate the value of older, bet... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		leschak on April 7, 2023 \| parent \| context \| favorite \| on: What it feels like to work in AI right now actually that might not be the case. don't underestimate the value of older, better understood and much smaller models. also, why not call bert-style (encoder) models LLMs as well. i would expect last-gen models to give us an edge in controlling the effects of the latest ones (cf. the alignment discussion).

chopete3 on April 7, 2023 [–]

BERT models are also LLMs. I referred to LLM more as an API based access, hosted by Microsoft, Google or AWS for large scale isolated/production consumption, like RDS (MySQL, Postgres).

There will always be custom models, with controlled training data and specific use cases.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact