Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

actually that might not be the case. don't underestimate the value of older, better understood and much smaller models. also, why not call bert-style (encoder) models LLMs as well. i would expect last-gen models to give us an edge in controlling the effects of the latest ones (cf. the alignment discussion).


BERT models are also LLMs. I referred to LLM more as an API based access, hosted by Microsoft, Google or AWS for large scale isolated/production consumption, like RDS (MySQL, Postgres).

There will always be custom models, with controlled training data and specific use cases.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: