Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
H3: New generative language model that can outperform Transformers (twitter.com/realdanfu)
2 points by maxutility on Feb 6, 2023 | past
Hungry Hungry Hippos: Towards Language Modeling with State Space Models (twitter.com/realdanfu)
1 point by haldujai on Jan 24, 2023 | past
H3 – Outperforming GPT-Neo-2.7B with only 2 attention layers (twitter.com/realdanfu)
10 points by m00x on Jan 23, 2023 | past

Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: