Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Sure it's a similar architecture but c'mon, 307 million learnable params for ViT-L vs 175 billion for GPT-3...


Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: