Hacker News new | past | comments | ask | show | jobs | submit login

Great article. Perhaps some part of this magic number simply factors in the amount of compute necessary to run the image through the CNN (proportional to compute use per token in the LM).



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: