I have some experience in this area, having both worked on machine learning fram... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		ipsum2 on May 25, 2023 \| parent \| context \| favorite \| on: The tiny corp raised $5.1M I have some experience in this area, having both worked on machine learning frameworks, trained large models on datacenters, and have my own personal machine for tinkering around with. This makes very little sense. Even if he was able to achieve his goals, consumer GPU hardware is bounded by network and memory, so it's a bad target to optimize. Fast device-to-device communication is only available on datacenter GPUs, and is essential for models training like LLaMA, Stable Diffusion, etc. Amdahl's law strikes again.

sanxiyn on May 25, 2023 [–]

Eh... no? Stable Diffusion works fine on single device. Ditto for smaller LLaMA.

ipsum2 on May 25, 2023 | [–]

That's for inference, I'm referring to training.

Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact