Your home setup is much less efficient than production inference in a data cente...

bayindirh · 2025-04-01T20:46:18 1743540378

These models do not appear from thin air. Add in the training cost in terms of power. Yes it's capex and not opex, but it's not free by any means.

Plus, not all these models run on optimized TPUs, but mostly on nVIDIA cards. None of them are that efficient.

Otherwise I can argue that running these models are essentially free since my camera can do face recognition and tracking at 30fps w/o a noticeable power draw since it uses a dedicated, purpose built DSP for that stuff.

grandmczeb · 2025-04-01T21:02:50 1743541370

GPU efficiency numbers in a real production environment are similar.

bayindirh · 2025-04-01T21:08:47 1743541727

I doubt, but I can check the numbers when I return to the office ;)

Saigonautica · 2025-04-02T11:58:45 1743595125

Oh, that's way better! I guess the comparison only holds as approximately true with home setups -- thanks for the references.