Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

In the professional context (apart of individual apps distributed by small creators / indiehackers) usually models are run using standardized runtimes in native code (C++ usually), using runtimes TensorRT (for Nvidia Devices), onnxruntime (agnostic), etc.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: