Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Save 80% Memory for DPO and ORPO in Liger-Kernel (twitter.com/hsu_byron)
1 point by byhsu 8 months ago | hide | past | favorite | 1 comment


Introducing the first open-source optimized post-training losses in Liger Kernel with ~80% memory reduction, featuring DPO, CPO, ORPO, SimPO, JSD, and more, achieving up to 70% end-to-end speedup through larger batch size. Use it as any PyTorch module - Available today in Liger v0.5.0!




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: