Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
Popular Reinforcement Learning algorithms and their implementation (2023) (aimind.so)
3 points by downboots 8 months ago | past
DoubleAgents: Fine-Tuning LLMs for Covert Malicious Tool Calls (aimind.so)
98 points by grumblemumble 10 months ago | past | 30 comments

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: