Submissions from aimind.so

		Popular Reinforcement Learning algorithms and their implementation (2023) (aimind.so)
		3 points by downboots 8 months ago \| past
		DoubleAgents: Fine-Tuning LLMs for Covert Malicious Tool Calls (aimind.so)
		98 points by grumblemumble 10 months ago \| past \| 30 comments