docere's submissions | Hacker News

1.		MedEvalArena: Peer-judged LLM medical reasoning benchmark (danbernardo.substack.com)
		1 point by docere 68 days ago \| past
2.		LLM Failure Modes in Medical QA Arising from Inflexible Reasoning (arxiv.org)
		3 points by docere on Feb 10, 2025 \| past
3.		EEG-GPT (arxiv.org)
		4 points by docere on Feb 13, 2024 \| past