Hacker Newsnew | past | comments | ask | show | jobs | submit | docere's submissionslogin
1.MedEvalArena: Peer-judged LLM medical reasoning benchmark (danbernardo.substack.com)
1 point by docere 68 days ago | past
2.LLM Failure Modes in Medical QA Arising from Inflexible Reasoning (arxiv.org)
3 points by docere on Feb 10, 2025 | past
3.EEG-GPT (arxiv.org)
4 points by docere on Feb 13, 2024 | past

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: