Exactly. And presumably had a more sophisticated harness around the model, longe...

		masterjack 42 days ago \| parent \| context \| favorite \| on: Evaluating publicly available LLMs on IMO 2025 Exactly. And presumably had a more sophisticated harness around the model, longer reasoning chains, best of N, self judging, etc