Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
riku_iki
4 months ago
|
parent
|
context
|
favorite
| on:
SOTA on swebench-verified: relearning the bitter l...
where we can check actual model output? The worry could be that it is unreadable buggy mess, even if it managed to close some specific bug.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: