Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
ImprobableTruth
on March 31, 2023
|
parent
|
context
|
favorite
| on:
Vicuna: An open-source chatbot impressing GPT-4 wi...
I don't think there are any benchmarks for chat models. You could just do the usual lambada, etc., but what's the point? We already know the scores for llama and that RLHF doesn't meaningfully improve capabilities.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: