Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
esperent
on Feb 24, 2024
|
parent
|
context
|
favorite
| on:
I Spent a Week with Gemini Pro 1.5–It's Fantastic
GPT3.5 and GPT4 are not the only options though, right? I don't follow that closely but there must be other models with longer context length that are roughly GPT3.5 quality by now, and they even probably use the same API.
wkat4242
on Feb 24, 2024
|
next
[–]
I don't really know. The benefit of ChatGPT is that it's so big, there are so many nice APIs for it :)
I'm not so deep into it all.
ajcp
on Feb 24, 2024
|
prev
[–]
Mistral 8x7b has can handle context of ~32,000 pretty comfortably and it benchmarks at or above GPT3.5
ComputerGuru
on Feb 24, 2024
|
parent
[–]
Is that the sliding context window size? Because I didn't have good results with sliding context windows in the regular Mistral models.
ajcp
on Feb 24, 2024
|
root
|
parent
[–]
Yeah, I think they fine-tune without a specific window size target to achieve and then keep expanding context until it starts falling over.
Join us for
AI Startup School
this June 16-17 in San Francisco!
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: