Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Is the context only 1024 tokens? it seem it will cut off more and more (which is weird) after I have longer conversation.


it looks like the Llamafile team is taking questions in their live Q&A tomorrow (thursday) at 1700 UTC - https://www.youtube.com/live/dwhBvUN-MD8?feature=shared




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: