Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I could be wrong, as I haven't used Mamba, but it seems to remain similar to transformers in that it doesn't "decide" anything and streams tokens to follow the existing ones; attention isn't a thing in the same way, but recency does still have impact. To that end, putting context after the question makes it more likely to follow the context, not the question.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: