I came here just to complain about that :-) All LLMs I used seem to give more we...

TheOtherHobbes · 2025-11-09T11:03:51 1762686231

The middle tends to be underweighted. The beginning and end get more attention.

otabdeveloper4 · 2025-11-09T13:05:50 1762693550

That's because when they say "long context window" they're lying and they actually mean that they support a long input prompt that is still compressed into a small context window. (Typically by throwing out tokens in the middle.)

An actually large context window is impossible due to how LLM attention works under the hood.

acuozzo · 2025-11-10T06:25:28 1762755928

Mamba-2 enters the chat.