What did you set the context window to? That's been my main issue with models on...

hrpnk · 2025-08-05T21:51:37 1754430697

With LM Studio you can configure context window freely. Max is 131072 for gpt-oss-20b.

coltonv · 2025-08-05T21:56:09 1754430969

Yes but if I set it above ~16K on my 32gb laptop it just OOMs. Am I doing something wrong?

mekpro · 2025-08-06T00:29:33 1754440173

try enable flash attention and offload all layer to GPU

simonw · 2025-08-05T22:59:16 1754434756

I punted it up to the maximum in LM Studio - seems to use about 16GB of RAM then, but I've not tried a long prompt yet.