You mean it reacts to you writing something while it's thinking of that you can ...

hmottestad · 2025-01-31T19:41:35 1738352495

You can stop it at any time, then modify what it's written so far...then press continue and let it continue thinking and answering.

thot_experiment · 2025-01-31T19:42:18 1738352538

Fundamentally the UI is up to you, I have a "typing-pauses-inference-and-starts-gaslighting" feature in my homebrew frontend, but in OpenWebUI/Sillytavern you can just pause it and edit the chain of thought and then have it continue from the edit.

Gracana · 2025-01-31T19:48:26 1738352906

That's a great idea. In your frontend, do you write in the same text entry field as the bot? I use oobabooga/text-generation-webui and I findit's a little awkward to edit the bot responses.

thot_experiment · 2025-01-31T22:39:50 1738363190

No, but the chat divs are all contenteditable.

Gracana · 2025-01-31T22:48:04 1738363684

Oh! That is an excellent solution. I wish it was that easy in every UI.

thot_experiment · 2025-01-31T23:14:14 1738365254

Thanks, for what it's worth unless you particularly need to use exl2 ollama works great for local inference and you can prompt together a half decent chat UI for yourself in a matter of minutes these days which gives you full control over everything. I also lean a lot on https://www.npmjs.com/package/amallo which is a api wrapper i wrote for ollama which makes this sort of hacking very very easy. (not that the default lib is bad, i just didn't like the ergonomics)