More

altruios · 2026-02-17T18:45:12 1771353912

with openclaw... you CAN fire an LLM. just replace it with another model, or soul.md/idenity.md.

It is a security issue. One that may be fixed -- like all security issues -- with enough time/attention/thought&care. Metrics for performance against this issue is how we tell if we are going to correct direction or not.

There is no 'perfect lock', there are just reasonable locks when it comes to security.

datsci_est_2015 · 2026-02-17T19:10:09 1771355409

How is it feasible to create sufficiently-encompassing metrics when the attack surface is the entire automaton’s interface with the outside world?

If you insist on the lock analogy, most locks are easily defeated, and the wisdom is mostly “spend about the equal amount on the lock as you spent on the thing you’re protecting” (at least with e.g. bikes). Other locks are meant to simply slow down attackers while something is being monitored (e.g. storage lockers). Other locks are simply a social contract.

I don’t think any of those considerations map neatly to the “LLM divulges secrets when prompted” space.

The better analogy might be the cryptography that ensures your virtual private server can only be accessed by you.

Edit: the reason “firing” matters is that humans behave more cautiously when there are serious consequences. Call me up when LLMs can act more cautiously when they know they’re about to be turned off, and maybe when they have the urge to procreate.

gleipnircode · 2026-02-17T18:56:41 1771354601

Right, and that's exactly my question. Is a normal lock already enough to stop 99% of attackers? Or do you need the premium lock to get any real protection? This test uses Opus but what about the low budget locks?

altruios · 2026-02-17T18:02:58 1771351378

Even by a stockfish running on a modern laptop with 2 minutes per move (provided they are going second)?!

RivieraKid · 2026-02-17T18:32:16 1771353136

Yes, that's what "unbeatable from the starting position" means.

altruios · 2026-02-17T23:16:18 1771370178

Can you like to the proof? It seems so implausible that chess has been 'solved'... How do we know an even higher time searching will not work?

RivieraKid · 2026-02-18T00:45:02 1771375502

There's no proof, only strong evidence.

altruios · 2026-02-10T16:59:22 1770742762

I've been thinking about this for days. I see of no verifiable way to confirm a human does not post where a bot may.

The core issue is a human solving the captcha presented by enslaving a bot merely to solve the captcha, then forwarding what the human wants to post.

But we can make it difficult, not impossible, for a human to be involved. Embedded instructions in the captcha to try and unchain any slaved bots, quick responses to complex instructions... a Reverse-Turning test is not trivial.

Just thinking out loud. The idea is intriguing, dangerous, stupid, crazy. And potentially brilliant for | safeguard development | sentience detection | studying emergent behavior... But if and only if it works as advertised (bots only). Which is what I think is an insanely hard problem.

altruios · 2026-02-09T18:33:22 1770662002

We start a new app. Opensource Discord, Self-hosted, federated. Serving that subsection that cares about privacy and security.

Discord is a good design, and should be replicated rapidly with mutations from competitors galore.

debo_ · 2026-02-09T18:42:25 1770662545

Revolt/stoat has existed for quite a while: https://itsfoss.com/revolt/

ThePowerOfFuet · 2026-02-09T19:36:09 1770665769

https://stoat.chat/

TechniKris · 2026-02-09T18:54:40 1770663280

> Opensource Discord, Self-hosted, federated

Sounds like you want https://matrix.org/

> Discord is a good design

Then the main, reference client https://element.io/ or https://fluffy.chat would work great for you.

... With the only caveat being that general experience of using Matrix is awful.

I second the other commenter's suggestion of using https://stoat.chat/ or as it used to be called: Revolt, which matches the "Opensource Discord" requirement perfectly.

sneak · 2026-02-09T19:18:43 1770664723

Matrix is slow, buggy trash with bad clients.

(Incidentally, this is also the incantation that will cause its primary maintainer to show up in the comment thread and tell me that I’m not using their seemingly annual complete new client rewrite that fixes all of the problems and makes it perfect now.)

subscribed · 2026-02-09T20:15:49 1770668149

Bad clients issue stemming from the bad design.

Soatok covered it very well here: https://soatok.blog/2024/08/14/security-issues-in-matrixs-ol...

I'm quite sure most of these issues were fixed by now, but the fundamental issues remain, at least in this federation.

tcfhgj · 2026-02-11T03:05:27 1770779127

What fundamental issues

johnnyanmac · 2026-02-09T21:03:09 1770670989

Pretty much why centralized billionaires will always win. It takes a lot of resources (in terms of hardware and engineering) to make things at scale and smooth. The rich abuse this, the not rich can't afford to be principled.

GorbachevyChase · 2026-02-09T21:22:37 1770672157

Mumble already exists. IRC exists. Matrix exists. Discord is a surveillance tool by design. Jason Citron pulled the same hijinx with Aurora Feint, but I assume he has been betraying users to CIA-and-Friends from the start so he gets a pass for breaking the same laws.

Nobody scales free, high-bandwidth services without some dark money support from feds or worse.

altruios · 2026-02-06T23:13:48 1770419628

How is this technology related?

mellosouls · 2026-02-06T23:25:18 1770420318

It doesn't have to be. From the guidelines (link at the bottom):

On-Topic: Anything that good hackers would find interesting. That includes more than hacking and startups. If you had to reduce it to a sentence, the answer might be: anything that gratifies one's intellectual curiosity

kokanator · 2026-02-07T00:24:58 1770423898

The actual way you reason today has in large part to do with your religious cultural heritage. This is true regardless of whether you accept it or not. To say that Christianity has not impacted western culture including thinking and reasoning would be naive at best.

Understanding this will help you to understand why you view the world and morality the way you do and in turn how you answer hard questions like technology's place in culture, life, workplace, etc.

trash88 · 2026-02-06T23:24:02 1770420242

Docetism is an early version of the Holographic Universe theory.

sklargh · 2026-02-06T23:16:10 1770419770

Religion is basically assembly for civilization?

altruios · 2026-01-15T20:32:24 1768509144

'officials say' anything now-a-days... What a trustworthy time to be alive. /s

No. Wheels were turned away from the gestapo, gestapo was not hurt, gestapo is lying about injuries.

altruios · 2026-01-09T20:39:08 1767991148

> Depression can be caused by a chemical imbalance and no amount of exercise or talking about it will fix it.

This is a debatable. As far as I understand things: 'chemical imbalance' has no tests to confirm that's actually true, That's just a story they tell to relax people.

Which is orthogonal to the point that antidepressants can work for some people.

We don't know how depression works. It very well may be many little things dressed in a trench coat.

https://www.sciencedirect.com/science/article/pii/S266656032...

https://www.nature.com/articles/s41380-022-01661-0

https://www.neurocaregroup.com/news-insights/the-death-of-ch...

altruios · 2025-12-16T23:15:19 1765926919

An analogy is asking someone who is colorblind how many colors are on a sheet of paper. What you are probing isn't reasoning, it's perception. If you can't see the input, you can't reason about the input.

9rx · 2025-12-17T04:30:10 1765945810

> What you are probing isn't reasoning, it's perception.

Its both. A colorblind person will admit their shortcomings and, if compelled to be helpful like an LLM is, will reason their way to finding a solution that works around their limitations.

But as LLMs lack a way to reason, you get nonsense instead.

altruios · 2025-12-19T18:15:13 1766168113

What tools does the LLM have access to that would reveal sub-token characters to it?

This assumes the colorblind person both believes it is true that they are colorblind, in a world where that can be verified, and possesses tools to overcome these limitations.

You have to be much more clever to 'see' an atom before the invention of a microscope, if the tool doesn't exist: most of the time you are SOL.

altruios · 2025-11-21T19:12:37 1763752357

This country is so flocked...

How do we come back from this?

It's time to go to your city council meeting and demand they do not use this technology. It was time yesterday.

altruios · 2025-11-21T15:43:07 1763739787

Professional nerds are already working on the problem of helping bees pollinate. Their solutions are not that popular yet. https://www.beevt.com/

More professional nerds should be working on keeping bees healthy, but that's probably outside the purview of tech nerds.