More

CuriouslyC · 2026-03-11T12:03:17 1773230597

This is overly pessimistic. Prompt injection can be largely mitigated by creating a protocol firewall between agents that access untrusted content and agents that perform computation: https://sibylline.dev/articles/2026-02-22-schema-strict-prom...

I'm working on an autonomous agent framework that is set up this way (along with full authz policy support via OPA, monitoring via OTel and a centralized tool gateway with CLI). https://github.com/sibyllinesoft/smith-core for the interested. It doesn't have the awesome power of a 30 year old meme like the OP but it makes up for it with care.

croes · 2026-03-12T05:43:15 1773294195

Agent hacking is just a the beginning, it’s a bit early to think it’s a solved problem

CuriouslyC · 2026-03-11T02:36:13 1773196573

LoRAs are better at steering models to produce correct answers from their data set than imparting new knowledge.

kgeist · 2026-03-19T08:59:54 1773910794

https://arxiv.org/abs/2603.01097

>Overall, our findings position LoRA as the complementary axis of memory alongside RAG and ICL, offering distinct advantages.

CuriouslyC · 2026-03-10T21:34:16 1773178456

The thing blocking level 8 isn't the difficulty of orchestration, it's the cost of validation. The quality of your software is a function of the amount of time you've spent validating it, and if you produce 100x more code in a given time frame, that code is going to get 1/100th as much validation, and your product will be lower quality as a result.

Spec driven development can reduce the amount of re-implementation that is required due to requirements errors, but we need faster validation cycles. I wrote a rant about this topic: https://sibylline.dev/articles/2026-01-27-stop-orchestrating...

CuriouslyC · 2026-03-10T17:50:07 1773165007

The only currency in a world where AI does everything is your ability to get human attention. So from that perspective moltbook is a huge success.

If Mark hired these people to do anything other than viral marketing, i.e. if he thinks they're visionaries who are going to make amazing apps, he's deluded.

samiv · 2026-03-10T17:58:04 1773165484

You're so right.

You can already see how the same thing has played out with computer games. With the modern engines such as Unity almost anyone can make a game. And almost everyone suffers.

And as a result there's now a million games most of which are poor quality asset flips. Everybody suffers, creators and consumers. Race to the bottom where the bottom has been reached. Prices are zero and earnings are zero.

If 15 years ago an indie game dev would allocate 80% to making the game and 20% to marketing etc. Today that will not get anything but it's much better to spend 20% on the game and 80% on the marketing, SEO optimization and attention harvesting. It's a shouting match where it's all about winning the shouting match not producing the best content.

Another race to the bottom.

armchairhacker · 2026-03-10T18:22:43 1773166963

There are millions of asset flips, but the top indie games have never been better. It’s hard for indie developers because there’s so much competition: you need to heavily promote a quality game only because there are so many other quality games.

Likewise these tools have enabled many more people to create vibe-coded slop, and may lead to more quality software (making it harder to stand out without marketing), but the best software will only get better.

sethops1 · 2026-03-10T18:46:32 1773168392

The implication is that the gatekeeping has become marketing dollars, when it used to be skill at making a fun game. I don't think we're in a better situation today.

armchairhacker · 2026-03-10T19:00:18 1773169218

There are fun games that succeed without marketing, e.g. Balatro, and there are bad games that fail despite it, e.g. Highguard.

The reason that “skill at making a fun game” doesn’t guarantee success is because there are so many fun games. Much less, if at all, because there is so many slop.

charcircuit · 2026-03-11T01:07:35 1773191255

Balatro did marketing and were extremely successful at it getting gigantic content creators to play their game.

WA · 2026-03-10T18:24:58 1773167098

idk, indie games that come to my attention seem to be very polished. Which one is successful and fits your criteria?

slumberlust · 2026-03-10T18:34:03 1773167643

I disagree that accessibility is a detractor here.

There's never been a better time to be an indie dev. I'd rather have 1/1000 indie games be awesome than being force fed whatever storefront disguised as a game 'AAA' publishers poop out every year.

Just look at how slay the spire is doing up against marathon right now. Which of those was shouting the loudest? Highguard anyone?

PaulHoule · 2026-03-10T19:16:25 1773170185

I'll second this.

It is true that the indy game market is brutal but it's always been brutal.

You don't really hear about a crisis at the indy game level though, rather at the AAA game level there is much of "we'd like to use our market power to take out the risk in game development" and then years later we realize they took out all the value before they took out the risk and now they're doomed.

toomuchtodo · 2026-03-10T18:14:47 1773166487

Mark got lucky enough once he can be wrong the rest of his life and still not be exposed to a cost for it. Purpose of the system is what it does.

PaulHoule · 2026-03-10T19:20:16 1773170416

... I think he's got an affinity for other people and organizations that have succeeded in the same way. The idea that somebody out there might have a workmanlike approach to life and be able to get consistent results at something would be a threat to his worldview.

CuriouslyC · 2026-03-08T19:11:50 1772997110

Don't give Pichai credit for that. Google had the strongest ML research org on the planet before he took over, and it had Demis, arguably the best researcher in the field (and it had Geoffrey Hinton before that). The fact that goog was so far behind OAI despite Demis blazing frontiers was a major management failure.

Sundar's enshittification has also juiced short term share prices at the cost of long term health. It might turn out to be a decent decision for search because it's in the midst of being disrupted, but that's a happy accident for Sundar, not 4d chess (and you can argue the enshittification hastened the disruption).

CuriouslyC · 2026-03-08T18:32:31 1772994751

At least in programming, humans have to check the product of the LLM's output rather than the output itself.

CuriouslyC · 2026-03-08T18:31:12 1772994672

But a less concise language is (theoretically, if you're doing useful stuff with the verbosity) easier for machines to verify.

CuriouslyC · 2026-03-08T18:29:55 1772994595

Possibly. First, I think there's still low hanging fruit in creating a programming language designed to be as easy as possible for agents to work with that we won't try to unlock until people writing code is a curiosity. Second, agents don't care about verbosity of code, so we can do verbosity/correctness/tooling tradeoffs that wouldn't have made sense when humans were the sole consumers of the code.

CuriouslyC · 2026-03-08T17:27:17 1772990837

One nice thing about serializing/transmitting AST changes is that it makes it much easier to to compose and transform change sets.

The text based diff method works fine if everyone is working off a head, but when you're trying to compose a release from a lot of branches it's usually a huge mess. Text based diffs also make maintaining forks harder.

Git is going to become a big bottleneck as agents get better.

a-dub · 2026-03-08T17:53:58 1772992438

what do you actually gain over enforced formatting?

first you should not be composing releases at the end from conflicting branches, you should be integrating branches and testing each one in sequence and then cutting releases. if there are changes to the base for a given branch, that means that branch has to be updated and re-tested. simple as that. storing changes as normalized trees rather than normalized text doesn't really buy you anything except for maybe slightly smarter automatic merge conflict resolution but even then it needs to be analyzed and tested.

CuriouslyC · 2026-03-08T18:23:44 1772994224

Diffs are fragile, and while I agree with that process in a world where humans do all the work and you aren't cutting a dozen different releases, I think that's a world we're rapidly moving away from.

a-dub · 2026-03-08T21:40:41 1773006041

in that case you probably flag a bunch of prs for release and it linearizes their order and rebases and tests each one a step ahead of your review (responding to any changes you make as you go).

CuriouslyC · 2026-03-07T20:18:51 1772914731

This. I wouldn't have touched Nix when you needed someone who was really good at Nix to keep it working, but agents make it viable to use in a number of place.