Hacker Newsnew | past | comments | ask | show | jobs | submit | comboy's commentslogin

Add CI to check if new laws don't contradict with any existing ones.

You might need to turn laws into formal proofs, and the existence of judges makes me think that’s not as likely as you would like. A commenting system would though—trained on countries’s precedents, jurisprudence and traditions might.

Can you imagine rebases with merge conflicts?

This could in theory already happen without any tech, but I suspect since the government is pretty monolithic, any changes in a specific law are all being done by the same set of people.

You might not have merge conflicts but I imagine you could end up with conflicting guidance from two separate pieces of law (e.g., law A says you must wear green on St. Patrick's day, law B outlaws green pajamas).


*that we know of

Which is exactly what they said:

> “We are not aware of any successful mercenary spyware attacks against a Lockdown Mode-enabled Apple device,” Apple spokesperson Sarah O’Rourke told TechCrunch on Friday.


Which is infinitely better than the cases we know about without the feature enabled.

Haha, here's some random AI generated content:

    At least 225 judges have ruled in more than 700 cases that the administration's mandatory immigration detention policy likely violates the right to due process[1] The Fifth Amendment's Due Process Clause generally requires those having federal funds cut off to receive notice and an opportunity for a hearing, which was not provided in many of DOGE's spending freezes[2]
(there's more but what's the point)

1. https://www.justsecurity.org/107087/tracker-litigation-legal...

2. https://www.cbpp.org/research/federal-budget/many-trump-admi...


Not really related, but does anybody know if somebody's tracking same models performance on some benchmarks over time? Sometimes I feel like I'm being A/B tested.

Oh, I didn't think about this, that's a good idea. I also feel generally model performance changes over time (usually it gets worse).

The problem with doing this is cost. Constsntly testing a lot of models on a large dataset can get really costly.


Yeah, good tests are associated with cost. I'd like to see benchmarks on big messy codebases and how models perform on a clearly defined task that's easy to verify.

I was thinking that tokens spent in such case could also be an interesting measure, but some agent can do small useful refactoring. Although prompt could specify to do the minimal change required to achieve the goal.


People are loading huge interpreted environments for stuff that can be done from the command line. Run computations on complex objects where it could be a single machine instruction etc. The trend has been around for a long time.

Wow /insights is genuinely useful, perhaps CLI should be pushing that as a tip, if one has enough sessions, instead of keep nagging me about the frontend developer skill which I already have installed

In general CLI could be more reliable and responsive though, it's a text based env yet sometimes feel like running windows 95 on 386dx

It seems clear from the insights that some model is marking failure cases when things went wrong and likely reporting home, so that should be extremely valuable to Anthropic


> it's a text based env yet sometimes feel like running windows 95 on 386dx

They use nodejs and React. Yes, for real.

https://xcancel.com/trq212/status/2014051501786931427


Claude Code uses Bun. Anthropic acquired Bun in December. Bun is an alternative node runtime.

Apologies, the nodejs comment above therefore is wrong. I don't seem to be able to edit it anymore.

lol, yeah

> We’ve rewritten Claude Code’s terminal rendering system to reduce flickering by roughly 85%.

tells you all you need to know

and I keep running it remotely through tmux, that explains so many things

edit: if they are writing it in react anyway (sic!) maybe we could at least get a web interface, skipping mapping it to terminal output part ..


I think stability and reliability have vastly improved over the last years in general (not necessarily talking about gh specifically)

It's just that everybody is using 100 tools and dependencies which themselves depend on 50 others to be working.


OpenX is becoming a bit like that hindu symbol associated with well being..

Claude getting clawed.

this is exactly i thought!

> a state sponsored threat actor

your CPU, your OS, CPU and firmware on your motherboard chips, ethernet, wifi, HDDs (btw did you know your sim card has JVM?), your browser, all your networking equipment in between, BGP and all the root certs and I'm just scratching the surface

the ballpark is on anther planet


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: