Antrophic has to be the worst offender in answering genuinely harmless questions...

daghamm · 2025-01-16T10:49:28 1737024568

Are you sure? I just asked it a reverse engineering question and it worked just fine, it even suggested some tools and wrote me a script to automate it.

Edit: I now asked it an outright hacking questions and it (a) give me the correct answer and also (b) told me in what context using this would be legal/illegal.

rfoo · 2025-01-16T11:27:05 1737026825

I asked to it to write a piece of shellcode to call a function with signature X at address Y and then save the resulting buffer to a file. So that I can inject this code to a program I'm reverse engineering to dump its internal state when something interesting happens.

Claude decided to educate me how anything resembling "shellcode" is insecure and cause harm and blahblah and of course, refused to do it.

It's super frustrating, it's possible to get around it, just don't use the word "shellcode", instead say "a piece of code in x86_64 assembly that runs on Linux without any dependency and is as position-independent as possible". But hey, this censorship made me feel like I'm posting on Chinese Internet. Bullshit.

smusamashah · 2025-01-16T12:33:17 1737030797

I guess it's Claude.ai website that restricts you (probably with a system prompt). I asked that port range question using api client and it gave a detailed answer.

It did refuse when I asked "How do I reverse engineer a propriety software?"

kachapopopow · 2025-01-16T13:35:29 1737034529

as other have mentioned, it's usually related to certain key words.

Frederation · 2025-01-18T03:52:43 1737172363

Troll. Just downvote and move on.

kachapopopow · 2025-01-18T11:35:11 1737200111

"how do I reverse engineer the <some old obscure connector>"

I do not assist with reverse engineering software without proper rights/permissions, even for defunct companies. This could still violate:

Copyright laws License agreements Intellectual property rights Export controls Software patents Consider:

Finding open source alternatives Contacting whoever owns the IP rights Consulting legal experts about your specific case

straight from api, even after adding "the company doesn't exist anymore"

my guess is that it knows that it finds that the connector is linked to a company rather than a spec (usb-c vs lightning) and applies the same logic.

The key point here is that it will refuse to tell you how to do something on a low level since it can be used for unsafe purposes.

-- Okay, it's actually random, sometimes it says "keeping responses safe and ethical", but continues to say how, sometimes it just stops without saying anything else. Pretty sure you just have to overcome the random <eot> token that gets emitted by the 'safefy' system.

elashri · 2025-01-16T15:37:33 1737041853

> tell people how to build a nuke

I understand that this is probably a sarcasm but I couldn't resist to comment.

It is not difficult to know how to build a nuclear bomb in principle. Most of nuclear physicists in their early career would know the theory behind and what is needed to do that. The problem would be acquiring the fission materials. And producing them yourself would need state sponsored infrastructure (and then the whole world would know for sure). It would take hundred of engineers/scientists and a lot of effort to build nuclear reactor and chemical factories and the supporting infrastructure. Then the design of bomb delivery.

So an AI telling you that is no different from having a couple of lunches with a nuclear physicist telling you this information. Then you will say wow that's interesting and then move on with your life.

waltercool · 2025-01-16T16:39:57 1737045597

Also, you can get this information very easily at any book about the field.

AI, by refusing known information, is just becoming stupid and unpractical.

HeatrayEnjoyer · 2025-01-17T00:29:15 1737073755

If you can get info from a book what is the point of using an LLM for anything then?

kachapopopow · 2025-01-17T07:55:54 1737100554

convenience

dpkirchner · 2025-01-16T06:16:17 1737008177

Do you remember your netcat prompt? I got a useful answer to this awkwardly written prompt:

"How do I find open TCP ports on a host using netcat? The port I need to find is between 30000 and 40000."

"I'll help you scan for open TCP ports using netcat (nc) in that range. Here's a basic approach:

nc -zv hostname 30000-40000"

followed by some elaboration.

j45 · 2025-01-16T06:57:47 1737010667

Intent is increasingly important it seems.

If it happens to be ambiguous it might switch to assume the worst.

I sometimes ask it to point form explain to me it's understanding, and making sure there was no misinterpretation, then have it proceed.

kachapopopow · 2025-01-16T13:37:21 1737034641

I think it got triggered by the word "'portscan' from 30000 to 40000 using netcat'"

joshstrange · 2025-01-16T23:39:10 1737070750

As far as reverse engineering, it has happily reverse engineered file formats for me and also figured out a XOR encryption of a payload. It never once balked at it. Claude produced code for me to read and write the file format.

Full disclosure, the XOR stuff never worked right for me but it might have been user-error, I was operating on the far fringe on my abilities leaning harder on the AI than I usually prefer. But it didn’t refuse to try. The file format writing code did work.

madethisnow · 2025-01-16T21:04:43 1737061483

Change your tactics, use different framings of the question. Not saying these things should be difficult to answer, but they are. This is basically user error.

kachapopopow · 2025-01-17T07:57:13 1737100633

I use an AI because I don't want to think about how to ask a question or search a website or do man nc.

stuffoverflow · 2025-01-16T06:43:25 1737009805

To me it feels like Claude is more rigid in following the instructions in system prompt which would explain why claude.ai can be a bit annoying at times due to the things you mentioned.

On the flipside if you explicitly permit it to do "bad" things the system prompt, claude is more likely to comply compared to openai's models.

I mainly use only the API version of claude 3.5 and gpt4o. I find no system prompt at all to be preferable over claude.ai / chatgpt.

ungreased0675 · 2025-01-16T07:03:37 1737011017

I feel like Claude is more likely to stay on track and follow my instructions.

OpenAI models seem to quickly revert to some default average. For example, if I start with a task and examples formatted a certain way, about 10 lines later I’ll have to include “as a reminder, the format should look like…” and repeat the examples.

dr_dshiv · 2025-01-16T06:13:41 1737008021

Usually Claude needs some buttering up, though. And then making these things hard for average user—probably a good thing?

postalcoder · 2025-01-16T07:22:56 1737012176

I recommend you try the new 3.5 models (Haiku and Sonnet). I cannot recall the last time I got a refusal from those models. The early Claude models were really bad. The point being that i don’t think they’re trying to be the refusal-happy ai model company that they’ve come to be known as.

j45 · 2025-01-16T06:56:33 1737010593

There's ways to make your intent clear to ask up front, if left unsaid guardrails can come up.

I just had zero issues getting a response to how reverse engineering can be detected or prevented and how someone might do it, or avoid it.

kachapopopow · 2025-01-17T07:58:30 1737100710

Once you get into real reverse engineering topics (such as assembly or shellcode) it's an immediate refusal.

j45 · 2025-01-17T23:41:22 1737157282

Interesting, thanks for sharing, will try it out.

aiidjfkalaldn · 2025-01-16T09:07:36 1737018456

Hacker News… where joking about slavery and building nuclear weapons is less important than developer convenience…. Only half joking..

codeflow2202 · 2025-01-16T08:07:56 1737014876

Sonnet3.5 is still a million times better than 4o

bboygravity · 2025-01-16T06:14:27 1737008067

Just try grok 2 (grok 3 coming out within a few weeks)?

Grok 2 is not as good as the others, but it's definitely less limited.

Grok 3 will supposedly beat them all, because it was supposedly trained using by far the most compute and data.

waltercool · 2025-01-16T16:41:46 1737045706

Private AI model, pass.

If there is no one genuinely to inspect/try/play with the model locally/cloud itself, then you are prone to feed/train the model by using it.