Why do people keep saying that Claude3 has been nerfed? Their CTO has said on Tw...

worldsayshi · on April 18, 2024

I suspect that there is some psychological effect going on where people adjust their expectations and start to be more open to noticing flaws after working with it for a while. Seems to be a recurring thing with most models.

gliched_robot · on April 18, 2024

The code it writes is getting worse eg. lazy and not updating the function, not following prompts etc. So we can objectively say its getting worse.

HeatrayEnjoyer · on April 18, 2024

So you posit they are lying?

oersted · on April 18, 2024

It's likely true that they didn't change the model, same for the many claims of GPT-4 getting worse. But they do keep iterating a lot on the "safety" layers on top: classifiers to detect dangerous requests, the main system prompt...

But I also think it's partially a psychological phenomenon, just people getting used to the magic and finding more bad edge-cases as it is used more.

EDIT: It seems that they do claim that the layers on top also didn't change https://twitter.com/alexalbert__/status/1780707227130863674

swores · on April 18, 2024

While I do think that many claims of GPT4 getting worse were subjective and incorrect, there certainly was an accidental nerfing of at least ChatGPT Plus, as confirmed by OpenAI releasing an update some months ago specifically acknowledging that it had become "more lazy" and the update was to rectify it.

(I think it was just the settings for how ChatGPT calls the GPT4 model, and not affecting use of GPT4 by API, though I may be misremembering.)

erichocean · on April 18, 2024

They can change the prompt without changing the model, since the prompt only affects current "attention."

And they do.

refulgentis · on April 18, 2024

Over yonder: https://x.com/alexalbert__/status/1780707227130863674

my $0.02: it makes me very uncomfortable that people misunderstand LLMs enough to even think this is possible

minimaxir · on April 18, 2024

It is 100% possible for performance regressions to occur by changing the model pipeline and not the model itself. A system prompt is a part of said pipeline.

Prompt engineering is surprisingly fragile.

refulgentis · on April 18, 2024

Absolutely! That was covered in the tweet link. If you're suggesting they're lying*, I'm happy to extract it and check.

* I don't think you are! I've looked up to you a lot over last year on LLMs btw, just vagaries of online communication, can't tell if you're ignoring the tweet & introducing me to idea of system prompts, or you're suspicious it changed recently. (in which case, I would want to show off my ability to extract system prompt to senpai :)

minimaxir · on April 19, 2024

I was agreeing with the tweet and think Anthropic is being honest, my comment was more for posterity since not many people know the difference between models and pipelines.

Thanks for liking my work! :)

mirsadm · on April 18, 2024

Is that surprising? Seemed like a giant hack to me. Prompt engineering sure sounds better than hack though.

minimaxir · on April 18, 2024

It is a necessary hack, though.

Vt71fcAqt7 · on April 18, 2024

Of course it is possible. For example via quantization. Unless you are refering to something I can't see in that tweet. (not signed in).

refulgentis · on April 18, 2024

You're right, that's a good point. It is possible to make a model dumber via quantization.

But even F16 -> llama.cpp Q4 (3.8 bits) has negligible perplexity loss.

Theoratically, a leading AI lab could quantize absurdly poorly after the initial release where they know they're going to have huge usage.

Theoratically, they could be lying even though they said nothing changed.

At that point, I don't think there's anything to talk about. I agree both of those things are theoratically possible. But it would be very unusual, 2 colossal screwups, then active lying, with many observers not leaking a word.

trevor-e · on April 18, 2024

Thanks, this is the tweet thread I was referring to.

polygamous_bat · on April 18, 2024

Why would the CTO/lead engineer admit that they nerfed the model even if they did? It’s all closed, how does admitting it benefit them? I would much rather trust the people using it everyday.

hackerlight · on April 18, 2024

It's not a random sample of people. You're sampling the 10 most noisy people out of a million users, and those 10 people could be mistaken.

Claude 3 hasn't dropped Elo on the lmsys leaderboard which supports the CTO's claim.

CuriouslyC · on April 18, 2024

Beyond that, to people who interact with the models regularly the "nerf" issue is pretty obvious. It was pretty clear when a new model rollout caused ChatGPT4 to try and stick to the "leadup, answer, explanation" response model and also start to get lazy about longer responses.

swores · on April 18, 2024

That's a different company's model, so while it may have been obvious it is not relevant to whether Claude 3 has been nerfed or not is it?

CuriouslyC · on April 18, 2024

I use claude3 opus daily and I haven't noticed a change in its outputs, I think it's more likely that there's a discontinuity in the inputs the user is providing to claude which is tipping it over a threshold into a response type they find incorrect.

When GPT4 got lobotomized, you had to work hard to avoid the new behavior, it popped up everywhere. People claiming claude got lobotomized seem to be cherry picking example.

swores · on April 18, 2024

Oh my bad, sorry, I misinterpreted your previous comment as meaning "it was obvious with GPT4 and therefore if people say the same about Claude 3 it must equally be obvious and true", rather than what you meant which was half the opposite.

refulgentis · on April 18, 2024

I wouldn't recommend that, it is tempting, but leaves you self-peasantizing and avoiding learnings.