So today is Qwen. Tomorrow a new SOTA model from Google apparently, R2 next week...

zamadatix · 2025-03-24T19:39:21 1742845161

Qwen 3 is coming imminently as well https://github.com/huggingface/transformers/pull/36878 and it feels like Llama 4 should be coming in the next month or so.

That said none of the recent string of releases has done much yet to "smash a wall", they've just met the larger proprietary models where they already were. I'm hoping R2 or the like really changes that by showing ChatGPT 3->3.5 or 3.5->4 level generational jumps are still possible beyond the current state of the art, not just beyond current models of a given size.

YetAnotherNick · 2025-03-25T05:03:19 1742878999

> met the larger proprietary models where they already were

This is smashing the wall.

Also if you just care about breaking absolute numbers, OpenAI released 4.5 a month back which is SOTA in base model, planning to release O3 full in maybe a month, and Deepseek released new V3 which is again SOTA in many aspects.

OsrsNeedsf2P · 2025-03-24T20:45:12 1742849112

> We haven't hit the wall yet.

The models are iterative improvements, but I haven't seen night and day differences since GPT3 and 3.5

anon373839 · 2025-03-24T21:14:48 1742850888

Yeah. Scaling up pretraining and huge models appears to be done. But I think we're still advancing the frontier in the other direction -- i.e., how much capability and knowledge can we cram into smaller and smaller models?

YetAnotherNick · 2025-03-25T05:05:42 1742879142

Because 3.5 has a new capability which is following instructions. Right now we are in 3.5 range in conversation AI and native image generation, both of which feels magical.

Davidzheng · 2025-03-25T01:37:38 1742866658

Tbh such a big jump from current capability would be ASI already

behnamoh · 2025-03-24T20:15:28 1742847328

Google's announcements are mostly vaporware anyway. Btw, where is Gemini Ultra 1? how about Gemini Ultra 2?

karmasimida · 2025-03-24T20:27:41 1742848061

It is already on the LLM arena right, codename nebula? But you are right they can fuck up their releases royally.

aoeusnth1 · 2025-03-25T04:20:35 1742876435

I guess they don’t do ultras anymore, but where was the announcement for it? What other announcement was vaporware?

intalentive · 2025-03-25T15:39:55 1742917195

Asymptotic improvement will never hit the wall

nwienert · 2025-03-24T21:25:42 1742851542

We've slid into the upper S curve though.

tomdekan · 2025-03-24T19:44:23 1742845463

Any more info on the new Google model?