>I feel the opposite, and pretty much every metric we have shows basically linea...

mountainriver · 2025-06-02T23:18:51 1748906331

Almost every single major benchmark, and yes progress is incremental but it adds up, this has always been the case

attemptone · 2025-06-03T06:35:10 1748932510

We were talking about linear improvements and I have yet to see it

mountainriver · 2025-06-03T17:04:48 1748970288

check the benchmarks or make one of your own

attemptone · 2025-06-03T20:20:04 1748982004

I checked the BlEU-Score and Perplexity of popular models and both have stagnated around 2021. As a disclaimer this was a cursory check and I didn't dive into the details of how individuals scores were evaluated.

mountainriver · 2025-06-04T16:37:03 1749055023

on what benchmarks? pretty much every major one is linear improvement