How many benchmarks for LLMs are there out there? Is there any evidence of over-...

sandos 89 days ago | parent | context | favorite | on: OpenAI O3-Mini

How many benchmarks for LLMs are there out there?

Is there any evidence of over-fitting on benchmarks, or is there truely hidden parts to them?