I wish they'd just reveal the CoT (like gemini and deepseek do), it's very helpf...

liamwire · 2025-02-01T04:03:03 1738382583

sama and OpenAI’s CPO Kevin Weil both suggested this is coming soon, as a direct response to DeepSeek, in an AMA a few hours ago: https://www.reddit.com/r/OpenAI/s/EElFfcU8ZO

msp26 · 2025-02-01T10:20:20 1738405220

> a much more helpful and detailed version of this

Notice the deliberate wording. To me this implies we aren't getting the raw CoT.

PeterHolzwarth · 2025-02-01T07:55:18 1738396518

Do you have a direct link to that? My "force .old on everything" plugin is having problems resolving your url (sorry!).

pona-a · 2025-02-01T12:00:18 1738411218

https://old.reddit.com/r/OpenAI/comments/1ieonxv/comment/ma9...

tucnak · 2025-01-31T21:45:13 1738359913

I'm sorry, but it's over for OpenAI. Some have predicted this; including me back in November[1] when I wrote "o1 is a revolution in accounting, not capability" which although tongue-in-cheek, has so far turned out to be correct. I'm only waiting to see what Google, Facebook et al. will accomplish now that R1-Zero result is out the bag. The nerve, the cheek of this hysterical o3-mini release—insisting to hide the COT from the consumer still, is telling us one thing and one thing alone: OpenAI is no longer able to adapt to the ever-changing landscape. Maybe the Chinese haven't beaten them yet, but Google, Facebook et al. absolutely will, & without having to resort to deception.

[1]: https://old.reddit.com/r/LocalLLaMA/comments/1gna0nr/popular...

mediaman · 2025-01-31T22:09:26 1738361366

You don't need to wait for Google. Their Jan 21 checkpoint for their fast reasoning model is available on AIStudio. It shows full reasoning traces. It's very good, much faster than R1, and although they haven't released pricing, based on flash it's going to be quite cheap.

tucnak · 2025-01-31T23:02:43 1738364563

Sure, their 01-21 reasoning model is really good, but there's no pricing for it!

I care mostly about batching in Vertex AI, which is 17-30x times cheaper than competition (whether you use prompt caching or not) while allowing for audio, video, and arbitrary document filetype inputs; unfortunately Gemini 1.5 Pro/Flash have remained the two so-called "stable" options that are available there. I can appreciate Google's experimental models for all I can, but I cannot take them seriously until they allow me to have my sweet, sweet batches.