Normally, I don't think 1000 tokens/s is that much more useful than 50 tokens/s....

deadmutex · 2024-11-19T07:11:08 1732000268

> Also, I assume financial applications such as hedge funds would be buying these things in bulk now.

Please elaborate.. why?

aurareturn · 2024-11-19T11:24:16 1732015456

I'm assuming hedge funds are using LLMs to dissect information from company news, SEC reports as soon as possible then make a decision on trading. Having faster inference would be a huge advantage.