Hacker News new | past | comments | ask | show | jobs | submit login

Yeah I'm surprised by all the negativity as well. I'm listening to the post right now (using xtts-v2 finetuned on a voice I like lol). Sounds like these companies are overvalued / over hyped. Maybe they are / some of these companies go the way of myspace, but LLMs are incredibly useful for me.

I'm able to do a so much more using LLMs (Mistral-Large, Qwen2.5 and R1 locally, Claude via API) than without them.

I have to get the IDE setup properly now.




Personally, I've found DeepSeek R1 to be a profoundly good model for thinking through problems across fields.

I had a complex finance situation that I was struggling with, both from a mathematical/taxation perspective and a personal psychological finance hangup. I spent a few good hours talking to it through everything and had a mental breakthrough. To get the same kind of insight, I would have to pay a financial advisor AND a psychologist for several hours.

That all of this was free while someone calls it a "con" seems completely wrong

(I got my CFA cousin to look over the numbers and he agreed with R1's advice, fwiw)


Yeah, I've had similar experiences. I still hesitate if it's a field I don't know too well of course (never trust an LLM), but R1 has been able to solve things I've been stuck on. And watching it's <think></think> process has been insightful. Only issue is that it ties up all my GPUs while I run it.

Hopefully Mistral can copy their technique and give us a 123b reasoning model.


Did you run this locally?




Consider applying for YC's Summer 2025 batch! Applications are open till May 13

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: