They're ALL bullshit and there's a technical reason why.
Your rube goldberg contraption that you put together for your borderline-fradulent pitch deck is NOT an assembly-line nor is it a product anyone's gonna buy. Why?
Because cosine similarity search mathematically sucks a* , large context windows, while better, are nowhere close to being fast and practical ( maybe with a small exception of the generic sounding 1M context summaries you now get from gemini flash 2.0 exp ) . You probably don't have any kind of CI/CD setup, no testing at all zero, no benchmarking of accuracy, you probably can't even get lm_eval installed in the first place so no troubleshooting methodoloy, no formal iteration pipeline, you're not putting out a new model every 2 weeks and iterating upon, and YOU at this point probably can't find your own way to your own fkin toilet seat without Cursor's GPS showing your where it is and then writing a whole factory just to open the toilet seat.
You look at the youtube demos and it's just another investor slop to be sold to other sloppy investors. I even asked on uncle Elons twitter if anyone had a demo of agents doing anything in real life, and after 1/4million views the only thing that even worked AT ALL were spambots and Pliny's agent making a sh*tcoin. https://x.com/nisten/status/1808522547169763448
People cook something at home and immediately get delusional thinking they now have an assembly line that's just going to print money... have you ever actually looked at an industrial pasta-maker machine. Do YOU have the skills to make that? I'm sorry but no ammount of shrooms and microdosed-meth pills is gonna get you that.
Agents do not exist yet, they will sooner or later, but right now they're a concept more along the lines of scammy ledger-backed dbs.
You can always prove me wrong with a real life demonstration of an automated tool doing a complex ammount of steps that you'd normally expect an average-ish worker to do for you on a RELIABLE rate basis. I.E. Doing your taxes like your accountant or 10 year old hopefully does.
> an automated tool doing a complex ammount of steps that you'd normally expect an average-ish worker to do for you on a RELIABLE rate basis. I.E. Doing your taxes like your accountant or 10 year old hopefully does.
Seems to be the definition they're using. Which is a high bar, in my opinion - but it does illustrate the difficulty current systems will have in meeting an exceptionally high bar of quality (human-grade).
Defining it this way and exploring percentage task failure compared to typical (expert?) human doing the same work is valuable insight, in my opinion.
On the other hand, you can define agents as anything that does tool calling, but then it's trivial to create an agent but still non-trivial to meet expectations of the typical consumer because you aren't observing their failure rates.
Your rube goldberg contraption that you put together for your borderline-fradulent pitch deck is NOT an assembly-line nor is it a product anyone's gonna buy. Why?
Because cosine similarity search mathematically sucks a* , large context windows, while better, are nowhere close to being fast and practical ( maybe with a small exception of the generic sounding 1M context summaries you now get from gemini flash 2.0 exp ) . You probably don't have any kind of CI/CD setup, no testing at all zero, no benchmarking of accuracy, you probably can't even get lm_eval installed in the first place so no troubleshooting methodoloy, no formal iteration pipeline, you're not putting out a new model every 2 weeks and iterating upon, and YOU at this point probably can't find your own way to your own fkin toilet seat without Cursor's GPS showing your where it is and then writing a whole factory just to open the toilet seat.
You look at the youtube demos and it's just another investor slop to be sold to other sloppy investors. I even asked on uncle Elons twitter if anyone had a demo of agents doing anything in real life, and after 1/4million views the only thing that even worked AT ALL were spambots and Pliny's agent making a sh*tcoin. https://x.com/nisten/status/1808522547169763448
People cook something at home and immediately get delusional thinking they now have an assembly line that's just going to print money... have you ever actually looked at an industrial pasta-maker machine. Do YOU have the skills to make that? I'm sorry but no ammount of shrooms and microdosed-meth pills is gonna get you that.
Agents do not exist yet, they will sooner or later, but right now they're a concept more along the lines of scammy ledger-backed dbs.
You can always prove me wrong with a real life demonstration of an automated tool doing a complex ammount of steps that you'd normally expect an average-ish worker to do for you on a RELIABLE rate basis. I.E. Doing your taxes like your accountant or 10 year old hopefully does.