Hacker Newsnew | past | comments | ask | show | jobs | submit | PradeetPatel's commentslogin

The proposed industry solution is to use agents to review PRs, as not to slow down the velocity of delivery...

My current workplace is going through a major "realignment" exercise to replace as many testers with agents as humanely possible, which proved to be a challenge when the existing process is not well documented.


The fact that anyone in leadership would ever think this is even remotely possible - given my experience in the general state of requirements / contracts / integrations / support - makes me bleed from my earholes just a little bit.

It's starting to just feel a little like an excuse to call everyone on deck for "a few weeks trying 9-9-6". But even then the lack of traction isn't between the eyeballs and the deployment. You'll still be spinning wheels in that slippery stuff between what a customer is thinking and what the iron they bought is doing.


So you essentially trust the output of the model from beginning to end? Curious to know what type of application you're building where you can safely do that.

Edit: to clarify, I know these models have gotten significantly better. The output is pretty incredible sometimes, but trusting it end to end like that just seems super risky still.


I guarantee you it's nothing quantifiable.

LLMs can't be responsible for deciding what code you use because they have no skin in the game. They don't even have skin.

If you type fast, well then it takes just as long to code it yourself as review it. Plus you actually get flow time when you're coding.

For heaven's sake people have the robot write your unit tests and dashboards, not your production code. Otherwise delete yourself.


"Hey Claude, did Claude do a good job?"

I did an experiment today, where I had a new Claude agent review the work of a former Claude agent - both Opus 4.6 - on a large refactor on a 16k LOC project. I had it address all issues it found, then I cleared context, and repeated. Rinse and repeat. It took 4 iterations before it approached nitpicking. The fact that each agent found new, legitimate problems that the last one had missed was concerning to me. Why can’t it find all of them at once?

You're expecting it to be a person. It's not.

It is more like a wiggly search engine. You give it a (wiggly) query and a (wiggly) corpus, and it returns a (wiggly) output.

If you are looking for a wiggly sort of thing 'MAKE Y WITH NO BUGS' or 'THE BUGS IN Y', it can be kinda useful. But thinking of it as a person because it vaguely communicates like a person will get you into problems because it's not.

You can try to paper over it with some agent harness or whatever, but you are really making a slightly more complex wiggly query that handles some of the deficiency space of the more basic wiggly query: "MAKE Y WITH NO ISSUES -> FIND ISSUES -> FIX ISSUE Z IN Y -> ...".

OK well what is an issue? _You_ are a person (presumably) and can judge whether something is a bug or a nitpick or _something you care about_ or not. Ultimately, this is the grounding that the LLM lacks and you do not. You have an idea about what you care about. What you care about has to be part of the wiggly query, or the wiggly search engine will not return the wiggly output you are looking for.

You cannot phrase a wiggly query referencing unavailable information (well, you can, but it's pointless). The following query is not possible to phrase in a way an LLM can satisfy (and this is the exact answer to your question):

- "Make what I want."

What you want is too complicated, and too hard, and too unknown. Getting what you are looking for reduces to: query for an approximation of what I want, repeating until I decide it no longer surfaces what I want. This depends on an accurate conception of what you want, so only you can do it.

If you remove yourself from the critical path, the output will not be what you want. Expressing what you want precisely enough to ground a wiggly search would just be something like code, and obviates the need for wiggly searching in the first place.


It might not be a bad thing if we have an Internet for humans, and a segmented Internet for AI.

Who's enforcing that rule?

What an incredibly diverse and inclusive UI design. I often find that Indian mythologies tend to be overshadowed, but with the advent of AI generated art and media there's been a resurgence of Indian-centric stories.

Keep up the good work!


The internet being flooded with AI slop masquerading as devotional artwork has been among the most depressing things about GenAI. It has no meaning or intention or devotion behind it, it’s just engagement farming. Nothing of value is added by having Devi with extra fingers on each hand and completely blurred messes for all the affects in her hands. Or pictures of Rama shooting a bansuri out of his bow. It’s just tripe. We could have told the stories with an overlay of open source artwork from Raja Ravi Varma or Gita Press or old Tanjore paintings or Chola bronzes or whatever if we couldn’t afford to hire an artist who knows what items Vishnu is supposed to be holding in each hand.

It’s not a problem just for us Hindus either. I see so much terrible Jesus/angel “artwork” everywhere. It makes me start to wonder if maybe the Wahabbis were onto something with their complete taboo around depictions of God or the prophets.


>Nothing of value is added by having Devi with extra fingers on each hand and completely blurred messes for all the affects in her hands

South Asian religions are in an especially bad position because so many works related to them have never been digitized (and quite frankly, in some cases what's available on the internet is of extremely low quality) [1]. I'd be pretty concerned if someone were to rely on entirely on these models since the probability of hallucinations (or at the very least, erasure of regional/ideological diversity) probably skyrockets because the information was never actually there in the training data to begin with.

[1] I was able to find a few works of Newari Buddhist iconography recently, so it might be changing: https://web.archive.org/web/20240901130203/https://download..... It still has a few mistakes and doesn't compare to what's out there, though.


If they’re never digitized then where do you get the originals?

Some references have been digitized but “Hinduism” is a broad collection of religious traditions with many different stories and folk practices and depictions of various deities and tales. Many of the depictions are considered “valid” only in the specific context of a particular temple or for a specific community and it becomes completely nonsensical once you start randomly jumbling up elements of all the Gods from across all of India over all of time.

Can you elaborate on it? I've only noticed more GitHub features post Microsoft acquisition.

GitHub Actions and Copilot integration are pretty much a staple at my workplace.


I've been using GitHub for over 2 decades now so its not that I don't still love them. I just worry that GitHub will become just another arm of Microsoft's AI strategy. It FEELS like the platform is being reshaped around monetizing AI rather than serving developers but that's just my opinion.


Why would it be satire? I thought that's a pretty stranded Agentic workflows.

My current workplace follows a similar workflow. We have a repository full of agent.md files for different roles and associated personas.

E.g. For project managers, you might have a feature focused one, a delivery driven one, and one that aims to minimise scope/technology creep.


I mean no offence to anyone but whenever new tech progresses rapidly it usually catches most unaware, who tend to ridicule or feel the concepts are sourced from it.


yeah, nfts, metaverse, all great advances

same people pushing this crap


ai is actually useful tho. idk about this level of abstraction but the more basic delegation to one little guy in the terminal gives me a lot of extra time


Maybe that's because you're not using your time well in the first place


bro im using ai swarms, have you even tried them?


bro wanna buy some monkey jpegs?

100% genuine


[flagged]


> Laughing about them instead of creating intergenerational wealth for a few bucks?

it's not creating wealth, it's scamming the gullible

criminality being lucrative is not a new phenomenon


Are you sure that yours would sell for $80K, if you aren't using it to launder money with your criminal associates?


If the price floor is 80k and there are thousands then it means that even if just one was legit it would sell for 80k

Weird Im getting downvoted for just stating facts again


That would be the dream, it has been stated by many AI leaders that AI is the key to UBI, democratising that capability will prevent specific monopolies having a stranglehold on our future.


Is that the one not too far from the statue of Tesla donated by the Serbian government? I vaguely remember visiting it a good few years ago...


Yes, I think so, it's right next to the parking lot. I visited it about 3 years ago, but back then the tunnel was not accessible yet, it is now.

Also, not sure if this is true, but our tour guide told us that Tesla himself actually never set foot on the Canadian side of the falls.


Exactly, you align your business objectives with the vision of your managers and the industry trend.

Right now it appears that most industry leaders have fully committed to AI. It would be foolish (career wise) to speak openly otherwise.


what's next. Companies will just let the customers vibe code what they're asking for


Thanks so much for sharing!

I'm interested to see how MCP and the development in AI will impact the CTF scene in the future.


Where abouts in NZ are you? I'm currently based in Wellington and my Venus fly traps, sundew, and pitcher plants seem to be doing quite well on the windowsill.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: