Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> Are your agents good at testing other agents? e.g. I want your agent to ask our agent a few questions and complete a few UI interactions with the results.

I'd say this is one of our strong suits I think, specifically the UIs tend to be easy to navigate for browser agents, and the LLM as a judge offers pretty good feedback on chat quality and it can inform later actions. (I'd be remiss not to mention though that a good LLM eval framework like Braintrust is probably the best first line though)

> How do you handle testing onboarding flows?

We can step through most onboarding flows if you start from logged out state & give the context it'll need (i.e. a stripe test card, etc.) That said though, setting up integrations that require multi-page hops is still a pain point in our system and leaves a lot to be desired.

Would love to talk more about your specific case and see if we can help! founders@propolis.tech



Then how do you compare with braintrust? Aren’t they doing same thing for Agents?



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: