Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Saw the launch yesteday. Love all of the excitement in the space!

LaVague is all about generating selenium code to interact with a specific page, and do it step-by-step

Skyvern is all about taking a simple instruction and converting it to a series of LLM-driven actions. It's meant to be more autonomous ("tell Skyvern what to do")



Isn't that the same thing when you interact with the underlying webpage?


We're quite different than LaVague. LaVague passes in the entire HTML DOM to the LLM to help it generate XPaths and valid Selenium code. (https://github.com/lavague-ai/LaVague/blob/main/src/lavague/...)

Try this at your own risk.. any reasonable website would result in extraordinarily high input token costs

We spend quite a bit of our time building a layer between the HTML and the LLM call to distill important pieces of information down to actions the LLM can take.. better weighing cost vs output. We're still not at 100% coverage.


It is similar. hence the timing of the plug, probably :)




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: