Saw the launch yesteday. Love all of the excitement in the space!
LaVague is all about generating selenium code to interact with a specific page, and do it step-by-step
Skyvern is all about taking a simple instruction and converting it to a series of LLM-driven actions. It's meant to be more autonomous ("tell Skyvern what to do")
Try this at your own risk.. any reasonable website would result in extraordinarily high input token costs
We spend quite a bit of our time building a layer between the HTML and the LLM call to distill important pieces of information down to actions the LLM can take.. better weighing cost vs output. We're still not at 100% coverage.
LaVague is all about generating selenium code to interact with a specific page, and do it step-by-step
Skyvern is all about taking a simple instruction and converting it to a series of LLM-driven actions. It's meant to be more autonomous ("tell Skyvern what to do")