More

giordanol · on May 7, 2025

Value isn’t just the editor, it’s the workflow. Letting LLMs plan and act across multi-step flows is a hard problem, and Windsurf figured out a dev-focused version of that. Gains to be made in browser automation once you add structure, retries, and context. Feels like a bet on that pattern becoming default. But yeah as others said, highly doubt that's $3B in hard cash, more likely a roll-up of shares etc.

giordanol · on May 6, 2025

Really cool direction. The embedding-first + agentic verification pipeline resonates, similar pattern worked well for us in the web interaction space.

giordanol · on May 6, 2025

Shift isn't just about competitors gaining ground but about users increasingly bypassing traditional search entirely. Between Reddit, Perplexity, ChatGPT, and direct domain knowledge, more queries are being fragmented across tools that aren't indexed as 'search engines' but functionally serve that role.

giordanol · on May 6, 2025

In my experience the key friction point has been schema stability vs input variance. Had better luck treating mapping as a dynamic planning problem with retries and memory.

constantinum · on May 6, 2025

Just curious: could you please expand on “dynamic planning problem”?

giordanol · on May 7, 2025

Basically treating extraction as an adaptive loop instead of a static function. If first parse fails or looks incomplete, tweak the prompt, inject more context, or switch strategies. Memory helps carry forward partial wins so you don’t start from scratch. We’ve seen the same pattern in agentic web environments. Structured retries, context propagation, and memory turn brittle flows into robust automation, especially with high-variance input and fuzzy schemas.

giordanol · on May 5, 2025

Curious how you're handling multi-step flows or follow-ups, seems like thats where MCP could really shine especially compared to brittle CLI scripts. We've seen similar wins with browser agents once structured actions and context are in place.

giordanol · on May 5, 2025

LLM-based coding only really works when wrapped in structured prompts, constrained outputs, external checks etc. The systems that work well aren’t just 'LLM take the wheel' architecture, they’re carefully engineered pipelines. Most success stories are more about that scaffolding than the model itself.

CivBase · on May 5, 2025

Does anyone provide a good breakdown of how much time/cost goes into the scaffolding vs how much is saved from not writing the code itself?

giordanol · on May 5, 2025

A breakdown would be interesting. I can’t give you hard numbers, but in our case scaffolding was most of the work. Getting the model to act reliably meant building structured abstractions, retries, output validation, context tracking, etc. Once that’s in place you start saving time per task, but there’s a cost up front.

giordanol · on May 5, 2025

Treating docstrings as the spec and asking an LLM to flag mismatches feels promising in theory but personally I'd b wary of overfitting to underspecified docs. Might be useful as a lint-like signal, but hard to see it replacing real tests just yet.

bluGill · on May 5, 2025

if that is the only testing you do I agree. However to test that the code works as the docs say is valuable as well. The code often will do more, but it needs to do at least what the docs say.

giordanol · on May 5, 2025

Agreed. Catching mismatches between doc and implementation is still valuable, just wouldn’t want people to rely on it as a safety net when the docs themselves might be inaccurate/incomplete. As a complement to traditional tests though seems like a solid addition.

giordanol · on May 5, 2025

If you outsource that to a model, you often end up with words but shallow or no understanding. Writing forces you to clarify your ideas. LLMs substitute genuine thinking with surface-level prose, which might sound alright but often lacks depth behind it.

giordanol · on May 2, 2025

Looks really cool. Curious how you're handling action abstraction? We've found that semantically parsing the DOM to extract high-level intents—like "click 'Continue'" instead of 'click div#xyz' helps reduce hallucination and makes agent planning more robust.

giordanol · on May 2, 2025

In trying to live an intellectually rich life, there's a risk of adding too much noise. Chasing more input, more ideas, more learning. Sometimes less really is more. Depth often comes not from adding, but from subtracting. Clear away the noise, and what’s left tends to have 'meaning'. Personally I prefer a deep life to a rich life, but maybe that's just semantics...