Need help?
<- Back

Comments (21)

  • watersb
    Strangely reminiscent of an Electric Monk:The Electric Monk was a labor-saving device, like a dishwasher or a video recorder. Dishwashers washed tedious dishes for you, thus saving you the bother of washing them yourself, video recorders watched tedious television for you, thus saving you the bother of looking at it yourself; Electric Monks believed things for you, thus saving you what was becoming an increasingly onerous task, that of believing all the things the world expected you to believe.-- Douglas Adams, "Dirk Gently's Holistic Detective Agency"
  • docheinestages
    Most writings about the spec-driven development I see start with a product requirements document that is assumed to be valid. But I doubt that's the case. If so, you would've written about it, and probably would've involved agents in the research that goes into it. My gut feeling tells me there's much more emphasis on implementing the feature than on questioning if it's relevant, feasible, and based on valid assumptions.
  • marcus_holmes
    I have a similar skill that assesses project drift (how far the current project state is from the original spec and brief) and looks for artifacts of that - orphaned code or features that are no longer relevant, design decisions that made sense at the time but are now dubious, or parts of the design that no longer fit the current project state.I found it really useful taming the spaghetti that claude tends to generate by itself.
  • m12k
    I've stumbled on the same workflow. Except for one thing: If I just do as OP does, Claude Code will tend to overengineer. For example it'll build complex solutions to super rare race conditions that have trivial fallout. But I've found that all it takes is a "skeptical pass". Here's how it goes: After having a bunch of specialist subagents review the (plan/implementation), after doing the deduplication/synthesis of their findings, the main agent will bucket them into A) Trivial/obvious fix B) there's multiple possible resolutions, but the LLM had a strong lean, so it went with it on its own C) Genuine ambiguity, where it asks me what to do (and presents its lean) and D) Wontfix. Crucially, after doing this, I have it run a "skeptical pass" where it takes a hard look at these findings and see if maybe some of them deserve to be downgraded. Generally, a lot of things make their way into wontfix this way. I find, I don't need to push back against overengineering, I can have the LLM do so itself, and it'll actually do a decent job of it.
  • HappySweeney
    I think its common to develop an adversarial-collaborative approach to getting some semblance of quality out of AI. I personally favour using multiple models for different roles, having a bunch of continuity documentation maintained, and having the plan surface human-verifiable deliverables as soon as feasible. It does involve more attention than most people would tolerate probably.
  • aself101
    This has been my attempt at wrangling the new A.I. assisted development that seems to be overtaking the software engineering profession. I jumped head first into LLM development after observing the trends from the last year and it appears this process might be a viable path forward.
  • dbgrman
    Enjoyed it until the first emdash (was half expecting it to arrive anyway). Sorry.
  • agnitripathi
    [flagged]
  • claudiosilva
    [flagged]
  • yudidnmkating
    [dead]
  • victorkulla
    [flagged]