Harness engineering is the new prompt engineering
Will harness engineering have the same fate as prompt engineering?
Remember how prompt engineering was the talk of town a few years ago? As it turns out, models automated that role before it even took off. Today is all about harness engineering, is that here to stay or will disappear into thin air in a few model iterations?
Lets decompose the harness and opine on what stays versus what is ephemeral
🧠 Memory (what the AI remembers across a session) - saves context, retrieves it later, compacts long histories, recaps when needed. This is a long standing limitation of current models that harnesses are addressing. It is becoming slightly less relevant though as context size increases. That said memory or state will be important components of successful harnesses until the models crack that more natively.
🎼 Orchestration (how the AI plans and retries) - forces the model through plan→execute→reflect cycles, search trees, multi-attempt with diverse strategies. This has already eroded as reasoning models are becoming better at driving their own trace and I think will continue to play a less important role moving on.
⚒️ Skills and tools (what the AI knows how to do) - loads relevant skill packs, surfaces tool descriptions, picks which tool to use when. I think its safe to say that this will become less important over time as models become better using the right tools, the right way and remember how to do so for the future something that skills solve for right now.
🤖 Sub-agents (when the AI delegates to copies of itself) - spawns child agents in isolated context, hands back results to the parent. Not sure if models are explicitly trained to use agents or are treated as another tool but I suspect that one way or another this will be absorbed by the ability of the model to break down problems and use proper isolation.
🪝 Hooks, modes and permissions (the real harness) - runs pre, post finish hooks, planning vs doing and which apps, folders AI has access to. The genuine harness is about the restrictions you put into the model and how you use it.
The agent loop that started thin, grew with harness engineering, will collapse back to thin. while not done: model(tools, memory). The harness that remains is all about restrictions and less so about turbocharging the model. This is not different from an actual harness…
