AI

173 posts
A vintage hourglass with most of its sand fallen to the bottom chamber on an antique wooden desk in a dimly lit study
AI

The "long-running agent" problem nobody is solving

Agents that run for ten minutes mostly work. Agents that run for ten hours mostly don't. The middle is where the actual production work lives, and the patterns we have for it are weaker than the conversation pretends.

Sid Smith Sid Smith 5 min read
Multiple architectural blueprints overlapping on a draftsman's wooden table with a brass compass and ruler under warm lamp light
AI

Agent design patterns: what's actually working

Two years into agents-everywhere, a small set of design patterns has separated from the noise. Worth being explicit about which shapes are actually paying back in production and which are still mostly aspirational.

Sid Smith Sid Smith 5 min read
A small AI chip on a thin stack of US one-dollar bills next to a much larger chip on a thicker stack of twenty-dollar bills
AI

Sonnet 4 makes Opus look expensive

A day after Claude 4 landed, the math is starting to settle. Opus 4 has its place. The honest read is that Sonnet 4 closes enough of the gap that the price-for-marginal-capability case for Opus needs more justification than I've been giving it.

Sid Smith Sid Smith 4 min read
Twin constellation-like glowing star clusters against a deep night sky with faint connecting lines suggesting neural networks
AI

Google I/O 2025: Gemini 2.5 Pro and the agent push

Google's I/O was Gemini 2.5 Pro getting a clean coming-out and the agent story finally being told with a straight face. The model is real. The agent story has more work to do than the keynote suggested.

Sid Smith Sid Smith 5 min read
A vast factory floor with many small autonomous robotic figures moving in patterns and no human supervisors visible
AI

Microsoft Build 2025: agents everywhere, governance nowhere

Build was the agents conference Microsoft has been preparing for. The agents pitch lands. The governance story underneath it is exactly as unfinished as it was a year ago, and the contradiction is starting to show.

Sid Smith Sid Smith 5 min read
An open vintage accountant's ledger with hundred-dollar bills tucked between pages and a brass calculator beside it
Cloud

Cost-modeling AI workloads with FinOps eyes

The per-token price is the easy line. Everything else, the retries, the context overhead, the agentic tool calls, the egress, the GPU reservation underneath the API, is where the actual bill comes from.

Sid Smith Sid Smith 5 min read
A polished bedrock stone slab on a dark surface with three small visible cracks running across it
Cloud

Three things AWS Bedrock still gets wrong

Bedrock has gotten meaningfully better in the last six months. The places it hasn't are still the same places. Worth being explicit about which gaps are likely to close and which look structural.

Sid Smith Sid Smith 5 min read
A dense network of glowing fiber-optic cables converging into a single central junction box
AI

MCP momentum: every major vendor has shipped one

Six months ago MCP was Anthropic's protocol nobody had implemented. Now it's a category every major vendor ships against. The thing nobody is asking is what that does to the protocol itself.

Sid Smith Sid Smith 5 min read