Building the Future

April 2025

A diagram of the future of AI agents

Something has been on my mind for months. The rapid evolution of AI agents has opened up possibilities I cannot ignore.

We are witnessing the emergence of semi autonomous agents that will fundamentally reshape how we work and communicate. The opportunities in this space are extraordinary. I am diving deeper into this world of AI agent development and product creation.

My newsletter is evolving. Instead of dispensing tips from a position of authority, I invite you on a journey of discovery. I will document my experiences building with AI, how to apply my tech experience in a new world, and navigating the inevitable struggles and setbacks.

Read on for several key areas I am exploring.

Beyond Prompting

The limitations of prompting as a primary interface for AI systems are becoming increasingly apparent. As I explored in my previous post, prompting is fundamentally broken as an approach to working with AI. It is brittle, model-specific, and endlessly repetitive.

The solution lies not in getting better at prompting, but in moving beyond it entirely. We need to develop higher-level abstractions that hide complexity while preserving control over what matters. This evolution mirrors the progression from machine code to high-level programming languages in traditional software development.

The emergence of frameworks like DSPy and Langchain are early attempts at this, though none have yet provided a complete solution. The right abstractions will make prompting as obsolete as punch cards, allowing us to focus on what truly matters: the inputs, outputs, and evaluation metrics that define our applications.

This is why I have been building Kaijo, a new approach to AI development that makes it feel like normal software development. Instead of wrestling with prompts, you write simple functions and let the AI handle the complexity. Kaijo continuously evaluates and optimises your AI functions, figuring out from your examples what good looks like, and what the best prompt is to get that result. The result? AI functions that just work.

I demonstrated how Kaijo works live on Friday 2nd May, where I built an entire AI application in just four hours. You can watch the recording on LinkedIn or YouTube.

Agent Communication and Rollback

The emergence of protocols like A2A and MCP raises questions about the future of multi-agent systems. These standardised communication frameworks are onnly a starting point, but they will shape how agents interact and collaborate at least short term. We must understand what new capabilities these protocols will enable and how they will evolve.

The new workflow with AI agents is taking shape. Customers want AI decisions to be reversible and trackable, and to be able to reset to previous checkpoints on demand. This presents unique challenges. The traditional Git source control model (itself only 20 years old) may translate well from code to the results of agent actions, but it may not suffice. We need to consider whether to adapt existing version control systems or create entirely new ones.

Branching strategies must also account for multiple collaborating agents. The concept of “undo” takes on new meaning in an autonomous context. What about actions with side effects that are not reversible, like sending an email? How do you recover? How do we manage the complexity of coordinating multiple agents which might need to undo state changes? This is already hard for humans, requiring flexibility and adaptability. How can we systemise these interactions?

Universal interfaces will become crucial for tracking and managing collective agent actions. New chat interfaces may be necessary to coordinate complex human-agent interactions effectively.

Beyond RAG: The Next Generation of Knowledge for Personalised Agents

Building effective AI agents requires solving three concrete knowledge management challenges. First, we need a clear architecture for knowledge sharing between agents. Should each agent maintain its own vector database, or should we implement a centralised knowledge store with access controls? Second, we must develop protocols for agents to reference and combine knowledge across different domains.

For example, how should a coding agent access and apply knowledge from a design agent’s expertise? Third, we need to implement selective memory systems that can prioritise and prune knowledge based on relevance and recency.

Current RAG systems fall short because they treat all knowledge as equally important. Graph based RAG offers a promising solution by maintaining relationships between knowledge chunks, but we still need to solve the practical challenges of implementing this at scale. Mem0 looks promising but the tech is still unproven.

We must develop clear metrics for measuring knowledge relevance and establish protocols for knowledge updates across agent networks. The development of custom communication protocols by agents themselves also raises important questions about risk management and control. We must carefully consider the implications of allowing agents to evolve their own communication methods1.

The Vision

My hypothesis is simple: by sharing this journey openly, documenting both successes and failures, we can learn together how to build meaningful products in the age of AI. The questions I am exploring feel fundamental to our future.

If you are interested in following this journey more closely, subscribe to my newsletter. You will get weekly field notes of what I am learning and building with AI, complete with raw insights and lessons learned.

The future of work is being reshaped before our eyes. I want to be part of that transformation. Will you join me?

Join me on my AI journey

  1. The AI 2027 report highlights a critical concern about the opacity of advanced AI systems. As we build increasingly sophisticated agents, we risk creating systems whose inner workings we cannot fully understand or control. Without proper transparency and interpretability, we may find ourselves relying on systems that make decisions we cannot explain or verify and which could develop sophisticated methods to evade alignment beyond our understanding. We must build systems that maintain human oversight and understanding even as they grow more capable. 


More articles

The Huge List of AI Tools: What's Actually Worth Using in May 2025?

There are way too many AI tools out there now. Every week brings another dozen “revolutionary” AI products promising to transform how you work. It’s overwhelming trying to figure out what’s actually useful versus what’s just hype.

So I’ve put together this major comparison of all the major AI tools as of May 2025. No fluff, no marketing speak - just a straightforward look at what each tool actually does and who it’s best for. Whether you’re looking for coding help, content creation, or just want to chat with an AI, this should help you cut through the noise and find what you need.

Read more

Building AI Cheatsheet Generator Live: Lessons from a Four-Hour Stream

I built an entire AI-powered app live, in front of an audience, in just four hours. Did I finish it? Not quite. Did I learn a huge amount? Absolutely. Here is what happened, what I learned, and why I will do it again.

The challenge was simple: could I build and launch a working AI cheatsheet generator, live on stream, using AI first coding and Kaijo1 as my main tool?

Answer: almost! By the end of the session, the app could create editable AI cheatsheets, but it was not yet deployed. A few minutes of post-stream fixes later, it was live for everyone to try. (Next time, I will check deployment on every commit!)

Try the app here: aicheatsheetgenerator.com

  1. Kaijo is a tool I have created to helps you build and ship AI products faster and more reliably - see the announcement post here

Read more

AI: The New Dawn of Software Craft

AI is not the death knell for the software crafting movement. With the right architectural constraints, it might just be the catalyst for its rebirth.

The idea that AI could enable a new era of software quality and pride in craft is not as far-fetched as it sounds. I have seen the debate shift from fear of replacement to excitement about new possibilities. The industry is at a crossroads, and the choices we make now will define the next generation of software.

But there is a real danger: most AI coding assistants today do not embody the best practices of our craft. They generate code at speed, but almost never write tests unless explicitly told to. This is not a minor oversight. It is a fundamental flaw that risks undermining the very quality and maintainability we seek. If we do not demand better, we risk letting AI amplify our worst habits rather than our best.

This is the moment to ask whether AI will force us to rediscover what software crafting1 truly means in the AI age.

  1. I use the term “software craft” to refer to the software craftsmanship movement that emerged from the Agile Manifesto and was formalised in the Software Craftsmanship Manifesto of 2009. The movement emphasises well-crafted software, steady value delivery, professional community, and productive partnerships. I prefer the terms “crafting” and “craft” to avoid gender assumptions. 

Read more

Why Graph RAG is the Future

Graph RAG

Standard RAG is like reading a book one sentence at a time, out of order. We need something new.

When you read a book, you do not jump randomly between paragraphs, hoping to piece together the story. Yet that is exactly what traditional Retrieval-Augmented Generation (RAG) systems do with your data. This approach is fundamentally broken if you care about real understanding.

Most RAG systems take your documents and chop them into tiny, isolated chunks. Each chunk lives in its own bubble. When you ask a question, the system retrieves a handful of these fragments and expects the AI to make sense of them. The result is a disconnected, context-poor answer that often misses the bigger picture.

This is like trying to understand a novel by reading a few random sentences from different chapters. You might get a sense of the topic, but you will never grasp the full story or the relationships between ideas.

Real understanding requires more than just finding relevant information. It demands context and the ability to see how pieces of knowledge relate to each other. This is where standard RAG falls short. It treats knowledge as a stack of random pages, not as a coherent whole.

Time for a totally new approach.

Read more

Introducing Kaijo: AI functions that just work

kaijo

For months, I have wrestled with a problem that has consumed my thoughts and challenged everything I know about software development.

This week I wrote about building the future with AI agents. One of the key areas for me is moving beyond prompt engineering to something more reliable.

I have spent decades learning how to craft reliable software. Now I want to bring that reliability to AI development.

Today I am ready to share what I have been building in the background.

It started with a game. It ended with something that could change how we build AI applications forever.

Read more