AI Guardrails: Who's Watching the Watchers (and the Watched)?
- Angira Mitra
- May 8
- 3 min read
Updated: 3 hours ago

Ah, AI guardrails. The topic du jour in every boardroom, especially those nestled in the hallowed, highly regulated halls of financial services. It seems everyone's scrambling to figure out how to keep these newfangled intelligent machines from going rogue, particularly when the penalties for a misstep are… well, let's just say "severe" feels like an understatement. As a wise old friend of ours once sagely remarked, "No AI will be sent to jail for getting it wrong – but a person could be for trusting it." And there, my friends, lies the delicious irony.
The Infallibility Myth: Human vs. Machine Edition
The case for human guardrails has been bandied about with all the gravitas of a parliamentary debate, and yes, the arguments hold water. Humans, after all, possess that elusive quality called "common sense," a trait conspicuously absent from even the most sophisticated algorithms. But here's the rub: while machines have a knack for amplifying inherent biases or, even more delightfully, hallucinating (because who needs reality when you can invent it?), humans aren't exactly paragons of unwavering perfection either. We're prone to stress, fatigue, boredom, and let's face it, just being gloriously, imperfectly human. So, when it comes to monitoring, who exactly is best placed to keep an eye on whom? It's a bit of a chicken-and-egg situation, only with potentially catastrophic financial consequences.
Dialling Up (or Down) the Trust Factor
It's entirely natural to approach any new technology with the cautious enthusiasm of a cat eyeing a cucumber, especially when individuals feel utterly out of control of the process. Welcome to the "land of black boxes," where decisions are made by unseen forces and explanations are, shall we say, minimal. This is precisely why we're rather fond of the idea of "dials" in our agentic design. Want to micromanage every single step? Dial down that automation! Feeling a bit more adventurous and trusting of the digital overlords? Dial it up, buttercup! It’s all about giving you the illusion of control, even if, deep down, we all know the machines are probably just humouring us.
Now, with the advent of multi-agentic design, we're promised more transparency and explainability. Apparently, these "deterministic agents" won't hallucinate – a truly revolutionary concept! They'll gather information from elected sources and, mercifully, won't just invent an answer because saying "I don't know" is apparently beneath them. A refreshing change from some of our human counterparts, perhaps?
The Perils of Human "Safety Measures"
But let's pause for a moment and consider the human element again. Who, pray tell, monitors the actions of the human? History, bless its cynical heart, has shown us that human safety measures haven't exactly been the impenetrable fortresses they were designed to be. Remember that self-driving car with the tragic accident? Turns out the "safety driver" (aka the human guardrail) wasn't paying attention. And air travel, despite its advanced technological marvels designed to support pilots, still sees pilots making mistakes. Because, you know, humans. They're easily distracted by shiny objects, a compelling TikTok feed, or perhaps just the existential dread of a Monday morning.
One of the biggest, most enduring challenges is the human tendency to simply take answers for granted, whether they come from automation, sheer availability, or the delightful phenomenon of cognitive bias (which, for the uninitiated, basically means believing what's easy to access, what you can only see, or, most powerfully, what you already know). Why bother interrogating something when a perfectly packaged answer is sitting right there, ready to be consumed? It's the intellectual equivalent of fast food.
The Symbiotic Solution: Where Machines Are Brilliant and Humans Are… Human
So, where does this leave us? In our humble opinion, guardrails are absolutely necessary. But here's the kicker: you need a delightful cocktail of both human and machine review, working in a truly symbiotic fashion. The machine, bless its silicon heart, is brilliant at pattern recognition. It can spot anomalies faster than a squirrel can bury a nut. The human, on the other hand, excels at that elusive thing called common sense and, more importantly, empathy. Because while an algorithm might optimize for efficiency, it's unlikely to shed a tear over a bad outcome.
The goal here isn't to replace infallible machines with equally infallible humans (a fool's errand, if ever there was one). Instead, it's about crafting a robust socio-technical systemwhere the strengths of both AI and humans are leveraged, and their respective weaknesses are mitigated through careful design, rigorous training, and a continuous commitment to improvement. It's about blending machine precision with human judgment and ethical reasoning to achieve a safer and, dare we say, more beneficial outcome for all.
So, who's monitoring whom? The answer, it seems, is everyone, and everything, all at once.And perhaps, just perhaps, that's precisely how it should be.



Comments