Artificial Intelligence

Enterprise AI Breaks at Audit Time — Abdul Nadeem Mohammed Solved the Retrofit Problem Before It Arrived

He built compliance infrastructure at BlackRock and modernised NYC's platform for 8 million residents, and now explains why governance retrofitted last is always a rebuild.

Written By : Arundhati Kumar

In August 2026, the question will no longer be whether an organisation has an AI strategy; it will be whether that strategy can be proven. According to Grant Thornton's 2026 AI Impact Survey of nearly 1,000 senior business leaders, 78% lack full confidence that their organisation could pass an independent governance audit within 90 days. The systems are being built. The accountability layer is not keeping pace.

The question of what it takes to build AI that is both compliant and fair in production is not abstract for Abdul Nadeem Mohammed; recognised as one of the leading specialists in enterprise AI governance and government-scale platform architecture in the United States, it has been the operational constraint of every system he has built. At BlackRock, that meant designing a compliance engine in which a single misconfigured rule could create a conflict of interest across thousands of employees' trades globally. At the New York City Office of Technology and Innovation, it meant modernising the only data platform connecting nine health and human services agencies, where an access control failure would not trigger a compliance alert; it would reach a family in crisis before anyone noticed something had gone wrong. When his peer-reviewed research on machine learning fairness was accepted at IEEE COINS 2025, it was not a theoretical contribution. It was a documentation of what production had already required him to solve. His receipt of the BrainTech Best Lead Developer Award, an international recognition for measurable technical leadership and engineering excellence, and in April 2026, serving as a jury member for the Cases & Faces International Business Award alongside practitioners from Yandex Technologies and DXC Luxoft. He saw the same gap from the other side: projects with capable models and pipelines that could not survive scrutiny.

Why Governance Built Last Always Costs More

That gap between capable models and pipelines that cannot survive scrutiny has a root cause. Enterprises have accuracy metrics but no fairness audit trails to back them up. Policy documents exist where working audit trails should. That gap is not a documentation problem; it is a sequencing problem, and it compounds over time.

“Here is what I see over and over again: engineers treat compliance and fairness like they are someone else's problem,” Mohammed explains. “Build the system first, add the governance later. That is the assumption. And it fails. Every single time, in production, it fails. Because by the time you go back to retrofit it, the architecture was never built to support it.”

The governance requirements shape what the architecture needs to look like from the start. Retrofitting them onto a system built without them is not a configuration task. It is a rebuild.

The financial sector makes this concrete. At BlackRock, Mohammed built the compliance engine governing employee securities trading inside the Aladdin ecosystem, a platform managing data for institutions overseeing tens of trillions in global assets. He designed the system to automate pre-clearance and post-clearance of employee trade requests, evaluating each against live portfolio data. Transactions that would create conflicts of interest were blocked automatically, without manual review, across thousands of employees globally. He also automated CI/CD pipelines that cut software release cycles by 50% without compromising audit integrity. This was not merely an operational efficiency gain. At an institution managing data at the scale of tens of trillions in assets, a release pipeline is a governance instrument. Slower release cycles mean delayed propagation of updated compliance rules across a global workforce. Mohammed's 50% reduction in cycle time directly accelerated the enforcement of regulatory logic firm-wide, a compliance-critical infrastructure achievement that almost no engineer ever operates at this scale. 

The depth and consequence of this work were recognised internationally: Mohammed received the BrainTech Best Lead Developer Award in 2024, an honour given to engineers whose work demonstrates measurable impact, technical innovation, and leadership excellence at the highest institutional levels. At BlackRock, those conditions were not aspirational; they were the baseline requirement of every decision he made.

The result was faster regulatory responsiveness without any degradation of audit integrity, demonstrating that delivery speed and compliance rigour are not in tension when the architecture is built correctly from the start. Compliance was not added at the end of that process. It was the load-bearing structure built first.

The lesson applies to teams preparing for August 2026. Governance that is designed into a system costs one thing to maintain. Governance that is retrofitted costs multiples of that, in time, in architectural rework, and in regulatory exposure.

When the Failure Is Invisible, the Architecture Is the Only Warning System

The cost of retrofitting is not only financial. In civic infrastructure, it is invisible, and invisibility is worse. A misconfigured access control surface shows incomplete records to a caseworker. No review triggers. The family never knows anything failed.

This failure mode is not hypothetical. HHS Worker Connect is the only system enabling real-time data sharing across nine New York City agencies, child welfare, homeless services, Medicaid, juvenile justice, and five others, serving more than eight million residents. When Mohammed took on the modernization of that platform in January 2025, every architectural decision carried direct consequences. A misconfigured access control rule would not produce a bad metric. It would result in a family receiving slower service, with no awareness that anything had gone wrong. He designed the full-stack architecture from the ground up: role-based access control governing what each agency's caseworkers could retrieve, JWT-based authentication, and LLM-based intelligent search across agency databases.

What Mohammed built at NYC OTI is more than a modernised government platform. The integration of LLM-powered intelligent search across nine disparate agency databases, enabling caseworkers to query fragmented legacy records through a single conversational interface, represents a pioneering application of AI to government civic infrastructure. There is no established playbook for embedding large language model capabilities into legacy multi-agency government systems at this scale. Now, his architecture is a working model for the broader field of civic technology: demonstrating that AI-driven data retrieval can be deployed within the access control, audit logging, and jurisdictional compliance constraints that government systems demand. As cities and federal agencies across the country face the same legacy modernisation imperative, Mohammed's approach at NYC OTI offers a replicable framework for how to do it without sacrificing security, accountability, or regulatory integrity.

“When you are the only person standing between a legacy system and 8 million residents who depend on it, there is no room for shortcuts,” Mohammed says. “Role-based access, secure authentication, and real-time data retrieval, none of that is optional in a government system. It is the entire product.”

To address the fairness problem systematically, Mohammed published a hyperparameter tuning framework at IEEE COINS 2025,  one of the IEEE's flagship international conferences, accepting only 30 to 40 papers from a globally competitive submission pool of thousands, through IEEE, the world's most authoritative professional organisation for engineering and computing. The paper received Best Paper recognition at the conference, a distinction awarded to only a handful of submissions. The framework provides a method for evaluating whether model behaviour differs across population subgroups before deployment, offering a structured pre-deployment test that most pipelines currently skip.

But production experience makes clear that model-level fairness evaluation is only one layer. The pipeline around the model carries equal weight: data quality across each source agency, access control rules governing retrieval, logging that allows reconstruction of every decision, and monitoring infrastructure that catches anomalies before they become patterns. A fairness audit of the model alone, without auditing the pipeline, will not satisfy regulators. More importantly, it will not prevent the failure it is supposed to prevent.

Is Your Integration Barrier an Architecture Problem?

The pipeline problem points to something deeper. Most AI leaders cite legacy integration as their primary barrier, but that too is a symptom, not the cause.

“The integration barrier everyone keeps citing, that is not the problem,” Mohammed notes. “That is a symptom. If your system was not designed with audit trails, if access control was not modular from day one, if your data pipelines were never observable, you cannot fix that by wrapping governance around it later. You have to get the foundation right. That is the only way this works.”

That observation is grounded in direct exposure to how AI projects fail under scrutiny. Serving as a jury member for the Cases & Faces International Business Award an international recognition programme that selects jury members exclusively for outstanding professional distinction, with only the most accomplished industry practitioners considered, in April 2026, evaluating AI and data analytics projects alongside senior professionals from Yandex Technologies and DXC Luxoft, Mohammed found that the projects that struggled to pass scrutiny shared a consistent profile: capable models, insufficient pipelines. The audit gap was not in the AI itself. It was in the architecture surrounding it.

Three architectural decisions determine whether a system can survive regulatory scrutiny. First, observability must be built into the data pipeline from the start, not as a log aggregator added after the fact, but as a first-class component capable of reconstructing what data state produced a given decision. Second, access control must be modular enough to be audited and adjusted at the component level without triggering system-wide changes. Third, the logging infrastructure must capture not just what decisions were made, but what data and model state produced them.

Fully integrated AI organisations report revenue growth far more often than those still piloting. The difference is not model sophistication. It is whether the infrastructure was built to prove the model works.

For engineering teams, the deadline is not abstract. August 2026 is the date. The three conditions above are the test. The organisations that have not yet checked whether their production systems meet their requirements are not facing a documentation problem. They are facing an architecture problem, and the longer that goes unanswered, the more expensive the answer becomes.

SEC Delays Tokenized Stock Proposal Amid Concerns Over Ownership and Regulation

Crypto Prices Today: Bitcoin Rises to $77,290 as Hormuz Tensions Cool, Hyperliquid Surges 37% This Week

CLARITY Act Hits Senate Roadblock as Crypto Bill Faces New Challenges

XRP News Today: XRP Price Reclaims $1.36 as Whale Positioning and Network Growth Support Sentiment

Hong Kong Dollar Stablecoin HKDAP Completes Mainnet Test on Ethereum