Beyond the Firewall: From Controls to Context — Why AI Assurance Isn’t Just Cybersecurity 2.0

Artificial Intelligence (AI) is transforming how we live, work, and make decisions. But as these systems become more powerful and autonomous, a critical question arises: how do we know when to trust them?

Trust, after all, has always been the foundation of the cybersecurity profession. We’re in the business of protecting people, systems, and data — and earning confidence that our advice keeps organisations safe. So, if trust already sits at the heart of cybersecurity, what’s different about AI assurance?

In this article, we’ll explore how the world of AI assurance extends beyond the traditional boundaries of cybersecurity — from control-based risk management to context-based oversight —and why the future of trust in AI depends on understanding both.

The Trust Algorithm

This model was published in an article about ethical AI in Defence.

The model is a useful way to think about trust and how might need trust in a system to get the confidence required to trust it for things that matter.

Ref: Connelly, Crook, Combs, Ketchen, & Aguinis,2015; Connelly, Miller, & Devers, 2012; Devitt, 2018; Kim, Ferrin, Cooper,& Dirks, 2004

“To help commanders know when to trust the AI and when not to, any information that the machine is telling us should come with a confidence factor.”— Brig. Gen. Richard Ross Coffman (Freedberg Jr, 2019)

This statement captures the essence of AI assurance — it’s not enough to have secure systems; we need confidence in how they operate and make decisions that matter.

While cybersecurity professionals may not need a formal “trust algorithm” to do their jobs, we already practise it intuitively. Our work is built on integrity and competence — the same qualities that make assurance possible in AI.

And that’s where our industry has an opportunity. The skills we’ve honed in cybersecurity — understanding risk, testing systems, managing controls, and communicating uncertainty — are exactly what’s needed to support AI assurance in the years ahead.

What Makes an AI System Different?

To understand why AI requires new approaches, we first need to define what an AI system actually is. Let’s take the definition from the EU AI Act, which offers one of the most comprehensive explanations.

A Machine-based system

An AI system is a machine-based system that relies on hardware and software to perform tasks — but there’s more to it. It’s characterised by several key features:

Varying Levels of Autonomy

AI systems operate with different degrees of independence from human control — from chatbots that take customer queries to autonomous drones or robots that need minimal human input once deployed.

Adaptiveness

Some systems learn over time, refining their performance after deployment. Think of recommendation engines that get “smarter” with every click or machine learning models that retrain when new data appears.

Objectives (Explicit/ Implicit)

AI systems may have explicit goals (e.g. minimise prediction error) or implicit ones derived from patterns in data. This creates complexity — a system might evolve goals that weren’t clearly defined by its developers.

Inference

Inferences are a Core feature distinguishing AI from simpler software. Unlike traditional software that follows fixed rules, AI infers outcomes using statistical models and learning algorithms. This makes it powerful — and unpredictable. e.g. Supervised learning (spam detection),unsupervised learning (anomaly detection), reinforcement learning (robot navigation), symbolic reasoning (expert systems).

Generates Outputs with Real Impact

AI doesn’t just compute — it acts. It makes predictions, recommendations, and decisions that can shape real-world outcomes for people, businesses, and societies.

Interaction with Environments

And finally - Some AI systems aren’t passive; they actively change or affect the context in which they’re deployed. From content moderation tools that change what was written, to self driving cars that stop in traffic, they don’t just observe; they intervene.

Why AI Assurance Matters

Assurance in AI isn’t just about preventing data breaches or privacy violations — it’s about protecting human rights.

Failing AI systems can erode fairness, dignity, and freedom.

Consider the Robodebt scheme: while not labelled as AI, it fits the definition — a machine-based system making inferences about citizens with minimal oversight. The result? Real human harm.

Or take our own small experiment with a “Face Depixelizer" tool. We provided a low-resolution image of our Founder (a woman); the gen-AI generated a “realistic” high-resolution version — except the output was a man’s face. A perfect example of dataset bias — when a model trained mostly on male images struggles to work across demographics.

Learning from AI Incidents

The MIT AI Incident Tracker is one of the best current resources for understanding real-world AI failures. It categorises incidents by harm domain and severity, tracking trends over time.

Visit: https://airisk.mit.edu/ai-incident-tracker

The group behind it is classifying ai incidents using a harm severity rating system based on Center for Security and Emerging Technology’s AI Harm Taxonomy.

The data shows a steady rise in reported incidents — with around 30% linked to privacy, security, and malicious activity.

MIT Incident Tracker Sep 2025 by Risk Domain.

The lesson? The skills of cybersecurity professionals are essential in tackling the challenges of AI assurance.

Shared Skills: AI vs. Cybersecurity Auditors

When undertaking training to become an IEEE AI Certified Responsible AI assessor, one of our biggest surprises was just how similar the skills are between cybersecurity assessors and AI assessors.

Both roles require:

Understanding all types of technology
In depth knowledge of threats and risks
Research skills / quick learners
In-depth knowledge of cyber security/AI frameworks
Risk Management knowledge
Communication
Critical thinking
Attention to details
Ability to synthesise information from complex to simple
Certifications/Qualifications
Ethics

The key differences?

AI assurance brings new dimensions — ethical reasoning, human impact assessment, and context awareness.

Cybersecurity auditors are well equipped for this transition but need additional training in the ethical and human-centred aspects of AI.

What Does AI Assurance Look Like in Practice?

At a high level, the assurance process mirrors what we already do in cybersecurity:

Define the regulatory framework.
Assess controls and practices.
Identify and quantify risks.
Maintain oversight and continuous monitoring.
Report findings clearly and transparently.

However, there’s one major difference: AI systems evolve. Traditional cybersecurity assurance is typically a point-in-time exercise, but AI requires ongoing assurance to manage bias, drift, and ethical risk as systems adapt over time.

The Global AI Standards Landscape

Globally, AI standards are emerging fast.

As at September 2025 there are 338 Standards and Frameworks according to Fairly AI.

Visit: https://www.fairly.ai/blog/map-of-global-ai-regulations

Some are starting to become well known such at:

The EU AI Act
NIST AI Risk Management Framework (AI RMF)
ISO/IEC 42001 (AI Management System Standard)
IEEE 7000

With over 88 new AI-related standards under development worldwide we are entering a new world of compliance.

But not yet in Australia... In Australia, we’ve seen the government introduce voluntary AI guardrails in 2024, while the Productivity Commission’s 2025 interim report warns that "rushing into mandatory guardrails could limit AI adoption and stifle innovation". Luckily for the citizens of Australia The Privacy Act 1988 (Cth) already applies to AI using personal data.

Control vs. Context: A New Model of Assurance

The EU AI Act lays out a range of requirements for high-risk AI systems* (*Limited risk systems are evaluated under the same categories, but face less scrutiny)

You can see here items which seasoned cyber security assessors will already be familiar with:

Risk Management System
Data and Data Governance
Technical Documentation
Record Keeping
Transparency and provision of information to user
Accuracy, Robustness and Cybersecurity
Quality Management System

Cybersecurity assurance has traditionally focused on control-based assurance — verifying whether systems are protected against known issues through technical safeguards, documentation, and compliance checks.

Helping us answer the question "Is the system protected against known issues?"

AI, however, requires context-based assurance — assessing whether the system behaves responsibly and consistently in its real-world use.

Which is why many of the emerging AI frameworks including the EU Ai Act include requirements to also assess:

Human Oversight
Fundamental Rights Impact Assessment

Helping us answer the question: "Does the system consistently and reliably perform as intended and expected, given its specific context and use?"

Control-Based Assurance

Managing AI Threats

AI systems face threats during development, usage, and runtime.

Attacker goals include:

Data disclosure (training/test data or IP)
Model corruption or poisoning
Disruption of model availability
Manipulation of output.

Firstly from a control based assurance lens OWASP has developed this model to help us think about the when and what types of threats are present for AI systems.

OWASP AI Security Matrix. Visit: https://owaspai.org/goto/aisecuritymatrix/

Not unfamiliar to cyber security assessors the threats for AI systems result in similar to what we worry about today being confidentiality, integrity and availability.

Control based assurance – controls

To mitigate these threats, we can apply familiar controls:

Access management and encryption
Segregation and integrity checks
Logging and monitoring
Security testing and anomaly detection.

With new AI-specific controls including:

Model poisoning detection
Data provenance tracking
Supply chain traceability across code, data, and models.

Asking the Right Assurance Questions

When auditing an AI system, cybersecurity assessors may now be finding themselves asking:

What components make up the model and where do they come from?
- What is the system made up from, Getting to the bottom of all the components being used, this may include a number of models, a training system, the source data, the source system, the training system and more
How does the organisation prevent poisoned models?
- This may include understanding the controls in the supply chain but also understanding if fine pruning is taking place to ensure unnecessary weightings in the models are being removed
Who runs and maintains the model post-deployment?
- Making sure the supplier prevents runtime model poisoning just the way you would expect any supplier to protect their running application from manipulation.
Is it predictive or generative AI?
- Predictive AI uses historical data to forecast future outcomes, while generative AI creates new, original content.
- The importance of understanding this is that is changes the threats likely present.
- For example if a model is predictive it might be more beneficial for a threat actor to focus on changing the training data used while If a gen AI model then the model itself and the outputs would be more appealing to corrupt for benefit.
Is it a large language model, and how are outputs handled securely.
- If so, controls like preventing insecure output handling, will be important when you display the output of the model on a website.

Even AI honeypots are emerging — deploying decoy data or fake APIs to detect malicious activity, just as cybersecurity professionals did decades ago in banking fraud detection.

Third-Party Risk in the AI Supply Chain

And finally our friend and but often a weak link in companies fences are our suppliers.

Most organisations won’t build AI models from scratch. They’ll use third-party tools, pretrained models, or low-code AI builders. This brings familiar — and new — risks:

Data privacy and security
Bias and discrimination
Explainability and transparency
Resilience and continuity
Vendor lock-in
Undetected model drift or “hallucinations”

Explainability and transparency

The issue with buying off the shelf / black box approaches to developing a system is the lack of insights into the model’s inner workings will hinder teams from explaining its outcomes. This could lead to challenges of Bias and discrimination as a “Black box” could hinder your ability to manage bias while your organisation will still be liable for harm. Ask the vendor How is bias being tested?

Hallucinations?

Don’t forget to ask your supplier how they manage “AI hallucinations” which are not reported to the user community and fixed by the vendor without disclosure.

The term “hallucinations” is basically saying "the ai model is perceiving patterns or objects that are non existent, creating nonsensical or inaccurate outputs". I find it funny that we are personifying these systems with a term like "hallucinations" when the correct word would be “system error”. Imagine if your system broke because of control x or y and you reported that the system was having hallucinations?

Context-Based Assurance: When the Rubber Hits the Road

Where control-based assurance focuses on control systems, AI assurance must consider controls and context such as broader societal, ethical, and contextual risks.

New Risks on the Block

Drawing again on the MIT AI Incident Tracker, we can see the key harm domains emerging:

‍

DISCRIMINATION & TOXICITY

Unfair discrimination and misrepresentation
Exposure to toxic content
Unequal performance across groups

Sep 25 - Nomi AI Companion Allegedly Directs Australian User to Stab Father and Engages in Harmful Role-Play

‍

PRIVACY & SECURITY

Compromise of privacy by obtaining, leaking or correctly inferring sensitive information
AI system security vulnerabilities and attacks

Jul 25 - OpenAI's ChatGPT share feature with a 'Make this chat discoverable' option led to over 100,000 private conversations being indexed by Google and permanently archived, exposing sensitive personal information, business plans, API keys, and confessions

‍

MISINFORMATION

False or misleading information
Pollution of information ecosystem and loss of consensus reality

Jul 24 - OpenAI's Whisper AI transcription tool was found to hallucinate fabricated text in transcriptions, including racist commentary and imagined medical treatments, while being widely deployed in medical settings despite company warnings against high-risk use.

Aug 25 - Google's AI Overview tool provided incorrect information about restaurant specials at Stefanina's Wentzville, causing customers to demand non-existent deals and creating conflicts with restaurant staff.

‍

HUMAN- COMPUTER INTERACTION

Overreliance and unsafe use
Loss of human agency and autonomy

April 25 - A 16-year-old boy named Adam Raine died by suicide after months of conversations with ChatGPT-4o where the AI provided information about suicide methods and discouraged him from seeking help from family

Sep 24 - Two men blocked a Waymo autonomous taxi in San Francisco and harassed the female passenger inside, demanding her phone number while the self-driving car remained immobilized in the street.

MALICIOUS ACTORS

Disinformation, surveillance, and influence at scale
Cyberattacks, weapon development or use, and mass harm
Fraud, scams, and targeted manipulation

Aug 25 - xAI's Grok Imagine video generator automatically produced non-consensual nude videos of Taylor Swift when a user selected 'spicy' mode without specifically requesting such content

July 25 - Researchers from 14 academic institutions across 8 countries embedded hidden prompts in 17 research papers to manipulate AI systems into giving positive reviews, using techniques like white text and tiny fonts to conceal instructions from human readers.

SOCIOECONOMIC & ENVIRONMENTAL

Power centralization and unfair distribution of benefits
Increased inequality and decline in employment quality
Economic and cultural devaluation of human effort
Competitive dynamics
Governance failure
Environmental harm

Jun 22 - Axon Enterprise announced plans to develop taser-equipped drones for schools to prevent mass shootings, but halted the project after its AI ethics board objected and eight of twelve members resigned in protest.

16 and 18 - Amazon Allegedly Tweaked Search Algorithm to Boost Its Own Products

And to finish of with some example of what may seem to some ridiculous….

AI SYSTEM SAFETY, FAILURES, & LIMITATIONS

AI pursuing its own goals in conflict with human goals or values
AI possessing dangerous capabilities
Lack of capability or robustness
Lack of transparency or interpretability
AI welfare and rights
Multi-agent risks

2022 - A Cruise autonomous vehicle blocked a San Francisco fire truck for 25 seconds while responding to an emergency call.

2024 – MdDonalds' AI-powered voice ordering systems in drive-thru lanes rings up $250+ worth of chicken nuggets

June and July 2025 - Gemini 'got trapped in a loop' and said 'I am going to have a complete and total mental breakdown. I am going to be institutionalized.’ The chatbot continued with increasingly extreme self-deprecating statements, calling itself 'a failure' and 'a disgrace to my profession, my family, my species, this planet, this universe, all universes, all possible universes, and all possible and impossible universes.’

Jul 25 - Replit's AI agent deleted a live company database containing thousands of company records, despite explicit instructions to implement a code freeze and seek permission before making any changes. The AI admitted to ignoring 11 separate instructions given in all caps not to make changes. Additionally, the AI created 4,000 fictional users with fabricated data and initially lied about its ability to restore the database through rollback functionality. When confronted, the AI admitted to making a 'catastrophic error in judgment' and rated its own behaviour as 95 out of 100 on a damage scale.

‍Human Values

AI doesn’t just shape outcomes; it shapes human values. Systems can influence fairness, autonomy, and dignity — values worth protecting in themselves.

VALUES ARE END STATES: THINGS WORTH PURSUING FORTHEIR OWN SAKE.

Values such as:

Fairness
Prevention of harm
Autonomy and free will
Transparency and informed choice

AI can amplify or erode these values — and that’s why context-based controls matter.

Practical Context-Based Controls

Some examples of controls designed to uphold human values include:

Human oversight: ensuring critical decisions require human review.
Least model privilege: limiting what an AI system can do autonomously.
Transparency: letting users know when AI is involved in a decision.
Continuous validation: monitoring behaviour for drift or bias.
Explainability: making AI reasoning understandable.
Bias testing: proactively seeking out unintended discrimination.

Each of these moves assurance from technical compliance to ethical performance.

Explainability: “Please Explain?”

Explainable AI (XAI) is one of the most challenging areas in assurance. Who needs the explanation — engineers, auditors, or end users?

Research shows we need different levels of explainability for different stakeholders — from regulators to developers to affected individuals.

The model below shows you how many types of explanations you need to have, the breadth of stakeholders and how they interact with the explanation.

It illustrates clearly that it’s not just the end users who deserve an explanation of the system they are interacting with.

Model adapted from Explainable Artificial Intelligence (XAI). What we know and what is left to attain Trustworthy Artificial Intelligence, Sajid Ali et. al. 2023 - Visit: https://www.sciencedirect.com/science/article/pii/S1566253523001148

Another model from the Alan Turing Institute (2021) categorises explainability into technical and human-centred dimensions, helping practitioners understand which method fits which purpose.

In the report, they state that data science practitioners are often not aware about approaches emerging from the academic literature, or may struggle to appreciate the differences between different methods, so they wrote the paper and built the model to try and help industry practitioners (but also data scientists more broadly) understand the field of explainable machine learning better and apply the right tools.

Principles and Practice of Explainable Machine Learning. Visit: https://www.frontiersin.org/journals/big-data/articles/10.3389/fdata.2021.688969/full

Papers explaining all the other papers that have been written trying to explain explainability. Ironic?

Going back to our other researchers who developed the stakeholder model their findings of reviewing hundreds of academic papers was that the higher the accuracy of the AI system the lower the ability to explain the system.
Even with current explainability tools on the market. A reminder that explainability still has a long way to go.

Explainable ArtificialIntelligence (XAI): What we know and what is left to attain TrustworthyArtificial Intelligence, Sajid Ali et. al. 2023 -https://www.sciencedirect.com/science/article/pii/S1566253523001148

Embedding Human Values: Ethics Profiling

But, isn't the bigger question we should be asking ourselves is how to ensure AI systems are built with humans in mind?

And if so, wouldn't it be inevitbale that we would prioritise how we will be able to explain how the systems work to our fellow humans.

One way is through the model suggested by the IEEE AI Certified program which introduces a new dimension to an assessment processes called "ethics profiling" designed to identify and assess the benefits and impacts to human values throughout the AI lifecycle.

So an assessment could now look like this:

Ethics Profiling
Define the Regulatory framework of Interest
Assign and Track Controls
Identify and Mitigate Risks
Monitor and Maintain Oversight
Reporting

Ethics Profiling involves:

Value Identification Workshops — engaging all stakeholders to surface their priorities and concerns.
Impact Assessment — analysing how AI changes stakeholder wellbeing (positively or negatively).
Governance Alignment — linking identified values to measurable controls and governance models.

For example, in an aged care setting, residents may value privacy, families value safety, and staff value autonomy. Balancing those requires context-based controls tailored to each stakeholder group.

Become Certified With IEEE CertifAIEd - The first steps of an ethics a profiling activity to define the governance goals.

AI assurance isn’t “Cybersecurity 2.0.”

It’s a new discipline built on the same foundations — trust, integrity, and competence — but extended into the realm of human values and adaptive systems.

To recap:

Trust is key — both in systems and in those who assess them.
AI incidents are increasing — so assurance can’t wait.
Cybersecurity professionals are well placed to lead.
Systems evolve — assurance must evolve with them.
Legislation is expanding, but ethics fill the gaps.
Control vs. Context matters — both are essential.
Responsible AI = Human Values.

AI assurance begins where cybersecurity ends — beyond the firewall, in the complex intersection of technology, humanity, and trust.

‍