A tester’s guide to AI guardrails thumbnail

A tester’s guide to AI guardrails

AI systems introduce a new kind of risk. 

They don’t just fail; they produce plausible, unsafe, or misleading outputs while appearing correct. 

Guardrails are used to control these behaviours. But in most systems, they are: 

  •  vaguely defined 
  •  poorly tested 
  •  incorrectly implemented 

In this session, Rahul Parwal introduces a structured approach to learning AI guardrails through an interactive, scenario-based format.

Participants will work through short exercises that reflect real testing challenges:

  • Identifying types of guardrail failures 
  • Determining when a guardrail should trigger 
  • Recognising common attack patterns 
  • Improving weak system prompt rules 
  • Spotting implementation-level issues

By the end, participants will have a practical framework to test AI systems more systematically. 

Learning outcomes
  • Understand the  various categories of AI guardrails 
  • Practice a set of practical techniques to test AI guardrails 
  • A clearer understanding of where set guardrails fail in real systems

Comments

Oleksandr Romanov
Beautiful masterclass. Nice topic and very good delivery. Game for testing knowledge is a great idea. Thanks, Rahul!

Sign in to comment
Explore MoT
Leading with AI - The London Edition image
Fri, 19 Jun
A half-day educational experience to navigate the world of AI
Advanced prompting for testers image
Advanced prompting skills to turn AI into your trusted testing companion.
This Week in Quality image
Debrief the week in Quality via a community radio show hosted by Simon Tomes and members of the community
Subscribe to our newsletter