AI systems introduce a new kind of risk.
They don’t just fail; they produce plausible, unsafe, or misleading outputs while appearing correct.
Guardrails are used to control these behaviours. But in most systems, they are:
- vaguely defined
- poorly tested
- incorrectly implemented
In this session, Rahul Parwal introduces a structured approach to learning AI guardrails through an interactive, scenario-based format.
Participants will work through short exercises that reflect real testing challenges:
- Identifying types of guardrail failures
- Determining when a guardrail should trigger
- Recognising common attack patterns
- Improving weak system prompt rules
- Spotting implementation-level issues
By the end, participants will have a practical framework to test AI systems more systematically.
Learning outcomes
- Understand the various categories of AI guardrails
- Practice a set of practical techniques to test AI guardrails
- A clearer understanding of where set guardrails fail in real systems
Resources
Manage your entire QA lifecycle in one place. Sync Jira, automate scripts, and use AI to accelerate your testing.
Explore MoT
Thu, 1 Oct
A tech conference to help you navigate the ever-shifting landscape of Quality Engineering, AI, Leadership, Product, Accessibility and Security.
Unleash the power of generative AI to boost your software testing and day-to-day tech tasks
Debrief the week in Quality via a community radio show hosted by Simon Tomes and members of the community
Comments