LLM-as-a-Judge

LLM-as-a-Judge image
LLM-as-a-Judge is a method where one large language model (LLM) evaluates the output of another model using defined criteria, such as relevance, factual accuracy, helpfulness, or tone.

It’s used as a scalable alternative to manual review: you give the judge model the prompt, the generated answer, and a rubric (a guide or standard), then it returns a score, label, or comparison verdict. This is especially useful for evaluating open-ended text where exact automatic metrics are weak or unavailable.
Explore MoT
Leading with AI - The London Edition image
Fri, 19 Jun
A half-day educational experience to navigate the world of AI
MoT Software Testing Essentials Certificate image
Boost your career in software testing with the MoT Software Testing Essentials Certificate. Learn essential skills, from basic testing techniques to advanced risk analysis, crafted by industry experts.
This Week in Quality image
Debrief the week in Quality via a community radio show hosted by Simon Tomes and members of the community
Subscribe to our newsletter