LLM-as-a-Judge

LLM-as-a-Judge image
LLM-as-a-Judge is a method where one large language model (LLM) evaluates the output of another model using defined criteria, such as relevance, factual accuracy, helpfulness, or tone.

It’s used as a scalable alternative to manual review: you give the judge model the prompt, the generated answer, and a rubric (a guide or standard), then it returns a score, label, or comparison verdict. This is especially useful for evaluating open-ended text where exact automatic metrics are weak or unavailable.
Explore MoT
MoT London image
Thu, 23 Apr
London Chapter April gathering
MoT Software Testing Essentials Certificate image
Boost your career in software testing with the MoT Software Testing Essentials Certificate. Learn essential skills, from basic testing techniques to advanced risk analysis, crafted by industry experts.
Into The Motaverse image
Into the MoTaverse is a podcast by Ministry of Testing, hosted by Rosie Sherry, exploring the people, insights, and systems shaping quality in modern software teams.
Subscribe to our newsletter