The MoTaCon 2026 reveal. Register today!

LLM-as-a-Judge

LLM-as-a-Judge is a method where one large language model (LLM) evaluates the output of another model using defined criteria, such as relevance, factual accuracy, helpfulness, or tone.

It’s used as a scalable alternative to manual review: you give the judge model the prompt, the generated answer, and a rubric (a guide or standard), then it returns a score, label, or comparison verdict. This is especially useful for evaluating open-ended text where exact automatic metrics are weak or unavailable.

Simon Tomes

20th April 2026

Add Definition

Explore MoT

MoT London

Thu, 23 Apr

London Chapter April gathering

MoT Software Testing Essentials Certificate

Boost your career in software testing with the MoT Software Testing Essentials Certificate. Learn essential skills, from basic testing techniques to advanced risk analysis, crafted by industry experts.

01 Sep 24

Certification

Into The Motaverse

Into the MoTaverse is a podcast by Ministry of Testing, hosted by Rosie Sherry, exploring the people, insights, and systems shaping quality in modern software teams.

LLM-as-a-Judge

Escape Regression Slowdowns in Software

MoT London