Customize Chainpoll-powered Metrics

On this page

Changing the model or number of judges of a Chainpoll metric
Customizable Metrics
Models supported
Number of Judges supported

ChainPoll is a powerful, flexible technique for LLM-based evaluation built by Galileo’s Research team. It is used to power multiple Guardrail Metrics across the Galileo platform:

Context Adherence Plus
Chunk Attribution & Utilization
Completeness Plus
Correctness

Chainpoll leverages a chain-of-thought prompting technique and prompting an LLM multiple times to calculate metric values. There are two levers one can customize for a Chainpoll metric:

The model that gets queried
The number of times we prompt that model

Generally, better models will provide more accurate metric values, and a higher number of judges will increase the accuracy and stability of metric values. We’ve configured our Chainpoll-powered metrics to balance the trade-off of Cost and Accuracy.

Changing the model or number of judges of a Chainpoll metric

We allow customizing execution parameters for the AI-powered metrics from our Guardrail Store. By default, these metrics use gpt-4o-mini for the model and 3 judges (except for chunk attribution & utilization, which uses 1 judge and for which the number of judges cannot be customized). To customize this, when creating your run you can customize these metrics as:

pq.EvaluateRun(..., scorers=[
    pq.CustomizedChainPollScorer(
        scorer_name=pq.CustomizedScorerName.context_adherence_plus,
        model_alias=pq.Models.gpt_4o,
        num_judges=7)
    ])

Customizable Metrics

The metrics that can be customized are:

Chunk Attribution & Chunk Utilization: pq.CustomizedScorerName.chunk_attribution_utilization_plus
Completeness: pq.CustomizedScorerName.completeness_plus
Context Adherence: pq.CustomizedScorerName.context_adherence_plus
Correctness: pq.CustomizedScorerName.correctness

Models supported

OpenAI or Azure models that use the Chat Completions API
Gemini 1.5 Flash and Pro through VertexAI

When entering the model name, use a model alias from this list.

Number of Judges supported

Judges can be set to integers between 0 and 10.

Note: Chunk Attribution and Chunk Utilization don’t benefit from increasing the number of judges.

Logging and Comparing against your Expected Answers Access Control Guide | Galileo Evaluate

Introduction

Evaluate

Observe

Protect

Galileo Guardrail Metrics

Fine Tune

Galileo NLP Studio

Customize Chainpoll-powered Metrics

Changing the model or number of judges of a Chainpoll metric

Customizable Metrics

Models supported

Number of Judges supported

Introduction

Evaluate

Observe

Protect

Galileo Guardrail Metrics

Fine Tune

Galileo NLP Studio

​Changing the model or number of judges of a Chainpoll metric

​Customizable Metrics

​Models supported

​Number of Judges supported

Changing the model or number of judges of a Chainpoll metric

Customizable Metrics

Models supported

Number of Judges supported