POST
/
v1
/
evaluate
/
runs
curl --request POST \
  --url https://api.acme.rungalileo.io/v1/evaluate/runs \
  --header 'Content-Type: application/json' \
  --header 'Galileo-API-Key: <api-key>' \
  --data '{
  "project_name": "my-evaluate-project",
  "run_name": "my-evaluate-run",
  "scorers": [
    {
      "name": "correctness"
    },
    {
      "name": "output_pii"
    }
  ],
  "workflows": [
    {
      "created_at_ns": 1739567790708355300,
      "duration_ns": 0,
      "input": "who is a smart LLM?",
      "metadata": {},
      "name": "llm",
      "output": "I am!",
      "type": "llm"
    }
  ]
}'
{
  "message": "<string>",
  "project_id": "<string>",
  "project_name": "<string>",
  "run_id": "<string>",
  "run_name": "<string>",
  "workflows_count": 123,
  "records_count": 123
}

WorkflowStep

A workflow step is the atomic unit of logging to Galileo. They represent a single execution of a workflow, such as a chain, agent, or a RAG execution. Workflows can have multiple steps, each of which can be a different type of node, such as an LLM, Retriever, or Tool.

You can log multiple workflows in a single request. Each workflow step must have the following fields:

  • type: The type of the workflow.
  • input: The input to the workflow.
  • output: The output of the workflow.

Examples

LLM Step

{
  "type": "llm",
  "input": "What is the capital of France?",
  "output": "Paris"
}

Retriever Step

{
  "type": "retriever",
  "input": "What is the capital of France?",
  "output": [{ "content": "Paris is the capital and largest city of France." }]
}

Multi-Step

Workflow steps of type workflow, agent or chain can have sub-steps with children. A workflow with a retriver and an LLM step would look like this:

{
  "type": "workflow",
  "input": "What is the capital of France?",
  "output": "Paris",
  "steps": [
    {
      "type": "retriever",
      "input": "What is the capital of France?",
      "output": [{ "content": "Paris is the capital and largest city of France." }]
    },
    {
      "type": "llm",
      "input": "What is the capital of France?",
      "output": "Paris"
    }
  ]
}

Authorizations

Galileo-API-Key
string
header
required

Body

application/json
workflows
object[]
required

List of workflows to include in the run.

scorers
object[]

List of Galileo scorers to enable.

registered_scorers
object[]

List of registered scorers to enable.

generated_scorers
object[]

List of generated scorers to enable.

project_id
string | null

Evaluate Project ID to which the run should be associated.

project_name
string | null

Evaluate Project name to which the run should be associated. If the project does not exist, it will be created.

run_name
string | null

Name of the run. If no name is provided, a timestamp-based name will be generated.

Response

200
application/json
Successful Response
message
string
required
project_id
string
required
project_name
string
required
run_id
string
required
run_name
string
required
workflows_count
integer
required
records_count
integer
required