API Reference
- Getting Started
- Schemas
health
auth
protect
Create Workflows Run
Create a new Evaluate run with workflows.
Use this endpoint to create a new Evaluate run with workflows. The request body should contain the workflows
to be ingested and evaluated.
Additionally, specify the project_id
or project_name
to which the workflows should be ingested. If the project does not exist, it will be created. If the project exists, the workflows will be logged to it. If both project_id
and project_name
are provided, project_id
will take precedence. The run_name
is optional and will be auto-generated (timestamp-based) if not provided.
The body is also expected to include the configuration for the scorers to be used in the evaluation. This configuration will be used to evaluate the workflows and generate the results.
Authorizations
Body
List of workflows to include in the run.
Input to the step.
Timestamp of the step's creation, as nanoseconds since epoch.
Duration of the step in nanoseconds.
Ground truth expected output for the step.
Name of the step.
Output of the step.
Parent node of the current node. For internal use only.
Input to the step.
Timestamp of the step's creation, as nanoseconds since epoch.
Duration of the step in nanoseconds.
Ground truth expected output for the step.
Metadata associated with this step.
Name of the step.
Output of the step.
Parent node of the current node. For internal use only.
Input to the step.
Timestamp of the step's creation, as nanoseconds since epoch.
Duration of the step in nanoseconds.
Ground truth expected output for the step.
Metadata associated with this step.
Name of the step.
Output of the step.
Parent node of the current node. For internal use only.
Input to the step.
Timestamp of the step's creation, as nanoseconds since epoch.
Duration of the step in nanoseconds.
Ground truth expected output for the step.
Metadata associated with this step.
Name of the step.
Output of the step.
Parent node of the current node. For internal use only.
Status code of the step. Used for logging failed/errored steps.
Steps in the workflow.
Type of the step. By default, it is set to workflow.
chain
, chat
, llm
, retriever
, tool
, agent
, workflow
Status code of the step. Used for logging failed/errored steps.
Steps in the workflow.
Input to the step.
Timestamp of the step's creation, as nanoseconds since epoch.
Duration of the step in nanoseconds.
Ground truth expected output for the step.
Metadata associated with this step.
Name of the step.
Output of the step.
Parent node of the current node. For internal use only.
Status code of the step. Used for logging failed/errored steps.
Steps in the workflow.
Type of the step. By default, it is set to workflow.
"workflow"
Type of the step. By default, it is set to workflow.
chain
, chat
, llm
, retriever
, tool
, agent
, workflow
Status code of the step. Used for logging failed/errored steps.
Steps in the workflow.
Input to the step.
Timestamp of the step's creation, as nanoseconds since epoch.
Duration of the step in nanoseconds.
Ground truth expected output for the step.
Metadata associated with this step.
Name of the step.
Output of the step.
Parent node of the current node. For internal use only.
Input to the step.
Timestamp of the step's creation, as nanoseconds since epoch.
Duration of the step in nanoseconds.
Ground truth expected output for the step.
Metadata associated with this step.
Name of the step.
Output of the step.
Parent node of the current node. For internal use only.
Status code of the step. Used for logging failed/errored steps.
Steps in the workflow.
Type of the step. By default, it is set to workflow.
chain
, chat
, llm
, retriever
, tool
, agent
, workflow
Status code of the step. Used for logging failed/errored steps.
Steps in the workflow.
Input to the step.
Timestamp of the step's creation, as nanoseconds since epoch.
Duration of the step in nanoseconds.
Ground truth expected output for the step.
Metadata associated with this step.
Name of the step.
Output of the step.
Parent node of the current node. For internal use only.
Status code of the step. Used for logging failed/errored steps.
Steps in the workflow.
Type of the step. By default, it is set to workflow.
"workflow"
Type of the step. By default, it is set to workflow.
"workflow"
Type of the step. By default, it is set to workflow.
chain
, chat
, llm
, retriever
, tool
, agent
, workflow
Status code of the step. Used for logging failed/errored steps.
Steps in the workflow.
Input to the step.
Timestamp of the step's creation, as nanoseconds since epoch.
Duration of the step in nanoseconds.
Ground truth expected output for the step.
Metadata associated with this step.
Name of the step.
Output of the step.
Parent node of the current node. For internal use only.
Input to the step.
Timestamp of the step's creation, as nanoseconds since epoch.
Duration of the step in nanoseconds.
Ground truth expected output for the step.
Metadata associated with this step.
Name of the step.
Output of the step.
Parent node of the current node. For internal use only.
Input to the step.
Timestamp of the step's creation, as nanoseconds since epoch.
Duration of the step in nanoseconds.
Ground truth expected output for the step.
Metadata associated with this step.
Name of the step.
Output of the step.
Parent node of the current node. For internal use only.
Status code of the step. Used for logging failed/errored steps.
Steps in the workflow.
Type of the step. By default, it is set to workflow.
chain
, chat
, llm
, retriever
, tool
, agent
, workflow
Status code of the step. Used for logging failed/errored steps.
Steps in the workflow.
Input to the step.
Timestamp of the step's creation, as nanoseconds since epoch.
Duration of the step in nanoseconds.
Ground truth expected output for the step.
Metadata associated with this step.
Name of the step.
Output of the step.
Parent node of the current node. For internal use only.
Status code of the step. Used for logging failed/errored steps.
Steps in the workflow.
Type of the step. By default, it is set to workflow.
"workflow"
Type of the step. By default, it is set to workflow.
chain
, chat
, llm
, retriever
, tool
, agent
, workflow
Status code of the step. Used for logging failed/errored steps.
Steps in the workflow.
Input to the step.
Timestamp of the step's creation, as nanoseconds since epoch.
Duration of the step in nanoseconds.
Ground truth expected output for the step.
Metadata associated with this step.
Name of the step.
Output of the step.
Parent node of the current node. For internal use only.
Input to the step.
Timestamp of the step's creation, as nanoseconds since epoch.
Duration of the step in nanoseconds.
Ground truth expected output for the step.
Metadata associated with this step.
Name of the step.
Output of the step.
Parent node of the current node. For internal use only.
Status code of the step. Used for logging failed/errored steps.
Steps in the workflow.
Type of the step. By default, it is set to workflow.
chain
, chat
, llm
, retriever
, tool
, agent
, workflow
Status code of the step. Used for logging failed/errored steps.
Steps in the workflow.
Input to the step.
Timestamp of the step's creation, as nanoseconds since epoch.
Duration of the step in nanoseconds.
Ground truth expected output for the step.
Metadata associated with this step.
Name of the step.
Output of the step.
Parent node of the current node. For internal use only.
Status code of the step. Used for logging failed/errored steps.
Steps in the workflow.
Type of the step. By default, it is set to workflow.
"workflow"
Type of the step. By default, it is set to workflow.
"workflow"
Type of the step. By default, it is set to workflow.
"workflow"
Type of the step. By default, it is set to workflow.
"workflow"
List of generated scorers to enable.
Name of the scorer to enable.
Evaluate Project ID to which the run should be associated.
Evaluate Project name to which the run should be associated. If the project does not exist, it will be created.
List of registered scorers to enable.
Name of the scorer to enable.
Name of the run. If no name is provided, a timestamp-based name will be generated.
List of Galileo scorers to enable.
Alias of the model to use for the scorer.
"agentic_workflow_success"
Number of judges for the scorer.
1 < x < 10
"plus"
Was this page helpful?