Create a new Evaluate Run
Create a new Evaluate run with workflows.
Use this endpoint to create a new Evaluate run with workflows. The request body should contain the workflows
to be ingested and evaluated.
Additionally, specify the project_id
or project_name
to which the workflows should be ingested. If the project does not exist, it will be created. If the project exists, the workflows will be logged to it. If both project_id
and project_name
are provided, project_id
will take precedence. The run_name
is optional and will be auto-generated (timestamp-based) if not provided.
The body is also expected to include the configuration for the scorers to be used in the evaluation. This configuration will be used to evaluate the workflows and generate the results.
WorkflowStep
A workflow step is the atomic unit of logging to Galileo. They represent a single execution of a workflow, such as a chain, agent, or a RAG execution. Workflows can have multiple steps, each of which can be a different type of node, such as an LLM, Retriever, or Tool.
You can log multiple workflows in a single request. Each workflow step must have the following fields:
type
: The type of the workflow.input
: The input to the workflow.output
: The output of the workflow.
Examples
LLM Step
Retriever Step
Multi-Step
Workflow steps of type workflow
, agent
or chain
can have sub-steps with children. A workflow with a retriver and an LLM step would look like this:
Authorizations
Query Parameters
Body
List of workflows to include in the run.
List of Galileo scorers to enable.
List of registered scorers to enable.
Evaluate Project ID to which the run should be associated.
Evaluate Project name to which the run should be associated. If the project does not exist, it will be created.
Name of the run. If no name is provided, a timestamp-based name will be generated.
Response
Was this page helpful?