Evaluate
Simple 'Prompt Run'
Run an evaluation over your prompts.
Running a Prompt Sweep
Run an evaluation over a combination of model, params and prompt templates to prompt engineer your prompts.
QA Chatbots
Evaluate and compare 3 RAG-based QA Chatbots with OpenAI
RAGOpenAI
RAGOpenAI
Summarization
Evaluate and compare 5 LLM-based summarization bots
SummarizationOpenAIMistralGemini
SummarizationOpenAIMistralGemini
Langchain Integration
Evaluation of a RAG-based QA Chatbot built with Langchain and ChromaDB
RAGLangchainChromaDB
RAGLangchainChromaDB
Registering a AI-powered custom scorer
Learn how to register a custom GPT scorer.GPT-powered metric
Zero-Shot
Integrate a topic detection model into a Galileo run through a Galileo CustomMetric
Observe
QA Chatbot
Monitor a RAG-based QA Chatbot with OpenAI
RAGOpenAI
RAGOpenAI
Summarization
Monitor a LLM-based summarization bot
SummarizationOpenAI
SummarizationOpenAI