> ## Documentation Index > Fetch the complete documentation index at: https://docs.galileo.ai/llms.txt > Use this file to discover all available pages before exploring further. # Improve LLM-as-a-Judge Metrics with Autotune > Use Autotune to turn feedback into prompt improvements that make LLM-as-a-judge metrics more accurate for your use case. LLM-as-a-judge metrics evaluate LLM application outputs at scale, but may not reflect your team's domain-specific standards out of the box. Whether you're adapting a preset metric to a new domain or refining a custom metric that still isn't accurate enough, the metric prompt often needs tuning to capture your specific evaluation criteria — and doing that manually is time-consuming and hard to scale. Teams typically rewrite prompts, test changes, and repeat that cycle across multiple rounds with no guarantee the result is right. Autotune lets anyone involved in building or reviewing metrics — annotators, product managers, or developers — provide feedback on metric outputs instead of editing prompts directly. Reviewers correct results and explain their reasoning in natural language. Galileo translates that feedback into prompt improvements and shows exactly what changed. ## When to use Autotune Use Autotune to improve metric performance when: * A new custom metric isn't accurate enough for your use case * An existing metric isn't generalizing well to a new domain or use case * An existing metric is producing inconsistent results with low reviewer agreement in production * The current prompt isn't handling domain-specific edge cases reliably * Manual prompt iteration is too time-consuming to scale ## How it works ### See Autotune in action