Your AI tools should know about your Braintrust experiments. They should understand your log schemas, help debug failed evaluations, and answer questions about your model performance.
Today, we're introducing Braintrust's Model Context Protocol (MCP) server. MCP is Anthropic's open standard that lets AI tools securely access external data sources. We've built an MCP server that exposes your Braintrust data and works seamlessly with popular AI coding tools to provide better access and help you discover insights about your app's structure and performance.
Building and evaluating AI applications means constantly jumping between tools like your code editor, the Braintrust UI, various documentation, and more.
Without proper tooling, each context switch breaks your flow. The tools that should be helping you debug and improve your product might have no idea what experiments you're running in Braintrust or how they're performing. This means building custom evaluation infrastructure from scratch.
With our MCP integration, your AI tools of choice become aware of your Braintrust projects, experiments, and data. You can now:
Query experiments naturally: "What were the accuracy scores for my recent sentiment analysis experiments?" Get instant answers from your data.
Debug failures in context: "Show me examples where my model failed on edge cases." See specific data points and understand what went wrong.
Get contextual documentation: "How do I create a custom scorer?" Find relevant examples based on your current project.
Compare model performance: "Compare GPT-5 vs Claude 4 Sonnet performance on my customer support dataset." Run analysis and get explanations automatically.
We support the most popular AI coding tools:
claude mcp add --transport http braintrust https://api.braintrust.dev/mcpAuthentication is handled via OAuth 2.0 with your existing Braintrust account. If you're using SSO, it works with that too. For self-hosted instances, the MCP server runs within your environment, so your data never leaves your infrastructure.
Once connected, your AI assistant gains access to the following tools:
infer_schemaWith Braintrust's MCP, your AI assistant finally understands your context. It knows your projects and data without explanations, accesses experiment results directly, and runs performance comparisons automatically.
Get started with MCP and let your AI assistant join your evaluation workflow.