Question 1

What is bayesian-cage?

Accepted Answer

bayesian-cage is an open-source confidence gate for MCP tool calls. It sits between an agent and its tools, verifies each tool output, and returns PROCEED / FLAG / BLOCK with a calibrated confidence and a per-tool belief state. A BLOCK is returned to the host as an MCP error so a compliant client will not act on an output it cannot verify.

Question 2

Why not just ask the model how confident it is?

Accepted Answer

Because a model's own confidence is overconfident and uncalibrated. On a 55-task execution-graded text-to-SQL benchmark (phi-3 via Ollama, 5-fold, seed=7, 67.3% accuracy), the cage cut calibration error ~4× versus phi-3's self-reported confidence: ECE 0.081 vs 0.325, Brier 0.174 vs 0.322, and it caught a third of phi-3's wrong answers (catch-rate 0.33 vs 0.00, wrong-passed 12 vs 18) with zero correct answers blocked. AUROC stays near chance either way (0.544 vs 0.583) because phi-3's raw confidence isn't discriminative — the cage's win is calibration, not ranking. Correctness is labeled by executing the SQL against a real database, and the benchmark ships with the repo.

Question 3

How do I install it?

Accepted Answer

pipx install bayesian-cage (or pip install bayesian-cage for library use). Point Claude Desktop, Cursor, or any MCP host at the cage; it spawns your real MCP server as a stdio subprocess and gates every tool call. Python 3.10+, pure stdlib, no runtime dependencies. MIT licensed.

Question 4

Is BayesCore open source?

Accepted Answer

Yes. The cage is open source under the MIT license, with calibration evaluations you can reproduce yourself. The project is focused on open research, community contribution, and scaling the verification layer.

A confidence gate
for MCP tool calls.

Sits between the agent and its tools.

Front your MCP servers

Every output is verified

PROCEED · FLAG · BLOCK

Three decisions. One rule. No guessing.

Calibration you can reproduce.

An open roadmap.

Built in the open.

Read the code

Report a miss

Add a verifier

Reproduce the numbers

Not a confidence score.
A posterior probability.

metric	cage	raw phi-3
ECE — calibration (lower better)	0.081	0.325
Brier (lower better)	0.174	0.322
catch-rate (higher better)	33%	0%
acts on wrong outputs (lower better)	12	18
AUROC (higher better)	0.544	0.583

A confidence gatefor MCP tool calls.