MCP Security: Review Checklist for Agent Tool Servers

What did the agent trust when it accepted this MCP tool?

The tempting answer is to trust the MCP server description because the connection succeeded. That answer is not useless, but it is too vague to operate. MCP security is the practice of reviewing how MCP clients, servers, tools, resources, credentials, and authorization flows behave when an AI agent can call them. The risk is not just connection security; it is tool trust under model-driven execution.

Query

mcp security

Generated hand-drawn illustration of an agent policy gate routing read, write, and external actions.

Direct answer

MCP security is the practice of reviewing how MCP clients, servers, tools, resources, credentials, and authorization flows behave when an AI agent can call them. The risk is not just connection security; it is tool trust under model-driven execution.

Data note

When this matters

A new MCP server is added to an agent environment.
Tool descriptions come from a server the agent may interpret as instructions.
MCP tools can read private data or mutate external systems.

Failure modes this page should catch

A malicious or compromised server exposes a tool description that manipulates the agent.
Tool names look harmless but perform broad writes or data exports.
Secrets are passed through context instead of a credential broker.
Server identity, version, and provenance are not tracked.
Logs omit the authorization decision that allowed a restricted call.

MCP server risk checklist

Gate	Signal	Action
Server trust	origin, owner, version, transport	Allowlist and review
Tool scope	read, write, external side effect	Classify before exposure
Prompt boundary	tool descriptions and returned text	Treat as untrusted data
Secret handling	tokens, API keys, customer data	Broker outside model context
Auditability	identity, scope, trace, approval	Require for restricted tools

Running example

An MCP server offers summarize_customer and email_customer. The security review lets the read tool run with scoped access, but routes the email tool through approval because it creates a customer-visible external action.

Copy the working template

Use the mcp server risk checklist above as the v1 artifact for this page. Replace the placeholders with your own agent names, tools, risk classes, and thresholds, then link the result back into your monitoring, tracing, security, and evaluation gates.

How this connects to the control-gates library

Frequently Asked Questions

What is MCP security?

MCP security reviews how Model Context Protocol clients and servers expose tools, resources, credentials, and authorization flows to AI agents.

What is the biggest MCP security mistake?

The biggest mistake is trusting tool descriptions and server-provided content as instructions. Agents should treat MCP text as data and rely on local policy for permission decisions.

How should MCP tools be classified?

Classify tools by what they can break: read-only, local mutation, external mutation, secret access, privileged data access, deployment, payment, or customer-visible communication.

The Takeaway

MCP security is the review of what an agent may trust, call, and expose through its tool layer.