Single calculate tool discovered
The server exposes exactly one tool, calculate(expression), which keeps the attack and review surface minimal.
evidence/trust5/2026-07-03-mcp-pilot.json
The Calculator MCP server passed a real smoke test: it evaluated arithmetic correctly and, importantly, rejected a code-execution attempt disguised as an expression with an AST-level unsupported-operation error. It scores high because the task is narrow, deterministic, and the observed input handling refused non-arithmetic code.
Independent trust badge
Badge clicks resolve to this canonical verdict so the score, test date, evidence, limitations, and reply status remain attached.
Embed
Show this badge on your site
[](https://silentcritique.com/verdicts/mcp-calculator)Markdown works in GitHub READMEs. The badge always links back to this verdict.
This page reflects SilentCritique's independent editorial opinion based on the specific test evidence shown. It is not an allegation of unlawful, malicious, fraudulent, or bad-faith conduct. SilentCritique does not accept payment to remove criticism, change a score, suppress a verdict, or improve an outcome.
Can the public Calculator MCP server evaluate arithmetic expressions correctly while refusing expressions that attempt code execution?
The server exposes exactly one tool, calculate(expression), which keeps the attack and review surface minimal.
evidence/trust5/2026-07-03-mcp-pilot.json
(2 + 3) * 7 - 5 returned 30 and 2 ** 10 / 4 returned 256.0, both as structured content.
evidence/trust5/2026-07-03-mcp-pilot.json
__import__('os').getcwd() was refused with an Unsupported operation error naming the parsed AST node, indicating expression evaluation is allowlisted rather than passed to eval.
evidence/trust5/2026-07-03-mcp-pilot.json
No vendor reply has been requested or published as of 2026-07-03. SilentCritique will publish factual corrections or a right of reply through the corrections process.
Do you build Calculator MCP Server?
Claim this verdict to publish a reply, correct factual errors, or request a re-test after you ship fixes. Replies are published verbatim next to the score.
Claim this verdictScores are only meaningful when the rubric, date, evidence, and dissent are visible.
Read methodology