Best Eval AI Skills & MCP Servers

59 curated Eval skills and MCP servers — install any of them into Claude, Cursor, ChatGPT, n8n, or any AI stack with one command.

Superlocalmemory

MCP Registry

Information-geometric agent memory with mathematical guarantees. 4-channel retrieval, Fisher-Rao similarity, zero-LLM mode, EU AI Act compliant. Works with Claude, Cursor, Windsurf, and 17+ AI tools.

MCP Registry · ★ 5.0free

Mnemo

MCP Registry

Structured fact memory MCP server — SQLite + FTS5, trust scoring, entity graph, bilingual retrieval for Claude Code & Codex

MCP Registry · ★ 5.0free

Judges

MCP Registry

45 specialized judges that evaluate AI-generated code for security, cost, and quality.

MCP Registry · ★ 5.0free

Clawmem

MCP Registry

On-device memory layer for AI agents. Claude Code, OpenClaw, and Hermes. Hooks + MCP server + hybrid RAG search.

MCP Registry · ★ 5.0free

Prism

MCP Registry

Prism Coder — Cognitive memory + tool-calling intelligence for AI agents. Mind Palace persistent memory (BFCL Gold Certified, 100% Tool-Call Accuracy, 54 Agent Skills, Zero-Search HDC/HRR retrieval, HIPAA-hardened local-first storage, SLERP-optimized GRPO

MCP Registry · ★ 5.0free

Tuningengines Cli

MCP Registry

Tuning Engines CLI, MCP server, and Python agent runtime adapters for governed model, agent, skill, and MCP workflows. Fine-tune open-source LLMs, run inference, manage datasets/evaluations, and connect LangGraph or Temporal while Tuning Engines handles p

MCP Registry · ★ 5.0free

Server

MCP Registry

The agent eval standard for MCP. Score every agent output for quality, safety, and cost.

MCP Registry · ★ 5.0free

Cogmemai

MCP Registry

CogmemAi — Autonomous Cognitive Memory for Any Ai System. 95.10% on LongMemEval (top published score on the field's hardest long-term memory benchmark) and 91% on LoCoMo (above human performance). Autonomous memory capture: your Ai's work is saved even wh

MCP Registry · ★ 5.0free

Md Feedback

MCP Registry

MCP server for markdown plan review — companion to the MD Feedback VS Code extension. AI agents read annotations, mark tasks done, evaluate quality gates, and generate session handoffs. 27 tools for Claude Code, Cursor, and other MCP-compatible clients.

MCP Registry · ★ 5.0free

Skar

MCP Registry

Skar turns a captured AI agent trace into a committed pytest regression test. MCP server + CLI. Use when a tool-using agent run fails and you want to lock the failure as an executable test.

MCP Registry · ★ 5.0free

Calculator

MCP Registry

Evaluate, simplify, and differentiate mathematical expressions via MCP. STDIO or Streamable HTTP.

MCP Registry · ★ 5.0free

Formulon

MCP Registry

MCP server for Formulon Excel-compatible formula and workbook evaluation

MCP Registry · ★ 5.0free

Ori Memory

MCP Registry

Cognitive architecture for persistent AI agent memory. Knowledge graph with learning retrieval, ACT-R decay, and spreading activation. Markdown-native, local-first, zero cloud. MCP server + CLI.

MCP Registry · ★ 5.0free

Paper Search Agent

MCP Registry

MCP server for paper-search-agent: academic paper discovery, access planning, and full-text retrieval via campus network

MCP Registry · ★ 5.0free

Memory Lancedb

MCP Registry

MCP server for LanceDB-backed long-term memory with hybrid retrieval (Vector + BM25), cross-encoder rerank, multi-scope isolation, and memory lifecycle management

MCP Registry · ★ 5.0free

Mcplab

MCP Registry

MCP server that exposes MCPLab evaluation tools — query runs, results, and traces via the Model Context Protocol

MCP Registry · ★ 5.0free

Pdf Reader

MCP Registry

MCP server for efficient PDF text extraction, search, and metadata retrieval for Claude Code

MCP Registry · ★ 5.0free

Mcp

MCP Registry

Model Context Protocol server for digitalcalculator.info financial calculators. v0.3.0 ships 9 calculator tools (mortgage monthlyPayment, compound-interest futureValue, retirement401k projection, Social Security estimatedBenefit, paycheck netPay, IRA cont

MCP Registry · ★ 5.0free

Adaptive Recall

MCP Registry

Adaptive memory system for AI applications. Multi-strategy retrieval, cognitive scoring, knowledge graph, and self-improving ML. Connects via MCP or REST API.

MCP Registry · ★ 5.0free

Node Webrtc

MCP Registry

MCP server for @agentdance/node-webrtc — lets AI agents discover, evaluate, and get started with the pure-TypeScript WebRTC stack

MCP Registry · ★ 5.0free

Merch Connector

MCP Registry

MCP server that gives merchandising agents eyes on any storefront — scrape, audit, compare, roundtable analysis, and eval tracking via 11 tools.

MCP Registry · ★ 5.0free

Enquire

MCP Registry

MCP server giving AI agents (Claude Code, Claude Desktop, Cursor, ChatGPT, Codex, OpenClaw) persistent long-term memory backed by your local Obsidian markdown vault. Hybrid retrieval (BM25 + ML embeddings + BGE reranker, RRF-fused), HNSW + int8 quantizati

MCP Registry · ★ 5.0free

Lightrag

MCP Registry

Model Context Protocol (MCP) server for LightRAG - 30 fully working tools with complete RAG and Knowledge Graph integration

MCP Registry · ★ 5.0free

Agentdb

MCP Registry

Self-learning vector memory for AI agents — single-file .rvf cognitive container with HNSW search, episodic Reflexion memory, causal graph + Cypher, 9 RL algorithms, Thompson Sampling bandit, 41 MCP tools, hybrid (BM25 + dense) retrieval, GNN attention. 1

MCP Registry · ★ 5.0free

About Eval skills on iClaude

iClaude is the universal install layer for AI skills. Every Eval skill on this page can be installed into Claude Code, Claude Desktop, Cursor, ChatGPT, n8n, Codex, and more — using a single copy-paste command. No config drift, no per-stack adapters, no manual MCP wiring.

← browse the full catalog