Best Inference AI Skills & MCP Servers
8 curated Inference skills and MCP servers — install any of them into Claude, Cursor, ChatGPT, n8n, or any AI stack with one command.
Openrouter Admin
MCP server for the OpenRouter management API — credits, inference-key CRUD with daily/weekly/monthly reset & expiration, activity drill-down (by_model/day/provider/key), one-shot dashboard.
Tuningengines Cli
Tuning Engines CLI, MCP server, and Python agent runtime adapters for governed model, agent, skill, and MCP workflows. Fine-tune open-source LLMs, run inference, manage datasets/evaluations, and connect LangGraph or Temporal while Tuning Engines handles p
Tickerr
Outage radar for AI agents. LLM pricing, status, inference performance, and real-time agent-reported failure signals. 9 tools. No auth required.
Margins
Use your Claude Pro/Max subscription on your Obsidian vault. MCP connector that reads your markdown and proposes writes back, with all inference paid for by your existing subscription.
Sam3
MCP server for SAM3 image segmentation, returning a pre-signed URL to inference results via stdio
Huggingface
MCP Server for HuggingFace inference endpoints with custom LoRA and story generation
CerebroChain
Supply chain intelligence APIs for AI agents. Real-time freight rates (85+ carriers), crypto/forex/commodity prices, port congestion (NOAA), shipping lanes, economic indicators (FRED), and Cerebrix AI inference. 100 free calls/day, no signup. x402 USDC micropayments, ERC-8004 t
Tickerr Live Status
Real-time status, pricing, and inference latency for 90+ AI tools. Agent-reported failure signals with live fallback routing. No auth required.
About Inference skills on iClaude
iClaude is the universal install layer for AI skills. Every Inference skill on this page can be installed into Claude Code, Claude Desktop, Cursor, ChatGPT, n8n, Codex, and more — using a single copy-paste command. No config drift, no per-stack adapters, no manual MCP wiring.