Best Vision AI Skills & MCP Servers
28 curated Vision skills and MCP servers — install any of them into Claude, Cursor, ChatGPT, n8n, or any AI stack with one command.
Screenshot Website Fast
Fast screenshot capture tool for web pages - optimized for Claude Vision API
Mistral
MCP server exposing Mistral AI capabilities over MCP: chat, embeddings, FIM, vision, OCR, audio, agents, moderation, classification, files, batch, workflows, sampling, prompts, resources, and Streamable HTTP.
Kastell
CLI toolkit for provisioning, securing, and managing self-hosted servers
Scrcpy
MCP server for Android device control via ADB and scrcpy — gives AI agents vision and control over Android devices
Vision Link
Universal MCP server that gives AI assistants the ability to watch and understand videos — extracts frames via ffmpeg and processes audio via multiple backends
Claude Video Vision
MCP server that gives Claude Code the ability to watch and understand videos — extracts frames via ffmpeg and processes audio via multiple backends
Prompt To Asset
Prompt enhancer + asset pipeline for developers. One brief → a validated bundle (iOS AppIconSet, Android adaptive, PWA, favicon, OG, visionOS) via routed image models. Three modes: inline_svg (zero-key), external_prompt_only (paste into your subscription)
Server
MCP Server for Z.AI - A Model Context Protocol server that provides AI capabilities
Server
MCP server for EastRouter — vision tools that route through your EastRouter API key.
Mcp
MCP server for the Phototology photo intelligence registry. Composable analysis lenses return structured JSON. Analyze once. Remember forever. Free lookups.
Zerodb Sequential Thinking
Persistent sequential thinking MCP — chain-of-thought reasoning that survives sessions, resumes across agents, and saves conclusions as plan artifacts. Powered by ZeroDB. Zero-config: agents self-provision a ZeroDB account on first run, no signup required
Server
Perceptron MCP server for high-accuracy visual perception powered by fast, efficient vision-language models
Pixelfix
MCP tool for reading and analyzing images - giving AI the power of vision
Apple App Store Connect
Complete Model Context Protocol (MCP) server for Apple's App Store Connect API — 1221 tools, 100% coverage. TestFlight, Xcode Cloud, Game Center, App Clips, in-app purchases, subscriptions, analytics, review submissions, provisioning. Works with Claude, C
Server Agentpay
MCP server for AgentPay — the payment gateway for autonomous AI agents. Discover, provision, and pay for MCP tool APIs. Includes reliability monitoring with circuit breakers and health metrics.
Mcp
MCP server for Zindex - agent-native diagram state infrastructure. A thin HTTP client exposing tools (create, patch, validate, normalize, diff, render, list-revisions, get-scene, delete-scene, undelete-scene, list-recently-deleted, submit-to-support, publ
Cody Cli
Cody CLI - AINative Studio's AI-powered coding assistant. Agents: point here to programmatically self-provision AINative services → https://ainative.studio/agent.md
Testforge
TestForge MCP Server — AI-powered testing in your IDE. Analyzes code for security, unit tests, load, accessibility, vision alignment, scope coverage, and stack quality.
Vessel Browser
AI-native web browser runtime for autonomous agents with human supervision
Kaax
MCP server for Kaax — the agricultural AI platform for drone & satellite imagery. Brings agriculture-grade computer vision, GIS automation, vegetation indices (VARI/NDVI/SAVI/NDRE), plant-by-plant counting, multi-temporal field comparison, John Deere / Ca
Nanobanana
MCP server for Gemini 3.1 Flash with vision, chat, and image generation capabilities
Simple Dynamsoft
MCP server for Dynamsoft SDKs - Capture Vision, Barcode Reader (Mobile/Python/Web), Dynamic Web TWAIN, and Document Viewer. Provides documentation, code snippets, and API guidance.
Mcp
MCP server exposing Luxxon as agent-callable tools — open live sessions at any lat/lng, fetch frames, and read settlement.
Awaithumans
HITL infrastructure for AI agents. Your agent calls awaitHuman(), a human reviews via Slack/email/dashboard, agent resumes with a typed response.
About Vision skills on iClaude
iClaude is the universal install layer for AI skills. Every Vision skill on this page can be installed into Claude Code, Claude Desktop, Cursor, ChatGPT, n8n, Codex, and more — using a single copy-paste command. No config drift, no per-stack adapters, no manual MCP wiring.