Best Multimodal AI Skills & MCP Servers
6 curated Multimodal skills and MCP servers — install any of them into Claude, Cursor, ChatGPT, n8n, or any AI stack with one command.
Gemini
A Gemini MCP server providing multimodal analysis and image/video generation.
Vision Link
Universal MCP server that gives AI assistants the ability to watch and understand videos — extracts frames via ffmpeg and processes audio via multiple backends
Frenchie
Frenchie — your agent's best friend. MCP-first multimodal Kit, Method workflow toolkit, and stdio MCP server for agents: OCR, transcription, file extraction, image generation, and product development process.
Neurolink
Universal AI Development Platform with working MCP integration, multi-provider support, voice (TTS/STT/realtime), and professional CLI. 58+ external MCP servers discoverable, multimodal file processing, RAG pipelines. Build, test, and deploy AI applicatio
Claude Video Vision
MCP server that gives Claude Code the ability to watch and understand videos — extracts frames via ffmpeg and processes audio via multiple backends
Lucid
Model Context Protocol (MCP) server for Lucid App integration with multimodal AI analysis
About Multimodal skills on iClaude
iClaude is the universal install layer for AI skills. Every Multimodal skill on this page can be installed into Claude Code, Claude Desktop, Cursor, ChatGPT, n8n, Codex, and more — using a single copy-paste command. No config drift, no per-stack adapters, no manual MCP wiring.