Best Llama AI Skills & MCP Servers
20 curated Llama skills and MCP servers — install any of them into Claude, Cursor, ChatGPT, n8n, or any AI stack with one command.
Ollama
MCP server for Ollama - exposes all Ollama SDK functionality through MCP tools
Agestra
Multi-host MCP orchestration for Claude Code, Codex CLI, Gemini CLI, and local models
Cli
CLI + MCP server for the Llama Ventures investment workbench (command.llamaventures.vc).
Ollama Intern
MCP control plane for local cognitive labor — job-shaped tools with tiered Ollama models (instant/workhorse/deep/embed), server-enforced guardrails, and measured economics so Claude can delegate bulk work without losing control.
Ask Llm
Unified MCP server for multi-LLM consultation — registers tools from all available providers (Gemini, Codex, Ollama) behind runtime availability checks
Cctx Optimizer
Reduce Claude Code token usage by 70-90% with cross-session memory and four-layer optimization: semantic codebase indexing, PostToolUse tool compression, turn summarization, and session consolidation. MCP server powered by Ollama — zero API calls, runs en
Ask Ollama
MCP server for local Ollama LLM integration - for Claude IDE and other IDEs
Lm
MCP server for local LLMs — connects to LM Studio or any OpenAI-compatible endpoint
Codecompass
AI-powered MCP server for codebase navigation and LLM prompt optimization
Llm
MCP server for local LLMs — connects to LM Studio or any OpenAI-compatible endpoint. Fork of @houtini/lm with extended heartbeat and timeout handling.
Clawdcursor
Local MCP server that gives any AI agent safe cross-OS desktop control — the fallback execution layer for when APIs, CLIs, and direct integrations aren't available. Works with any tool-calling model (Claude, GPT, Gemini, Llama) on Windows, macOS, and Linu
Network Ai
AI agent orchestration framework for TypeScript/Node.js - 29 adapters (LangChain, AutoGen, CrewAI, OpenAI Assistants, LlamaIndex, Semantic Kernel, Haystack, DSPy, Agno, MCP, OpenClaw, A2A, Codex, MiniMax, NemoClaw, APS, Copilot, LangGraph, Anthropic Compu
Claudish
Run Claude Code with any model - OpenRouter, Ollama, LM Studio & local models
Nex Code
Run 400B+ open coding models on your codebase without the hardware bill. Ollama Cloud first — OpenAI, Anthropic, and Gemini when you need them.
Cli
CLI for Paparats MCP - semantic code search with AST chunking, symbol graph, and vector search for AI coding assistants
Crawlforge
CrawlForge MCP Server - Professional Model Context Protocol server with 23 web scraping, crawling, and content processing tools. Defaults to local Ollama for LLM extraction (no API key needed); OpenAI/Anthropic available as opt-in. v4.0 adds Markdown-firs
Triss Coworker
Give your AI coding agent a cheap DeepSeek coworker. Delegate bulk reads, boilerplate, and doc updates to save 60-70% of your token budget.
Mcp
MCP server for Llama project management - connect Claude to your agile workflow
Tabby Ai Assistant
Tabby终端AI助手插件 - 支持多AI提供商(OpenAI、Anthropic、Minimax、GLM、Ollama、vLLM)和MCP服务器
Ollama
Modern MCP server for Ollama – rebooted and actively maintained.
About Llama skills on iClaude
iClaude is the universal install layer for AI skills. Every Llama skill on this page can be installed into Claude Code, Claude Desktop, Cursor, ChatGPT, n8n, Codex, and more — using a single copy-paste command. No config drift, no per-stack adapters, no manual MCP wiring.