Tools & Code
142 tools for prompt management, evaluation, and development.
Showing 142 of 142 tools
Promptfoo
Open-source CLI for testing, evaluating, and red-teaming LLM prompts. YAML configs, CI/CD integration, adversarial testing. ~9K+ ⭐
Prompt Management and TestingPromptify
Solve NLP Problems with LLM's & Easily generate different NLP Task prompts for popular generative models like GPT, PaLM, and more with Promptify
Prompt Management and TestingAgenta
Open-source LLM developer platform for prompt management, evaluation, human feedback, and deployment.
Prompt Management and TestingPromptLayer
Version, test, and monitor every prompt and agent with robust evals, tracing, and regression sets.
Prompt Management and TestingHelicone
Production prompt monitoring and optimization platform.
Prompt Management and TestingLangGPT
Framework for structured and meta-prompt design. 10K+ ⭐
Prompt Management and TestingChainForge
Visual toolkit for building, testing, and comparing LLM prompt responses without code.
Prompt Management and TestingLMQL
A query language for LLMs making complex prompt logic programmable.
Prompt Management and TestingPromptotype
Platform for developing, testing, and managing structured LLM prompts.
Prompt Management and TestingPromptPanda
AI-powered prompt management system for streamlining prompt workflows.
Prompt Management and TestingPromptimize AI
Browser extension to automatically improve user prompts for any AI model.
Prompt Management and TestingPROMPTMETHEUS
Web-based "Prompt Engineering IDE" for iteratively creating and running prompts.
Prompt Management and TestingBetter Prompt
Test suite for LLM prompts before pushing to production.
Prompt Management and TestingOpenPrompt
Open-source framework for prompt-learning research.
Prompt Management and TestingPrompt Source
Toolkit for creating, sharing, and using natural language prompts.
Prompt Management and TestingPrompt Engine
NPM utility library for creating and maintaining prompts for LLMs (Microsoft).
Prompt Management and TestingPromptInject
Framework for quantitative analysis of LLM robustness to adversarial prompt attacks.
Prompt Management and TestingLynxPrompt
Self-hostable platform for managing AI IDE config files (.cursorrules, CLAUDE.md, copilot-instructions.md). Web UI, REST API, CLI, and federated blueprint marketplace for 30+ AI coding assistants.
Prompt Management and Testingflompt
Visual AI prompt builder that decomposes prompts into 12 semantic blocks (role, context, constraints, examples, etc.) and compiles them into optimized XML. Browser extension for ChatGPT/Claude/Gemini, and MCP server for Claude Code agents. Free, open-source.
Prompt Management and TestingDeepEval
Open-source evaluation framework covering RAG, agents, and conversations with CI/CD integration. ~7K+ ⭐
LLM Evaluation ToolsRagas
RAG evaluation with knowledge-graph-based test set generation and 30+ metrics. ~8K+ ⭐
LLM Evaluation ToolsLangSmith
LangChain's platform for debugging, testing, evaluating, and monitoring LLM applications.
LLM Evaluation ToolsLangfuse
Open-source LLM observability with tracing, prompt management, and human annotation. ~7K+ ⭐
LLM Evaluation ToolsBraintrust
End-to-end AI evaluation platform, SOC2 Type II certified.
LLM Evaluation ToolsArize AI / Phoenix
Real-time LLM monitoring with drift detection and tracing.
LLM Evaluation ToolsTruLens
Evaluating and explaining LLM apps; tracks hallucinations, relevance, groundedness.
LLM Evaluation ToolsInspectAI
Purpose-built for evaluating agents against benchmarks (UK AISI).
LLM Evaluation ToolsOpik
Evaluate, test, and ship LLM applications across dev and production lifecycles.
LLM Evaluation ToolsEvalView
CLI tool for testing multi-step AI agents with YAML test cases, regression detection, and production monitoring.
LLM Evaluation ToolsLangChain / LangGraph
Most widely adopted LLM app framework; LangGraph adds graph-based multi-step agent workflows. ~100K+ / ~10K+ ⭐
Agent FrameworksCrewAI
Role-playing AI agent orchestration with 700+ integrations. ~44K+ ⭐
Agent FrameworksAutoGen (AG2)
Microsoft's multi-agent conversational framework. ~40K+ ⭐
Agent FrameworksDSPy
Stanford's framework for programming LLMs with automatic prompt/weight optimization. ~22K+ ⭐
Agent FrameworksOpenAI Agents SDK
Official agent framework with function calling, guardrails, and handoffs. ~10K+ ⭐
Agent FrameworksSemantic Kernel
Microsoft's AI framework powering M365 Copilot; C#, Python, Java. ~24K+ ⭐
Agent FrameworksLlamaIndex
Data framework for RAG and agent capabilities. ~40K+ ⭐
Agent FrameworksHaystack
Open-source NLP framework with pipeline architecture for RAG and agents. ~20K+ ⭐
Agent FrameworksAgno (formerly Phidata)
Python agent framework with microsecond instantiation. ~20K+ ⭐
Agent FrameworksSmolagents
Hugging Face's minimalist code-centric agent framework (~1000 LOC). ~15K+ ⭐
Agent FrameworksPydantic AI
Type-safe agent framework using Pydantic for structured validation. ~8K+ ⭐
Agent FrameworksMastra
TypeScript AI agent framework with assistants, RAG, and observability. ~20K+ ⭐
Agent FrameworksGoogle ADK
Agent Development Kit deeply integrated with Gemini and Google Cloud.
Agent FrameworksStrands Agents (AWS)
Model-agnostic framework with deep AWS integrations.
Agent FrameworksLangflow
Node-based visual agent builder with drag-and-drop. ~50K+ ⭐
Agent Frameworksn8n
Workflow automation with AI agent capabilities and 400+ integrations. ~60K+ ⭐
Agent FrameworksDify
All-in-one backend for agentic workflows with tool-using agents and RAG.
Agent FrameworksPraisonAI
Multi-AI Agents framework with 100+ LLM support, MCP integration, and built-in memory.
Agent FrameworksNeurolink
Multi-provider AI agent framework unifying 12+ providers with workflow orchestration.
Agent FrameworksComposio
Connect 100+ tools to AI agents with zero setup.
Agent FrameworksDSPy
Multiple optimizers (MIPROv2, BootstrapFewShot, COPRO) for automatic prompt tuning. ~22K+ ⭐
Prompt Optimization ToolsTextGrad
Automatic differentiation via text (Stanford). ~2K+ ⭐
Prompt Optimization ToolsOPRO
Google DeepMind's optimization by prompting.
Prompt Optimization ToolsGarak (NVIDIA)
LLM vulnerability scanner for hallucination, injection, and jailbreaks — the "nmap for LLMs." ~3K+ ⭐
Red Teaming and Prompt SecurityPyRIT (Microsoft)
Python Risk Identification Tool for automated red-teaming. ~3K+ ⭐
Red Teaming and Prompt SecurityDeepTeam
40+ vulnerabilities, 10+ attack methods, OWASP Top 10 support.
Red Teaming and Prompt SecurityLLM Guard
Security toolkit for LLM I/O validation. ~2K+ ⭐
Red Teaming and Prompt SecurityNeMo Guardrails (NVIDIA)
Programmable guardrails for conversational systems. ~5K+ ⭐
Red Teaming and Prompt SecurityGuardrails AI
Define strict output formats (JSON schemas) to ensure system reliability.
Red Teaming and Prompt SecurityLakera
AI security platform for real-time prompt injection detection.
Red Teaming and Prompt SecurityPurple Llama (Meta)
Open-source LLM safety evaluation including CyberSecEval.
Red Teaming and Prompt SecurityGPTFuzz
Automated jailbreak template generation achieving >90% success rates.
Red Teaming and Prompt SecurityRebuff
Open-source tool for detection and prevention of prompt injection.
Red Teaming and Prompt SecurityAgentSeal
"Open-source scanner that runs 150 attack probes to test AI agents for prompt injection and extraction vulnerabilities."
Red Teaming and Prompt SecurityMCP Specification
The core protocol specification and SDKs. ~15K+ ⭐
MCP (Model Context Protocol)MCP Reference Servers
Official implementations: fetch, filesystem, GitHub, Slack, Postgres.
MCP (Model Context Protocol)FastMCP (Python)
High-level Pythonic framework for building MCP servers. ~5K+ ⭐
MCP (Model Context Protocol)GitHub MCP Server
GitHub's official MCP server for repo, issue, PR, and Actions interaction. ~15K+ ⭐
MCP (Model Context Protocol)Awesome MCP Servers
Curated list of 10,000+ community MCP servers. ~30K+ ⭐
MCP (Model Context Protocol)Context7
MCP server providing version-specific documentation to reduce code hallucination.
MCP (Model Context Protocol)GitMCP
Creates remote MCP servers for any GitHub repo by changing the domain.
MCP (Model Context Protocol)MCP Inspector
Visual testing tool for MCP server development.
MCP (Model Context Protocol)Claude Code
Anthropic's agentic coding CLI; understands full codebases and executes complex multi-step tasks via natural language.
Vibe Coding and AI Coding AssistantsOpenAI Codex CLI
Open-source terminal coding agent from OpenAI; lightweight, local-first, with sandboxed code execution. ~68K+ ⭐
Vibe Coding and AI Coding AssistantsGemini CLI
Google's open-source terminal AI agent with 1M-token context window and Google Search grounding. ~96K+ ⭐
Vibe Coding and AI Coding AssistantsQwen Code
Open-source terminal AI agent optimized for Qwen3-Coder; multi-protocol support (OpenAI/Anthropic/Gemini APIs), 1,000 free requests/day. ~21K+ ⭐
Vibe Coding and AI Coding AssistantsAider
AI pair programming in terminal with deep Git integration; maps entire codebases and auto-commits changes. ~42K+ ⭐
Vibe Coding and AI Coding AssistantsOpenCode
Powerful open-source AI coding agent with beautiful TUI; supports nearly all AI model providers. ~120K+ ⭐
Vibe Coding and AI Coding AssistantsGoose
Extensible open-source AI agent from Block (Square/Cash App); installs, executes, edits, and tests with any LLM. ~29K+ ⭐
Vibe Coding and AI Coding AssistantsCrush
Glamorous agentic coding agent from Charmbracelet with multi-model support, LSP integration, and beautiful terminal UI. ~9K+ ⭐
Vibe Coding and AI Coding AssistantsAmazon Q Developer CLI
Agentic chat experience in terminal from AWS; transitioning to Kiro CLI.
Vibe Coding and AI Coding AssistantsAmp
Sourcegraph's agentic coding tool (Cody successor); works across CLI and IDE.
Vibe Coding and AI Coding AssistantsJunie CLI
JetBrains' LLM-agnostic coding agent CLI (beta 2026); supports all major model providers.
Vibe Coding and AI Coding AssistantsAutohand Code CLI
Self-evolving autonomous terminal coding agent with multi-provider LLM support, 40+ tools, and modular skills system.
Vibe Coding and AI Coding AssistantsName
Description
Vibe Coding and AI Coding Assistants:-----
:-----------
Vibe Coding and AI Coding AssistantsCursor
Leading AI-native code editor (VS Code fork); Composer generates entire apps from natural language, agentic multi-file edits.
Vibe Coding and AI Coding AssistantsWindsurf
AI-powered IDE (VS Code fork) with proprietary Cascade agent and SWE-1.5 model; acquired by Cognition AI.
Vibe Coding and AI Coding AssistantsZed
High-performance editor in Rust with native AI features, Zeta edit prediction, and Agent Client Protocol support. ~77K+ ⭐
Vibe Coding and AI Coding AssistantsTrae
Free AI-powered IDE from ByteDance ("The Real AI Engineer") with Builder Mode; provides free access to Claude, GPT-4o, and DeepSeek.
Vibe Coding and AI Coding AssistantsGoogle Antigravity
Google's agent-first IDE (VS Code fork) with Manager view for orchestrating multiple agents in parallel; powered by Gemini.
Vibe Coding and AI Coding AssistantsKiro
AWS's spec-driven agentic AI IDE (VS Code fork); turns prompts into specs, then working code, docs, and tests.
Vibe Coding and AI Coding AssistantsPearAI
Open-source AI code editor (VS Code fork) with Continue-based chat and completions. ~40K+ ⭐
Vibe Coding and AI Coding AssistantsVoid
Open-source Cursor alternative (VS Code fork); any model or local hosting with change visualization. ~28K+ ⭐
Vibe Coding and AI Coding AssistantsMelty
Open-source chat-first AI code editor with multi-file editing and deep Git integration. ~7K+ ⭐
Vibe Coding and AI Coding AssistantsEmdash
Open-source agentic dev environment (YC W26) for running multiple coding agents in parallel in isolated Git worktrees.
Vibe Coding and AI Coding AssistantsName
Description
Vibe Coding and AI Coding Assistants:-----
:-----------
Vibe Coding and AI Coding AssistantsGitHub Copilot
Most widely adopted AI coding assistant; inline completions, chat, and agentic coding agent across VS Code, JetBrains, Neovim.
Vibe Coding and AI Coding AssistantsCline
Autonomous coding agent in VS Code with human-in-the-loop approvals; file editing, terminal commands, and browser use. ~59K+ ⭐
Vibe Coding and AI Coding AssistantsContinue
Open-source VS Code and JetBrains extension for creating custom, modular AI dev systems; any model. ~32K+ ⭐
Vibe Coding and AI Coding AssistantsCody
Sourcegraph-powered AI assistant that pulls context from local and remote codebases; VS Code, JetBrains, Visual Studio.
Vibe Coding and AI Coding AssistantsCodeium
Free AI coding extension for 40+ IDEs with completions, chat, and search across 70+ languages.
Vibe Coding and AI Coding AssistantsAmazon Q Developer
AWS's AI coding assistant with completions, inline chat, and agent mode; deep AWS integration.
Vibe Coding and AI Coding AssistantsGemini Code Assist
Google's IDE extension powered by Gemini with completions, Next Edit Predictions, and inline diffs; free for individuals.
Vibe Coding and AI Coding AssistantsTabnine
Privacy-focused AI assistant trained on permissive-licensed OSS; supports all major IDEs with on-premises deployment.
Vibe Coding and AI Coding AssistantsAugment Code
Enterprise AI coding assistant with 200K-token Context Engine for deep codebase understanding.
Vibe Coding and AI Coding AssistantsQodo
AI code review and quality platform with multi-agent architecture; test generation, code review, CI/CD enforcement.
Vibe Coding and AI Coding AssistantsCodeGeeX
Open-source multilingual code generation model supporting 20+ languages with VS Code and JetBrains extensions. ~11K+ ⭐
Vibe Coding and AI Coding AssistantsTabby
Self-hosted open-source AI coding assistant (Copilot alternative); runs entirely on your infrastructure. ~25K+ ⭐
Vibe Coding and AI Coding AssistantsName
Description
Vibe Coding and AI Coding Assistants:-----
:-----------
Vibe Coding and AI Coding AssistantsDevin
First fully autonomous cloud-based AI software engineer; plans, codes, tests, and opens PRs independently.
Vibe Coding and AI Coding AssistantsReplit Agent
Cloud-native AI agent that autonomously builds, tests, and deploys full-stack apps in-browser; 50+ languages.
Vibe Coding and AI Coding Assistantsbolt.new
AI-powered web dev agent; prompt, run, edit, and deploy full-stack apps directly in the browser via WebContainers. ~15K+ ⭐
Vibe Coding and AI Coding Assistantsbolt.diy
Community fork of bolt.new with extended features and broader LLM flexibility. ~12K+ ⭐
Vibe Coding and AI Coding AssistantsLovable
Full-stack apps from natural language with built-in Supabase, auth, and one-click deploy; fastest European startup to $20M ARR.
Vibe Coding and AI Coding Assistantsv0
Vercel's AI platform for generating high-quality React/Next.js UI components from natural language.
Vibe Coding and AI Coding AssistantsGitHub Copilot Workspace
Cloud-based coding environment with plan, brainstorm, and repair agents; included with paid Copilot plans.
Vibe Coding and AI Coding AssistantsFirebase Studio
Google's agentic cloud-based development environment.
Vibe Coding and AI Coding AssistantsName
Description
Vibe Coding and AI Coding Assistants:-----
:-----------
Vibe Coding and AI Coding AssistantsOpenHands
Leading open-source platform for cloud coding agents; consistently top on SWE-bench. Formerly OpenDevin. ~69K+ ⭐
Vibe Coding and AI Coding AssistantsSWE-agent
Takes a GitHub issue and automatically fixes it using a custom agent-computer interface. [NeurIPS 2024] ~19K+ ⭐
Vibe Coding and AI Coding AssistantsOpen SWE
LangChain's async cloud-hosted coding agent framework built on LangGraph with Slack/Linear integration. ~8K+ ⭐
Vibe Coding and AI Coding AssistantsDevika
Open-source agentic software engineer; breaks down instructions, researches, and writes code. Devin alternative. ~18K+ ⭐
Vibe Coding and AI Coding AssistantsAutoCodeRover
Autonomous program improvement combining LLMs with fault localization for GitHub issue resolution. ~2.8K+ ⭐
Vibe Coding and AI Coding AssistantsAgentless
Simple three-phase approach (localize → repair → validate) to solving software development problems. ~2K+ ⭐
Vibe Coding and AI Coding AssistantsDevon
Open-source pair programmer SWE agent with code writing, planning, and research; supports Claude, GPT-4, Llama, Ollama. ~3.5K+ ⭐
Vibe Coding and AI Coding AssistantsPrompt Engineering Guide (DAIR.AI)
The definitive open-source guide and resource hub. 3M+ learners. ~55K+ ⭐
Other Notable RepositoriesAwesome ChatGPT Prompts / Prompts.chat
World's largest open-source prompt library. 1000s of prompts for all major models.
Other Notable Repositories12-Factor Agents
Principles for building production-grade LLM-powered software. ~17K+ ⭐
Other Notable RepositoriesNirDiamant/Prompt_Engineering
22 hands-on Jupyter Notebook tutorials. ~3K+ ⭐
Other Notable RepositoriesContext Engineering Repository
First-principles handbook for moving beyond prompt engineering to context design.
Other Notable RepositoriesAI Agent System Prompts Library
Collection of system prompts from production AI coding agents (Claude Code, Gemini CLI, Cline, Aider, Roo Code).
Other Notable RepositoriesAwesome Vibe Coding
Curated list of 245+ tools and resources for building software through natural language prompts.
Other Notable RepositoriesOpenAI Cookbook
Official recipes for prompts, tools, RAG, and evaluations.
Other Notable RepositoriesEmbedchain
Framework to create ChatGPT-like bots over your dataset.
Other Notable RepositoriesThoughtSource
Framework for the science of machine thinking.
Other Notable RepositoriesPromptext
Extracts and formats code context for AI prompts with token counting.
Other Notable RepositoriesPrice Per Token
Compare LLM API pricing across 200+ models.
Other Notable RepositoriesOpenPaw
CLI tool (npx pawmode) that turns Claude Code into a personal assistant by generating system prompts (CLAUDE.md + SOUL.md) with personality, memory, and 38 skill routers.
Other Notable RepositoriesThink Better
Open-source CLI that permanently injects 10 structured decision frameworks (MECE, Issue Trees, Pre-Mortems) and 12 cognitive bias detectors into AI assistant prompts. Go, MIT.
Other Notable Repositories