Repository navigation
prompt-testing
- Website
- Wikipedia
Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.
Agentic LLM Vulnerability Scanner / AI red teaming kit 🧪
LLM Reasoning and Generation Benchmark. Evaluate LLMs in complex scenarios systematically.
The prompt engineering, prompt management, and prompt evaluation tool for TypeScript, JavaScript, and NodeJS.
LLM Prompt Test helps you test Large Language Models (LLMs) prompts to ensure they consistently meet your expectations.
Community Plugin for Genkit to use Promptfoo
Sample project demonstrates how to use Promptfoo, a test framework for evaluating the output of generative AI models
A collection of prompts that I use on a day-to-day basis for work and leisure.
Sample implementation demonstrating how to use Firebase Genkit with Promptfoo
A dynamic and interactive playground for testing and refining prompts with OpenAI's language models. Includes customizable inputs for prompts, advanced model settings, and live response streaming for seamless experimentation.