Repository navigation

#

prompt-testing

Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.

TypeScript
6232
17 小时前
msoedov/agentic_security
Python
1296
4 天前

LLM Reasoning and Generation Benchmark. Evaluate LLMs in complex scenarios systematically.

TypeScript
157
15 天前

The prompt engineering, prompt management, and prompt evaluation tool for TypeScript, JavaScript, and NodeJS.

TypeScript
6
7 个月前

LLM Prompt Test helps you test Large Language Models (LLMs) prompts to ensure they consistently meet your expectations.

TypeScript
5
1 年前

Sample project demonstrates how to use Promptfoo, a test framework for evaluating the output of generative AI models

1
7 个月前

A collection of prompts that I use on a day-to-day basis for work and leisure.

1
7 个月前

A dynamic and interactive playground for testing and refining prompts with OpenAI's language models. Includes customizable inputs for prompts, advanced model settings, and live response streaming for seamless experimentation.

HTML
0
3 个月前