Repository navigation

#

prompt-testing

Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.

TypeScript
8047
3 小时前
msoedov/agentic_security
Python
1617
5 天前

LLM Reasoning and Generation Benchmark. Evaluate LLMs in complex scenarios systematically.

TypeScript
163
3 个月前

Test, compare, and optimize your AI prompts in minutes

JavaScript
7
6 天前

The prompt engineering, prompt management, and prompt evaluation tool for TypeScript, JavaScript, and NodeJS.

TypeScript
6
1 年前

LLM Prompt Test helps you test Large Language Models (LLMs) prompts to ensure they consistently meet your expectations.

TypeScript
5
1 年前

Sample project demonstrates how to use Promptfoo, a test framework for evaluating the output of generative AI models

1
1 年前

A collection of prompts that I use on a day-to-day basis for work and leisure.

1
1 年前

A dynamic and interactive playground for testing and refining prompts with OpenAI's language models. Includes customizable inputs for prompts, advanced model settings, and live response streaming for seamless experimentation.

HTML
0
7 个月前

Quickstart guide for using PromptFoo to evaluate LLM prompts via CLI or Colab.

0
18 天前

A pytest-based framework for testing multi AI agents (mAIa) system. It provides a flexible and extensible platform for creating and running complex multi-agent simulations and capturing the results.

Python
0
13 天前

🐙 Team Agents unifica 82 especialistas en IA para resolver desafíos con chat inteligente, analista de requisitos y subida de documentos. Plataforma futurista y modular.

Python
0
6 天前