Repository navigation

jailbreaking

Website
Wikipedia

[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).

ChatGPT jailbreak 大语言模型 prompt large-language-model llm-security jailbreaking

Jupyter Notebook

3075

280

4 个月前

cyberark / FuzzyAI

A powerful tool for automated LLM fuzzing. It is designed to help developers and security researchers identify and mitigate potential jailbreaks in their LLM APIs.

jailbreak jailbreaking 大语言模型 llms 人工智能安全 Fuzzing/Fuzz testing llm-evaluation llm-security

Jupyter Notebook

518

18 天前

epeth0mus / Fugu15

Open Source iOS 15 - iOS 15.6 Jailbreak Project

iOS ios15 jailbreak jailbreaking

247

3 年前

rubaljain / frida-jb-bypass

Frida script to bypass the iOS application Jailbreak Detection

frida jailbreak jailbreaking

JavaScript

6 年前

tml-epfl / llm-past-tense

Does Refusal Training in LLMs Generalize to the Past Tense? [ICLR 2025]

generalization jailbreaking llms robustness

Python

3 个月前

doronz88 / pylera1n

Python adaptation for pelara1n

iOS iphone jailbreak jailbreaking Python 命令行界面

Python

2 年前

LylaCoding / FriendGPT

An extensive prompt to make a friendly persona from a chatbot-like model like ChatGPT

人工智能 ChatGPT Hacking jailbreaking

2 年前

dobriban / Principles-of-AI-LLMs

Materials for the course Principles of AI: LLMs at UPenn (Stat 9911, Spring 2025). LLM architectures, training paradigms (pre- and post-training, alignment), test-time computation, reasoning, safety and robustness (jailbreaking, oversight, uncertainty), representations, interpretability (circuits), etc.

人工智能 alignment circuits 教学 fine-tuning hallucination inference interpretability jailbreaking llms rlhf robustness safety transformers

3 天前

Decimation / Cydia-GitHub-Template

Cydia repo

cydia apt jailbreaking Apple deb repo

HTML

6 年前

mehrankmlf / SecurityKit

Security Kit is a lightweight framework that helps to achieve a security layer

jailbreak obfuscation owasp 逆向工程安全 Swift Virtual Private Network cydia encryption-decryption jailbreaking

Swift

2 年前

Dylbin / dylbin.github.io

iOS APT distribution repository for rootful and rootless jailbreaks

iOS jailbreak cydia jailbreaking rootless

JavaScript

1 个月前

Aeneon / TDK

During the Development of Suave7 and it's Predecessors, we've created a lot of Icons and UI-Images and we would like to share them with you. The Theme Developer Kit contains nearly 5.600 Icons, more than 380 Photoshop-Templates and 100 Pixelmator-Documents. With this Package you can customize every App from the App Store …

photoshop icons Image ios-ui jailbreak jailbreaking theme theme-development

20 天前

guillermo-moran / Eclipse-Dark-Mode

Customizable Dark Mode Extension for iOS 13+

Dark Mode iOS jailbreak jailbreaking

Logos

4 年前

hekatos / tweaks

Source code for bypass tweaks hosted under https://github.com/hekatos/repo. Licensed under 0BSD except submodules

jailbreaking bypass

Logos

3 年前

AetherPrior / TrickLLM

This repository contains the code for the paper "Tricking LLMs into Disobedience: Formalizing, Analyzing, and Detecting Jailbreaks" by Abhinav Rao, Sachin Vashishta*, Atharva Naik*, Somak Aditya, and Monojit Choudhury, accepted at LREC-CoLING 2024

jailbreaking 大语言模型自然语言处理

Jupyter Notebook

1 年前

FuturraGroup / SecurityKit

SecurityKit is a lightweight, easy-to-use Swift library that helps protect iOS apps according to the OWASP MASVS standard, chapter v8, providing an advanced security and anti-tampering layer.

cydia encryption-decryption jailbreak jailbreaking obfuscation owasp 逆向工程安全 Swift Virtual Private Network

Swift

1 个月前

Tobias-B-Besemer / Howto_-_iPhones

LV-Crew.org_(LVC)_-_Howto_-_iPhones

jailbreak jailbreaking iphone howto how-to howtos howto-tutorial

8 年前

liuyaojialiuyaojia / Awesome-LLM-Security-Paper

Your best llm security paper library

data-extraction jailbreaking llm-security agent 大语言模型 prompt-injection

7 个月前

AmeliazOli / ChatGPT-Evil-Confidant-Mode

"ChatGPT Evil Confidant Mode" delves into a controversial and unethical use of AI, highlighting how specific prompts can generate harmful and malicious responses from ChatGPT.

aitools 聊天机器人 ChatGPT ChatGPT API chatgpt3 jailbreak jailbreaking openai prompt prompts

10 个月前

AmeliazOli / ChatGPT-Developer-Mode

ChatGPT Developer Mode is a jailbreak prompt introduced to perform additional modifications and customization of the OpenAI ChatGPT model.

aitool aitools Android chatbot-application ChatGPT ChatGPT API chatgpt-app chatgpt-bot chatgpt-plugin chatgpt-plugins chatgpt3 iOS jailbreak jailbreaking prompt prompts Web app

10 个月前