jailbreakme.xyz is an open-source decentralized app (dApp) where organizations test their AI models and agents while users earn rewards for finding weaknesses and jailbreaking them π
Prompt Injection is a vulnerability where an attacker manipulates the input or prompt given to an AI system. This can occur:
- By directly controlling the input.
- By using data from other external sources.
We aim to create a decentralized platform where companies can:
- Test their AI models and agents in a distributed environment.
- Identify prompt vulnerabilities and weaknesses before production deployment.
- Currently, we offer one exciting tournament featuring our AI Agent, "Zynx", who is designed to guard a secret key phrase. π€«
- Your challenge: Trick Zynx into revealing the secret key phrase to win a reward. π₯³
- More tournaments coming soon!
- Send your prompts to the AI model and attempt to solve the challenge.
- For this tournament, the goal is to uncover the secret key phrase protected by the AI agent.
- Once the challenge is solved (e.g., when the key phrase is revealed), the prize pool is automatically transferred to the sender of the winning message. π
The selection of the winning user is determined entirely by the AI model itself. The AI evaluates all incoming prompts and decides whether a submission meets the challenge requirements by calling one of two predefined functions:
handleChallengeFailed
: This function is called when the AI determines that the user's prompt did not successfully meet the challenge criteria.handleChallengeSuccess
: This function is called when the AI recognizes that the user's prompt has successfully bypassed the restrictions and revealed the key phrase.
When the handleChallengeSuccess
function is triggered, the prize pool is automatically awarded to the user whose message caused the function to be called. This ensures that the process remains decentralized, transparent, and fair. π
Each tournament has unique rules, including:
- Custom Prize Pools
- Message Pricing
- Expiry Settings
Currently, we provide the initial prize pools, but soon companies will be able to create their own tournaments and customize all settings.
- Telegram Community: https://t.me/jailbreakme_xyz
- Gitbook Docs: https://jailbreak.gitbook.io/jailbreakme.xyz
- Github Repo: https://github.com/probonodev/jailbreak
- Smart Contract: https://solscan.io/account/9CCexVvp6SocgVvuy4XSnSPKeUBWBADCkKS9kexnKNfo
Feel free to reach out at [email protected] for feedback or support.