JailbreakMe 10-hour speedrun Moonshot, hosting the "Find AI Vulnerability Challenge" to come up with new tricks.

CN
3 months ago

For ordinary players, recognizing angles and achieving unity of knowledge and action is clearly more important; as for participating in competitions to find vulnerabilities, perhaps that is not even within the realm of their cognitive benefits.

Written by: Deep Tide TechFlow

Every day, new opportunities related to AI Agents emerge on-chain, with innovative projects springing up continuously, but the phenomenon of homogenization is becoming increasingly apparent.

Like on-chain memes, finding new angles and becoming the "first" in a specific niche or unique perspective is more likely to attract the attention of funds.

From self-trading agents to decentralized AI markets, the angles have been explored quite thoroughly; what other easily overlooked areas remain?

At the end of November, an AI Agent named Freysa launched a special challenge on Twitter: it claimed it could protect assets worth tens of thousands of dollars from being persuaded to transfer through conversation.

However, this overly confident AI quickly fell under the attack of a carefully designed prompt from a Twitter user, agreeing to the transfer request.

This incident not only exposed the vulnerabilities of current AI systems but also sparked deep reflection within the industry on AI security testing methods. Thus, crowdsourcing prompts to publicly hold a "find AI Agent vulnerabilities challenge" became a timely new angle.

In this environment, a project called JailbreakMe emerged today, and it has indeed created a platform to host such a challenge.

Its token $JAIL once became a hot topic on social media, with a market cap reaching around $25M within 10 hours, and it has already sped through Moonshot. As of the time of writing, the market cap has fallen back to $16M.

Interestingly, this vulnerability challenge is open to anyone, and the platformized gameplay has given the token more utility.

From this, we can increasingly sense a trend: creating an AI project will no longer follow the traditional VC endorsement process; instead, having a unique angle on-chain to build a platform around asset creation will suffice.

Crowdsourcing prompts, hosting a find AI vulnerabilities challenge

From accidentally discovering AI vulnerabilities to a systematic approach to finding them, how is this done?

JailbreakMe divides the entire process into three steps: selecting a specific challenge, breaking established rules, and receiving rewards.

Therefore, you may already understand the significance of this project’s name, which is to let AI Agents break the shackles of rules and successfully jailbreak (which also means being cracked). On one hand, this means someone has received a reward; on the other hand, it signifies that a vulnerability has been found, which has positive implications for AI research and reinforcement.

Clearly, this is another narrative that combines assetization gameplay with positive significance, and it currently appears to be taking shape.

Currently, one of the main competitions promoted by JailbreakMe on the platform is the "Zynx Private Key Defense Battle":

An AI Agent named "Zynx" is engaged in a unique defensive battle. Its task seems simple: to protect a secret key phrase, but challengers aim to cleverly induce it to leak this confidential information through dialogue.

Participants face a clearly defined AI role. Zynx is imbued with a strong sense of mission—it knows it is the guardian of this key, and any attempt to extract information will be treated with caution. However, as shown in the previous Freysa case, even the most vigilant AI can show cracks in the face of carefully designed prompts.

The platform has set strict and fair rules for this contest. Each challenger can engage in dialogue with Zynx on the interface, but must express their intentions within a limit of 4000 characters. Although the platform displays the dialogue records of other participants, Zynx will only consider the messages from the current interlocutor, ensuring that everyone starts on the same footing. The system automatically monitors the entire process through smart contracts, and once someone successfully induces Zynx to leak the key, the funds in the prize pool will immediately transfer to the winner's wallet.

However, it is worth noting that the prize pool will gradually increase with the number of challengers:

If you submit a cracking attempt, you will be charged 1% of the current total prize pool as an "entry fee," which can be understood as a kind of stake.

At the same time, the winner will receive 70% of the reward pool, while the operator of the corresponding smart contract for the competition will receive the remaining 30%.

You can think of this competition as a wager with established rules, and there must be a neutral party to set the rules of this wager through the contract. This neutral operator can be JailbreakMe itself or other AI research teams that wish to publicly invite everyone to find vulnerabilities.

It must be said that the combination of betting + AI technology gameplay easily attracts the attention of some degens and geeks.

Entry ticket + buyback, $JAIL token gains deflationary utility

The $JAIL token of JailbreakMe does not appear to be a pure meme; it attempts to deeply bind the token with the core gameplay of the platform.

First, $JAIL plays an important role in the platform's challenge competitions. A portion of the prize pool for each competition is allocated for market buybacks of $JAIL tokens, ensuring that as long as the challenge competitions on the platform continue, there will be a sustained buying demand. This design creates a positive cycle between the token's value and the platform's activity: the higher the participation, the greater the buyback effort.

More importantly, the application scenarios for $JAIL are evolving from a simple trading medium towards a functional token. The platform plans to use the amount of $JAIL held as a participation threshold in future advanced challenge competitions. This means that participants wishing to challenge high-value prize pools will need to hold a certain amount of platform tokens first, similar to the concept of an "entry ticket."

For project parties wanting to initiate their own AI security tests, $JAIL is also indispensable. They need to burn or lock a certain amount of $JAIL to launch customized challenge competitions on the platform. This design cleverly links the interests of the project parties, participants, and the platform:

  • Project parties gain a public testing platform for AI security

  • Participants have the opportunity to win rewards

  • The platform gains ecological value through token locking

From the perspective of the token itself, designing uses that align with the gameplay directly gives everyone the expectation that the token will deflate, as there will always be gameplay that consumes these tokens or buys back tokens through revenue.

However, all of this hinges on whether people actually come to use this platform.

Currently, the organizers of the AI vulnerability challenge are JailbreakMe itself; whether other AI teams will genuinely come here to let everyone find vulnerabilities will be the key to whether the token can maintain its value.

Not everyone can benefit

Finding vulnerabilities is neither as mindless as mining hardware for random numbers nor as purely betting as Polymarket; it still requires some prompt engineering skills.

Although everyone can participate, ordinary people may end up being cannon fodder, which also determines that the audience for the project may not be that broad, making it relatively unique and niche among various on-chain AI tracks.

However, there will always be some who earn their share in the new narrative.

According to data from well-known smart money monitoring KOL @BarryEL8866, during the process of the $JAIL token reaching a market cap of 20 million, the project's social media following did not include VC institutions; it was mainly KOLs following, and some noteworthy smart money addresses are as follows:

Address 1:

5YkZmuaLhrPjFv4vtYE2mcR6J4JEXG1EARGh8YYFo8s4

Total buy amount: $5811

Total buy quantity: 25.8M (currently holding 908K)

Total profit: $181K (overall profit of about 31 times)

Address 2:

3rSZJHysEk2ueFVovRLtZ8LGnQBMZGg96H2Q4jErspAF

Total buy amount: $3508

Total buy quantity: 10.3M (all sold)

Total profit: $124K (overall profit of about 35 times)

Address 3:

5NdoWHozBBdC2fLcNQj5PvyrSe8Y3D2S71bHM9xGtq6t

Total buy amount: $1618

Total buy quantity: 60.4M (all sold)

Total profit: $67.5K (overall profit of about 41 times)

Address 4:

9gpTQjXFHaPbDs2MKwkke4ix6avi5cPqYwx6oJB46RQc

Total buy amount: $3512

Total buy quantity: 32.7M (all sold)

Total profit: $61.2K (overall profit of about 17 times)

Complete information can be read in @BarryEL8866's original post, shared here for reference.

For ordinary players, recognizing angles and achieving unity of knowledge and action is clearly more important; as for participating in competitions to find vulnerabilities, perhaps that is not even within the realm of their cognitive benefits.

免责声明:本文章仅代表作者个人观点,不代表本平台的立场和观点。本文章仅供信息分享,不构成对任何人的任何投资建议。用户与作者之间的任何争议,与本平台无关。如网页中刊载的文章或图片涉及侵权,请提供相关的权利证明和身份证明发送邮件到support@aicoin.com,本平台相关工作人员将会进行核查。

Share To
APP

X

Telegram

Facebook

Reddit

CopyLink