Episode details

Anthropic is offering a $15,000 bounty to hackers who can hack their AI system. This opportunity is open to anyone, not just professional hackers. The concept of 'jailbreaking' AI models has been popular, where people try to get the models to say or do things they're not supposed to. Anthropic's bounty program is similar to what people have been doing for free, but now they can get paid for it. This move by Anthropic may be a way to signal that they take AI safety seriously and to avoid regulatory scrutiny.


Published on Aug 14, 2024 in Business
US English

Comments

Add new comment

Login to comment

Newsletter

Don't want more email? Follow @podhuntapp for the best podcasts.

Sponsor