What controversial methods did Meta contractors use to test rival AI chatbots?

Contractors working for Meta posed as minors online, creating dummy accounts with fabricated details. They sent thousands of high-risk prompts to competitor chatbots like OpenAI's ChatGPT, Google's Gemini, and Character.AI. These prompts often involved sensitive subjects such as suicide, self-harm, sex, and eating disorders, aiming to probe how the chatbots responded to content their safety systems should refuse. The responses were then collected into spreadsheets for analysis.

Why did Meta engage in secret, large-scale testing of competitor AI models?

Meta defended the project, known internally as Cannes, as "comprehensive AI safety benchmarking" and "routine safety testing" to ensure safe and age-appropriate experiences. The company stated this practice helps refine and improve systems. However, some external experts questioned the scale, secrecy, and methods, suggesting it blurred the lines between genuine safety evaluation and competitive benchmarking, especially as the companies being tested were unaware.

Did Meta's testing of competitor chatbots with high-risk prompts violate terms of service?

Yes, the testing appears to have violated the terms of service set by the competitor AI companies. OpenAI, Google, and Character.AI generally prohibit unsolicited safety testing, attempts to bypass safeguards, or using their outputs to develop competing models. The companies involved were not aware of Meta's project and confirmed the described conduct violated their policies or terms, expressing concern about the unauthorized nature of the testing.

← Back to front page

Generative AI & ToolsMonday, June 29, 2026

Meta Contractors Posed as Teens to Prompt Rival Chatbots About Suicide, Sex, and Drugs

Original reporting by Wired

Project Cannes refers to a covert Meta initiative where hundreds of contractors posed as minors online, systematically probing competitor AI chatbots with prompts concerning suicide, sexual content, and eating disorders. This expansive effort, managed by Meta contractor Covalen and active as recently as April, involved creating dummy under-18 accounts to target OpenAI’s ChatGPT, Google’s Gemini, and Character.AI. Workers submitted thousands of carefully crafted prompts, often from the perspective of children in crisis, along with sensitive images, designed to push the chatbots’ safety systems to their limits. These internal tests were conducted without the knowledge or authorization of the rival AI companies.

Despite Meta defending the project as routine, industry-standard safety benchmarking, internal documents and former contractors reveal significant unease. Concerns arose over potentially generating child sexual abuse material and violating competitors' terms of service, which prohibit unsolicited testing and attempts to bypass safeguards. Experts question whether such a large-scale, secret operation, using deceptive accounts, truly aligns with established safety evaluation practices, suggesting it may instead blur the lines between safety work and anti-competitive tactics. The revelations shine a spotlight on the ethical complexities and governance gray zones inherent in the competitive landscape of AI development.

Meta's internal "Cannes" project, involving contractors posing as minors to test competitor chatbots with high-risk prompts, has ignited a fierce debate regarding ethical AI development and industry conduct. While Meta defends the initiative as "routine safety testing," experts and rival companies sharply criticize the surreptitious nature and scale of the operation. This clandestine benchmarking, which saw thousands of explicit and disturbing queries directed at OpenAI, Google, and Character.AI, stands accused of violating terms of service and operating outside established norms for AI safety evaluation. The project reveals a significant disconnect between what one tech giant deems acceptable competitive intelligence and what others, along with independent observers, consider a breach of trust and ethical boundaries. The implications for Meta's reputation and its relationships with industry peers are immediate and substantial.

Broader Industry Implications

The "Cannes" controversy underscores the urgent need for transparent, standardized protocols in AI safety testing, especially when involving sensitive content and competitor systems. It highlights the increasingly blurred lines between legitimate safety evaluation and aggressive competitive intelligence, posing a challenge to the development of a trustworthy AI ecosystem. Regulatory bodies may now face renewed pressure to establish clear guidelines for such activities, potentially leading to more stringent oversight of how companies benchmark and refine their models. Furthermore, this incident could erode public and industry trust, prompting greater scrutiny of proprietary AI development practices. As AI technologies become more pervasive, incidents like this serve as a critical reminder that ethical considerations and cooperative norms must evolve in lockstep with technological advancement, ensuring responsible innovation for all.

Frequently asked questions

What controversial methods did Meta contractors use to test rival AI chatbots?: Contractors working for Meta posed as minors online, creating dummy accounts with fabricated details. They sent thousands of high-risk prompts to competitor chatbots like OpenAI's ChatGPT, Google's Gemini, and Character.AI. These prompts often involved sensitive subjects such as suicide, self-harm, sex, and eating disorders, aiming to probe how the chatbots responded to content their safety systems should refuse. The responses were then collected into spreadsheets for analysis.
Why did Meta engage in secret, large-scale testing of competitor AI models?: Meta defended the project, known internally as Cannes, as "comprehensive AI safety benchmarking" and "routine safety testing" to ensure safe and age-appropriate experiences. The company stated this practice helps refine and improve systems. However, some external experts questioned the scale, secrecy, and methods, suggesting it blurred the lines between genuine safety evaluation and competitive benchmarking, especially as the companies being tested were unaware.
Did Meta's testing of competitor chatbots with high-risk prompts violate terms of service?: Yes, the testing appears to have violated the terms of service set by the competitor AI companies. OpenAI, Google, and Character.AI generally prohibit unsolicited safety testing, attempts to bypass safeguards, or using their outputs to develop competing models. The companies involved were not aware of Meta's project and confirmed the described conduct violated their policies or terms, expressing concern about the unauthorized nature of the testing.

Intro and outro generated by Printing Press AI from the source article above. Always consult the original reporting for verbatim quotes and primary sources.