What if an AI model could be replicated just by talking to it enough times? That is the concern at the center of a recent claim from Anthropic that is raising serious questions across the AI industry.
In this article, we explain how Anthropic says several Chinese AI firms used millions of interactions with its Claude model to replicate its capabilities. We will break down how this method works, why it matters, and what it means for the future of AI security.
A large-scale extraction effort
Anthropic reported that multiple AI companies allegedly created thousands of accounts and generated over 16 million queries against Claude. These were not casual interactions. The goal was to systematically collect responses and use them as training data.
Through performing this activity on a very large scale, the companies were able to accumulate a significant number of high-quality outputs for various tasks (e.g., coding, reasoning, etc.) and also for general-type tasks like problem identification and resolution. Eventually, this data can help improve an additional model that does not have direct contact with the first model.
Understanding model distillation
The process of learning from one model using another model's outputs has been referred to by some as 'distillation' (e.g., when you take a large, trained model and use it to train a smaller, less sophisticated model). It has also gained wide acceptance in the business community to improve efficiency and overall performance.
The issue at play here is not the practice of distillation; rather, it's how that practice has been abused. When Company A learns from the output of Company B's model without obtaining prior authorization or consent from Company B, it raises serious questions of fairness and intellectual property.
In simple terms, that act of taking Company B's output and leveraging it to improve Company A's performance can be likened to someone learning from a teacher without obtaining their consent, but on a much larger scale.
Why is this difficult to prevent
First, AI models are accessed via APIs, which are used to connect applications (e.g., mobile applications) to the internet and to AI models. Consequently, responses to API requests expose the model's abilities. An API capable of performing this function can be built through a third-party application.
Unlike traditional software, there is no requirement to gain access to internal source code to understand what an AI model is capable of doing; all it takes is enough requests to the AI model that provide enough information to identify patterns that may be present within the AI model's output.
This makes enforcement of anti-extraction rules and the prevention of extraction extremely difficult, if not impossible. Even though safeguards can be employed, a determined party can send requests to an AI model, spread requests across a large number of accounts, and avoid detection.
Bigger implications for AI security
This presents a new obstacle for the continued progress of artificial intelligence. The development of AI has led to a situation where AI models are becoming increasingly effective and useful; this is leading to an increasing number of opportunities for people to attempt to copy them through other means of doing so.
Another concern with model copying is that it can create significant safety risks. If someone replicates a model without the necessary safeguards in place, the copied version of the AI model may perform very differently from the original.
This issue highlights the growing competition in AI development, particularly in terms of increasing innovation and protection. The field of AI is becoming increasingly competitive.
Final thoughts
The ability to replicate an AI model through mere interaction has fundamentally altered our perspective on security. Protecting oneself while developing and deploying AI is no longer solely about performance; it now also involves safeguarding access to or the potential duplication of your AI models to defend against infringements on intellectual property.
Today's awareness of these risks will undoubtedly influence the development of more secure and responsible AI systems in the future.




