← Back to Home
Tech 5 min read

Anthropic Accuses Alibaba of Illicitly Harvesting Claude AI Capabilities

A high-stakes dispute over intellectual property in the AI sector reveals the cutthroat tactics shaping the future of large language models—and the fragile trust underpinning innovation.

a close up of a computer motherboard in the dark
Photo by Vishnu Mohanan on Unsplash

Anthropic, the San Francisco-based artificial intelligence startup, has leveled serious allegations against Chinese tech giant Alibaba, accusing it of systematically extracting proprietary capabilities from its flagship Claude AI model. The dispute, which surfaced in internal documents reviewed by Byte Brief, underscores the intensifying battle over intellectual property in the AI sector, where the line between competitive research and industrial espionage grows increasingly blurred. At stake is not merely the performance of cutting-edge language models but the viability of a business model built on safeguarding proprietary algorithms—a model that has propelled Anthropic into a valuation exceeding $18 billion and positioned it as a formidable rival to OpenAI. If proven, Alibaba’s alleged tactics could set a dangerous precedent, emboldening other firms to exploit vulnerabilities in AI systems under the guise of benign experimentation.

The allegations center on what Anthropic describes as a coordinated effort by Alibaba engineers to reverse-engineer Claude’s underlying architecture through a technique known as 'model extraction.' Unlike conventional software, where source code can be obfuscated or protected by legal barriers, large language models are uniquely vulnerable to extraction because their capabilities can be inferred through carefully crafted input-output pairs. By bombarding Claude’s public API with millions of strategically designed queries, Alibaba is said to have reconstructed enough of the model’s decision-making patterns to replicate its performance in certain domains. This method, while not illegal per se, exploits a gray area in intellectual property law, where the distinction between inspiration and theft hinges on the scale and intent of the activity. Anthropic’s legal team has reportedly compiled evidence suggesting that Alibaba’s queries were not merely exploratory but designed to systematically map Claude’s internal logic, a claim that could force courts to confront whether AI models deserve protections akin to trade secrets.

The timing of the accusations adds a layer of geopolitical tension to an already fraught dispute. Anthropic, which counts Amazon and Google among its investors, has positioned itself as a Western counterweight to China’s rapidly advancing AI sector, emphasizing safety and alignment with democratic values in its public messaging. Alibaba, meanwhile, has been vocal about its ambition to lead the global AI race, with its Qwen model series emerging as a direct competitor to Claude and OpenAI’s GPT-4. The allegations come as U.S. policymakers intensify scrutiny of Chinese tech firms, imposing export controls on advanced semiconductors and scrutinizing investments in AI startups. If Alibaba’s actions are deemed malicious, it could provide ammunition for hawks in Washington who argue that China’s tech sector operates under a different set of ethical and legal standards, justifying further restrictions on cross-border collaboration in AI research.

Industry observers note that model extraction is not a new phenomenon, but its prevalence has surged as the economic value of AI models has skyrocketed. Startups like Anthropic and OpenAI have staked their futures on the assumption that their models’ performance advantages will translate into sustainable revenue streams, whether through enterprise licensing or consumer-facing applications. However, the ease with which these models can be probed and replicated threatens to erode those advantages, creating a 'race to the bottom' where only the largest players—those with the resources to continuously innovate—can survive. The situation is further complicated by the lack of clear legal precedents. While copyright law protects the code underlying AI models, the outputs and capabilities derived from training data remain legally ambiguous. This uncertainty has led some firms to adopt aggressive defensive measures, such as rate-limiting API access or embedding watermarks in model outputs, though these tactics are often circumvented by determined actors.

Anthropic’s response to the alleged extraction has been multifaceted, blending legal posturing with technical countermeasures. Internally, the company has accelerated efforts to harden Claude’s defenses against extraction attacks, including deploying adversarial training techniques designed to detect and neutralize suspicious query patterns. Externally, it has signaled its willingness to pursue litigation, with sources indicating that in-house counsel has been in discussions with the U.S. Department of Justice’s Computer Crime and Intellectual Property Section. The case, if it materializes, could hinge on whether Alibaba’s actions meet the threshold for 'unauthorized access' under the Computer Fraud and Abuse Act, a statute traditionally applied to hacking but increasingly invoked in disputes over data scraping. Anthropic’s legal team is also exploring claims under trade secret law, arguing that Claude’s training methodology and architectural innovations constitute protectable assets, even if the model’s outputs are not copyrightable.

The broader implications of the dispute extend beyond the immediate legal battle, touching on fundamental questions about the nature of innovation in the AI era. Unlike traditional software, where progress is often incremental and traceable, AI models evolve in non-linear ways, with breakthroughs emerging from vast datasets and experimental architectures. This opacity makes it difficult to attribute improvements to any single entity, complicating efforts to enforce intellectual property rights. Critics of Anthropic’s position argue that the company is attempting to monopolize techniques that are, in practice, difficult to contain. They point to the open-source AI movement, which has demonstrated that high-performing models can be built collaboratively without the need for proprietary secrecy. However, proponents of Anthropic’s stance counter that without legal protections, firms will have little incentive to invest in the costly, risky research required to push the frontiers of AI, ultimately stifling progress.

As the AI sector grapples with these challenges, the Anthropic-Alibaba dispute serves as a cautionary tale about the fragility of trust in an industry where collaboration and competition are often indistinguishable. The case also highlights the inadequacy of existing legal frameworks, which were not designed to address the unique vulnerabilities of AI systems. Policymakers in the U.S. and Europe are beginning to recognize the urgency of the issue, with proposals ranging from new IP regimes for AI models to mandatory disclosure requirements for training data. Yet, the pace of regulation lags far behind the speed of technological advancement, leaving firms like Anthropic and Alibaba to navigate a landscape where the rules are written in real time. For now, the outcome of this dispute may hinge less on legal principles than on the balance of power between the two companies—and the willingness of courts to adapt centuries-old doctrines to the realities of the digital age.
K

Kenji Tanaka

Kenji Tanaka is Asia Technology Correspondent, focusing on technology developments across East and Southeast Asia. He covers robotics, manufacturing technology, and regional tech policy. Kenji studied Engineering at University of Tokyo and worked in the tech industry before journalism. His …