AI Model Distillation Tutorial

Microsoft's new AI training method eliminates bloated system prompts without sacrificing model performance

Microsoft researchers have developed On-Policy Context Distillation (OPCD), a training method that permanently embeds ...

Hosted on MSN

AI distillation could shrink models and cut costs

The AI industry is witnessing a transformative trend: the use of distillation to make AI models smaller and cheaper. This shift, spearheaded by companies like DeepSeek and OpenAI, is reshaping the AI ...

6don MSN

Anthropic joins OpenAI in flagging 'industrial-scale' distillation campaigns by Chinese AI firms

Anthropic accused three Chinese artificial intelligence enterprises of engaging in coordinated distillation campaigns, the ...

Infosecurity Magazine

Chinese AI Firms Hit Claude with Distillation Attacks, Anthropic Warns

Anthropic accused DeepSeek, Moonshot and MiniMax of illicitly using Claude to steal some of the AI model’s capabilities ...

Quanta Magazine

How Distillation Makes AI Models Smaller and Cheaper

The Chinese AI company DeepSeek released a chatbot earlier this year called R1, which drew a huge amount of attention. Most of it focused on the fact that a relatively small and unknown company said ...

Anthropic Accuses DeepSeek and Other Chinese AI Firms of Model Distillation Attempts

Anthropic said it is investing heavily in defences designed to make distillation attacks harder to execute and easier to identify.

Wired

Distillation Can Make AI Models Smaller and Cheaper

The original version of this story appeared in Quanta Magazine. The Chinese AI company DeepSeek released a chatbot earlier this year called R1, which drew a huge amount of attention. Most of it ...

7don MSN

Are China’s ‘AI tigers’ cheating? US rival Anthropic alleges some are

Top United States artificial intelligence firm Anthropic is accusing three prominent Chinese AI labs of illegally extracting capabilities from its Claude model to advance their own, claiming it raises ...

InfoWorld

Researchers propose a self-distillation fix for ‘catastrophic forgetting’ in LLMs

LLMs tend to lose prior skills when fine-tuned for new tasks. A new self-distillation approach aims to reduce regression and simplify model management.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results