← Back to Model Beat
4Open Source·2d ago

Amazon engineers are reportedly distilling Anthropic models to cut costs before new token-based pricing kicks in

Amazon engineers are distilling larger Anthropic models into smaller, more efficient versions to reduce internal operational expenses. This effort anticipates a shift in Amazon's billing structure next year, which will transition from compute-based pricing to a token-based model that could significantly increase costs. To maintain budget stability, the company is also evaluating alternative model providers like OpenAI.

Covered by 1 source

Related stories

Open SourceAnthropic Economic Index report: CadencesJun 26Open SourceTransition-Aware best-of-N sampling for Longitudinal Chest X-ray ReportsJun 30Open SourceRun a vLLM Server on HF Jobs in One CommandJun 26Open SourceJuZhou 1.0 Technical Report: The First Edge-Native Text-to-Image Foundation Model Trained Entirely on China-Developed AI AcceleratorsJun 30