9Hardware·Jun 24

OpenAI and Broadcom unveil LLM-optimized inference chip

OpenAI and Broadcom have unveiled Jalapeño, a custom semiconductor specifically engineered to handle the compute-heavy requirements of running large language models. By shifting from general-purpose hardware to specialized silicon, OpenAI aims to improve the performance and scalability of its AI services. The companies expect this hardware to reduce inference costs by 50 percent, marking a strategic move to lower operational expenses and decrease reliance on external chip suppliers for their core infrastructure.

Covered by 11 sources

OOpenAI Blog↗Jun 24
BBloomberg Technology↗Dina BassJun 24
TThe Decoder↗Maximilian SchreinerJun 24
BBloomberg Technology↗Jun 24
BBloomberg Technology↗Jun 24
TTechCrunch AI↗Theresa Loconsolo5d ago
TTechCrunch AI↗Theresa Loconsolo, Anthony Ha, Kirsten Korosec, Sean O'Kane5d ago
TTechCrunch AI↗Russell BrandomJun 24
HHacker News↗jamdeskJun 24
TThe New York Times↗6d ago
BBusiness Standard↗Jun 24

OpenAI and Broadcom unveil LLM-optimized inference chip

Covered by 11 sources

Related stories