← Back to Model Beat
9Hardware·Jun 24

OpenAI and Broadcom unveil LLM-optimized inference chip

OpenAI and Broadcom have unveiled Jalapeño, a custom semiconductor specifically engineered to handle the compute-heavy requirements of running large language models. By shifting from general-purpose hardware to specialized silicon, OpenAI aims to improve the performance and scalability of its AI services. The companies expect this hardware to reduce inference costs by 50 percent, marking a strategic move to lower operational expenses and decrease reliance on external chip suppliers for their core infrastructure.

Covered by 11 sources

Related stories

HardwareFive Eyes intelligence alliance says frontier AI models could reshape offensive cyber ops in monthsJun 21 · 18 sourcesHardwareChina Makes Sweeping Education Reforms to Prepare for AI EraJun 22 · 4 sourcesHardwareMicrosoft is building a 2-gigawatt data center in Texas with its own gas plant to dodge the gridJun 22HardwareHotter Than a Hot Tub: The 45°C Breakthrough to Cool AI’s Biggest MachinesJun 22