OpenAI and Broadcom Unveil Jalapeño: OpenAI's First Custom AI Inference Chip

OpenAI

Industry official + media 3 src. ~1 min

OpenAI and Broadcom jointly announced Jalapeño on June 24 — OpenAI's first custom ASIC designed exclusively for LLM inference. The chip was co-developed from initial design to tape-out in nine months, with AI models accelerating parts of the chip design itself. OpenAI claims roughly 50% better cost-per-token versus current-generation GPUs. Prototype deployments are targeted for end of 2026, with production ramp in 2027–2028. The chip will not be sold to external customers.

Why it matters

OpenAI's first step toward vertical hardware integration reduces dependence on Nvidia and cuts the per-token cost of serving ChatGPT and API products at scale. The nine-month design cycle — itself enabled in part by AI — signals an acceleration in the hardware development loop. This places OpenAI alongside Google (TPUs), Amazon (Trainium), and Microsoft (Maia) in the custom silicon club.

Importance: 4/5

OpenAI's first custom silicon — major strategic move with direct inference cost and supply-chain implications

Sources