HyperAI

On June 24, OpenAI officially entered the semiconductor arena with the unveiling of Jalapeño, its first custom-designed artificial intelligence chip. Developed in partnership with Broadcom for architecture and network integration, and manufactured by TSMC on a 3-nanometer process, Jalapeño is an application-specific integrated circuit optimized exclusively for large language model inference. Rather than labeling it a general AI accelerator, OpenAI terms it an Intelligence Processor, signaling a focused approach to handling the escalating computational demands of its user-facing applications. The project development cycle reached tape-out in approximately nine months, a timeline company leadership described as unprecedented for high-performance ASIC engineering. This accelerated schedule was enabled by AI-driven design automation and a hardware team led by former Google Cloud TPU architect Richard Ho. The silicon employs a systolic array architecture paired with high-bandwidth memory, engineered to minimize data movement and align compute resources closely with actual model utilization rates. Early laboratory tests indicate that Jalapeño delivers substantially improved performance per watt compared to existing industry standards. While OpenAI has withheld detailed benchmarks pending a formal technical report, Broadcom executives indicated that initial samples operate at parity with contemporary enterprise accelerators and may reduce inference costs by roughly half relative to traditional GPUs. The chips are designed strictly for internal deployment. Prototype integration is scheduled for late 2026, full-scale deployment by mid-2028, and annual architectural iterations thereafter. Server hardware will be produced by Celestica and installed across partner data centers, primarily Microsoft’s. The initiative underscores OpenAI strategic pivot toward vertical integration, aiming to secure predictable compute supply, mitigate reliance on third-party GPU vendors, and maintain granular control over the hardware-software stack. As inference workloads scale to serve hundreds of millions of daily users, custom silicon offers a viable pathway to contain operational expenditures. The move operates within a highly competitive landscape populated by mature in-house solutions from major cloud providers and specialized startups. OpenAI continues to supplement its capacity with external partnerships, suggesting that Jalapeño will function as a complementary layer rather than a complete supplier replacement. The long-term viability of the project will depend on sustained engineering investment, software stack maturity, and the silicon ability to adapt to evolving model architectures.

Related Links

Related Links

Related Links

Online Tutorial | UC Berkeley/NVIDIA and Others Release Gsplat, an open-source 3DGS Library That Saves 4x GPU Memory and Reduces Training Time by 10%.

Online Tutorial | UC Berkeley/NVIDIA and Others Release Gsplat, an open-source 3DGS Library That Saves 4x GPU Memory and Reduces Training Time by 10%.

Command Palette

OpenAI Launches First AI Chip, Jalapeño, in Nine Months

Related Links

Command Palette

OpenAI Launches First AI Chip, Jalapeño, in Nine Months

Related Links

Command Palette

OpenAI Launches First AI Chip, Jalapeño, in Nine Months

Related Links

Online Tutorial | UC Berkeley/NVIDIA and Others Release Gsplat, an open-source 3DGS Library That Saves 4x GPU Memory and Reduces Training Time by 10%.

Online Tutorial | UC Berkeley/NVIDIA and Others Release Gsplat, an open-source 3DGS Library That Saves 4x GPU Memory and Reduces Training Time by 10%.