
IBM has unveiled the z17 mainframe, a system engineered to enhance artificial intelligence capabilities within enterprise environments. Central to this advancement is the IBM Telum II processor, featuring eight high-performance cores operating at 5.5GHz. This processor integrates an AI accelerator core and a coherently attached Data Processing Unit , collectively designed to deliver low-latency, high-throughput in-transaction AI inference, addressing complex transactional demands across industries such as finance, healthcare, and retail.
The z17’s architecture incorporates a 40% increase in cache size compared to its predecessor, with each core supported by a 36MB L2 cache, culminating in a total on-chip cache of 360MB. Additionally, the system offers a virtual level-4 cache of 2.88GB per processor drawer, enhancing data processing efficiency and responsiveness.
Complementing the Telum II processor is the IBM Spyre Accelerator, an external component designed to support large-scale AI workloads, including the deployment of large language models and generative AI applications. Each Spyre Accelerator card is equipped with up to 1TB of memory and 32 compute cores, supporting int8 and fp16 data types. These specifications enable the handling of complex AI models with improved performance and accuracy.
IBM’s infrastructure segment, responsible for 22% of the company’s $62.8 billion revenue in 2024, has experienced declines in recent years. Analysts anticipate that the introduction of the z17 will revitalize this segment. Evercore ISI analyst Amid Daryanani reaffirmed an “outperform” rating for IBM, citing the upcoming mainframe cycle as a contributing factor to growth following two consecutive years of declines.
The z17 aligns with IBM’s hybrid strategy, blending cloud and on-premise solutions to meet the evolving needs of enterprises. The system is scheduled for availability on June 18, 2025, marking a significant milestone in IBM’s efforts to integrate AI more deeply into enterprise computing infrastructure.