AI Infrastructure – Review of Google Cloud Next 2025 Product Launch
As AI models become more complex and resource-intensive, enterprises must modernize their infrastructure to support high-performance, scalable, cost-effective workloads. Core challenges in modernization include integrating multimodal data, enabling autonomous agents, and optimizing the AI stack across diverse environments. Enterprises also aim to deploy AI in hybrid and edge settings, requiring flexibility, low latency, and data sovereignty.
At Google Cloud Next 2025, Google announced AI infrastructure upgrades, including Ironwood TPUs for inference at scale, AI Hypercomputer improvements, expanded VM families, and re-architected networking with multi-shard architecture. Google also emphasized hybrid and distributed AI deployment with support for air-gapped environments and on-premises inference using NVIDIA Blackwell systems.
These updates show Google’s intent to deliver an integrated AI stack, combining custom hardware, orchestration tools, and productivity platforms. However, these offerings also raise questions around interoperability with third-party tools, operational complexity, and cost transparency.
In this report, we analyze Google’s AI infrastructure announcements at Google Cloud Next 2025, assessing their alignment with enterprise needs across performance, scalability, and deployment. The report covers key enterprise priorities, Google’s positioning, detailed product reviews, and Everest Group’s perspective on strengths and gaps of announced AI infrastructure-related products, offering a clear view of Google’s AI infrastructure maturity.
Some reports are complimentary and others require a qualifying membership.