Data analysts have spent decades trapped in a cycle of writing exhaustive SQL queries and scouring dozens of tables to extract a single insight. This repetitive, manual process of data retrieval has long been the bottleneck of corporate intelligence. However, the emergence of generative AI is fundamentally altering this workflow, allowing users to query databases using natural language and receive automated summary reports. To power this transition at scale, Snowflake has entered into a massive five-year, $6 billion agreement with Amazon Web Services (AWS) to secure a steady supply of Graviton processors.
The Financial Scale of the Graviton Pivot
Snowflake has been tethered to AWS since its inception in 2012, and while the company now operates across Microsoft Azure and Google Cloud, AWS remains its primary infrastructure backbone. To put the scale of this new $6 billion commitment into perspective, it nearly equals the $7 billion in total revenue Snowflake has generated through the AWS Marketplace since its founding. The financial trajectory of this partnership is accelerating rapidly; Snowflake's projected AWS expenditure for 2025 has reached $2 billion, representing a 100% increase over the previous year.
This surge in spending is driven primarily by Cortex AI, Snowflake's suite of AI building tools. The strategic logic behind Cortex AI is to bring the compute to the data rather than moving massive datasets to a separate AI environment. By allowing AI to read and process data directly where it resides, Snowflake eliminates the latency and cost associated with data movement. This architectural shift requires an immense amount of raw compute power to handle the constant stream of natural language queries and automated reporting tasks that Cortex AI facilitates for its enterprise clients.
At the heart of this agreement is the AWS Graviton, a custom-designed ARM-based CPU. As AI services evolve from the training phase—where massive models are built—to the operational phase—where AI agents perform daily tasks—the demand for CPU resources spikes. While GPUs handle the heavy lifting of deep learning and complex inference, the orchestration, control logic, and general data processing required by AI agents fall to the CPU. By locking in a massive allocation of Graviton chips, Snowflake aims to lower the operational overhead of its AI agent services and ensure the scalability of its infrastructure as more enterprises migrate their data workflows to AI-driven models.
The Shift from GPU Dominance to CPU Efficiency
For the past few years, the AI hardware narrative has been dominated by a singular obsession with Nvidia GPUs. The industry focused almost exclusively on the massive compute power required to train Large Language Models (LLMs). However, a critical inflection point has arrived: the center of gravity is shifting from training to inference and automation. While GPUs are unmatched for parallel matrix multiplication, they are prohibitively expensive and power-hungry for the general-purpose control tasks that AI agents perform. This has created a strategic opening for ARM-based CPUs, which offer a far more economical price-to-performance ratio for inference-heavy workloads.
This trend is not isolated to Snowflake. Meta recently signed a large-scale agreement with AWS to integrate Graviton CPUs into its own AI infrastructure, following a separate $10 billion computing deal with Google Cloud. Cloud service providers are aggressively deploying their own silicon to break the Nvidia monopoly and reduce their dependency on external hardware vendors. Microsoft has followed a similar path, launching its Maia AI chip in January to optimize inference efficiency. The motivation for these hyperscalers is clear: by designing their own chips, they can slash infrastructure costs and pass those savings to customers, creating a competitive moat based on operational efficiency rather than just raw power.
Nvidia is not standing still in the face of this CPU insurgency. To defend its territory, the chip giant recently launched Vera, an AI-specific CPU designed to capture the general-purpose compute market. Jensen Huang, CEO of Nvidia, has signaled that Vera is intended to unlock a new $200 billion market opportunity. The initial demand is already evident, with Vera generating $20 billion in revenue. Nvidia is attempting to evolve from a GPU specialist into a comprehensive hardware provider that can handle every stage of the AI pipeline, from the most complex training runs to the simplest agentic control loops.
For the modern enterprise, the choice of hardware now depends on the nature of the workload. If a company's data is primarily structured and its analysis consists of repetitive SQL-based tasks, ARM-based CPUs are the superior choice over general-purpose GPUs. The architecture Snowflake is employing—physically tightening the link between data storage and the compute environment—removes the bottlenecks that typically plague large-scale AI deployments. Using a GPU for simple data retrieval or summary tasks is an inefficient use of resources; the real efficiency gain comes from bifurcating the hardware based on whether the task is a complex mathematical operation or a logical flow of data.
As AI services move from the development lab into full-scale production, the financial stakes of infrastructure choices have intensified. For companies whose cloud bills exceed 10% of their annual revenue, long-term contracts to secure specific chip allocations are no longer optional—they are a survival strategy. The risk of vendor lock-in to a specific cloud chip ecosystem is now outweighed by the immediate need to improve operating margins through hardware efficiency. The industry is moving toward a hybrid infrastructure model, where Nvidia GPUs and cloud-native CPUs are balanced in a 7:3 or 6:4 ratio to optimize for both performance and cost.
Ultimately, the competition between custom ARM CPUs and specialized AI CPUs like Vera marks the end of the GPU-only era. The metric of success has shifted from absolute peak performance to the cost per unit of processing and power efficiency. In the current landscape, infrastructure efficiency is the primary driver of price competitiveness in the AI market.




