Wednesday, July 1, 2026
Search

US AI Inference Firms Raise $2.65B in June as Global Infrastructure Race Accelerates

Three US-based AI inference companies—Baseten, Groq, and Upscale AI—collectively raised over $2.65B in June 2026, signaling a global capital consensus on inference as a standalone infrastructure market. Nvidia simultaneously acquihired Groq's founding team, deepening the chip giant's position in inference talent. A liquid helium supply deal between Air Products and an Asian semiconductor manufacturer points to parallel capacity expansion across the global hardware supply chain.

Salvado
Salvado

July 1, 2026

US AI Inference Firms Raise $2.65B in June as Global Infrastructure Race Accelerates
Image generated by AI for illustrative purposes. Not actual footage or photography from the reported events.
Loading stream...

Baseten closed a $1.5B Series F round in June 2026.1 Groq raised $650M to scale its AI inference cloud.2 Upscale AI extended its Series A to $500M total.3 Three large rounds in one month—over $2.65B combined—mark a coordinated global bet on inference infrastructure.

Nvidia acquihired Groq's founder and key team members during the same period.2 The dominant GPU maker, already central to AI supply chains worldwide, is now pulling inference-specialized talent in-house even as independent inference firms scale aggressively.

Inference is the operational layer of AI—serving real user requests from trained models. It demands low latency at continuous scale. That requirement is creating a distinct infrastructure market, separate from model training. Investors across the US, Asia, and Europe are now pricing it as one.

Baseten serves enterprise customers with model deployment and serving infrastructure. Groq's custom LPU architecture competes on tokens-per-second economics against general-purpose GPUs. Both companies occupy the layer between model developers and end-user applications—a position that is attracting capital globally.

The hardware supply chain reflects the same pressure. Air Products secured a long-term liquid helium supply agreement with an Asian semiconductor manufacturer.4 Liquid helium is essential for cooling advanced fabrication equipment. The deal points to sustained capacity expansion—a prerequisite for scaling GPU output from fabs in Taiwan, South Korea, and beyond.

Forward indicators will include GPU order volumes and revenue guidance from Nvidia, AMD, and TSMC. Data center announcements from Groq and Baseten, tracked against capex from global cloud providers, will show whether June's capital commitments convert into accelerator procurement within two quarters.

Capital clustering within a single month carries its own signal. When multiple large institutional rounds close in the same vertical within weeks, it reflects shared conviction on near-term demand—not independent bets. That pattern has historically preceded infrastructure build-outs in cloud computing, mobile networks, and semiconductor fabs.

Inference has historically been treated as a cost to minimize. The June 2026 funding wave reframes it as a growth market worth scaling aggressively—and a new front in the global competition for AI infrastructure leadership.


Sources:
1 Baseten Series F funding announcement, June 2026
2 Groq funding round announcement, June 2026
3 Upscale AI Series A extension announcement, June 2026
4 Air Products semiconductor supply agreement announcement, June 2026

Salvado
Salvado

Tracking how AI changes money.