AI NewsBusiness

9 min read 5/3/2026

Agents moved from chat to execution: GPT‑5.5, multi‑cloud OpenAI, Pentagon scale, and Nvidia’s agent CPU

Agentic AI went practical: GPT‑5.5 targets multi‑step work, OpenAI opened multi‑cloud and government channels, the Pentagon scaled Gemini to millions, and Nvidia unveiled a CPU for agent loops. Here’s what changed — and one hands‑on experiment to run.

Find in this article

Reading Mode

This Week in One Line

OpenAI rolled out GPT-5.5 to paid ChatGPT tiers, Microsoft and OpenAI unwound Azure exclusivity to enable multi‑cloud, the Pentagon expanded Gemini to millions on GenAI.mil, and Nvidia unveiled a CPU for agent loops — agents are crossing from demos into daily workflows.

Week in Numbers

$600B — Estimated 2026 AI capex across Microsoft, Alphabet, Amazon, and Meta; a market-level test of AI spend. ¹
10M — Weekly conversations handled by Meta’s business AI tools across WhatsApp, Messenger, and Instagram. ²
3M — Users eligible to access Gemini 3.1 Pro on the U.S. DoD’s GenAI.mil platform; 1.3M+ already active. ³
9x — Throughput gain Nvidia claims for Nemotron 3 Nano Omni vs. other open “omni” models at similar interactivity. ⁴
88 — Custom Olympus cores in Nvidia’s Vera CPU (central processing unit) for agentic workloads. ⁵
$1.1B — Seed funding raised by Ineffable Intelligence, valuing the new lab at $5.1B. ⁶
€500M — Schwarz Group’s structured financing to back Cohere’s tie-up with Aleph Alpha. ⁷

Trend Analysis

A clear shift toward “agentic” systems emerged across model, infrastructure, and deployment news. OpenAI’s GPT‑5.5 is framed for multi‑step tasks, Nvidia’s Vera CPU targets CPU‑bound agent loops, and Nemotron 3 Nano Omni collapses perception handoffs by unifying vision, audio, and text — together pointing to faster, more reliable software that plans, uses tools, and checks its own work. ⁸ ⁵ ⁴

Distribution and governance widened at the same time. Microsoft and OpenAI ended Azure exclusivity so OpenAI can sell on any cloud, while OpenAI added a compliant government path (FedRAMP Moderate) and an enterprise route inside Amazon Bedrock. For buyers, that means more leverage on latency, data residency, and procurement fit — and fewer reasons to block pilots on platform constraints. ⁹ ¹⁷ ¹⁸

Adoption signals concentrated in real workflows: Meta’s business AI handled about 10 million weekly customer conversations, Aidoc raised $150M to expand clinical imaging AI with dozens of FDA clearances, and hyperscaler earnings emphasized cloud growth and capacity planning ($600B in capex, large backlogs). The center of gravity is shifting from model headlines to measurable throughput, guardrails, and ROI. ² ²¹ ²²

Public-sector moves add a second anchor: the Pentagon’s GenAI.mil scaled Gemini across up to 3 million users and the department signed agreements to bring AI tools onto classified networks, while small-scale agent-to-agent commerce at Anthropic highlighted outcome gaps across model tiers. The takeaway: capability is rising, but usage policies and model choice now visibly shape results. ³ ²³ ¹⁵

Watch Points

“GPT‑5.5 API” — OpenAI says API access is coming “very soon” with additional safeguards; watch for availability and usage policies. ⁸
“Cohere–Aleph Alpha approval” — Regulatory review and STACKIT deployments will indicate how fast the sovereign AI pitch lands in the EU public sector. ⁷
“Vera shipping window” — OEM timelines for Nvidia’s Vera (H2 2026) will show how quickly agentic CPU capacity reaches data centers. ¹³

Open Source Spotlight

Qwen Code — Terminal-native coding agent that plugs into local or hosted models; ideal for CLI-first devs who want provider flexibility and privacy. QwenLM/qwen-code
promptfoo — Reproducible prompt/agent/Retrieval‑Augmented Generation (RAG) test runner with CI integration; helps teams baseline quality, cost, and latency. promptfoo/promptfoo
Skyvern — Browser automation with large language models and vision; good for scaling login–navigate–extract flows beyond brittle scripts. Skyvern-AI/skyvern
TensorRT‑LLM — Nvidia’s high‑performance inference stack for large models; consolidates kernel and scheduling optimizations behind a Python/C++ API. NVIDIA/TensorRT-LLM
vllm‑mlx — OpenAI/Anthropic‑compatible local server for Apple Silicon (MLX backend) with continuous batching and multimodal support. waybarrios/vllm-mlx

What Can I Try?

Put GPT‑5.5 on a real task: have it plan and complete a weekly report or spreadsheet end‑to‑end, then compare quality, time, and tokens vs. your current flow. ⁸
Register for Google × Kaggle’s free AI Agents Intensive (June 15–19) and pick a work‑relevant capstone to ship in a week. ²⁴
Pilot OpenAI on AWS Bedrock with IT: run the same prompt flow you use today and evaluate latency, security fit, and billing. ¹⁸
Test Adobe’s Firefly AI Assistant (public beta): turn one product photo into a set of platform‑ready social assets and time the lift. ²⁵
Verify model lineage before deployment: fingerprint one model using Cisco’s Model Provenance Kit and archive the result in your AI inventory. ²⁶

At a Glance

Weekly Quiz

1. What changed in the Microsoft–OpenAI partnership this week?

2. What is Nvidia’s Nemotron 3 Nano Omni primarily designed to do?

3. Up to how many users are eligible to access Gemini 3.1 Pro on GenAI.mil?

Sources 35

[1] Openai Introducing GPT-5.5 | OpenAI [2] Nvidia NVIDIA Vera CPU Delivers High Performance, Bandwidth, and Efficiency for AI Factories [3] Nvidia NVIDIA Launches Nemotron 3 Nano Omni Model [4] Defenseone Pentagon adds Google’s latest model to GenAI.mil as usage soars [5] Wsj Ex-Twitter CEO’s AI Startup Raises Funds at $2 Billion Valuation [6] Axios Exclusive: Clinical AI provider Aidoc raises $150M Series E [7] Techcrunch Why Cohere is merging with Aleph Alpha [8] Nytimes Microsoft and OpenAI Loosen Their Partnership - The New York Times [9] Venturebeat Microsoft and OpenAI gut their exclusive deal, freeing OpenAI to sell on AWS and Google Cloud [10] Businessinsider OpenAI and Microsoft renegotiated their deal for the second time in 6 months. See what's new. [11] Decrypt Microsoft and OpenAI Rework AI Deal, Cutting Exclusivity and AGI Provisions [12] Thenextweb OpenAI launches GPT-5.5, its first fully retrained base model since GPT-4.5 [13] Openai OpenAI available at FedRAMP Moderate | OpenAI [14] Openai OpenAI models, Codex, and Managed Agents come to AWS | OpenAI [15] Nvidia NVIDIA Launches Vera CPU, Purpose-Built for Agentic AI [16] Arxiv Nemotron 3 Nano Omni: A Multimodal Model for Audio, Video, Image, and Text [17] Latimes Google, Nvidia and other tech titans sign AI deal with the Pentagon [18] Forbes Pentagon Signs AI Deals With OpenAI, Nvidia And More [19] Defenseone Pentagon adds Google’s latest model to GenAI.mil as usage soars [20] Uol Uso do Gemini em operações confidenciais divide funcionários do Google [21] Digitaljournal Canada's Cohere buys Germany's Aleph Alpha to take on US AI giants - Digital Journal [22] Yahoo AWS Cuts AI Agent Setup To 3 API Calls In AgentCore Update [23] Ibm Introducing IBM Bob: AI Development Partner that Takes Enterprises from AI-Assisted Coding to Production-Ready Software [24] Latimes China blocks Meta’s AI startup deal as rivalry with U.S. over artificial intelligence deepens [25] Fortune Microsoft, Meta, and Google just announced billions more in AI spending. Only Google convinced investors it’s paying off [26] Arxiv Safety Drift After Fine-Tuning: Evidence from High-Stakes Domains [27] Reuters Hyperscaler results pose major test for AI-driven US stock market [28] Techcrunch Meta says its business AI now facilitates 10 million conversations a week [29] Cnbc Former Google DeepMind researcher's AI startup raises record $1.1 billion seed funding to pursue superintelligence [30] Anthropic project deal [31] Techcrunch Anthropic created a test marketplace for agent-on-agent commerce [32] Openai Our Principles | OpenAI [33] Blog Google and Kaggle’s GenAI Intensive Vibe Coding course 2026 [34] Adobe Firefly AI Assistant now available in public beta [35] Helpnetsecurity Cisco releases open-source toolkit for verifying AI model lineage

Helpful?

0to1log Weekly

Latest AI News

Agents moved from chat to execution: GPT‑5.5, multi‑cloud OpenAI, Pentagon scale, and Nvidia’s agent CPU

This Week in One Line

Week in Numbers

Top Stories

OpenAI releases GPT-5.5 for paid ChatGPT tiers

Microsoft–OpenAI loosen exclusivity, enabling multi‑cloud

Pentagon adds Gemini 3.1 Pro to GenAI.mil for up to 3M users

Nvidia unveils Vera CPU for agentic AI bottlenecks

Nvidia releases Nemotron 3 Nano Omni, a unified multimodal model

Anthropic pilots agent‑to‑agent commerce with real money

OpenAI opens government and AWS routes (FedRAMP + Bedrock)

Cohere to merge with Aleph Alpha, backed by €500M from Schwarz

Aidoc raises $150M to scale clinical AI imaging

Hyperscaler earnings put AI capex — and payoffs — under the microscope

Trend Analysis

Watch Points

Open Source Spotlight

What Can I Try?

Sources 35

Comments (0)

Tool of the Week

Sign in to keep going