OpenAI ships GPT-5.5 to paid ChatGPT users, built to finish multi-step work
GPT-5.5 matches GPT-5.4’s latency while posting higher scores on coding and computer-use benchmarks, and it’s rolling out to Plus, Pro, Business, and Enterprise in ChatGPT and Codex. API access is delayed pending additional safety work, as Nvidia touts a CPU built for agentic AI and investors back agent infrastructure and clinical AI.
One-Line Summary
AI shifts from Q&A to getting work done: OpenAI rolls out GPT-5.5 for multi-step tasks, Nvidia unveils a CPU built for agentic loops, and capital flows to agent infrastructure and clinical AI.
Big Tech
OpenAI releases GPT-5.5 for paid ChatGPT tiers
OpenAI launches GPT-5.5, a new model designed to handle messy, multi-part work like coding, research, data analysis, and computer use across tools, and it rolls out in ChatGPT and Codex for Plus, Pro, Business, and Enterprise users, with GPT-5.5 Pro for Pro, Business, and Enterprise. OpenAI says it improves over GPT-5.4 on benchmarks such as Terminal-Bench 2.0 (82.7%), SWE-Bench Pro (58.6%), GDPval (84.9%), OSWorld-Verified (78.7%), and Tau2-bench Telecom (98.0%), while matching GPT-5.4’s per‑token latency in serving. 1
OpenAI emphasizes that GPT-5.5 can plan, use tools, check its work, and persist through ambiguity, reaching higher-quality outputs with fewer tokens and fewer retries in Codex. Analysis from The Next Web frames the launch as a push to make complex tasks more self-managing for enterprises and notes API access is delayed pending additional safeguards. 2
Safety is a focal point: OpenAI says GPT-5.5 is released with its strongest safeguards to date, including evaluations across safety and preparedness frameworks, internal and external red‑teaming, targeted testing for advanced cybersecurity and biology capabilities, and feedback from nearly 200 early‑access partners. OpenAI says API deployments require different safeguards and will follow "very soon." 1
For everyday work, OpenAI highlights gains in generating documents, spreadsheets, and slides, and better performance on knowledge-work benchmarks; internal teams use Codex weekly across functions from engineering to finance and communications, citing time savings in tasks like report generation and large-scale form review. 1
Industry & Biz
Nvidia’s Vera CPU targets agentic AI bottlenecks
Nvidia details Vera, a data center CPU purpose-built for agentic AI and reinforcement learning loops that demand strong single‑core performance and high memory bandwidth per core; the chip features 88 custom Olympus cores, up to 1.2 TB/s LPDDR5X memory bandwidth, a second‑generation Scalable Coherency Fabric, and uniform low‑latency access across the die. Nvidia reports up to 1.5× higher agentic sandbox performance versus competing x86 platforms under full load. 3
Nvidia also announces a Vera CPU rack integrating up to 256 CPUs with capacity for more than 22.5K concurrent CPU environments, designed to simplify rack‑scale deployment for AI factories and reduce build‑out time. The company says systems will be available from OEM partners in the second half of 2026. 4
The architecture is pitched to speed RL post‑training cycles and agentic inference by cutting CPU‑bound delays in tool use, orchestration, and evaluation jobs—key steps that can otherwise gate accelerator utilization and user‑perceived latency. 3
Parallel Web Systems raises $100M at $2B to power agent web search
Parallel Web Systems, founded by former Twitter CEO Parag Agrawal, raises $100 million in Series B funding led by Sequoia Capital, valuing the company at $2 billion, to expand sales, marketing, and R&D. The startup builds web search infrastructure for AI agents, aiming to let autonomous systems retrieve information with more control and precision than generic search. 5
Use cases include deep research tasks in areas like underwriting, insurance claims, and government contracts; AI legal startup Harvey uses Parallel’s platform, and the company says over 100,000 developers have built on its infrastructure. Parallel previously raised a $100 million Series A in November 2025, bringing total funding to $230 million. 5
Investors point to momentum in “long‑horizon” agents that run in the background, keep context longer, and use the web as a shared capability—an area Parallel targets with enterprise buyers. 5
Aidoc raises $150M to scale clinical AI imaging
Clinical AI provider Aidoc raises $150 million in Series E funding led by Goldman Sachs Alternatives’ growth equity arm, with General Catalyst, SoftBank Investment Advisors, and NVentures participating; the company has secured 31 FDA clearances and says its tools are deployed in nearly 200 U.S. health systems and over 1,600 hospitals globally. 6
Aidoc’s AI flags findings on CT scans and X‑rays for workflows like emergency triage and abdominal abnormality detection; proceeds will support regulatory processes and development of a large, comprehensive model. Axios notes Aidoc has raised $520 million to date. 6
Competition is active across incumbents and startups, with GE HealthCare completing a $2.3 billion Intelerad acquisition and Siemens Healthineers launching radiology services; Aidoc says it is building an enterprise platform approach alongside its foundation-model strategy. 6
Community Pulse
Hacker News (1576↑) — Mixed reactions: enthusiasm for capability tempered by cost concerns and the need for human review when context exceeds current windows. 7
"I'm in a large enterprise context--you have to use human reviewers if you don't want to end up like Github's status page. So much context exists outside of the code that the bots are either not provided or are far too large of contexts for current windows." — Hacker News 7
"Got invited to try this, but it was too expensive. I gave it two tasks that I would expect Codex 5.3 xhigh to take $1-2 of tokens on. It used $20 on each, and one was on medium with the other on xhigh!" — Hacker News 7
What This Means for You
If you use ChatGPT for work, GPT-5.5’s pitch is practical: hand it a multi‑step task and let the model plan, use tools, and check work with less hand‑holding, while keeping response speed similar to GPT‑5.4. That can reduce time on reports, spreadsheet models, or code changes—especially inside Codex—though API builders need to wait for access. 1
Enterprise teams should balance capability with governance and cost: community feedback flags high token use on some tasks, while OpenAI says 5.5 often completes Codex tasks with fewer tokens. Treat early rollouts as controlled pilots with human review for context gaps, particularly where decisions span multiple systems or policies. 7
On infrastructure, Nvidia’s Vera underscores that agentic AI isn’t just about GPUs. CPU-bound steps—tool calls, data parsing, orchestration, and evaluation—can bottleneck user experience and training loops; vendors and IT teams may need to evaluate CPU choices and rack‑level designs as agent workloads scale. 3
For healthcare and other regulated fields, Aidoc’s raise and clearances signal that enterprise AI can move from point features to platform deployments—provided teams invest in approvals, monitoring, and measurable outcomes like triage speed and length‑of‑stay improvements. Procurement and compliance teams should align early with clinical and data leaders. 6
Action Items
- Try GPT-5.5 on a real weekly task: In ChatGPT, pick a complex report or spreadsheet you do often and ask GPT-5.5 to plan and complete it end-to-end, then compare the output and time saved against your current process.
- Run a cost/latency A–B test: Solve the same Codex task with GPT-5.4 and GPT-5.5, record tokens and time, and decide where 5.5’s gains justify switching for your team.
- Map human-in-the-loop checkpoints: For tasks touching policy, finance, or code, define what a reviewer must verify before adoption, reflecting the community’s context-window caution.
- Ask your vendor about CPU bottlenecks: If you use agent tools, request data on CPU-bound steps (tool calls, orchestration) and how they plan to address them as workloads grow.
- Healthcare teams: shortlist one imaging workflow: If relevant to your role, pick a narrow CT/X-ray use case (e.g., ED triage) and draft a 2–3 step pilot plan including metrics and escalation rules.
Comments (0)