Agents go live: Microsoft in-house, Meta deploys, Google goes local
Agents moved from chat to action as Microsoft launched its own model+agent stack, Meta embedded a business bot in WhatsApp/Instagram and a creator aide in Facebook, and Google sized Gemma 4 12B to run locally — with big checks still flowing to AI.
This Week in One Line
Microsoft unveiled its first in‑house reasoning model and Scout agent; Meta shipped a business agent across WhatsApp/Instagram and a Facebook creator assistant; Google released Gemma 4 12B for 16GB laptops; DeepSeek lined up $7.4B — AI is shifting from chat to hands‑on agents and local devices.
Week in Numbers
- 35B — Parameter count for Microsoft’s MAI‑Thinking‑1, a new in‑house reasoning model. 1
- 1M+ — Businesses that already used Meta’s earlier chatbots as Meta launches its Business Agent. 2
- 5M — Weekly active users for OpenAI’s Codex work tool after new job‑specific plugins. 3
- 16GB — Memory target to run Google’s Gemma 4 12B model locally on many laptops. 4
- $7.4B — Funding Chinese startup DeepSeek is set to raise in its first round. 5
- $400M — New capital raised by robotics startup Generalist AI at a $2B valuation. 6
- 3 km — Forecast resolution WindBorne says its WeatherMesh 6 delivers in Europe and the U.S. 7
Top Stories
Microsoft unveils MAI‑Thinking‑1 and Scout agent
Microsoft introduced MAI‑Thinking‑1, a new in‑house reasoning model it says matches leading peers on software engineering tasks, alongside six other models spanning image, voice, transcription, and coding. Axios reports MAI‑Thinking‑1 uses 35 billion active parameters and was trained from scratch on licensed data; The Verge frames the set as Microsoft moving more AI in‑house. For non‑specialists, the signal is lower costs and tighter integration inside Microsoft 365 and GitHub workflows. 8 1
Microsoft also previewed Scout, a personal agent for Outlook and Teams, and positioned the models for efficiency via Azure and the Foundry platform’s private preview. The practical impact: enterprises get a clearer path to govern agents that can act in productivity tools, potentially at lower token costs. 9
Meta launches AI Business Agent across WhatsApp and Instagram
Meta released an AI Business Agent that can book appointments, close sales, and answer FAQs inside WhatsApp and Messenger, with Instagram support, announced at the Conversations event in London on Jun 3. Meta says more than 1 million businesses already used earlier chatbot versions, with the new agent rolling out globally, free at first and with paid plans to follow. For teams, this moves customer support and lead‑handling into the channels customers already use — but permissioning and human escalation matter given the risks of action‑taking agents. 2
Google’s Gemma 4 12B brings local multimodal AI to 16GB laptops
Google added a 12‑billion‑parameter Gemma 4 model sized to run fully locally on many 16GB RAM/VRAM laptops, reducing reliance on cloud inference for everyday tasks. Weights are available for download and the design emphasizes faster generation and a streamlined path for audio and vision inputs; Ars Technica notes downloads are just under 18GB and tools like LM Studio can help people try it. For privacy‑sensitive work and cost control, this makes an on‑device assistant more practical. 4
OpenAI expands Codex with job‑specific plugins for office work
OpenAI rolled out six role‑focused plugins for Codex — spanning data analytics, creative, sales, product design, equity investing, and investment banking — aiming to help non‑engineers get useful outputs out of the box. The company reports more than 5 million weekly active users, up 6× since the desktop app launch in February, with knowledge workers growing quickly as a share. For readers, this points to ready‑made templates rather than prompt crafting from scratch. 3
Nvidia debuts Cosmos 3 for ‘physical AI’
Nvidia announced Cosmos 3 as an “open world” foundation model that can understand and generate text, images, video, ambient sound, and actions, with a two‑tower Mixture‑of‑Transformers design and availability via build.nvidia.com and Hugging Face. Nvidia claims it can shrink training/evaluation cycles for physical AI from months to days, with Cosmos 3 Super and Nano available now and Edge coming for real‑time use. 10
Supporting tools include open‑source physical skills and an Agent Toolkit to automate data generation, simulation, training, and evaluation pipelines across robotics and autonomous systems. For non‑robotics teams, the near‑term utility is synthetic video and task simulation for QA and training content. 11
Anthropic confidentially files for a U.S. IPO
Anthropic, maker of the Claude assistant, submitted a confidential initial public offering (IPO) filing on Jun 1, positioning it as an early test of public‑market appetite for frontier AI labs. Going public could surface details on revenue mix, compute costs, and margins that procurement and finance teams can use to evaluate vendor risk and pricing assumptions. 12
DeepSeek lines up roughly $7.4B in its first funding round
Reuters reports Chinese AI startup DeepSeek is set to raise about 50 billion yuan (roughly $7.4 billion), potentially valuing the company between $52–$59 billion. The investor list spans major Chinese tech and industrial players, signaling a push to build a more self‑sufficient national AI stack across models and data‑center infrastructure. 5
Meta rolls out a Facebook AI creator assistant
Meta introduced a conversational assistant for Facebook creators in the U.S., Canada, and India that answers planning questions like “When should I post?” and suggests ideas based on trends. Meta also expanded AI translations for Reels to more languages, aiming to boost engagement and reduce reliance on third‑party tools for analytics and brainstorming. 13
Robotics startup Generalist AI raises $400M at a $2B valuation
Generalist AI, backed by Nvidia and Bezos Expeditions, raised $400 million in a round led by Radical Ventures, valuing it at $2 billion including the new money. The raise underscores investor interest in moving robots beyond narrow, scripted motions and into more adaptable real‑world tasks — a cue for operators to evaluate uptime, edge‑case recovery, and task success rates, not just demos. 6
Snowflake and Anthropic expand governed AI deployments
Snowflake and Anthropic highlighted momentum at Snowflake Summit, positioning Claude inside Snowflake Cortex AI so enterprises can run AI on governed data with security, observability, and scale. Named customers include Basis, Block, Carvana, eSentire, Indeed, and Notion; the pitch is reduced data movement by bringing models to where data already lives. 14
Trend Analysis
Enterprise AI edged from chat into action. Meta’s Business Agent can book and sell inside WhatsApp and Instagram, Microsoft’s Scout points toward autonomous meeting prep in Outlook and Teams, and OpenAI’s Codex added role‑based plugins so non‑engineers can ship work without hand‑rolled prompts. The shared theme is agents operating inside the tools people already use — raising the bar on permissioning and audit while promising time saved on routine steps. 2 1 3
At the same time, AI moved closer to the device and the physical world. Google’s Gemma 4 12B targets 16GB laptops for fully local multimodal work, Nvidia’s Cosmos 3 packages world models and skills for robots and AVs, and chipmakers are pitching edge portfolios so tasks can run offline for latency, privacy, and cost reasons. For knowledge workers, this implies a hybrid pattern: quick, private steps on‑device and heavier lifts in the cloud. 4 10 15
Capital and vendor dynamics added urgency to unit economics and backup plans. Anthropic’s confidential filing points to a forthcoming look at revenue, compute costs, and margins, DeepSeek’s planned $7.4B round shows continued mega‑funding, and a $400M raise for Generalist AI signals investor focus on real‑world automation. Against that backdrop, Meta’s delayed model release is a reminder to avoid single‑vendor dependencies. 12 5 6 16
Watch Points
- “Gross margin” in Anthropic’s S‑1 — If you see this metric next week, it’s the key line investors use to judge whether AI services cover compute costs with room to spare. 17
- “Agent permissions” — Any mention of payments, refunds, or escalations ties back to Meta’s action‑taking Business Agent and the need for strict guardrails. 2
- “16GB local multimodal” — References to Gemma 4 12B or similar indicate growing on‑device capability; expect memory/quantization trade‑offs to dominate early trials. 4
Open Source Spotlight
- InsForge — One‑stop backend for agentic coding: DB, auth, storage, compute, hosting, and an AI gateway with an SDK; helpful if you want an agent to ship full‑stack apps end to end. InsForge/InsForge
- Vercel AI SDK — Provider‑agnostic TypeScript software development kit (SDK) to build AI apps and agents across major UI frameworks with a unified application programming interface (API). vercel/ai
- Lance — Open lakehouse format with fast random access, built‑in vector search, and versioning; good for multimodal datasets and retrieval pipelines. lance-format/lance
- YOLO26 (paper) — Real‑time, NMS‑free unified vision pipeline for detection/segmentation/pose; useful for builders needing simple deployment. arxiv:2606.03748
- Code2LoRA (paper) — Generates per‑repo LoRA adapters so code models stay in sync with fast‑changing repositories without long prompts. arxiv:2606.06492
What Can I Try?
- Pilot Meta’s Business Agent on a single WhatsApp/Instagram flow (FAQs or lead qualification) and add human escalation before enabling bookings or payments. 2
- Run a local trial of Gemma 4 12B on a 16GB laptop via LM Studio; test a short text+image or 30–60s media task and note latency and quality. 4
- Add one OpenAI Codex plugin (e.g., data analytics or product design) to a weekly task and compare outcome time vs. your usual workflow. 3
- Watch Nvidia’s Cosmos 3 demos and assess whether synthetic video or action simulation could accelerate your QA or training content. 10
Comments (0)