Daily aggregation of provider blogs, curated articles, trending models, and status incidents.
Browse the full archive →NVIDIA Research Unlocks Advanced Grasping, Smarter Autonomous Driving and Agent Training at Scale
NVIDIA research addresses robotic generalization and autonomous driving safety through improved gripper control and agent training methods at scale.
NVIDIA Enables the Next Era Of Physical AI Research With Agent Skills For Autonomous Vehicles, Robotics And Vision AI
NVIDIA unveils physical AI agent skills for autonomous vehicles, robotics, and vision AI, enabling developers to accelerate research by providing full workflow tools beyond model development.
5 ways Google Search can level up your thrift and vintage shopping
A colorful, grain-textured collage of various items scattered across a light blue background. The items include a green bucket hat, a wooden clothes hanger, a blue-and-red striped collar, blue sunglasses, denim shorts, a yellow price tag, a red high-heele
Direct Preference Optimization Beyond Chatbots
Research on Direct Preference Optimization techniques applicable to model training beyond conversational AI.
Adding MCP Tools to Reachy Mini
Hugging Face adds MCP (Model Context Protocol) tool support to Reachy Mini robotic platform.
Industrial Software Leaders Build Secure, Autonomous AI Engineers With NVIDIA NemoClaw
NVIDIA NemoClaw enables industrial software vendors to build autonomous AI engineers that optimize engineering workflows from CAD through post-processing.
NVIDIA Partners With Microsoft on Unified Stack for Agentic AI Deployment, From Windows Devices to Cloud to Local
NVIDIA and Microsoft partner on unified agentic AI stack spanning Windows, cloud, and local deployment with optimized hardware, runtimes, and long-context reasoning models.
Build Personal AI Agents on Windows PCs with New Tools from Microsoft and NVIDIA
Microsoft and NVIDIA provide new tools for developers to build personal AI agents running on Windows PCs.
Deploy Self-Evolving Agents for Faster, More Secure Research with a Hermes Agent and NVIDIA NemoClaw
NVIDIA enables self-evolving agents using Hermes Agent and NemoClaw for research data synthesis and acceleration with improved security.
Holo3.1: Fast & Local Computer Use Agents
Holo3.1 enables fast, local execution of computer use agents without external dependencies, allowing developers to build autonomous task-completion systems that run on-device.
Why Financial Institutions Are Converging on Transaction Foundation Models to Build Their Own Intelligence
Financial institutions have spent years building AI: fraud models, credit models, recommendation engines and risk systems. While this sprawl of task-specific models has been effective, it’s also constrained by siloed systems. Siloed systems prevent institutions from developing a unified understanding of consumers’ financial behavior. As enterprise datasets keep growing, so does the gap between what […]
NVIDIA Jetson Brings Agentic AI to the Physical World
NVIDIA announces JetPack 7.2 and NemoClaw support on Jetson platform, enabling agentic AI deployment on edge devices with improved performance on Jetson Orin.
Deploy Agentic-Ready AI at the Edge with Memory Efficiency in NVIDIA JetPack 7.2
NVIDIA JetPack 7.2 brings memory-efficient agentic AI deployment to edge devices for physical world applications.
Run Local AI Agents with Faster Models and Multi-Node Clustering on NVIDIA DGX Spark
NVIDIA DGX Spark enables local AI agent deployment with optimized models, large context windows, and multi-node clustering for concurrent long-running tasks.
How we used Gemini to build Google I/O 2026
A collage of I/O-related images, including the Antigravity Coffee Co. pop-up, a colorful jellyfish and a still from the Timmy TPU video. The word AI repeats three times on the left of the image, and there are colorful icons, including a sparkle, as well.
Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains
JetBrains released Mellum2, a 12B parameter mixture-of-experts model available on Hugging Face.
Beyond LLMs: Why Scalable Enterprise AI Adoption Depends on Agent Logic
Article discussing enterprise AI adoption strategies beyond large language models, focusing on agent-based approaches.
NVIDIA AI Cloud Ecosystem Expands Worldwide to Meet Global AI Compute Demand
The NVIDIA AI Cloud ecosystem is accelerating the global buildout of AI factory infrastructure. Partners are expanding capacity to meet growing demand from enterprises, startups, nations, AI labs and developers scaling agentic AI applications. NVIDIA AI Clouds are a growing ecosystem of purpose-built clouds serving the exploding token demand behind today’s most popular AI applications. […]
NVIDIA Factory Operations Blueprint Gives Factories a New AI Brain
NVIDIA announces Factory Operations Blueprint, an AI system connecting machine signals and operational data into unified decision layer for plant-wide manufacturing intelligence.
Taiwan’s Industry Titans Turbocharge World’s AI Infrastructure Buildout With NVIDIA
Taiwan is home to more than 500 NVIDIA ecosystem partners. More than 1 million NVIDIA MGX rack components for NVIDIA Vera Rubin infrastructure come together in Taiwan, from across 25 factory sites. As Vera Rubin ramps into full production to power agentic AI factories worldwide, that ecosystem spans the full supply chain — from key […]
How to Post-Train Autonomous Vehicle Models in Closed-Loop with NVIDIA Alpamayo
NVIDIA Alpamayo enables closed-loop post-training of autonomous vehicle vision-language-action models, bridging training and deployment.
How Cosmos 3 Helps Physical AI Think Before It Acts
NVIDIA Cosmos 3 helps physical AI systems reason about the world before taking action.
Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action
NVIDIA released Cosmos 3, an open omni-model for physical AI reasoning and action tasks.
Develop Physical AI Reasoning, World, and Action Models with NVIDIA Cosmos 3
NVIDIA Cosmos 3 provides tools for developing physical AI reasoning, world models, and action models for robots and autonomous systems.
Advancing AI Infrastructure for Agentic AI with NVIDIA DOCA In-Silicon Security
NVIDIA DOCA in-silicon security advances AI factory infrastructure for agentic AI deployment with improved security for autonomous agents.
NVIDIA Vera CPU Sets a New Standard for Agentic Workloads in AI Factories
NVIDIA Vera CPU is optimized for agentic AI workloads in AI factories, setting new performance standards for long-context token generation.
NVIDIA DSX OS Delivers Open, Modular Software for Operating AI Factories at Scale
NVIDIA DSX OS provides open modular software for operating AI factories at scale, enabling token generation and agent deployment.
May Release Spotlight
OpenRouter released speech APIs, model fusion, and 20 new models including Gemini 3.5 Flash and Claude Opus 4.8 with enterprise workspace controls.
DynoSim: Simulating the Pareto Frontier
DynoSim addresses LLM serving optimization challenges by simulating performance trade-offs across model backends, tensor parallelism, and prefill/decode configurations.
Take our I/O 2026 quiz, vibe coded in Google AI Studio.
We used Google AI Studio to vibe code a quiz about our top I/O 2026 announcements.
9 demos of Gemini Omni and Gemini 3.5 in action
Gemini Omni & Gemini 3.5 hero
Check out real-life AI prototypes from the Futures Lab.
University of Waterloo students develop AI prototypes like sign language tutors to reshape the future of education and work.
Guardrails: Protect your Agents, Data, and Costs
OpenRouter released configurable security and governance tools including budget enforcement, data retention controls, model restrictions, and injection defense.
Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler
Tutorial on PyTorch profiling using torch.profiler for performance analysis and optimization.
Catch up on 12 major I/O 2026 moments
The colorful I/O logo against a black background, surrounded by stills from the I/O keynote
OpenRouter Raises $113M Series B
OpenRouter has raised a $113M Series B led by CapitalG, with participation from NVentures, ServiceNow Ventures, MongoDB Ventures, Snowflake Ventures, Databricks Ventures, AMP PBC, and Pace Capital, alongside existing investors Andreessen Horowitz and Menlo Ventures.
Reachy Mini goes fully local
Reachy Mini robot platform now supports fully local AI inference capabilities.
Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL
Delta Weight Sync in TRL enables efficient distributed training of trillion-parameter models using Hub storage.
Harness, Scaffold, and the AI Agent Terms Worth Getting Right
Article discussing terminology and conceptual frameworks for AI agents and scaffolding approaches.
Catch up on the Dialogues stage at Google I/O 2026.
Alphabet CEO Sundar Pichai in conversation on the I/O 2026 Dialogues stage
We’re announcing new community investments in Missouri.
We’re helping build the state’s next-generation workforce and investing in energy programs.
100 things we announced at I/O 2026
Image with the words "Ready, Set, I/O" and a colorful Gemini logo
A new experiment brings better group meetings to Google Beam
Google Beam adds experimental group meeting features for hybrid video conferencing.
A Robot is Sprinting Towards You: Do You Want it Running on Claude or Grok?
OpenRouter evaluates 11 LLMs including Claude and Grok across robotic tasks, demonstrating significant performance differences in model selection for physical AI.
Human-in-the-Loop Tools for the Agent SDK
OpenRouter Agent SDK adds human-in-the-loop tool type enabling auto-resolution of routine decisions with human input for high-stakes choices.
Consistent Web Search and Fetch Across Every Model
OpenRouter enables web search and page fetching capabilities for all tool-calling models across multiple search and fetch engine providers.
GPT-5.5 Price Increase: What It Actually Costs
GPT-5.5 token prices doubled on OpenRouter; actual usage cost impact is reduced due to the model's decreased verbosity.
New Audio APIs for Speech and Transcription
OpenRouter launches text-to-speech and audio transcription APIs supporting multiple providers under a unified interface.
Response Caching: Zero Cost for Identical Requests
Response Caching feature on OpenRouter caches identical requests for zero-cost retrieval with significantly faster response times.
April Release Spotlight
OpenRouter April release includes video generation, workspaces, Agent SDK, reranker models, and frontier model launches.