Hermes Agent takes #1 on OpenRouter; local AI gets faster with Lemonade

Nous Research’s Hermes Agent ships the “Tenacity” release and now leads OpenRouter by daily tokens. Lemonade’s latest update adds vLLM ROCm to speed up on-device large language models.

Find in this article

Reading Mode

One-Line Summary

Open-source agents are moving into daily workflows: Hermes Agent tops OpenRouter usage with a self-improving loop, while Lemonade improves on-device LLM performance for local apps.

New Tools

Hermes Agent v0.13 “Tenacity” climbs to #1 on OpenRouter

Hermes Agent is an open-source AI assistant that executes tasks and then learns from its own performance to create reusable skills; the project ships under an MIT license and posts the v0.13.0 “Tenacity” release dated May 7, 2026. ¹

According to Marktechpost’s read of OpenRouter’s daily app and agent rankings, Hermes holds the #1 spot with 224 billion daily tokens versus OpenClaw’s 186 billion, reflecting strong active use. The contest also reflects two philosophies: Hermes focuses on depth via a do–learn–improve loop, while OpenClaw optimizes for breadth through a central gateway across many channels. ²

The “Tenacity” release brings reliability and control features including a Kanban multi-agent task board with heartbeat and recovery, a /goal command to keep the agent locked on an objective, Checkpoints v2, auto-resume after restarts, and Google Chat as the 20th supported platform. It also ships multiple security hardening measures, including eight P0 fixes such as default redaction and stricter role allowlists. ²

For teams already on OpenClaw, Hermes offers low-friction onboarding: it detects an existing ~/.openclaw folder to import settings, memories, and skills, and provides a “hermes claw migrate” command with dry-run previews; some teams also run both frameworks in parallel via an agent communication protocol. ²

Lemonade v10.4.0 adds vLLM ROCm for faster local LLMs

Lemonade is a tool that helps you discover and run local AI apps by serving optimized large language models on your own GPUs and NPUs. ³

The v10.4.0 release integrates vLLM ROCm as an experimental backend for AMD Strix Halo and Strix Point devices on Linux and overhauls its documentation site. ³

Because Lemonade is designed to run models on your machine, this update targets better on-device performance for supported hardware while keeping workloads local, consistent with the project’s goal of serving optimized LLMs from user-owned GPUs and NPUs. ³

What This Means for You

Hermes’s rise suggests teams can get compounding value from agents that learn: reusable skills and layered memory reduce repeated setup for recurring tasks, and the tool is seeing high daily use on OpenRouter (224 billion daily tokens). Start where processes are stable and measured, like weekly status collation or content refreshes. ²

The “Tenacity” feature set points to practical control for non-developers: Kanban to visualize tasks, /goal to keep the agent on target across turns, and checkpointing for safe progress. That makes it easier to pilot one or two real workflows without rebuilding each week. ²

Security deserves first-class attention. Recent CVE clusters around agent gateways show that misconfigurations and exposed instances can become attack surfaces; run pilots in a test workspace, scope connector permissions narrowly, and keep audit trails. ²

If your content or research can stay on-device, Lemonade’s local serving model provides a path to try offline or hybrid setups on supported hardware, aligning with its mission to run optimized LLMs on your own GPUs and NPUs. ³

Action Items

Pilot Hermes with one repetitive workflow: Follow the project’s Quickstart to launch the local dashboard, connect a single channel you already use, then try the /goal command and Kanban board on a weekly report or status-collection task.
Draft a “skill” from a routine: Write 5–7 bullet steps for a task you repeat, feed them to Hermes as a template, and see if the agent saves it as a reusable skill you can invoke next time.
Preview an OpenClaw migration: If you currently use OpenClaw, run “hermes claw migrate --dry-run” to see what config, memories, and keys it would import before you decide to switch.
Test local LLMs with Lemonade: Install the latest Lemonade release, load a small open model, and time a PDF summary or email draft offline to compare speed and responsiveness to your cloud setup.
Create a sandbox for agent pilots: Spin up a separate workspace or test account, limit connector scopes (e.g., read-only where possible), and document exactly what the agent can access before inviting teammates.

Sources 3

[1] Github NousResearch/hermes-agent: The agent that grows with you [2] Marktechpost OpenClaw vs Hermes Agent: Why Nous Research’s Self-Improving Agent Now Leads OpenRouter’s Global Rankings [3] Github lemonade-sdk/lemonade: Lemonade helps users discover and run local AI apps by serving optimized LLMs right from their own GPUs and NPUs

Helpful?

0to1log Weekly

Latest AI News