Nous Portal

Loading...

Loading...

Models

Embeddings

Tool Pricing

Nous Products

Hermes Agent

Hermes Agent

An intelligent AI assistant by Nous Research. Helpful, knowledgeable, and direct — for coding, research, creative work, and more.

Visit Hermes Agent →

Nous Chat

Nous Chat

Chat with state-of-the-art open models from Nous Research. Free tier available with your account.

Visit Nous Chat →

Hermes 4

The Hermes 4 series of models are not recommended for use in Hermes Agent.
For Hermes Agent, configure an agentic model.

Hermes 4 is a frontier, hybrid-mode reasoning model. It extends Hermes 3 with stronger math and science reasoning, better instruction following and schema-adherent outputs, and more nuanced roleplay and writing.

Hybrid thinking models require the following system prompt to activate reasoning mode:

You are a deep thinking AI, you may use extremely long chains of thought to deeply consider the problem and deliberate with yourself via systematic reasoning processes to help come to a correct solution prior to answering. You should enclose your thoughts and internal monologue inside <think> </think> tags, and then provide your solution or response to the problem.

Hermes-4-70B

Hermes-4-70B

Available to all API users

Was: $0.70/1M tokens

$0.05/1M prompt tokens

$0.20/1M completion tokens

Try for free on Nous Chat.

128k tokens ctx

Thinking:

hybrid

This incarnation of Hermes 4 balances scale and size. It handles complex reasoning tasks, while staying fast and cost effective. A versatile choice for many use cases.

This model is based on Llama-3.1-70B.

Hermes-4-405B

Hermes-4-405B

Available to all API users

Was: $1.50/1M tokens

$0.09/1M prompt tokens

$0.37/1M completion tokens

Try for free on Nous Chat.

128k tokens ctx

Thinking:

hybrid

This is the largest model in the Hermes 4 family, and it is the fullest expression of our design, focused on advanced reasoning and creative depth rather than optimizing inference speed or cost.

This model is based on Llama-3.1-405B.

Unsupported Models

Hermes-4.3-36B

Hermes-4.3-36B

Currently unavailable via the API

You can use this model locally, but if you're using our hosted API we recommend migrating to Hermes-4-70B.

N/A

128k tokens ctx

Thinking:

hybrid

Hermes 4.3 is optimised for local deployment. You can dowload it from Hugging Face for local use.

Models

Embeddings

Tool Pricing

Nous Products

Hermes Agent

Hermes Agent

An intelligent AI assistant by Nous Research. Helpful, knowledgeable, and direct — for coding, research, creative work, and more.

Visit Hermes Agent →

Nous Chat

Nous Chat

Chat with state-of-the-art open models from Nous Research. Free tier available with your account.

Visit Nous Chat →

Hermes 4

The Hermes 4 series of models are not recommended for use in Hermes Agent.
For Hermes Agent, configure an agentic model.

Hermes 4 is a frontier, hybrid-mode reasoning model. It extends Hermes 3 with stronger math and science reasoning, better instruction following and schema-adherent outputs, and more nuanced roleplay and writing.

Hybrid thinking models require the following system prompt to activate reasoning mode:

You are a deep thinking AI, you may use extremely long chains of thought to deeply consider the problem and deliberate with yourself via systematic reasoning processes to help come to a correct solution prior to answering. You should enclose your thoughts and internal monologue inside <think> </think> tags, and then provide your solution or response to the problem.

Hermes-4-70B

Hermes-4-70B

Available to all API users

Was: $0.70/1M tokens

$0.05/1M prompt tokens

$0.20/1M completion tokens

Try for free on Nous Chat.

128k tokens ctx

Thinking:

hybrid

This incarnation of Hermes 4 balances scale and size. It handles complex reasoning tasks, while staying fast and cost effective. A versatile choice for many use cases.

This model is based on Llama-3.1-70B.

Hermes-4-405B

Hermes-4-405B

Available to all API users

Was: $1.50/1M tokens

$0.09/1M prompt tokens

$0.37/1M completion tokens

Try for free on Nous Chat.

128k tokens ctx

Thinking:

hybrid

This is the largest model in the Hermes 4 family, and it is the fullest expression of our design, focused on advanced reasoning and creative depth rather than optimizing inference speed or cost.

This model is based on Llama-3.1-405B.

Unsupported Models

Hermes-4.3-36B

Hermes-4.3-36B

Currently unavailable via the API

You can use this model locally, but if you're using our hosted API we recommend migrating to Hermes-4-70B.

N/A

128k tokens ctx

Thinking:

hybrid

Hermes 4.3 is optimised for local deployment. You can dowload it from Hugging Face for local use.

browser-use

Browser Use session minutes

$0.0011 / minute

Browser Use proxy bandwidth MB

$4.20 / GB

fal

FAL billable unit - clarity upscaler

$0.0315 / billable unit

FAL billable unit - Flux 2 Klein 9b

$0.0116 / billable unit

FAL billable unit - flux 2 pro

$0.0315 / billable unit

FAL billable unit - gpt-image-2 medium quality

$1.05 / billable unit

FAL billable unit - gpt-image-2 medium quality

$1.05 / billable unit

FAL billable unit - ideogram v3

$0.0315 / billable unit

FAL billable unit - Nano Banana Pro

$0.1575 / billable unit

FAL billable unit - qwen image

$0.021 / billable unit

FAL billable unit - recraft v4

$0.2625 / billable unit

FAL billable unit - Z Image Turbo

$0.0053 / billable unit

fal.queue.alibaba.happy-horse.image-to-video

$0.147 / billable unit

fal.queue.alibaba.happy-horse.text-to-video

$0.147 / billable unit

fal.queue.bytedance.seedance-2\.0.image-to-video

$0.0147 / billable unit

fal.queue.bytedance.seedance-2\.0.text-to-video

$0.0147 / billable unit

fal.queue.fal-ai.gpt-image-1\.5

$1.05 / billable unit

fal.queue.fal-ai.kling-video.v3.4k.image-to-video

$0.441 / billable unit

fal.queue.fal-ai.kling-video.v3.4k.text-to-video

$0.441 / billable unit

fal.queue.fal-ai.ltx-2\.3-22b.image-to-video

$0.0017 / billable unit

fal.queue.fal-ai.ltx-2\.3-22b.text-to-video

$0.0017 / billable unit

fal.queue.fal-ai.pixverse.v6.image-to-video

$0.0053 / billable unit

fal.queue.fal-ai.pixverse.v6.text-to-video

$0.0053 / billable unit

fal.queue.fal-ai.veo3\.1

$0.42 / billable unit

fal.queue.fal-ai.veo3\.1.image-to-video

$0.42 / billable unit

firecrawl

Firecrawl credits

$0.0005 / credit

krea

krea.image.krea-2-large.base

$0.06 / generation

krea.image.krea-2-large.style-references

$0.065 / generation

krea.image.krea-2-medium-turbo.base

$0.015 / generation

krea.image.krea-2-medium-turbo.style-references

$0.0175 / generation

krea.image.krea-2-medium.base

$0.03 / generation

krea.image.krea-2-medium.style-references

$0.035 / generation

modal

Modal sandbox CPU seconds

$0.0495 / CPU-hour

Modal sandbox memory GiB seconds

$0.0084 / GiB-hour

openai-audio

GPT-4o mini transcribe input tokens

$1.31 / 1M input tokens

Hermes Agent GPT-4o transcribe input tokens

$2.62 / 1M input tokens

OpenAI audio speech input tokens

$0.63 / 1M input tokens

GPT-4o mini transcribe output tokens

$5.25 / 1M output tokens

GPT-4o transcribe output tokens

$10.50 / 1M output tokens

OpenAI audio speech output tokens

$12.60 / 1M output tokens

Whisper transcription audio seconds

$0.0063 / minute of audio

browser-use

Browser Use session minutes

$0.0011 / minute

Browser Use proxy bandwidth MB

$4.20 / GB

fal

FAL billable unit - clarity upscaler

$0.0315 / billable unit

FAL billable unit - Flux 2 Klein 9b

$0.0116 / billable unit

FAL billable unit - flux 2 pro

$0.0315 / billable unit

FAL billable unit - gpt-image-2 medium quality

$1.05 / billable unit

FAL billable unit - gpt-image-2 medium quality

$1.05 / billable unit

FAL billable unit - ideogram v3

$0.0315 / billable unit

FAL billable unit - Nano Banana Pro

$0.1575 / billable unit

FAL billable unit - qwen image

$0.021 / billable unit

FAL billable unit - recraft v4

$0.2625 / billable unit

FAL billable unit - Z Image Turbo

$0.0053 / billable unit

fal.queue.alibaba.happy-horse.image-to-video

$0.147 / billable unit

fal.queue.alibaba.happy-horse.text-to-video

$0.147 / billable unit

fal.queue.bytedance.seedance-2\.0.image-to-video

$0.0147 / billable unit

fal.queue.bytedance.seedance-2\.0.text-to-video

$0.0147 / billable unit

fal.queue.fal-ai.gpt-image-1\.5

$1.05 / billable unit

fal.queue.fal-ai.kling-video.v3.4k.image-to-video

$0.441 / billable unit

fal.queue.fal-ai.kling-video.v3.4k.text-to-video

$0.441 / billable unit

fal.queue.fal-ai.ltx-2\.3-22b.image-to-video

$0.0017 / billable unit

fal.queue.fal-ai.ltx-2\.3-22b.text-to-video

$0.0017 / billable unit

fal.queue.fal-ai.pixverse.v6.image-to-video

$0.0053 / billable unit

fal.queue.fal-ai.pixverse.v6.text-to-video

$0.0053 / billable unit

fal.queue.fal-ai.veo3\.1

$0.42 / billable unit

fal.queue.fal-ai.veo3\.1.image-to-video

$0.42 / billable unit

firecrawl

Firecrawl credits

$0.0005 / credit

krea

krea.image.krea-2-large.base

$0.06 / generation

krea.image.krea-2-large.style-references

$0.065 / generation

krea.image.krea-2-medium-turbo.base

$0.015 / generation

krea.image.krea-2-medium-turbo.style-references

$0.0175 / generation

krea.image.krea-2-medium.base

$0.03 / generation

krea.image.krea-2-medium.style-references

$0.035 / generation

modal

Modal sandbox CPU seconds

$0.0495 / CPU-hour

Modal sandbox memory GiB seconds

$0.0084 / GiB-hour

openai-audio

GPT-4o mini transcribe input tokens

$1.31 / 1M input tokens

Hermes Agent GPT-4o transcribe input tokens

$2.62 / 1M input tokens

OpenAI audio speech input tokens

$0.63 / 1M input tokens

GPT-4o mini transcribe output tokens

$5.25 / 1M output tokens

GPT-4o transcribe output tokens

$10.50 / 1M output tokens

OpenAI audio speech output tokens

$12.60 / 1M output tokens

Whisper transcription audio seconds

$0.0063 / minute of audio