LogoTop AI Hubs

Cogito V2 Preview Llama 109B

Llama4
Multimodal
Paid

An instruction-tuned, hybrid-reasoning Mixture-of-Experts model built on Llama-4-Scout-17B-16E. Cogito v2 can answer directly or engage an extended “thinking” phase, with alignment guided by Iterated Distillation & Amplification (IDA). It targets coding, STEM, instruction following, and general helpfulness, with stronger multilingual, tool-calling, and reasoning performance than size-equivalent baselines. The model supports long-context use (up to 10M tokens) and standard Transformers workflows. Users can control the reasoning behaviour with the `reasoning` `enabled` boolean. [Learn more in our docs](https://openrouter.ai/docs/use-cases/reasoning-tokens#enable-reasoning-with-default-config)

Parameters

109B

Context Window

32,767

tokens

Input Price

$0.18

per 1M tokens

Output Price

$0.59

per 1M tokens

Capabilities

Model capabilities and supported modalities

Performance

Reasoning

Excellent reasoning capabilities with strong logical analysis

Math

-

Coding

Capable of generating functional code with good practices

Knowledge

-

Modalities

Input Modalities

image,text

Output Modalities

text

LLM Price Calculator

Calculate the cost of using this model

$0.000270
$0.001770
Input Cost:$0.000270
Output Cost:$0.001770
Total Cost:$0.002040
Estimated usage: 4,500 tokens

Monthly Cost Estimator

Based on different usage levels

Light Usage
$0.0077
~10 requests
Moderate Usage
$0.0770
~100 requests
Heavy Usage
$0.7700
~1000 requests
Enterprise
$7.7000
~10,000 requests
Note: Estimates based on current token count settings per request.
Last Updated: 1970/01/21