LogoTop AI Hubs

Llama Nemotron Ultra

by NVIDIA

Overview

License:
Open
Context Window:128k
Intelligence Index:20.00
Omniscience Index:−46

Pricing

Blended Price:$0.90/1M
Input Price:$0.60/1M
Output Price:$1.80/1M

Performance

Median Speed:36.00 tokens/s
First Token:55.71 s
Total Response:69.46 s
Reasoning Time:54.99 s

Visualizations

Benchmark Scores

Performance across different benchmarks

Capability Radar

Model capabilities across domains

Speed Distribution

Tokens per second across percentiles

Latency Distribution

Response time across percentiles