OpenGVLab: InternVL3 78B
Other
Multimodal
Paid
The InternVL3 series is an advanced multimodal large language model (MLLM). Compared to InternVL 2.5, InternVL3 demonstrates stronger multimodal perception and reasoning capabilities. In addition, InternVL3 is benchmarked against the Qwen2.5 Chat models, whose pre-trained base models serve as the initialization for its language component. Benefiting from Native Multimodal Pre-Training, the InternVL3 series surpasses the Qwen2.5 series in overall text performance.
Parameters
78B
Context Window
32,768
tokens
Input Price
$0.03
per 1M tokens
Output Price
$0.13
per 1M tokens
Capabilities
Model capabilities and supported modalities
Performance
Reasoning
Excellent reasoning capabilities with strong logical analysis
Math
-
Coding
-
Knowledge
-
Modalities
Input Modalities
image,text
Output Modalities
text
LLM Price Calculator
Calculate the cost of using this model
$0.000045
$0.000390
Input Cost:$0.000045
Output Cost:$0.000390
Total Cost:$0.000435
Estimated usage: 4,500 tokens
Monthly Cost Estimator
Based on different usage levels
Light Usage
$0.0016
~10 requests
Moderate Usage
$0.0160
~100 requests
Heavy Usage
$0.1600
~1000 requests
Enterprise
$1.6000
~10,000 requests
Note: Estimates based on current token count settings per request.
Last Updated: 1970/01/21