Qwen: Qwen3 VL 32B Instruct
Qwen
Multimodal
Paid
Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual perception with advanced text...
Parameters
32B
Context Window
131,072
tokens
Input Price
$0.104
per 1M tokens
Output Price
$0.416
per 1M tokens
Capabilities
Model capabilities and supported modalities
Performance
Reasoning
Excellent reasoning capabilities with strong logical analysis
Math
-
Coding
-
Knowledge
-
Modalities
Input Modalities
text,image
Output Modalities
text
LLM Price Calculator
Calculate the cost of using this model
$0.000156
$0.001248
Input Cost:$0.000156
Output Cost:$0.001248
Total Cost:$0.001404
Estimated usage: 4,500 tokens
Monthly Cost Estimator
Based on different usage levels
Light Usage
$0.0052
~10 requests
Moderate Usage
$0.0520
~100 requests
Heavy Usage
$0.5200
~1000 requests
Enterprise
$5.2000
~10,000 requests
Note: Estimates based on current token count settings per request.
Last Updated: 2026/04/11
