Qwen3.5-9B is a multimodal foundation model from the Qwen3.5 family, designed to deliver strong reasoning, coding, and visual understanding in an efficient 9B-parameter architecture. It uses a unified vision-language design with early fusion of multimodal tokens, allowing the model to process and reason across text and images within the same context.
by Qwen|256K context|$0.05/M input tokens|$0.15/M output tokens
Endpoints
Available providers for this model, with details on pricing, context limits, and real-time health metrics.