Qwen-Image-Edit is the editing version of the 20B Qwen-Image model. It offers precise, model-native editing, including bilingual text modification and both high-level semantic and low-level appearance changes.
Qwen3-VL is the new flagship vision-language model from the Qwen team, excelling at visual agent tasks, long-video understanding, and spatial reasoning with a native 256K context window.
Qwen3-Next is a new family of models from the Qwen team, featuring a novel architecture that activates just 3B of its 80B parameters. This delivers performance comparable to much larger models with a >10x speedup, especially on long-context tasks.
Qwen3-ASR is a new high-accuracy speech recognition model. It supports 11 languages, excels at transcribing songs with background music, and features a unique contextual biasing system that accepts any text format to improve accuracy on specific terms.
Qwen3 is the newest family of open-weight LLMs (0.6B to 235B MoE) from Alibaba. Features switchable "Thinking Mode" for reasoning vs. speed. Strong performance on code/math. Multilingual.