qwen

Qwen/Qwen3-VL-2B-Instruct

Fine-tunable
text-to-textimage-to-textvideo-to-text
Advanced image editing through natural language, specializing in precise text modification while preserving fonts and styles in bilingual content.
Model deployment missing
Deployment not configured for this model.
Input
Input image
Preview
Image to use as reference. Must be jpeg, png, gif, or webp.
Output
Your creation will appear here 🪄
History
Generated images and videos will be saved to a dataset for later reference.