Qwen Chat can now directly read and process the content of any web page when you simply paste a link into the chat.
Replies
Best
Hunter
📌
Hi everyone!
Qwen-VL is seriously impressive, especially with its multi-modal capabilities from the Qwen team and it's focused on visual understanding!
What's interesting about this:
🖼️ Image Question Answering: Describes content, classifies and labels elements like people, places, animals with incredible accuracy.
🧮 Mathematical Problem Solving: Solves math problems directly from images - perfect for education and training applications. This is a major differentiator.
📹 Video Understanding: Analyzes video content, locates specific events, gets timestamps, generates summaries of key segments.
📍 Object Localization: Locates objects and returns precise coordinates of bounding boxes or centroids. Strong performance on spatial tasks.
📄 Document Parsing: Parses image-based documents into QwenVL HTML format while preserving position information of elements like images and tables.
🔤 Multi-language OCR: Recognizes text and formulas in 11+ languages including Chinese, English, Japanese, Korean, Arabic, Vietnamese, French, German, Italian, Spanish, Russian.
I really love Qwen, it's truly a reliable partner, and its product iterations are leading the world! Especially with the addition of Video Understanding this time, it such a blessing for us who work with videos. Thank you, this is amazing! @QWQ-Max
Replies
AdFox (formerly GoodsFox)
Your names seem to be emoji 😂
YouMind
I really love Qwen, it's truly a reliable partner, and its product iterations are leading the world! Especially with the addition of Video Understanding this time, it such a blessing for us who work with videos. Thank you, this is amazing! @QWQ-Max
Triforce Todos
The math from pictures is super cool. Kids could just take a photo of a problem and learn step by step. Big help for schools.