ReadWeb.ai: Dịch trang web đa ngôn ngữ tức thì miễn phí và xem song ngữ cho mọi người

nội dung

A visual multimodal version of the large model series, Qwen (abbr. Tongyi Qianwen), proposed by Alibaba Cloud. Qwen-VL accepts image, text, and bounding box as inputs, outputs text and bounding box.

liên kết

https://fal.ai/models

Tóm tắt

Alibaba Cloud has introduced Qwen-VL, a visual multimodal version of the large model series. Qwen-VL can process inputs such as images, text, and bounding boxes, and provides outputs in the form of text and bounding boxes.