英伟达dam实测

简要记录英伟达 DAM 模型的本地测试结果,以及输入图片后的描述效果。

AI

英伟达最新发布的Describe Anything: Detailed Localized Image and Video Captioning(DAM),展示文案上效果非常好,下载了一下模型进行测试,效果确实不错

输入图片(要求描述整个图片)

输出结果

The sky is a clear, vibrant blue with a few scattered, fluffy white clouds. The clouds are primarily concentrated towards the left side of the image, with one larger cloud near the top left corner and smaller clouds dispersed around it. The right side of the sky is mostly clear with a few smaller clouds.
# ai # python # vision # captioning