cyberneticlibrary

Analyze video and image with Qwen Omni

give-claude-eyespluginsetup L23,035
davepoon/buildwithclaude
What it does

Analyze videos and images via Qwen Omni API

Best for

Developers who want to add native video/image understanding to Claude Code workflows without calling external vision APIs directly (Qwen abstracts the complexity).

Inputs
  • · video file (MP4, WebM, etc) or image file (PNG, JPG, etc)
  • · task (classify, extract motion, describe content, etc)
Outputs
  • · classification labels
  • · motion data (if applicable)
  • · text description of video/image content
  • · frame-level annotations (if applicable)
Requires
  • · Qwen Omni API (Alibaba)
Preconditions
  • · Qwen Omni API key configured
  • · video/image file accessible locally or via URL
  • · file format supported by Qwen
Failure modes
  • · API key invalid or quota exceeded — requests rejected
  • · video too long or high resolution — upload/processing timeouts
  • · unsupported format — API returns error
  • · motion extraction on static image — returns null
Trust signals
  • · Qwen Omni (multimodal provider)
  • · video + image support