cyberneticlibrary

Analyze and classify video and image content

qwen-visionskillsetup L13,035
davepoon/buildwithclaude
What it does

Analyze video and image content with temporal understanding

Best for

Understanding motion, transitions, and temporal sequences in video when frame-by-frame analysis is insufficient.

Inputs
  • · Video or image file path
  • · Analysis prompt
Outputs
  • · Video/image analysis text
  • · Optional JSON structured output
Requires
  • · Qwen Omni API (dashscope)
Preconditions

DASHSCOPE_API_KEY set; dashscope Python package installed; video/image file accessible

Failure modes
  • · API key missing → clear error with setup instruction
  • · File format unsupported → rejects with supported format list
  • · File too large → suggests lowering FPS or trimming
Trust signals
  • · Temporal attention model (sees motion, not just frames)
  • · FPS parameter for cost control
  • · Multi-turn context support