Analyze and classify video and image content
qwen-visionskillsetup L1★3,035
davepoon/buildwithclaude ↗What it does
Analyze video and image content with temporal understanding
Best for
Understanding motion, transitions, and temporal sequences in video when frame-by-frame analysis is insufficient.
Inputs
- · Video or image file path
- · Analysis prompt
Outputs
- · Video/image analysis text
- · Optional JSON structured output
Requires
- · Qwen Omni API (dashscope)
Preconditions
DASHSCOPE_API_KEY set; dashscope Python package installed; video/image file accessible
Failure modes
- · API key missing → clear error with setup instruction
- · File format unsupported → rejects with supported format list
- · File too large → suggests lowering FPS or trimming
Trust signals
- · Temporal attention model (sees motion, not just frames)
- · FPS parameter for cost control
- · Multi-turn context support