AI Video Editor Pipeline with Vision LLM Models
-
Updated
Apr 11, 2026 - Python
AI Video Editor Pipeline with Vision LLM Models
Claude Code skill — 给无视觉能力的 LLM(DeepSeek/o1/o3)外挂看图能力。Vision proxy for text-only LLMs, supports OpenAI/Anthropic/DashScope/Qwen3-VL.
Alt'Ollama: The app for generating alt-text for image/s using LLMs with image processing support.
Add a description, image, and links to the llm-vision topic page so that developers can more easily learn about it.
To associate your repository with the llm-vision topic, visit your repo's landing page and select "manage topics."