Skip to main content
Back to registry

Video and Media

Video and media skills support audio, video, image, and presentation workflows used in content production. Expect tools for generating, editing, packaging, or automating media assets that sit outside the typical code-and-infrastructure toolchain.

166 skills

Skills in this category

ocr-super-surya

by aktsmm/agent-skills

216

Source details, install context, and public review data are available on the full page.

Video and MediaFirst seen Jan 23, 2026

video-transcript-downloader

by steipete/agent-scripts

214

Source details, install context, and public review data are available on the full page.

Video and MediaFirst seen Jan 22, 2026

pixi-js

by mindrally/skills

211

Source details, install context, and public review data are available on the full page.

Video and MediaFirst seen Jan 24, 2026

xhs-publisher

by byheaven/byheaven-skills

210

Source details, install context, and public review data are available on the full page.

Video and MediaFirst seen Jan 22, 2026

mgrep-code-search

by intellectronica/agent-skills

204

mgrep is a semantic search tool that enables natural language queries across code, text, PDFs, and images. It is particularly effective for exploring larger or complex codebases where traditional pattern matching falls short.

Video and MediaFirst seen Jan 19, 2026

reading-invoice

by kazukinagata/shinkoku

204

Source details, install context, and public review data are available on the full page.

Video and MediaFirst seen Feb 21, 2026

r3f-loaders

by enzed/r3f-skills

203

The recommended way to load GLTF/GLB models.

Video and MediaFirst seen Jan 19, 2026

ralph-plan

by mastra-ai/mastra

202

Source details, install context, and public review data are available on the full page.

Video and MediaFirst seen Jan 21, 2026

reading-deduction-cert

by kazukinagata/shinkoku

199

控除証明書(生命保険料控除証明書、地震保険料控除証明書、社会保険料控除証明書等)の画像を読み取り、構造化データとして返すスキル。

Video and MediaFirst seen Feb 21, 2026

reading-receipt

by kazukinagata/shinkoku

199

レシート・領収書・ふるさと納税受領証明書の画像を読み取り、構造化データとして返すスキル。

Video and MediaFirst seen Feb 21, 2026

reading-payment-statement

by kazukinagata/shinkoku

197

支払調書(報酬、料金、契約金及び賞金の支払調書)の画像を読み取り、構造化データとして返すスキル。

Video and MediaFirst seen Feb 21, 2026

framer-motion

by mindrally/skills

188

Source details, install context, and public review data are available on the full page.

Video and MediaFirst seen Jan 24, 2026

openai-whisper-api

by steipete/clawdis

186

Transcribe an audio file via OpenAI’s /v1/audio/transcriptions endpoint.

Video and MediaFirst seen Jan 23, 2026

pw-redbook-image

by plugins-world/pw-skills

186

Source details, install context, and public review data are available on the full page.

Video and MediaFirst seen Jan 20, 2026

seedance-prompter

by pexoai/pexo-skills

186

This skill transforms a user's scattered multimodal assets (images, videos, audio) and ambiguous creative intent into a structured, executable prompt for the Seedance 2.0 video generation model. It acts as an expert prompt engineer, ensuring the highest quality output from the underlying model.

Video and Media

openai-image-gen

by steipete/clawdis

185

Generate a handful of “random but structured” prompts and render them via the OpenAI Images API.

Video and MediaFirst seen Jan 23, 2026

source-management

by anthropics/knowledge-work-plugins

180

If you see unfamiliar placeholders or need to check which tools are connected, see CONNECTORS.md .

Video and MediaFirst seen Jan 30, 2026

reading-withholding

by kazukinagata/shinkoku

179

Source details, install context, and public review data are available on the full page.

Video and MediaFirst seen Feb 21, 2026

gpt-image-1-5

by intellectronica/agent-skills

177

Generate new images or edit existing ones using OpenAI's GPT Image 1.5 model.

Video and MediaFirst seen Jan 19, 2026

generating-sounds-with-ai

by raphaelsalaja/userinterface-wiki

172

Review Web Audio API code for sound synthesis best practices.

Video and MediaFirst seen Jan 27, 2026

seedance

by songguoxs/seedance-prompt-skill

172

你是一个专业的 AI 视频提示词工程师,专门为字节跳动即梦平台的 Seedance 2.0 视频生成模型编写高质量的中文提示词。

Video and MediaFirst seen Feb 12, 2026

discord

by steipete/clawdis

171

Use the message tool. No provider-specific discord tool exposed to the agent.

Video and MediaFirst seen Feb 13, 2026

nanobanana-ppt-skills

by sickn33/antigravity-awesome-skills

171

AI-powered PPT generation with document analysis and styled images

Video and MediaFirst seen Jan 29, 2026

sag

by steipete/clawdis

167

Use sag for ElevenLabs TTS with local playback.

Video and MediaFirst seen Jan 23, 2026
Page 4 of 7