Toolinggeneric

ai-video-generation

Generate AI videos with Google Veo, Seedance 2.0, HappyHorse, Wan, Grok and 40+ models via inference.sh CLI. Models: Veo 3.1, Veo 3, Seedance 2.0, HappyHorse 1.0, Wan 2.5, Grok Imagine Video, OmniHuman, Fabric, HunyuanVideo. Capabilities: text-to-video, image-to-video, reference-to-video, video editing, lipsync, avatar animation, video upscaling, foley sound. Use for: social media videos, marketing content, explainer videos, product demos, AI avatars. Triggers: video generation, ai video, text to video, image to video, veo, animate image, video from image, ai animation, video generator, generate video, t2v, i2v, ai video maker, create video with ai, runway alternative, pika alternative, sora alternative, kling alternative, seedance, happyhorse

inference-skills/skillsinference.sh
View source

Install

npx skills add https://github.com/inference-skills/skills --skill ai-video-generation

Use with your agent

ClaudeCursorOpenAIGemini

Install the ai-video-generation skill, then use it as build context. Run: npx skills add https://github.com/inference-skills/skills --skill ai-video-generation. Then read the installed skill.md and follow its guidance to build or refactor my project.

AI Video Generation

Generate videos with 40+ AI models via inference.sh CLI.

AI Video Generation

Quick Start

Requires inference.sh CLI (belt). Install instructions

belt login

# Generate a video with Veo
belt app run google/veo-3-1-fast --input '{"prompt": "drone shot flying over a forest"}'

Available Models

Text-to-Video

ModelApp IDBest For
Veo 3.1 Fastgoogle/veo-3-1-fastFast, with optional audio
Veo 3.1google/veo-3-1Best quality, frame interpolation
Veo 3google/veo-3High quality with audio
Veo 3 Fastgoogle/veo-3-fastFast with audio
Veo 2google/veo-2Realistic videos
P-Videopruna/p-videoFast, economical, with audio support
WAN-T2Vpruna/wan-t2vEconomical 480p/720p
Grok Videoxai/grok-imagine-videoxAI, configurable duration
Seedance 2 T2Vfalai/seedance-2-t2vText-to-video with sync audio
Seedance 2 R2Vfalai/seedance-2-r2vReference images/videos/audio to video
HappyHorse T2Valibaba/happyhorse-1-0-t2vPhysically realistic, up to 15s

Image-to-Video

ModelApp IDBest For
Wan 2.5falai/wan-2-5Animate any image
Wan 2.5 I2Vfalai/wan-2-5-i2vHigh quality i2v
WAN-I2Vpruna/wan-i2vEconomical 480p/720p
P-Videopruna/p-videoFast i2v with audio
Seedance 2 I2Vfalai/seedance-2-i2vAnimate images with sync audio
HappyHorse I2Valibaba/happyhorse-1-0-i2vAnimate images, up to 1080P/15s
HappyHorse R2Valibaba/happyhorse-1-0-r2vCharacter-preserving from references

Avatar / Lipsync

ModelApp IDBest For
OmniHuman 1.5bytedance/omnihuman-1-5Multi-character
OmniHuman 1.0bytedance/omnihuman-1-0Single character
Fabric 1.0falai/fabric-1-0Image talks with lipsync
PixVerse Lipsyncfalai/pixverse-lipsyncRealistic lipsync

Video Editing

ModelApp IDBest For
HappyHorse Editalibaba/happyhorse-1-0-video-editNatural language video editing

Utilities

ToolApp IDDescription
HunyuanVideo Foleyinfsh/hunyuanvideo-foleyAdd sound effects to video
Topaz Upscalerfalai/topaz-video-upscalerUpscale video quality
Media Mergerinfsh/media-mergerMerge videos with transitions

Browse All Video Apps

belt app list --category video

Examples

Text-to-Video with Veo

belt app run google/veo-3-1-fast --input '{
  "prompt": "A timelapse of a flower blooming in a garden"
}'

Grok Video

belt app run xai/grok-imagine-video --input '{
  "prompt": "Waves crashing on a beach at sunset",
  "duration": 5
}'

Image-to-Video with Wan 2.5

belt app run falai/wan-2-5 --input '{
  "image_url": "https://your-image.jpg"
}'

AI Avatar / Talking Head

belt app run bytedance/omnihuman-1-5 --input '{
  "image_url": "https://portrait.jpg",
  "audio_url": "https://speech.mp3"
}'

Fabric Lipsync

belt app run falai/fabric-1-0 --input '{
  "image_url": "https://face.jpg",
  "audio_url": "https://audio.mp3"
}'

Seedance 2.0 Text-to-Video with Audio

belt app run falai/seedance-2-t2v --input '{
  "prompt": "a jazz band performing in a dimly lit club",
  "generate_audio": true,
  "duration": 10
}'

Seedance 2.0 Reference-to-Video

belt app run falai/seedance-2-r2v --input '{
  "prompt": "A person who looks like @Image1 walking through a garden",
  "images": ["https://portrait.jpg"],
  "generate_audio": true
}'

HappyHorse Text-to-Video

belt app run alibaba/happyhorse-1-0-t2v --input '{
  "prompt": "a golden retriever running through autumn leaves, slow motion",
  "duration": 10,
  "resolution": "1080P"
}'

HappyHorse Video Editing

belt app run alibaba/happyhorse-1-0-video-edit --input '{
  "video": "https://your-video.mp4",
  "prompt": "change the background to a snowy mountain landscape"
}'

PixVerse Lipsync

belt app run falai/pixverse-lipsync --input '{
  "image_url": "https://portrait.jpg",
  "audio_url": "https://speech.mp3"
}'

Video Upscaling

belt app run falai/topaz-video-upscaler --input '{"video_url": "https://..."}'

Add Sound Effects (Foley)

belt app run infsh/hunyuanvideo-foley --input '{
  "video_url": "https://silent-video.mp4",
  "prompt": "footsteps on gravel, birds chirping"
}'

Merge Videos

belt app run infsh/media-merger --input '{
  "videos": ["https://clip1.mp4", "https://clip2.mp4"],
  "transition": "fade"
}'

Related Skills

# Full platform skill (all 250+ apps)
npx skills add inference-sh/skills@infsh-cli

# Pruna P-Video (fast & economical)
npx skills add inference-sh/skills@p-video

# Google Veo specific
npx skills add inference-sh/skills@google-veo

# Seedance 2.0
npx skills add inference-sh/skills@seedance

# HappyHorse 1.0
npx skills add inference-sh/skills@happyhorse

# AI avatars & lipsync
npx skills add inference-sh/skills@ai-avatar-video

# Text-to-speech (for video narration)
npx skills add inference-sh/skills@text-to-speech

# Image generation (for image-to-video)
npx skills add inference-sh/skills@ai-image-generation

# Twitter (post videos)
npx skills add inference-sh/skills@twitter-automation

Browse all apps: belt app list

Documentation