Docsgeneric

happyhorse

Generate and edit videos with Alibaba HappyHorse 1.0 models via inference.sh CLI. Models: HappyHorse T2V, I2V, R2V, Video Edit. Capabilities: text-to-video, image-to-video, reference-to-video, video editing with natural language, character preservation, 720P/1080P, up to 15 seconds. Use for: physically realistic video, video editing, character-consistent content, product demos, social media. Triggers: happyhorse, happy horse, alibaba video, happyhorse 1.0, dashscope video, alibaba happyhorse, video editing ai, ai video editor

inference-skills/skillsinference.sh

View source

Install

npx skills add https://github.com/inference-skills/skills --skill happyhorse

Use with your agent

Install the happyhorse skill, then use it as build context. Run: npx skills add https://github.com/inference-skills/skills --skill happyhorse. Then read the installed skill.md and follow its guidance to build or refactor my project.

HappyHorse 1.0 Video Generation

Generate and edit physically realistic videos with Alibaba's HappyHorse 1.0 models via inference.sh CLI.

Quick Start

Requires inference.sh CLI (belt). Install instructions

belt login

belt app run alibaba/happyhorse-1-0-t2v --input '{"prompt": "a horse galloping across a sunlit meadow"}'

HappyHorse Models

Model	App ID	Best For
T2V	`alibaba/happyhorse-1-0-t2v`	Text-to-video, physically realistic motion
I2V	`alibaba/happyhorse-1-0-i2v`	Animate a single image
R2V	`alibaba/happyhorse-1-0-r2v`	Preserve characters from up to 9 reference images
Video Edit	`alibaba/happyhorse-1-0-video-edit`	Edit existing videos with natural language

All models support 720P/1080P resolution, up to 15 seconds duration.

Examples

Text-to-Video

belt app run alibaba/happyhorse-1-0-t2v --input '{
  "prompt": "a golden retriever running through autumn leaves in a park, slow motion",
  "duration": 10,
  "resolution": "1080P",
  "ratio": "16:9"
}'

Image-to-Video

Animate a still image:

belt app run alibaba/happyhorse-1-0-i2v --input '{
  "first_frame": "https://your-image.jpg",
  "prompt": "gentle camera zoom, clouds moving in the sky",
  "duration": 8,
  "resolution": "720P"
}'

Reference-to-Video (Character Preservation)

Generate videos that preserve characters from reference images (up to 9):

belt app run alibaba/happyhorse-1-0-r2v --input '{
  "prompt": "a woman walking through a busy market street",
  "reference_images": ["https://portrait.jpg"],
  "duration": 10,
  "resolution": "720P"
}'

Multi-Character Reference

belt app run alibaba/happyhorse-1-0-r2v --input '{
  "prompt": "two friends sitting at a cafe having coffee",
  "reference_images": ["https://person1.jpg", "https://person2.jpg"],
  "ratio": "16:9"
}'

Video Editing

Edit existing videos using natural language instructions:

belt app run alibaba/happyhorse-1-0-video-edit --input '{
  "video": "https://your-video.mp4",
  "prompt": "change the background to a snowy mountain landscape"
}'

Video Editing with Reference Images

belt app run alibaba/happyhorse-1-0-video-edit --input '{
  "video": "https://your-video.mp4",
  "prompt": "replace the person with the character from the reference image",
  "reference_images": ["https://character.jpg"]
}'

Video Editing with Audio Control

belt app run alibaba/happyhorse-1-0-video-edit --input '{
  "video": "https://your-video.mp4",
  "prompt": "make the scene look like a rainy day",
  "audio_setting": "generate"
}'

Pricing

Resolution	Price
720P	$0.14 per second
1080P	$0.24 per second

Video Edit is billed on input + output duration.

Parameters (T2V)

Parameter	Type	Default	Description
`prompt`	string	required	Text description of the video
`duration`	integer	5	Duration in seconds (3–15)
`resolution`	enum	720P	720P or 1080P
`ratio`	enum	16:9	16:9, 9:16, 1:1, 4:3, 3:4, 21:9
`seed`	integer	random	Reproducible generation
`watermark`	boolean	false	Add HappyHorse watermark

Parameters (I2V)

Parameter	Type	Default	Description
`first_frame`	file	required	First frame image (JPEG, PNG, WebP)
`prompt`	string	-	Optional text description
`duration`	integer	5	Duration in seconds (3–15)
`resolution`	enum	720P	720P or 1080P
`seed`	integer	random	Reproducible generation

Parameters (R2V)

Parameter	Type	Default	Description
`prompt`	string	required	Text description of the scene
`reference_images`	array	required	Up to 9 character reference images
`duration`	integer	5	Duration in seconds (3–15)
`resolution`	enum	720P	720P or 1080P
`ratio`	enum	16:9	16:9, 9:16, 1:1, 4:3, 3:4, 21:9
`seed`	integer	random	Reproducible generation

Parameters (Video Edit)

Parameter	Type	Default	Description
`video`	file	required	Video to edit (MP4/MOV, H.264)
`prompt`	string	required	Editing instruction
`reference_images`	array	-	Up to 5 reference images
`audio_setting`	enum	auto	auto, generate, or keep_original
`resolution`	enum	720P	720P or 1080P
`seed`	integer	random	Reproducible generation

Search HappyHorse Apps

belt app list --search "happyhorse"

Related Skills

# Full platform skill (all 250+ apps)
npx skills add inference-sh/skills@infsh-cli

# All video generation models
npx skills add inference-sh/skills@ai-video-generation

# Seedance 2.0
npx skills add inference-sh/skills@seedance

# Google Veo
npx skills add inference-sh/skills@google-veo

# Image generation (for image-to-video)
npx skills add inference-sh/skills@ai-image-generation

Browse all video apps: belt app list --category video

Documentation

Running Apps - How to run apps via CLI
Streaming Results - Real-time progress updates
Content Pipeline Example - Building media workflows