short-video-production

Use when creating video content for Xiaohongshu, filming vlogs or tutorials, editing short-form vertical video, or optimizing video for mobile viewing

INSTALLATION
npx skills add https://github.com/vivy-yi/xiaohongshu-skills --skill short-video-production
Run in your project or agent environment. Adjust flags if your CLI version differs.

SKILL.md

$27

Use when:

  • Creating vertical video content for Xiaohongshu (primary format)
  • Filming vlogs, tutorials, reviews, or lifestyle content
  • Editing short-form vertical video (under 60 seconds)
  • Optimizing video for mobile-first consumption
  • Repurposing longer content into short clips
  • Creating product showcases or demonstrations
  • Building personal brand through video storytelling
  • Producing behind-the-scenes or day-in-the-life content
  • Making educational or how-to video content

Do NOT use when:

  • Creating static image posts (use photo-editing or graphic-layout instead)
  • Editing feature-length video or documentary content (use professional video editing software)
  • Production requires cinematic quality beyond smartphone capability
  • Content is fundamentally audio-first and doesn't benefit from visuals (podcast-style content)
  • Creating live streaming content (different production considerations)
  • Platform requires horizontal format (certain cinematic content for other platforms)

Core Pattern

Before (unoptimized, boring video):

❌ "3+ minutes long, viewers scroll away"

❌ "Horizontal format, black bars on mobile"

❌ "Poor audio, can't hear clearly"

❌ "No clear structure, rambles"

❌ "Boring 15-second intro, lost viewers"

❌ "Weak or missing call-to-action"

❌ "Raw footage, no editing"

After (engaging, optimized short video):

✅ "15-60 seconds, respects viewer time"

✅ "Vertical 9:16 format, mobile-native"

✅ "Clear audio, professional quality"

✅ "Hook-value-CTA structure, purposeful"

✅ "Captivating first 3 seconds, immediate hook"

✅ "Clear CTA, tells viewer what to do"

✅ "Polished editing, dynamic pacing"

6 Essential Short Video Elements:

Element

Purpose

Impact

Priority

Hook (0-3s)

Grab attention immediately

Critical

#1 - determines 80% of success

Value Delivery (3-45s)

Deliver on hook's promise

Critical

#2 - must earn continued watching

Pacing

Maintain engagement through speed

High

#3 - dynamic, fast cuts

Visual Quality

Professional, credible appearance

High

#4 - lighting, framing, stability

Audio Quality

Clear, pleasant sound experience

High

#5 - poor audio = video failure

Call-to-Action

Guide viewer to next step

Medium-High

#6 - engagement and conversion

Quick Reference

Video Type Comparison for Xiaohongshu:

Video Type

Optimal Length

Structure

Production

Engagement

Best For

Tutorial (教程)

45-60s

Hook → Problem → Steps → Result

Medium

Very High

Educational value

Review (测评)

30-45s

Hook → First impressions → Features → Verdict

Medium

High

Product content

Vlog (日常)

30-60s

Scene → Action → Reaction → Reflection

Low-Medium

Medium

Authenticity, lifestyle

Tips List (干货)

15-30s

Hook → Tip 1, 2, 3 → CTA

Low

Very High

Quick value delivery

Before-After (对比)

15-30s

Before → Process → After reveal

Low-Medium

Very High

Transformations

Storytelling (故事)

45-60s

Hook → Conflict → Journey → Resolution

High

High

Emotional connection

Challenge (挑战)

30-45s

Intro → Attempt → Result → Reaction

Medium

High

Entertainment, relatability

Quick Tip (秒懂)

7-15s

Hook → One tip → CTA

Very Low

High

Ultra-brief value

Video Format Specifications:

Specification

Recommended

Acceptable

Avoid

Aspect Ratio

9:16 (vertical)

1:1 (square)

16:9 (horizontal)

Resolution

1080×1920 (Full HD)

720×1280 (HD)

Below 720p

Frame Rate

30fps

24fps, 60fps

Below 24fps

File Size

Under 500MB

Under 1GB

Over 1GB

Length

15-60s

Up to 3 min (rare)

Over 3 min

Format

MP4, MOV

-

AVI, MKV (unsupported)

Equipment by Production Quality:

Quality Level

Camera

Audio

Lighting

Stabilization

Budget

Basic

Smartphone (built-in)

Phone mic

Natural light

Handheld careful

¥0

Improved

Smartphone + app

Lavalier mic (¥50)

Ring light (¥100)

Mini tripod (¥50)

¥200

Advanced

Smartphone + lens kit

Wireless mic (¥300)

Softbox + ring light

Gimbal stabilizer (¥500)

¥1,000

Professional

Mirrorless camera

Shotgun mic + recorder

3-point lighting

Professional rig

¥10,000+

Implementation

Step 1: Plan Video Structure and Concept

Great videos start with intentional planning, not random filming.

Define Your Video Type (from Quick Reference table above):

  • Tutorial: Teaching a skill or process
  • Review: Evaluating a product or service
  • Vlog: Documenting daily life or experience
  • Tips List: Delivering multiple quick tips
  • Before-After: Showing transformation
  • Storytelling: Narrative arc with emotion
  • Challenge: Attempting something difficult
  • Quick Tip: Ultra-brief single-value delivery

Structure Formula by Length:

15-Second Videos (ultra-fast):

  • 0-2s: Hook (grab attention immediately)
  • 2-13s: Single value point (one tip, one reveal, one moment)
  • 13-15s: CTA (follow, like, save)

30-Second Videos (standard):

  • 0-3s: Hook (setup, promise, curiosity)
  • 3-25s: Main content (2-3 key points, examples)
  • 25-30s: CTA + preview (what's next, why follow)

60-Second Videos (maximum):

  • 0-3s: Hook (strongest opener)
  • 3-50s: Main content (3-5 points, demonstration, examples)
  • 50-55s: Summary or key takeaway
  • 55-60s: CTA (specific action)

Pre-Filming Checklist:

  • Video type and structure defined
  • Hook concept finalized (first 3 seconds scripted)
  • Key points outlined (3-5 max)
  • Location and lighting scouted
  • Props and materials ready
  • Battery charged, storage available
  • Practice run-through (optional but recommended)

Step 2: Film High-Quality Vertical Footage

Good footage makes editing easy; bad footage can't be fully fixed.

Camera Settings for Smartphone:

Setting

Recommended

Why

Resolution

4K (downscale to 1080p in edit)

Future-proofing, editing flexibility

Frame Rate

30fps

Standard for social media, smooth motion

Grid

On (rule of thirds)

Composition guide

HDR

Off (can cause editing issues)

Consistent exposure

Stabilization

On (but use gimbal when possible)

Reduce shake

Framing and Composition:

Vertical Framing Rules:

  • Subject placement: Center or rule-of-thirds intersection
  • Headroom: Leave 10-15% space above head (not too much, not too little)
  • Eye level: Camera at subject's eye level (most flattering)
  • Close-ups: Chest up for talking head, closer for detail shots
  • Variety: Mix close-ups, medium shots, and wide shots for dynamic editing

Lighting Fundamentals:

Lighting Type

Quality

Best For

Setup

Natural (window)

Soft, flattering

Talking head, beauty

Face near window, not direct sun

Ring light

Even, circular catchlights

Product, tutorials

Ring light centered on face/product

Softbox

Professional, diffused

Any content

Softbox 45° angle to subject

Natural (outdoor)

Varied by weather

Vlog, lifestyle

Golden hour (sunrise/sunset) best

Lighting Setup Tips:

  • Avoid: Backlighting (subject becomes silhouette)
  • Avoid: Harsh overhead light (unflattering shadows under eyes)
  • Preferred: Light source in front of subject (illuminates face)
  • Test: Record 5-second test, check lighting before full shoot

Audio Recording Best Practices:

Microphone Selection:

  • Built-in phone mic: Acceptable for quiet indoor settings, worst choice
  • Lavalier (clip-on): Best for voice clarity, consistent distance (¥50-300)
  • Wireless mic: Freedom of movement, professional quality (¥300-800)
  • Shotgun mic: Directional, outdoor use, requires operator (¥500+)

Audio Setup:

  • Distance: 15-30cm from mic to mouth (too close = distortion, too far = echo)
  • Environment: Quietest room available, close windows, turn off fans/AC
  • Monitoring: Use headphones to monitor audio live (catch issues immediately)
  • Test record: Always do 10-second test, check audio quality

Stabilization Techniques:

Method

Effectiveness

Cost

Best For

Handheld careful

Low

Free

Stationary shots

Two-handed grip

Low-Medium

Free

Brief shots

Elbow lock (body as tripod)

Medium

Free

Medium-length shots

Mini tripod

Medium

¥30-50

Stationary, tutorials

Gimbal stabilizer

Very High

¥400-800

Moving shots, vlogs

Fixed tripod

Very High

¥100-300

Talking head, tutorials

Step 3: Edit for Maximum Engagement

Editing transforms raw footage into compelling content.

Editing Workflow (using JianYing/CapCut/VietCut):

1. Import and Organize (5 minutes):

  • Import footage to editing app
  • Create bins/folders for organized clips
  • Watch all footage, mark best takes

2. Rough Cut (10-15 minutes):

  • Arrange clips in planned structure order
  • Trim to approximate target length (20-30% over target)
  • Remove mistakes, pauses, "ums" and "uhs"
  • Focus on narrative flow, not perfection yet

3. Fine Cut (15-20 minutes):

  • Tighten pacing: remove unnecessary frames
  • Target exact length (15, 30, or 60 seconds)
  • Ensure each shot earns its place (advances story or value)
  • Cut on action (movement masks cuts)

4. Add Visual Elements (10-15 minutes):

  • Text overlays: Key points, product names, tips
  • Captions: Auto-generate, manually edit for accuracy
  • Transitions: Keep subtle (0.3-0.5s), avoid cheesy effects
  • Speed changes: Speed up boring parts (1.5-2x), slow down reveals (0.5-0.75x)

5. Audio Enhancement (5-10 minutes):

  • Voiceover: Record clear narration if needed
  • Music: Add background music (10-20% volume, not competing with voice)
  • Sound effects: Subtle whooshes, pops for emphasis (use sparingly)
  • Volume normalization: Ensure consistent volume throughout

6. Export and Quality Check (5 minutes):

  • Export settings: 1080p, 30fps, high quality
  • Format: MP4 for Xiaohongshu compatibility
  • File size: Under 500MB for fast upload
  • Test playback: Watch on phone before publishing

Editing Best Practices:

Technique

When to Use

Effect

Jump cuts

Remove pauses, maintain energy

Dynamic, fast-paced

B-roll overlay

Cover cuts, add visual interest

Professional, engaging

Text animation

Emphasize key points

Attention-grabbing

Speed ramping

Build anticipation to reveal

Dramatic effect

Zoom in/out

Focus on detail, show context

Dynamic movement

Step 4: Optimize the First 3 Seconds (Critical)

The hook determines 80% of your video's success. Invest disproportionate time here.

Hook Formula Types:

Formula

Example

Why It Works

Curiosity gap

"这个产品真的让我惊呆了..."

Creates mystery, must watch to learn

Specific promise

"5个学生党省钱技巧"

Clear value, sets expectations

Problem-solution

"别再犯这个错误了"

Identifies pain, promises solution

Counter-intuitive

" unpopular opinion: [contrarian take]"

Challenges assumptions, provokes

Immediate result

[Show before-after instantly]

Visual proof, instant gratification

Question hook

"你知道为什么...吗?"

Engages curiosity, interactive

Bold statement

"这是我用过最好的..."

Strong opinion, controversial

Visual Hook Techniques:

1. Face-to-Camera:

  • Direct eye contact (creates connection)
  • Genuine expression (smile, surprise, concern)
  • Close-up framing (chest up)

2. Dynamic Movement:

  • Quick camera move (zoom, pan, reveal)
  • Subject movement (walk toward camera, gesture)
  • Transition effect (whip pan, zoom transition)

3. Text Teaser:

  • Bold text overlay: "Wait for the end!"
  • Split screen: "Before" vs "After"
  • Countdown: "3 things that will change your..."

4. Action Start:

  • Start mid-action (not "hi everyone, welcome back")
  • Immediate demonstration (show, don't tell)
  • In media res (in the middle of action)

Hook Testing:

  • Record 3 different hook variations
  • Show to friends/audience: "Which makes you want to keep watching?"
  • Use analytics: Which hook has highest 3-second retention?

Step 5: Add Clear Call-to-Action

Don't leave viewers guessing—tell them exactly what to do.

CTA Types and Examples:

CTA Type

Example

Best For

Follow

"关注我看更多 [topic] 内容"

Building audience

Like/Save

"点赞收藏,下次容易找到"

Algorithm boost, utility

Comment

"评论区告诉我你的看法"

Engagement boost

Share

"转发给需要的朋友"

Viral potential

Link in bio

"链接在主页"

Traffic/Conversion

Challenge

"试试这个,然后告诉我结果"

Community, UGC

Series CTA

"这是Part 1, Part 2明天见"

Retention, series viewership

CTA Placement Strategies:

1. End CTA (standard):

  • Last 3-5 seconds of video
  • Clear, specific, singular action (not 3 CTAs at once)
  • Example: "关注我,明天分享Part 2"

2. Mid-roll CTA (for longer videos):

  • At 50% mark (e.g., 25 seconds into 50-second video)
  • Tease remaining content: "最重要的技巧在最后..."
  • Returns viewer attention for second half

3. Text CTA (always visible):

  • Persistent text overlay during video
  • Example: "关注看更多" in corner throughout
  • Reinforces verbal CTA

4. Comment CTA (engagement):

  • Ask specific question: "你更喜欢A还是B? 评论区告诉我"
  • Pin your response to top comment
  • Builds community and algorithm signals

Step 6: Optimize for Xiaohongshu Platform

Platform-specific optimization maximizes reach and engagement.

Xiaohongshu Algorithm Signals:

Signal

Weight

How to Optimize

Completion rate

Very High

Keep videos 15-60s, engaging throughout

Watch time

Very High

Deliver value early, don't bury lead

Engagement (likes, comments, saves, shares)

High

Include CTAs, ask questions

Re-watches

Medium

Create loops, make worth rewatching

Session time

Medium

Create binge-worthy content series

Platform-Specific Best Practices:

Title and Description:

  • Title: 10-20 characters, compelling, descriptive
  • First line: Most important, visible in feed
  • Hashtags: 3-5 relevant tags (not spammy)
  • Keywords: Include searchable terms in description

Thumbnail/Cover (first frame matters):

  • High contrast: Bright, eye-catching
  • Text overlay: Tease content (not give away)
  • Face: People respond to human faces
  • Branding: Consistent style across videos

Posting Timing:

  • Best times: 7-10 PM (high engagement hours)
  • Best days: Weekends (Saturday, Sunday) for lifestyle
  • Consistency: Post same time/days to build habit
  • Frequency: 3-7 videos per week for growth

Cross-Promotion:

  • WeChat Moments: Share video link to personal network
  • Weibo: Post teaser with link back to Xiaohongshu
  • Douyin: Repost vertical videos (cross-platform friendly)

Step 7: Test, Analyze, and Iterate

Continuous improvement through data-driven optimization.

Key Metrics to Track:

Metric

How to Measure

What It Indicates

Views

Total impressions

Reach + algorithm boost

Completion Rate

Finished / Started × 100%

Content engagement quality

3-Second Retention

% who watch past 3 seconds

Hook effectiveness

Engagement Rate

(likes + comments + saves) / views

Content resonance

Share Rate

Shares / views

Viral potential

New Followers

Followers gained from video

Audience growth

A/B Testing Framework:

Test Variables:

  • Hooks: Same content, different opening
  • Length: 15s vs 30s vs 60s for same topic
  • CTA: "Follow" vs "Comment" vs "Save"
  • Thumbnail: Face vs text vs product
  • Posting Time: Morning vs evening vs weekend

Testing Process:

  • Identify variable to test
  • Create two versions (all else equal)
  • Post at different times (to avoid self-competition)
  • Compare metrics after 48 hours
  • Scale winning approach

Analytics Review Routine (weekly):

  • Top 3 performing videos: What made them successful?
  • Bottom 3 performing videos: What went wrong?
  • Hook performance: Which hooks had highest 3-second retention?
  • CTA performance: Which CTAs drove most engagement?
  • Trend analysis: What topics, formats, hooks consistently work?

Common Mistakes

Mistake

Why It's Wrong

Fix

Too long (3+ minutes)

Low completion rate, algorithm punishes

Cut to 15-60 seconds maximum

Weak or missing hook

80% of viewers scroll past 3 seconds

Invest time in first 3 seconds, test multiple hooks

Poor audio quality

Viewers abandon regardless of visual quality

Use external mic, record in quiet space

No captions

60% watch without sound (mobile/public)

Auto-generate, manually edit for accuracy

Horizontal video

Black bars, poor mobile experience

Always film vertical 9:16

Slow pacing

Viewer boredom, scrolling

Fast cuts, dynamic editing, remove pauses

Buried lead (value at end)

Viewers never see payoff

Deliver value early, hook promised content

Weak or missing CTA

Lost engagement/growth opportunity

Clear, specific call-to-action

Over-editing (too many effects)

Tacky, distracting, unprofessional

Subtle edits serve content, not show off

Inconsistent posting

Algorithm doesn't know what to surface

Post 3-7x per week consistently

Real-World Impact

Case Study 1: Beauty Creator's Hook Optimization

Creator: Makeup tutorial creator, 3K followers

Problem: Good content but low views (200-500 average), high drop-off

Issues Identified:

  • Generic hooks: "Hi everyone, today I'll show you..." (boring)
  • Length: Often 2-3 minutes (too long)
  • No captions (inaccessible without sound)

Solution Implemented:

  • Hook overhaul: Tested 10 hook formulas, found "Wait until you see the transformation..." most effective
  • Length reduction: Cut all videos to 30-45 seconds
  • Captions: Added auto-captions, manually edited
  • CTA consistency: Always ended with "关注我,明天分享更多妆容技巧"

Results (60 days):

  • Average views: 200-500 → 3,000-8,000 (15-40x increase)
  • Completion rate: 25% → 68% (2.7x improvement)
  • 3-second retention: 40% → 85% (hook optimization)
  • Follower growth: +2,100 (70% growth) in 2 months
  • Brand inquiries: 3 brand partnership offers (from improved engagement metrics)

Key Learning: Hook is 80% of success. Test relentlessly, optimize obsessively.

Case Study 2: Product Reviewer's Format Strategy

Creator: Tech product reviewer, 8K followers

Challenge: Time-consuming production, inconsistent quality

Strategy:

  • Standardized format: 30-second structure (5s hook → 20s review → 5s verdict)
  • Batch production: Filmed 5 products in one session (2 hours)
  • Template editing: Created editing template in JianYing (text overlays, transitions)
  • Single CTA: Always "关注我看更多真实测评"

Results (90 days):

  • Production efficiency: 1 video per day (vs. 2-3 per week before)
  • Quality consistency: All videos met professional standard (template effect)
  • Average views: 5K → 15K (3x increase from consistency)
  • Completion rate: 72% (consistent length set expectations)
  • Monetization: Launched tech review course, 180 sales (¥18,000 revenue) from consistent audience growth

Key Learning: Templates + consistency = scale. Systematized approach enabled daily posting without burnout.

Case Study 3: Lifestyle Vlogger's Pacing Optimization

Creator: Daily vlog and lifestyle content, 5K followers

Problem: Engagement plateau despite consistent posting, comments: "Videos feel slow"

Diagnosis: Pacing issues, too much dead air, weak hooks

Solution:

  • Pacing audit: Removed all pauses, "ums," and dead time
  • Speed changes: Sped up boring parts 1.5x, kept reveals at 1x for impact
  • Hook variety: Rotated through 5 hook types (curiosity, promise, problem, contrarian, question)
  • Visual variety: Added B-roll, text overlays, zoom cuts for dynamic feel
  • Length optimization: Cut vlogs from 60-90s to 30-45s

Results (4 months):

  • Average watch time: 12s → 28s (2.3x increase)
  • Completion rate: 20% → 55% (2.75x improvement)
  • Engagement rate: 2% → 6.5% (3.25x increase)
  • Follower growth: +3,800 (76% growth) from improved completion
  • Brand partnerships: 4 brands approached (improved metrics = attractive)

Key Learning: Pacing is everything. Every second must earn continued watching. Ruthless editing beats bloated content.

Related Skills

REQUIRED:

  • jiaying-tool (or capcut/vietcut): JianYing/CapCut editing techniques and workflows
  • script-writing: Planning video content and scripting hooks
  • audio-processing: Professional audio recording and enhancement
  • cover-design: Creating compelling video thumbnails/covers

RECOMMENDED:

  • storytelling: Narrative structure and emotional engagement
  • vlog-creation: Vlog-specific filming and editing strategies
  • content-planning: Strategic video content calendar and topic planning
  • lighting-photography: Professional lighting setup and techniques
  • performance-on-camera: On-camera presence and delivery skills
  • video-seo: Optimizing titles, descriptions, and tags for search

NEXT STEPS:

  • Audit your last 5 videos: What are your completion rates and 3-second retention?
  • Test 3 hook formulas on next video: Use analytics to identify winner
  • Invest in audio: Buy lavalier mic (¥100-300) or find quiet recording space
  • Create template: Standardize your video structure for consistency
  • Post consistently: Aim for 3-7 videos per week for 60 days, measure growth

Short video production is a skill of ruthless editing and audience empathy. Every second must earn the viewer's continued attention—you're competing with endless content, and attention is the scarcest currency. The first 3 seconds determine everything: invest disproportionate time here, test relentlessly, and optimize obsessively. Great hooks + tight pacing + clear value + strong CTAs = short videos that build audiences, drive growth, and establish your presence on Xiaohongshu's mobile-first platform.

BrowserAct

Let your agent run on any real-world website

Bypass CAPTCHA & anti-bot for free. Start local, scale to cloud.

Explore BrowserAct Skills →

Stop writing automation&scrapers

Install the CLI. Run your first Skill in 30 seconds. Scale when you're ready.

Start free
free · no credit card