vidmonto
Tutorial

Understanding the 8 Visual Adjustment cards

What each card actually controls under the hood — Edit Style, Hook, Color Look, Captions, Music, Audio Balance, Transitions, and Voice Over.

M
Marcus LeeEngineering · Vidmonto
Apr 15, 2026 · 6 min read

"8 cards · Visual Adjustment"

"8 cards · Visual Adjustment"

Eight cards. Three primary options each, plus more in the popover. Every Vidmonto video — Text, Image, or Video — passes through these eight decisions. This article explains what each card actually controls so you can mix them confidently.

Edit Style — pacing and structure

Drives the Agent 2 rhythm plan. Picks between Balanced (default), Fast (rapid cuts every 1.5-2s), Cinematic (longer holds, more breathing room), and four context-specific styles in More: Vlog Daily, Travel, Tutorial, Music Video, Documentary.

Hook — what kind of opening

Tells Agent 2 how to shape scene 0. Bold opens with the strongest shot; Question turns the first caption into a rhetorical question; Build-up deliberately holds the strongest shot back. The rest (Action First, Reveal, Stat, Contrarian) trade visual punch against curiosity.

Color Look — global grade

Applies a 3D LUT (.cube) to the entire timeline. Cinematic is a teal-orange split-tone. Mono kills color entirely. Vintage adds a yellow-green wash with lifted blacks. Vibrant Pop and High-contrast push saturation and contrast hard for social-first content.

Captions — burn-in style

Auto uses Whisper transcription with a clean default style. Word-by-word karaoke is best for hook-driven shorts. Big Text is sans-serif full-width captions that work on muted social feeds. Branded uses a fixed color palette aligned with Vidmonto brand.

Music + Audio Balance — soundtrack and mix

Music selects a mood from the curated Pixabay/Freesound library (Mellow, Upbeat, Cinematic, Lo-fi, Hip-hop, Acoustic, Ambient). Audio Balance controls relative levels: Voice First (BGM ducked), Music First (voice attenuated), Balanced (the default), and the two extreme mute settings.

Transitions — between-clip joins

Cuts (default, no transition) is almost always right for short-form. Smooth and Cinematic fade gently. Whip Pan, Zoom, Glitch, and Wipe are big statements — use them sparingly on hook/payoff beats, not throughout.

Voice Over — narration

None (default), or pick a Google TTS voice (Male/Female + 6 specific voices in More). The voice script is auto-generated from your prompt by Agent 0-A; you can hand-edit it in a later iteration.

💡 None of the 8 cards is destructive. Tweaking any combination produces a new Tweaked output next to your baseline — you never lose the first result.
Open workspace
Tags:tutorialvisual-adjustment8-cards
M
Marcus LeeEngineering · Vidmonto

Marcus builds the Agent pipeline and the rhythm director.