Understanding the 8 Visual Adjustment cards
What each card actually controls under the hood — Edit Style, Hook, Color Look, Captions, Music, Audio Balance, Transitions, and Voice Over.
"8 cards · Visual Adjustment"
"8 cards · Visual Adjustment"
Eight cards. Three primary options each, plus more in the popover. Every Vidmonto video — Text, Image, or Video — passes through these eight decisions. This article explains what each card actually controls so you can mix them confidently.
Edit Style — pacing and structure
Drives the Agent 2 rhythm plan. Picks between Balanced (default), Fast (rapid cuts every 1.5-2s), Cinematic (longer holds, more breathing room), and four context-specific styles in More: Vlog Daily, Travel, Tutorial, Music Video, Documentary.
Hook — what kind of opening
Tells Agent 2 how to shape scene 0. Bold opens with the strongest shot; Question turns the first caption into a rhetorical question; Build-up deliberately holds the strongest shot back. The rest (Action First, Reveal, Stat, Contrarian) trade visual punch against curiosity.
Color Look — global grade
Applies a 3D LUT (.cube) to the entire timeline. Cinematic is a teal-orange split-tone. Mono kills color entirely. Vintage adds a yellow-green wash with lifted blacks. Vibrant Pop and High-contrast push saturation and contrast hard for social-first content.
Captions — burn-in style
Auto uses Whisper transcription with a clean default style. Word-by-word karaoke is best for hook-driven shorts. Big Text is sans-serif full-width captions that work on muted social feeds. Branded uses a fixed color palette aligned with Vidmonto brand.
Music + Audio Balance — soundtrack and mix
Music selects a mood from the curated Pixabay/Freesound library (Mellow, Upbeat, Cinematic, Lo-fi, Hip-hop, Acoustic, Ambient). Audio Balance controls relative levels: Voice First (BGM ducked), Music First (voice attenuated), Balanced (the default), and the two extreme mute settings.
Transitions — between-clip joins
Cuts (default, no transition) is almost always right for short-form. Smooth and Cinematic fade gently. Whip Pan, Zoom, Glitch, and Wipe are big statements — use them sparingly on hook/payoff beats, not throughout.
Voice Over — narration
None (default), or pick a Google TTS voice (Male/Female + 6 specific voices in More). The voice script is auto-generated from your prompt by Agent 0-A; you can hand-edit it in a later iteration.
Marcus builds the Agent pipeline and the rhythm director.