7. Google Veo 3 입문: 개념부터 첫 영상 제작까지

1 Google Veo 3 기초 이해

1.1 AI 영상 생성의 원리와 프롬프트

텍스트-투-비디오(Text-to-Video) 개념: 사람이 텍스트로 장면을 설명(프롬프트)하면 AI가 이를 읽고 픽셀 단위로 새 영상을 합성한다. 기존 영상을 잘라 붙이는 방식이 아니라, 학습한 데이터를 바탕으로 무에서 유를 만든다.

프롬프트 입력 예시: 아래 문구를 입력창에 넣으면 AI는 '새벽 호수에서 낚시하는 어부'의 8초 영상을 생성한다.

A weathered old angler drifts alone in a narrow rowboat on a fog-veiled lake at first light, pale gold sunbeams cutting through the haze, a quiet and wistful mood.

기술 발전 흐름: 2022년 정지 이미지 생성에서 시작해 2025년 Google Veo 3에 이르러 영상과 오디오(대화·음악)를 동시에 생성하는 단계로 진화했다.

프롬프트는 AI에게 주는 '장면 지시문'이다.

Veo 3는 영상뿐 아니라 그 장면에 어울리는 소리까지 함께 만든다.

1.2 Veo 3의 핵심 강점과 지시 가능 요소

사실적 물리 표현: 파도, 불꽃, 연기, 옷감의 펄럭임 등 물리 법칙을 반영한 움직임을 구현한다.
내장 오디오 생성: 프롬프트 안에 [Audio: ...] 태그를 넣어 대화, 환경음, 배경 음악을 지정한다.
카메라 제어: 영화 용어(달리 샷, 버드아이 뷰 등)를 이해하여 구도를 바꾼다.
스타일 변환: 시네마틱, 애니메이션, 다큐멘터리 등 장르 키워드에 따라 화풍을 바꾼다.

카메라 앵글: Close-up, Wide shot, Low angle, High angle.

카메라 움직임: Pan, Tilt, Dolly, Tracking, Handheld.

조명: Golden hour, Moonlight, Neon, Backlit.

1.3 모델 등급(Fast / Lite / 고품질) 선택 전략

등급별 차이: 속도·비용·품질에 따라 세 등급으로 나뉜다.
- Fast: 가장 빠르고 저렴하다. 아이디어 방향을 테스트할 때 쓴다.
- Lite(경량): 중간 속도와 비용이다. 구도와 디테일을 빠르게 검증할 때 쓴다.
- 고품질: 가장 느리고 비싸다. 정밀한 대화와 립싱크가 필요한 최종 결과물용이다.
오디오 지원 확인: 오디오 지원 여부는 등급·버전에 따라 다르고 자주 바뀐다. 일반적으로 가장 저가의 경량(Light) 등급은 묵음일 수 있고, 표준·고품질 등급이 오디오를 포함한다. 생성 전 Flow 화면에서 각 모델의 오디오 지원 표시와 예상 크레딧을 직접 확인한다.
3단계 작업 순서: 크레딧을 아끼기 위해 비용·품질 축으로 아래 순서를 지킨다.
- 테스트: Fast 모델로 프롬프트 방향 확인.
- 검증: Lite 모델로 구도와 디테일 확인.
- 완성: 고품질 모델로 최종 영상 생성. 오디오가 필요하면 오디오 지원 등급을 화면에서 확인해 선택한다.

처음부터 고품질 모델을 쓰면 크레딧이 금방 바닥난다.

Fast로 여러 번 시도해 본 뒤 가장 좋은 결과물의 프롬프트를 고품질에 넣는다.

1.4 Veo 3 활용 콘텐츠 유형

AI 단편 영화: 8초 클립들을 이어 붙여 이야기를 만든다.
가상 캐릭터 브이로그: 실제 존재하지 않는 캐릭터의 일상을 구현한다.
광고 크리에이티브: 제품(향수, 자동차 등) 쇼케이스 영상을 만든다.
POV(1인칭 시점): 시청자가 직접 보는 듯한 몰입형 영상을 만든다.

텍스트 자막을 영상 안에 직접 넣는 기능은 아직 약하다. 자막은 외부 편집 앱을 쓴다.

캐릭터의 일관성을 유지하려면 참조 이미지를 활용하거나 외모 묘사를 구체적으로 반복한다.

1.5 응용 — 내 소재로 만들기

이 묶음에서 익힌 'Veo가 알아듣는 지시 요소(피사체·카메라·조명·스타일)'를 자기 소재의 짧은 광고 쇼케이스 컷에 다시 적용한다.

A {제품} {배치/움직임} on {배경 표면}, {카메라 무빙}, {조명 키워드}, {스타일 키워드}, {분위기} mood, 8-second product showcase.

{제품} — 보여줄 대상. 예: a glass perfume bottle, a matte black smartwatch.
{배치/움직임} — 제품이 놓이거나 회전하는 방식. 예: slowly rotating, standing still as light sweeps across it.
{배경 표면} — 제품이 놓인 바닥/공간. 예: a wet black stone slab, a soft beige fabric.
{카메라 무빙} — 단원에서 익힌 카메라 키워드. 예: slow dolly-in, orbiting tracking shot.
{조명 키워드} — 빛의 종류. 예: dramatic backlit rim light, soft golden hour glow.
{스타일 키워드} — 장르 화풍. 예: cinematic luxury commercial, clean minimalist studio.
{분위기} — 전달할 인상. 예: premium and elegant, sleek and modern.

A glass perfume bottle slowly rotating on a wet black stone slab, slow dolly-in, dramatic backlit rim light, cinematic luxury commercial, premium and elegant mood, 8-second product showcase.

{제품}만 향수→시계→화장품으로 바꾸면 같은 조명·카메라 세팅으로 시리즈 광고컷을 재사용한다.

{카메라 무빙}과 {조명 키워드}를 둘 다 비우면 밋밋한 정지컷이 나온다. 적어도 한 칸은 영화 용어로 채운다.

2 Google Flow 접속 및 첫 실습

2.1 Google Flow 접속 및 환경 준비

웹사이트 방문: Chrome 브라우저 주소창에 https://labs.google/flow를 입력하고 Enter를 누른다.
로그인: 우측 상단 [Sign in]을 클릭해 Google 계정으로 로그인한다.
무료 체험 시작: 화면에 나타나는 안내에 따라 구독 또는 무료 체험을 시작한다.
- [Start free trial] 클릭 → 결제 정보 등록(체험 기간 내 해지 시 비용 미발생) → 대시보드 진입.
프로젝트 생성: 좌측 사이드바 [New Project] 클릭 → 프로젝트 이름(예: First_Test) 입력 → 빈 작업 영역을 확인한다.

무료 체험 종료 1~2일 전에 해지 알림을 설정해 자동 결제를 방지한다.

지역에 따라 서비스 접근이 제한될 수 있다. 이 경우 Gemini(gemini.google.com)를 대안으로 쓴다.

2.2 인터페이스 구성 요소 확인

상단 바: 현재 남은 Credits 수치를 확인한다. 영상 생성 시 실시간으로 줄어든다.
프롬프트 입력창: 화면 하단의 Describe your scene... 영역을 클릭한다.
모델 선택 드롭다운: 입력창 왼쪽의 모델 이름을 클릭해 Fast, Lite, 고품질 등급을 확인한다.
화면 비율: 16:9(가로), 9:16(세로), 1:1(정방형) 버튼을 확인한다.
생성 버튼: 우측의 파란색 [Generate] 버튼을 확인한다.

2.3 첫 번째 영상 생성 실습 (해변 노을)

모델 설정: 입력창 왼쪽 모델 드롭다운을 클릭해 가장 저렴한 Fast 계열 모델을 선택한다.
비율 설정: 16:9 버튼을 클릭한다.

프롬프트 입력: 아래 영문 텍스트를 복사해 입력창에 붙여넣는다.

Soft waves rolling onto a sandy shore at sunset, warm orange and pink sky, gentle slow motion, wide cinematic framing, a calm and tranquil feel, photorealistic.

생성 실행: [Generate] 버튼을 클릭하거나 Ctrl+Enter를 누른다.
재생 및 확인: 생성이 완료(약 30~60초 소요)되면 나타나는 영상을 클릭해 재생한다. 파도와 노을이 제대로 표현됐는지 확인한다.
다운로드: 영상 우측 하단 [Download] 아이콘을 클릭해 MP4 파일을 내 컴퓨터에 저장한다.

결과가 마음에 들지 않으면 [Regenerate]를 눌러 같은 프롬프트로 다시 만든다.

문구를 수정하고 싶으면 [Edit Prompt]를 클릭한다.

2.4 변수 통제 실험 (시간대 변경)

프롬프트 수정: 기존 문구에서 at sunset을 at DAWN으로, warm orange and pink sky를 soft purple and blue sky로 바꾼다.
재생성: [Generate]를 누른다.
비교: 노을(Sunset) 버전과 새벽(Dawn) 버전의 색감 차이를 확인한다.

2.5 모바일 워크플로우 (Flow + CapCut)

모바일 접속: 스마트폰 브라우저에서 https://labs.google/flow에 접속한다.
영상 생성 및 저장: 위 실습과 동일하게 영상을 생성한 후, 화면의 [Download] 버튼을 길게 눌러 기기 갤러리에 저장한다.
CapCut 실행: CapCut 앱을 열고 [새 프로젝트]를 누른다.
클립 불러오기: 저장한 Veo 영상을 선택해 타임라인에 올린다.
비율 및 음악: [비율] 메뉴에서 9:16을 선택하고, [오디오] 메뉴에서 배경 음악을 넣는다.
내보내기: 우측 상단 화살표 아이콘을 눌러 영상을 저장하거나 SNS에 올린다.

모바일에서 영어 입력이 어려우면 한국어로 장면을 메모한 뒤 번역 도구를 써서 붙여넣는다.

숏폼용 영상은 처음부터 Flow에서 9:16 비율로 생성하는 것이 좋다.

2.6 응용 — 내 소재로 만들기

이 묶음에서 실습한 'Flow에 텍스트 한 줄로 영상 생성하고 변수 하나만 바꿔 비교하기'를 자기 소재의 움직이는 도시 장면에 다시 적용한다.

{공간/도시 묘사} at {시간대}, {빛/색 묘사}, {움직이는 요소}, {카메라 동작}, a {분위기} feel, {화풍}, {화면비} aspect ratio.

{공간/도시 묘사} — 움직임이 담긴 배경 한 컷. 예: A busy night market alley, A rooftop view over a glowing city.
{시간대} — 빛을 정하는 시각. 예: night, rainy evening, early morning.
{빛/색 묘사} — 화면 톤. 예: neon reflections on wet pavement, cool blue twilight.
{움직이는 요소} — 장면에 생기를 주는 동작. 예: people walking past, cars streaking by with light trails.
{카메라 동작} — 카메라 무빙. 예: slow tracking shot, static wide shot.
{분위기} — 정서. 예: lively and energetic, quiet and moody.
{화풍} — 시각 양식. 예: cinematic photorealistic, cyberpunk style.
{화면비} — 출력 비율. 예: 16:9, 9:16, 1:1.

A busy night market alley at night, neon reflections on wet pavement, people walking past with steam rising from food stalls, slow tracking shot, a lively and energetic feel, cinematic photorealistic, 9:16 aspect ratio.

Fast로 한 번 만든 뒤 {움직이는 요소} 한 칸만 people walking past → light rain starting to fall로 바꿔 같은 변수 통제 실험(2.4)을 반복한다.

{움직이는 요소}를 비우면 정지화면처럼 밋밋해진다. Veo는 움직임 지시를 줄수록 영상다워지므로 이 칸을 꿈 채운다.

3 Google Flow 실전: 이미지에서 말하는 영상까지

앞 단원에서는 텍스트만으로 8초 영상을 만들었다. 이번 단원에서는 한 단계 나아간다. 먼저 인물 이미지를 만들고, 그 이미지를 시작점으로 삼아 한국어로 말하는 립싱크 영상을 생성한 뒤, 확장(Extend)으로 장면을 길게 잇는 실전 흐름을 따라간다. 이때 프롬프트는 [ANCHOR]·[ACTION]·[CAMERA]·[LIGHTING]·[AUDIO]·[STYLE]·[NEGATIVE]의 일곱 블록으로 나누어 작성한다. 블록을 나누면 인물의 외형은 고정한 채 동작·대사·조명만 바꾸기 쉽다.

2026.05.18 기준 작업 기록을 바탕으로 한다.

2026.05.19 omi 모델이 추가되며 화면(UI) 구성이 일부 바뀌었다. 버튼 위치가 교안과 다르면 같은 기능의 메뉴를 화면에서 찾아 진행한다.

3.1 인물 이미지 생성과 다듬기

이미지 생성: Flow의 이미지 생성 기능으로 인물 이미지를 만든다. 아래는 생성에 사용한 공유 프롬프트 링크다.
- 프롬프트: <https://labs.google/fx/tools/flow/shared/image/46c70f76-214e-4ac0-8364-18316ffb0c25>
- 최초 생성 이미지: <https://flow-content.google/image/ff414606-539f-476b-82e1-1154e13cf0a7>
불필요 요소 제거: 배경과 옷깃에 남은 군더더기 요소를 지운다. 멀리 떨어진 영역은 한 번에 선택되지 않으므로 두 번에 나누어 지운다.
수정 프롬프트 입력: 지울 영역을 선택한 뒤 프롬프트에 지워라고 입력한다.
- 수정 후 이미지: <https://labs.google/fx/api/trpc/media.getMediaUrlRedirect?name=b89366a1-dfd1-43fa-9d24-fcbf21caae90>

3.2 이미지를 장면으로 불러오기

장면 만들기 진입: Flow 우측 상단 + 버튼을 클릭한 뒤 [장면 만들기]를 클릭한다.
이미지 선택: 앞에서 생성·수정한 인물 이미지를 선택한다. 이 이미지가 영상의 첫 프레임 기준이 된다.

3.3 메인 프롬프트 작성 (7블록 구조)

블록별 역할 이해: 아래 일곱 블록을 순서대로 채운다. [ANCHOR]는 인물 외형, [ACTION]은 동작, [CAMERA]는 구도, [LIGHTING]은 조명, [AUDIO]는 대사·음성, [STYLE]은 화풍, [NEGATIVE]는 금지 요소다.

메인 프롬프트 입력: 아래 문구를 입력창에 붙여넣는다. 한국어 대사를 또렷한 방송 톤으로 말하는 인물 영상이다.

[ANCHOR]
A Korean woman in her late 20s, long dark brown hair tied in a high ponytail with the tail falling over her right shoulder, a few loose strands framing her face. Clean glowing skin, natural dewy makeup, soft pink lip tint, defined slightly arched eyebrows, warm brown eyes, a gentle closed-lip smile. She wears a black tailored blazer over a plain white inner top, with a thin silver ring on her index finger. Soft pastel pink seamless studio background.

[ACTION]
She starts with her right hand resting lightly near her chin, then slowly lowers her hand as she begins speaking. A soft confident smile spreads across her face, she gives one small nod mid-sentence, and looks straight into the camera with a calm, friendly gaze.

[CAMERA]
Medium close-up from chest level, eye-level static shot, 85mm portrait lens look, very shallow depth of field, subject perfectly centered with slight space above her head.

[LIGHTING]
Soft large key light from the front, gentle fill bouncing back from the pink background, warm 4800K, beauty-portrait style, no harsh shadows, subtle catchlight in both eyes.

[AUDIO]
She speaks in clear, warm Korean with a polished broadcasting tone:
"안녕하세요, 오늘은 영상 편집 앱 측컷을 소개할게요."
Lip movement precisely synced to Korean phonemes. Soft studio room tone, no background music, natural breath between phrases.

[STYLE]
Korean beauty-brand commercial aesthetic, clean and polished, soft pastel color palette, photorealistic 4K, natural skin texture preserved, 16:9 aspect ratio.

[NEGATIVE]
no English speech, no subtitles, no text overlays, no multiple people, no hair color change, no clothing change, no harsh shadows, no plastic skin, no distorted lip sync, no robotic voice, no logo watermarks, no extra fingers.

[ANCHOR]와 [NEGATIVE]는 그대로 두고 [ACTION]·[LIGHTING]·[AUDIO]만 바꾸면 같은 인물의 다른 톤 영상을 빠르게 만든다.

대사에 따옴표를 써도 [NEGATIVE]에 no subtitles를 넣으면 자막이 화면에 박히는 것을 막는다.

3.4 톤 변형 실습

변형 1 (밝은 튜토리얼 톤): 메인 프롬프트에서 [ACTION]·[LIGHTING]·[AUDIO]만 아래로 교체한다.

[ACTION]
She holds her right hand up at shoulder level with palm facing the camera in a gentle greeting wave, then brings both hands together at chest level as she finishes speaking. Bright open smile showing slight teeth, one cheerful nod.

[LIGHTING]
Bright high-key lighting from upper front and both sides, 5000K, slightly warmer than neutral, vibrant but soft, creator-vlog feel.

[AUDIO]
She speaks in upbeat, friendly Korean:
"여러분 안녕하세요! 측컷으로 영상 편집 시작해 볼까요?"
Mouth movements tightly synced to Korean syllables. Quiet studio ambience, no music.

변형 2 (차분한 리뷰 톤): 같은 방식으로 세 블록을 아래로 교체한다.

[ACTION]
She keeps her hands gently clasped at chest level throughout, slight head tilt to the right at the start, returns to center as she speaks. Composed thoughtful expression, soft closed-lip smile only at the end.

[LIGHTING]
Even soft front light, 5200K neutral, slight directional shadow on the left side of her face for depth, premium-magazine portrait feel.

[AUDIO]
She speaks in calm, measured Korean with a clear reviewer tone:
"측컷은 무료로 쓸 수 있는 영상 편집 도구예요."
Lip movement precisely synced to Korean syllables. Subtle ambient room tone, no music.

비교: 동작·조명·대사 톤만 바꿤는데 인물은 그대로 유지되는지 확인한다. [ANCHOR]가 외형을 잡아 주기 때문이다.

3.5 확장(Extend)으로 장면 잇기

확장 원리 이해: Flow의 Extend 기능을 쓰면 이전 영상의 마지막 프레임이 자동으로 다음 영상의 시작 프레임이 된다. 따라서 [ANCHOR]는 외형을 길게 묘사하지 않고 "같은 인물·같은 의상·같은 배경"임을 짧게 명시한 뒤, [ACTION]과 [AUDIO]에 집중한다.

이어지는 컷 작성: 이전 영상의 끝 상태(손을 내리고 카메라 응시)에 이어 붙는 8초 컷이다. 타임라인 개념을 설명하는 장면이다.

[ANCHOR]
The same Korean woman from the previous shot, identical hairstyle, makeup, black tailored blazer over white inner top, soft pastel pink studio background. Continuous scene.

[ACTION]
She raises both hands to chest level, palms facing the camera, then slowly moves both hands horizontally from her left side to her right side as if drawing an invisible long bar in the air. She finishes with her right hand extended to the side and looks confidently at the camera with a small smile.

[CAMERA]
Medium close-up, eye-level static shot, 85mm portrait lens look, shallow depth of field. Same framing as the previous shot to keep visual continuity.

[LIGHTING]
Identical to the previous shot. Soft large key light from the front, gentle pink fill from the background, warm 4800K, no harsh shadows.

[AUDIO]
She speaks in clear, friendly Korean with a polished broadcasting tone:
"화면 아래 이 긴 막대가 바로 타임라인이에요."
Lip movement precisely synced to Korean phonemes. Soft studio room tone, no background music.

[STYLE]
Korean beauty-brand commercial aesthetic, photorealistic 4K, natural skin texture preserved, 16:9 aspect ratio. Visual continuity with the previous clip.

[NEGATIVE]
no English speech, no subtitles, no text overlays, no UI graphics, no multiple people, no hair color change, no clothing change, no scene cut, no camera movement, no distorted lip sync, no extra fingers.

동작·대사만 바꿔 연속 컷 늘리기: 이어지는 컷도 [ACTION]과 [AUDIO]만 교체해 같은 흐름으로 생성한다.

[ACTION]
She points her right index finger to her left side at chest level, then slowly traces a horizontal line to her right side, finger extended throughout, finishing with a small tap gesture at the end. Eyes follow her finger then return to the camera.

[AUDIO]
She speaks in clear, instructive Korean:
"왼쪽에서 오른쪽으로 시간이 흘러가요."
Lip movement precisely synced to Korean syllables. Soft studio room tone, no music.

[ACTION]
She holds her left palm out flat horizontally at chest level, then with her right hand makes a soft placing motion as if dropping rectangular blocks onto her left palm, twice. Finishes looking at the camera with an open smile.

[AUDIO]
She speaks in warm, encouraging Korean:
"여기에 영상 클립을 순서대로 올려놓으면 돼요."
Mouth movements tightly synced to Korean syllables. Quiet studio ambience, no music.

확장 컷에서는 [CAMERA]와 [LIGHTING]을 이전 컷과 동일하게 유지해야 장면이 튀지 않는다.

[NEGATIVE]에 no scene cut, no camera movement를 넣어 연속성을 강제한다.

3.6 완성 영상 확인

위 메인 프롬프트와 확장 컷을 이어 만든 완성 영상은 아래에서 확인한다.

완성 영상: <https://drive.google.com/file/d/1nocxgkm3t9AB2NTP_UX6nH98hIdavp8i/view?usp=sharing>

한국어 립싱크는 짧고 또렷한 문장일수록 정확하다. 한 컷당 한두 문장으로 끊는다.

인물·의상·배경을 고정한 [ANCHOR]와 연속성을 막는 [NEGATIVE]가 시리즈 영상의 일관성을 좌우한다.

3.7 응용 — 내 소재로 만들기

이 묶음에서 익힌 7블록 립싱크 구조를 자기 인물·자기 대사로 다시 적용한다. 각 블록의 {중괄호}만 채우면 말하는 영상 한 컷이 완성된다.

[ANCHOR]
{인물 외형 묘사: 나이·성별·헤어·피부·표정·의상·소품}. {배경 묘사}.

[ACTION]
{시작 자세} then {말하는 동안의 손동작·표정 변화}, looks at the camera with a {시선/표정} gaze.

[CAMERA]
{샷 크기} from {앵글}, {렌즈 느낌} lens look, {심도}, subject centered.

[LIGHTING]
{키라이트 방향·세기}, {색온도}K, {조명 스타일} feel, no harsh shadows, catchlight in both eyes.

[AUDIO]
She speaks in {말투/톤} Korean:
"{한국어 대사 한두 문장}"
Lip movement precisely synced to Korean phonemes. {환경음 묘사}, no background music.

[STYLE]
{전체 화풍 키워드}, photorealistic 4K, natural skin texture preserved, {화면비} aspect ratio.

[NEGATIVE]
no English speech, no subtitles, no text overlays, no multiple people, no hair color change, no clothing change, no distorted lip sync, no robotic voice, no extra fingers.

{인물 외형 묘사} — [ANCHOR]의 핵심. 헤어·피부·의상·소품을 구체적으로. 예: A Korean man in his 30s, short black hair, round glasses, gray knit sweater.
{배경 묘사} — 인물 뒤 공간. 예: A warm wooden home-office background, softly blurred.
{시작 자세}·{손동작·표정 변화} — [ACTION]. 예: He sits with hands folded on the desk, gestures one hand outward as he explains.
{시선/표정} — 마무리 시선. 예: calm and trustworthy, friendly and bright.
{샷 크기}·{앵글}·{렌즈 느낌}·{심도} — [CAMERA]. 예: Medium close-up, eye-level, 50mm, shallow depth of field.
{키라이트 방향·세기}·{색온도}·{조명 스타일} — [LIGHTING]. 예: Soft key light from the left, 5200, natural documentary.
{말투/톤}·{한국어 대사}·{환경음 묘사} — [AUDIO]. 예: calm, sincere, "오늘은 제가 쓰는 공부 앱을 소개할게요.", Quiet room tone.
{전체 화풍 키워드}·{화면비} — [STYLE]. 예: Warm lifestyle documentary aesthetic, 9:16.

[ANCHOR]
A Korean man in his 30s, short black hair, round glasses, a gray knit sweater, a thin leather watch on his left wrist. A warm wooden home-office background with shelves, softly blurred.

[ACTION]
He sits with both hands folded on the desk, then gestures one hand outward as he begins explaining, gives a small nod mid-sentence, looks at the camera with a calm and trustworthy gaze.

[CAMERA]
Medium close-up from chest level, eye-level static shot, 50mm lens look, shallow depth of field, subject centered.

[LIGHTING]
Soft key light from the upper left, gentle fill from the right, 5200K, natural documentary feel, no harsh shadows, catchlight in both eyes.

[AUDIO]
She speaks in calm, sincere Korean:
"오늘은 제가 매일 쓰는 공부 앱을 소개할게요."
Lip movement precisely synced to Korean phonemes. Quiet room tone with faint keyboard ambience, no background music.

[STYLE]
Warm lifestyle documentary aesthetic, photorealistic 4K, natural skin texture preserved, 9:16 aspect ratio.

[NEGATIVE]
no English speech, no subtitles, no text overlays, no multiple people, no hair color change, no clothing change, no distorted lip sync, no robotic voice, no extra fingers.

같은 인물로 시리즈를 만들 때는 [ANCHOR]와 [NEGATIVE]를 고정하고 [ACTION]·[AUDIO]만 바꿔 3.4·3.5처럼 톤 변형과 확장 컷으로 재사용한다.

[AUDIO]의 대사가 길면 립싱크가 어긋난다. 한 컷당 한두 문장으로 끊고, [NEGATIVE]의 no distorted lip sync를 지우지 않는다.