Voice acting Study Guide
Study Guide
📖 Core Concepts
Voice Acting – Performing a character or delivering information using only the voice; no on‑screen appearance required.
Voice Actor / Voice Artist – The performer; also called voice‑over talent, dubbing artist, etc.
Character Voices – Voices created for animated or non‑visual characters (e.g., video‑game avatars, radio dispatchers).
Narration – Spoken commentary that conveys a story; may be a distinct character voice or a neutral, author‑crafted tone.
Commercial Voice‑Over – Voice work for advertising (TV, radio, online); often edited for a clean, persuasive sound.
De‑breathing – Editing technique that removes audible breaths to avoid listener distraction.
Dub Localization – Re‑recording dialogue in a new language replacing the original audio track.
Voice‑over Translation – Adding a new language track while keeping the original audio audible in the background (common in documentaries).
Automated Dialogue Replacement (ADR) / Looping – Post‑production re‑recording of dialogue to improve quality, fix script changes, or change an accent.
Automated Announcements – Short, pre‑recorded fragments that a computer assembles and plays on demand (e.g., elevator chimes).
AI‑Generated / Modified Voices – Text‑to‑speech or voice‑synthesis software that creates or alters human‑like speech; increasingly used in games and tech products.
Voice‑Actor Recruitment – In the U.S., agencies specialize in matching actors to film, TV, or commercial projects.
Post‑Production Editing – Timing adjustments, noise removal, and consistency checks performed after recording.
---
📌 Must Remember
Voice acting spans animation, video games, commercials, audiobooks, documentaries, rides, theater, puppetry, and more.
Non‑fictional voice acting = pre‑recorded announcements in public spaces (stores, elevators, transport).
De‑breathing originated in commercial voice‑over but is now a standard clean‑up step across most voice work.
AI‑generated voices raise job‑security and deep‑fake concerns among professionals.
ADR is also called looping; its goals are clarity, timing, diction, or accent correction.
Voice‑over translation ≠ dub localization – the former retains the original background audio.
Recruitment is typically agency‑driven in the U.S.; agencies handle casting, contracts, and rates.
Post‑production editing ensures the final product sounds consistent across platforms (TV, web, mobile).
---
🔄 Key Processes
Casting & Recruitment
Agency receives project brief → auditions voice talent → selects actor.
Script Preparation
Script is broken into lines, marked for tone, pacing, and any required de‑breathing notes.
Recording Session
Actor records in a sound‑proof booth → multiple takes for each line.
Post‑Production Editing
Remove unwanted noises (mouth clicks, breaths).
Apply de‑breathing where needed.
Align timing to picture or animation cues.
ADR / Looping (if needed)
Identify problematic dialogue after filming.
Actor watches the scene, re‑records matching lip‑sync and emotion.
Engineer blends new audio with existing track.
Final Mix & Delivery
Mix voice with music/sfx → master for target platform → deliver assets.
---
🔍 Key Comparisons
Dub Localization vs. Voice‑over Translation
Dub: Original language track replaced completely.
Voice‑over Translation: Original audio remains audible beneath the new language track.
Character Voice vs. Narration
Character: Distinct personality, often matches a visual or imagined character.
Narration: Neutral or authorial tone; may be “in‑character” but primarily conveys story information.
Commercial Voice‑Over vs. Non‑Commercial (e.g., Audiobook)
Commercial: Persuasive, concise, heavily edited (de‑breathing, tight pacing).
Non‑Commercial: Emphasis on storytelling, may retain natural breaths for authenticity.
Human Voice Actor vs. AI‑Generated Voice
Human: Creative control, subtle emotion, legal/ethical rights.
AI: Scalable, cost‑effective, but prone to deep‑fake concerns and lacks genuine nuance.
---
⚠️ Common Misunderstandings
“Voice‑over translation replaces the original audio.” – It does not; the original stays in the background.
All narration is required. – Narration is optional in many media (games, films, TV).
AI‑generated voices are always low‑quality. – Modern TTS can sound human‑like, but still lacks true performance nuance.
De‑breathing is done during recording. – It is a post‑production edit, not a vocal technique.
ADR is only for fixing bad audio. – It also serves to change accents, add lines, or improve timing.
---
🧠 Mental Models / Intuition
“Voice Acting = Audio Layer.”
Imagine the final product as a stack: script → performance → edit (de‑breathing, noise‑remove) → mix. Each layer must line up perfectly.
AI Voice = Synthetic Instrument.
Treat AI‑generated speech like a digital instrument: you can program notes (words) but you still need a skilled composer (human director) for musicality.
ADR = “Audio Patch.”
Think of ADR as a patch applied after filming—only the sections that don’t fit are replaced, leaving the rest intact.
---
🚩 Exceptions & Edge Cases
AI Voices can be legitimate for accessibility (e.g., screen‑readers) or low‑budget projects, despite industry pushback.
Automated Announcements may be assembled from tiny fragments, allowing dynamic phrase construction (e.g., “The train to [destination] is now boarding”).
Narration may be required for interactive media (e.g., tutorial sections in games) even when the main story is visual.
De‑breathing is sometimes intentionally left in for artistic effect (e.g., intimate audiobook narration).
---
📍 When to Use Which
Character Voice → When a distinct, personified persona is needed (animation, games).
Narration → When the story must be explained or guided without visual cues (novels, documentaries).
Commercial Voice‑Over → For advertising messages that demand brevity and persuasion; always apply de‑breathing.
Dub Localization → For foreign‑language releases where the audience expects a fully localized audio track (feature films).
Voice‑over Translation → For documentaries or news where preserving the original speaker’s tone adds credibility.
ADR → When on‑set audio is unusable, a line changes, or an accent must be altered.
AI‑Generated Voice → When scale, speed, or cost outweigh the need for nuanced performance, and legal/ethical clearance is secured.
---
👀 Patterns to Recognize
“Fragment + Computer” → Indicates automated announcements (short recorded bits assembled on the fly).
“Post‑recording re‑capture” → Signals an ADR/looping scenario.
“Background audio remains audible” → Points to voice‑over translation, not full dub.
“Removal of breaths” → Typical of commercial voice‑over editing.
“AI‑generated” + “deep‑fake concerns” → Relates to ethical debates surrounding synthetic voices.
---
🗂️ Exam Traps
Distractor: “Dub localization keeps the original language audible.” – Wrong; dub replaces it.
Distractor: “All narration is mandatory in video games.” – Incorrect; many games use ambient sound or text instead.
Distractor: “De‑breathing is a vocal warm‑up technique.” – Misleading; it’s an editing step, not a performance skill.
Distractor: “AI‑generated voices are illegal for commercial use.” – Not true; legality depends on licensing and consent, not the technology itself.
Distractor: “ADR is only used for fixing microphone hiss.” – Oversimplified; ADR also handles script changes, accent adjustments, and timing fixes.
---
or
Or, immediately create your own study flashcards:
Upload a PDF.
Master Study Materials.
Master Study Materials.
Start learning in seconds
Drop your PDFs here or
or