FAQs | Swell

Audio Production Glossary

At Swell, every deliverable goes through a documented QC pass before transmission. Our clients receive files that are ready to air—not files that still need to be checked by the agency or network.
Elad Marish
Swell Founder

Audio Production Glossary for Advertising

By Elad Marish, Founder & ECD, Swell Music + Sound · Updated May 8, 2026

This glossary covers the essential audio production terms you'll encounter working with music and sound for advertising. Definitions are written for brand marketers, agency producers, and creative directors — not audio engineers.


Audio post production

Audio post production is the process of creating, editing, and finalizing all sound elements for a video or broadcast deliverable after picture editing is complete. For advertising, a full audio post production pass typically includes dialogue editing, ADR, sound design, foley, music editing, final mix to picture, and delivery in broadcast-compliant formats.

The term "post" distinguishes this work from production audio — sound recorded on set during filming. Post production audio takes the raw materials from the shoot (dialogue recordings, production sound) and transforms them into a polished, broadcast-ready audio track.

At Swell, audio post production for advertising has been our core service since 2009. The full chain — from OMF to final mix to broadcast delivery — lives under one roof with one point of contact. Learn more about our audio post service.

Sound design

Sound design is the creation and placement of audio elements — effects, atmospheres, textures, and sonic moments — that build the world of a spot beyond what was captured on set. Sound design transforms raw footage from a location into a believable, emotionally resonant sonic environment.

In advertising, sound design serves both functional and expressive purposes. Functionally, it fills gaps where production audio is unusable. Expressively, it heightens the reality of the spot — making a product sound crispier, a car sound more powerful, or a brand moment feel more cinematic than the raw footage would suggest.

Sound design is distinct from music (composed elements) and dialogue (spoken word). In a finished mix, these three layers — music, dialogue, and effects (including sound design and foley) — are balanced by the mixing engineer. See Swell's sound design work.

Foley

Foley is the practice of recording custom, synchronized sound effects in a studio environment to replace or augment sounds that were not captured adequately on set. Named after Universal Studios sound editor Jack Foley, who pioneered the technique in the 1920s, foley covers the physical sounds of a scene: footsteps, clothing movement, prop handling, and any tactile interaction that needs to match the picture precisely.

Foley is performed by a foley artist who watches the picture on a monitor and records sounds in real time, matching timing and physical characteristics to the on-screen action. The recorded foley is then edited, processed, and mixed into the final sound design.

In advertising, foley is particularly valuable for product-centric spots — food, beverage, consumer goods, automotive — where the product sound is integral to the brand message. A soda being poured. A chip being bitten. A car door closing with authority. These sounds, when done right, make the product feel real. When done wrong or omitted, viewers register a vague sense that something is missing, even if they can't name it.

Swell records foley at Airship Labs Studios in Richmond, CA, typically as a half-day or full-day session. Read more about foley in advertising.

ADR (Automated Dialogue Replacement)

ADR, or Automated Dialogue Replacement, is the process of re-recording dialogue in a studio environment to replace on-set recordings that are technically unusable or require revision. Despite the word "automated," there is nothing automatic about it — ADR is one of the most technically and performatively demanding skills in audio post production.

On-set dialogue is frequently compromised by environmental noise (wind, HVAC, traffic), equipment interference, or performance issues that only become apparent in the editing room. ADR provides a clean, controlled alternative.

The ADR process: the actor or voice talent watches their original performance on a monitor while listening to a looped "beep sync" countdown, then speaks the line in time with their on-screen lip movement. Multiple takes are recorded until the timing, performance, and emotional quality match. The new recording is then processed to match the acoustic environment of the original location.

In advertising, ADR is most commonly used for script revisions after the shoot (changing a line without a reshoot), location audio compromised by environmental issues, and performance improvements where the timing was good but the emotional quality wasn't right.

Swell records ADR at Airship Labs in Richmond, CA, and remotely via Source-Connect. Full ADR guide for advertising.

Final mix to picture

A final mix to picture is the process of balancing, processing, and delivering all audio elements — dialogue, music, and effects — at their finished levels in sync with the locked video, ready for broadcast or distribution.

The final mix is the last creative and technical stage of audio post production. It integrates all the elements built during post — edited dialogue, ADR, sound design, foley, and music — into a single, cohesive audio track that meets broadcast loudness specifications and platform delivery requirements.

A professional final mix is not just about volume levels. It involves frequency management (EQ), dynamic control (compression and limiting), spatial positioning (panning and depth), and careful attention to how music, dialogue, and effects interact throughout the spot.

Delivery standards that Swell mixes to: ATSC A/85 (US broadcast), EBU R128 (European broadcast), YouTube, Meta, and Spotify platform specs. See Swell's mixing service.

OMF / AAF

OMF (Open Media Framework) and AAF (Advanced Authoring Format) are file formats used to transfer an edited video project's audio tracks from an editing application to an audio post production studio.

When a video editor completes a picture-locked cut in Avid, Adobe Premiere, or Final Cut Pro, they export an OMF or AAF file that contains the audio tracks, edit decisions, timing information, and reference to the media files. The audio post engineer imports this file into their digital audio workstation (Pro Tools, Logic, etc.) to begin audio post work.

A clean, well-organized OMF or AAF is the single most important thing a video editor can provide to their audio post partner. A messy session — mislabeled tracks, missing media, mixed sample rates — adds time and cost to every subsequent step of the audio post process.

At Swell, we prefer AAF over OMF for most projects, as AAF supports a wider range of metadata and is more broadly compatible. We provide a handoff guide to our clients specifying exactly how to export from each major editing application. See the full audio post checklist including handoff instructions.

Stems

Stems are separate audio files of the individual components of a final mix — typically delivered as a music stem, a dialogue stem, and a sound effects stem — that allow editors and distributors to modify or repurpose the audio without returning to the full mix session.

For example, a final mix might be delivered as: a full stereo mix, a music-only stem, a dialogue-only stem, and an effects-only stem. With these four files, an agency can create a version of the spot for a market where the dialogue needs to be overdubbed in another language (using the music and effects stems without the dialogue) without requiring a full remix.

Stems are particularly valuable for: international versioning, creating cutdowns (15-second and 6-second versions from a 30-second master), adjusting music or dialogue levels for specific platforms, and future campaign updates that reuse audio elements.

All Swell custom music packages include stem delivery as standard. For audio post projects, stem mixes are included in the deliverables package. See the full deliverables checklist.

Music supervision

Music supervision for advertising is the process of finding, evaluating, budgeting for, and securing the rights to music for a campaign — managing the creative, logistical, and legal dimensions of music placement from brief to delivery.

A music supervisor's job begins before a note is heard. They help the creative team articulate what the music needs to do — translating vague references ("upbeat and modern") into a workable brief with specific emotional, tonal, and structural parameters. They then search the available market — library catalogs, independent artists, major-label catalogs — for tracks that fit the brief and the budget.

Once a track is selected, the supervisor manages sync clearance: negotiating and securing both the sync license (composition rights from the publisher) and the master use license (recording rights from the label or artist). Both sides must reach 100% clearance before the track can legally appear in a final deliverable.

Music supervision for advertising requires fluency in both creative taste and licensing law. The best supervisors know what a track is likely to cost before they pitch it — saving clients from falling in love with music that's out of reach.

Swell provides music supervision as part of our full audio post service, and as a standalone service for clients who have their post handled elsewhere. See Swell's music supervision service.

Sync license

A sync license (synchronization license) grants permission to use a piece of music in timed relation to visual content — in an advertisement, film, television program, or any other video media.

The sync license covers the underlying composition — the melody and lyrics as written by the songwriter. It is controlled by the music publisher (or the songwriter directly, if self-published) and must be negotiated and cleared independently of the master recording.

Sync license fees are not fixed. They are negotiated based on: the media use (broadcast, digital, cinema, in-store), the territory (US only, North America, worldwide), the duration of the license, the exclusivity requirements, and the profile of the track and its songwriter.

A sync license alone does not authorize use of a specific recording. For that, a separate master use license is required. Both must be cleared before a track can legally appear in a final deliverable. Full music licensing reference.

Master use license

A master use license grants permission to use a specific recorded performance of a song in a video or broadcast deliverable. It is distinct from the sync license, which covers the underlying composition.

The master use license is controlled by whoever owns the specific recording — typically a record label for commercially released music, or the artist directly for independent releases. For a well-known commercially released track, the master use license is usually the more expensive and more difficult clearance to obtain.

When both a sync license and a master use license are required (as they are for all commercially released recordings), each is negotiated separately with different parties at potentially very different prices. If either side declines, the track cannot be used regardless of the status of the other license.

Custom music composed by Swell eliminates the master use issue entirely — there is no separate master to clear, because the recording is created for the client and rights are assigned directly. Full licensing reference.

Music licensing

Music licensing is the process of obtaining legal permission to use copyrighted music in a specific context — in advertising, this means securing the rights to use a track in a commercial, branded content video, or other media where music accompanies a visual.

For advertising use, music licensing almost always requires two separate licenses: the sync license (from the publisher, for the composition) and the master use license (from the label or artist, for the specific recording). Both must be cleared before the track appears in a deliverable.

Music licensing fees are calculated based on four primary variables: platform (broadcast, digital, in-store), territory (US only through worldwide), term (12 months through perpetual), and exclusivity (non-exclusive through full exclusivity). Each variable materially affects the final fee. See the complete music licensing cost reference.

Custom music / original score

Custom music (also called an original score or bespoke music) is music composed specifically for a brand, campaign, or piece of content — written to fit the brief, the edit, and the brand's sonic identity rather than adapted from an existing track.

Custom music for advertising is commissioned through a music studio or composer. The client receives an original composition that no other brand can use. Depending on the agreement, the client may own the master recording and the publishing outright (a work-for-hire), or license the music for specific uses.

Custom music eliminates the two-license clearance process required for commercially released music. There is no publisher to negotiate with and no label to clear — the rights are assigned directly through the composition agreement.

Swell's custom music rate card: custom library track $5,000; original score with 1 demo $10,000; original score with 5 demos $15,000; original score with 8 demos $18,000. All packages include ideation library pulls, stem delivery, and up to 5 revision rounds. Full custom music service details.

Library music / stock music

Library music (also called stock music or production music) is pre-composed, pre-recorded audio available for licensing through a catalog or platform, typically with standardized terms and pricing.

Library music platforms — including Artlist, Musicbed, Epidemic Sound, Marmoset, and Top Shelf Music (Swell's own platform) — offer catalogs ranging from thousands to millions of tracks. Licensing is typically faster and less expensive than sync licensing commercially released music or commissioning custom compositions.

Important caveats: most library subscription plans do not cover paid advertising media without an upgraded license tier — confirm your specific plan covers commercial use before delivery. Library music is non-exclusive in most cases, meaning competitors can license the same track. Culture-first platforms like Top Shelf Music curate specifically to reduce the risk of overexposure and competitive collision.

Custom music vs. library music: full comparison.

Sonic branding

Sonic branding is the strategic, systematic use of sound to build and reinforce brand identity — the audio equivalent of a visual identity system. A complete sonic brand encompasses audio logos, brand music guidelines, a sonic palette (a defined set of musical characteristics that govern how the brand sounds), and the deployment rules for how those elements appear across every touchpoint.

Sonic branding is not the same as a jingle. A jingle is a single piece of campaign content. A sonic brand is the system within which jingles, background music, product sounds, app tones, hold music, and event audio all operate coherently.

Well-known sonic brand examples: the Intel chime (5 notes, immediately recognizable without visual context), the Netflix "ta-dum" (a product sound engineered to create anticipation), McDonald's "ba da ba ba baa" (a melody that escaped its advertising context and became cultural shorthand).

Swell has built sonic branding systems for United Healthcare (in partnership with Turner Duckworth and Leo Burnett), Postmates, the Golden State Warriors, and other brands. Full sonic branding service details.

Audio mnemonic / sonic logo

An audio mnemonic (also called a sonic logo or audio logo) is a short, distinctive musical or sonic signature — typically 2–5 seconds — that serves as the acoustic equivalent of a visual logo, creating immediate brand recognition.

An audio mnemonic is the most distilled expression of a sonic brand. It must be memorable enough to trigger brand recognition in isolation — without visual context — and distinct enough to be ownable by one brand.

Designing an effective audio mnemonic requires the same discipline as designing a visual logo: understanding what the brand represents, what emotional associations it needs to trigger, and what sonic characteristics distinguish it from competitors. Unlike a jingle, an audio mnemonic is not typically melodic in the traditional sense — it's a sonic mark, engineered for recognition rather than sing-along appeal.

Swell designs audio mnemonics as part of full sonic branding engagements. The United Healthcare mnemonic, developed with Turner Duckworth and Leo Burnett, is one of our most significant sonic brand projects. Sonic branding service.

Kill fee

A kill fee is a payment made to a composer, musician, or studio when commissioned creative work is ultimately not used — compensating them for the time and effort invested in work that was cancelled or replaced after it was completed.

Kill fees are a standard professional protection in the music composition industry. They arise when a project is cancelled after work has begun, when a campaign direction changes and the commissioned music is no longer needed, or when multiple composers are commissioned and only one track is selected.

Kill fees in professional composition agreements typically range from 25% to 50% of the total contracted fee. The exact percentage depends on how far into the project the work progressed at the time of cancellation. Agencies and brands that commission multiple composers simultaneously (for competing demos) should budget kill fees for the compositions not selected.

Kill fees are distinct from demo fees (which are upfront investments in the creative exploration process) and from buyout agreements (which transfer rights to the client). Full kill fee guide.

Demo fee

A demo fee is an upfront payment to a composer or music studio for creating speculative compositions — demos — for consideration before a project is awarded or a final track is selected.

Demo fees compensate composers for the creative and production time required to develop original music to a brief before a client has committed to using any specific track. They are distinct from the full project fee, which covers the complete composition, recording, and delivery.

At Swell, demo costs are built into our tiered score packages rather than charged separately — the package price reflects how many compositional directions you want to explore (1, 5, or 8 demos). This provides cost transparency upfront: you know the total investment before work begins, regardless of which demo direction is ultimately selected.

Demo fees are an appropriate and professional investment in creative exploration. Expecting custom music composition without compensating for the demo phase is equivalent to expecting a design firm to present multiple visual identity concepts for free. Custom music packages and pricing.

Source-Connect

Source-Connect is a professional audio routing and session-sharing technology that enables real-time, broadcast-quality remote recording and mixing sessions between a studio and any location worldwide.

Before Source-Connect became the industry standard for remote audio production, physical presence in the studio was required for live sessions — voiceover recording, ADR, and supervised mixing. Source-Connect eliminates that requirement while maintaining the audio quality and real-time interactivity of an in-studio session.

For advertising clients, Source-Connect means that geographic proximity to a studio is no longer a selection criterion. A brand team in New York can supervise a final mix session at Swell in Richmond, CA in real time — hearing the same audio the engineer hears, providing feedback in the moment, and approving deliverables without travel.

All Swell sessions are available via Source-Connect as a standard option. We work regularly with agency teams in Los Angeles, New York, Chicago, and internationally. Audio post production service.

Loudness normalization / LKFS

Loudness normalization is the process of adjusting a finished audio mix to meet specific loudness targets required by broadcast networks, streaming platforms, and digital distribution channels. LKFS (Loudness K-weighted, relative to Full Scale) is the measurement unit used to specify these targets.

Before loudness standards were established, advertisers would intentionally master their spots louder than program content — creating the "loud TV commercial" phenomenon audiences found disruptive. Regulatory standards (CALM Act in the US, EBU R128 in Europe) now require broadcasters to normalize content to consistent loudness levels.

Key loudness targets for advertising delivery:

  • US broadcast (ATSC A/85): –24 LKFS integrated loudness
  • European broadcast (EBU R128): –23 LUFS integrated loudness
  • Streaming (YouTube, Spotify, Apple Music): –14 LUFS integrated loudness
  • Meta/Instagram: –14 LUFS

A professional final mix includes loudness normalization to the correct target for every platform the spot will appear on. A spot mastered for broadcast and uploaded to YouTube without re-mastering will be automatically turned down by the platform's normalization algorithm, potentially affecting the perceived impact of the music and sound design.

Swell delivers to all major platform specs as standard. See the full deliverables checklist.

M&E mix (Music and Effects mix)

An M&E mix (Music and Effects mix) is a version of a finished spot's audio that contains all elements except the dialogue — music, sound design, foley, and any other non-voice audio — delivered as a separate file alongside the full mix.

M&E mixes exist to facilitate international versioning. When a spot is adapted for a market where the dialogue needs to be dubbed into another language, the dubbing studio works from the M&E — replacing only the dialogue while retaining the original music and sound design intact.

Without an M&E, international versioning requires either re-creating the full audio post (expensive, time-consuming, and prone to quality inconsistency) or delivering a version where the dubbed dialogue is mixed on top of the original full mix (audibly inferior).

The M&E mix is the most commonly forgotten deliverable in advertising audio post. It costs almost nothing to produce alongside the final mix — and it costs significantly more to produce after the fact when it's suddenly needed for an international release. Swell builds M&E mixes into our standard deliverables for any project with potential international distribution. Full deliverables checklist.

Dolby Atmos (advertising context)

Dolby Atmos is an object-based spatial audio format that places sounds in a three-dimensional acoustic space rather than assigning them to fixed speaker channels, creating an immersive listening experience that adapts to any playback system from cinema to headphones.

In advertising, Dolby Atmos is no longer exclusively a theatrical format. It is now delivered natively on Apple Music, Amazon Music, and supported by streaming platforms; it renders on AirPods and compatible headphones as binaural spatial audio; and it is increasingly required for CTV (connected TV) and premium streaming ad formats.

For advertising audio, the practical impact of Atmos is most pronounced in close-listening environments — AirPods, high-quality headphones, Atmos-enabled home theater systems. A spot mixed in Atmos can create a physical sense of presence and dimension that stereo cannot replicate: a crowd surrounding you, a product materializing in space, an intimate vocal that feels closer than any stereo mix could achieve.

Swell delivers Dolby Atmos mixes as a standard option for campaigns with streaming, CTV, or premium digital distribution requirements. Atmos is mixed in conjunction with the standard stereo deliverable, not as a replacement. Audio post production service.

Broadcast compliance / QC

Broadcast compliance refers to the technical and format requirements that a finished audio (and video) deliverable must meet to be accepted for transmission by a broadcast network, streaming platform, or distribution channel. QC (quality control) is the process of verifying that a deliverable meets those requirements before submission.

For audio, broadcast compliance typically covers: loudness levels (LKFS targets), sample rate and bit depth, file format, channel configuration (stereo, 5.1, Atmos), and the absence of technical artifacts (clipping, dropouts, sync errors, level anomalies).

A spot that fails technical compliance at the network or platform level will be rejected and returned for correction — a delay that can miss air dates and incur rush fees. A professional audio post studio performs QC as a standard final step before any deliverable leaves the building.

Your Story Our Sound
© Swell. All rights reserved.