· By the Baltimore AV Company production team
Hybrid town halls that don't embarrass anyone: audio first
Remote attendees forgive a mediocre camera and never forgive bad audio. Why hybrid sound fails, what mix-minus means, and what a real hybrid kit includes.
Here is the rule that governs every hybrid town hall: remote attendees will forgive a mediocre camera angle and will never forgive bad audio. Video quality is a preference; audio quality is comprehension. When the remote half of your company can’t hear the CEO clearly, they haven’t attended a worse version of the meeting — they haven’t attended the meeting. So the hybrid budget, attention, and engineering should start with sound, and this guide explains what that means in practice.
Why remote audio fails
The classic hybrid failure starts with a reasonable-sounding idea: the room already has microphones, so point a laptop at the stage and let the meeting platform handle it. Here’s what actually happens.
The laptop hears the room, not the program. A laptop microphone — or a conference puck on a table — captures the PA’s sound after it has bounced around a ballroom: reverberant, distant, drowned in chair squeaks and coughing. The in-room audience hears a tuned system; the remote audience hears a recording of a room. The fix is structural, not incremental: the remote feed must come directly from the audio console, where every microphone arrives clean before the room’s acoustics touch it.
Echo arrives the moment anyone remote speaks. Hybrid means audio flows both directions, and that’s where DIY setups collapse. The remote presenter’s voice plays through the room’s PA, the room’s microphones pick it up, and it returns to the presenter a half-second later. Hearing your own voice on delay is neurologically brutal — presenters stumble, stop, apologize, and stumble again.
The cure is mix-minus: each remote participant receives the full program mix minus their own audio. It’s a routing discipline on the console, invisible when done right and unmistakable when skipped. If you learn one term from this article, it’s this one — asking a vendor “how are you handling mix-minus?” instantly reveals whether they’ve engineered hybrid before.
Q&A gets forgotten. The town hall’s whole point is the exchange, and then a remote employee asks a question nobody in the room can hear — or an in-room question never reaches the stream. Every voice that’s part of the program needs a path through the console: audience microphones in the room, and the platform’s audio returned properly for remote questions.
Levels drift with nobody watching. Presenters trade microphones, walk away from podiums, and play videos mastered at wildly different volumes. In the room, an engineer rides those changes. If nobody owns the remote mix, remote listeners get the unridden version — whisper, blast, whisper.
Mult feeds: the same principle, more destinations
The device that solves “many listeners need clean audio” is the mult — a distribution box that takes the console’s program feed and hands out clean, isolated copies. At press-facing events, it’s how every journalist and broadcast crew gets direct audio without a pile of recorders taped to the podium. At a town hall, the same discipline feeds the stream, the recording, the overflow room, and the assistive listening system — each from the console, each independent, so a problem in one feed can’t contaminate the others. One clean source, many destinations, no daisy chains: it’s the architecture behind every professional multi-audience event.
What a proper hybrid kit includes
When we build a hybrid town hall, the kit behind it looks like this:
- A digital console with the channel count for both audiences — every room microphone and playback source, plus the remote return paths, with separate mixes for the room and the feed. The remote mix is its own product: compressed and balanced for laptop speakers and earbuds, not just a copy of the room’s.
- Mix-minus returns for every remote presenter, tested with each of them before show day — their tech check is part of your show now.
- A produced video path — real cameras on the speakers and slides at native resolution into the platform, so the remote experience is a program, not surveillance footage. (Camera choices are covered in our video work.)
- Wired network with confirmed upload bandwidth, specified with the venue’s IT during the advance and tested during setup — plus bonded-cellular backup when the stakes or the venue’s infrastructure demand it. Hotel Wi-Fi is not a transmission plan.
- A dedicated stream engineer. During a live program, the room engineer’s attention belongs to the room. The remote audience needs its own human — watching levels, watching the platform, answering “is the stream okay?” with knowledge instead of hope. One-operator hybrid fails exactly when both audiences need attention at once, which is always.
- Redundant recording, so the town hall that went perfectly exists somewhere other than the platform’s cloud processing queue.
The room audience is part of the hybrid system too
Hybrid engineering runs both directions, and the in-room half has its own failure modes worth naming.
When a remote presenter speaks, the room needs to see and hear them well — a face at real size on a real screen, voice through the tuned PA, not a laptop on a stool doing both jobs badly. The difference in how seriously the room takes a remote speaker when they’re produced properly is visible from the back row. Delay matters here too: platform audio arrives a beat behind, and an engineer managing that gap keeps remote-to-room exchanges from stepping on each other.
And when leadership takes questions across both audiences, the moderation is part of the system. The pattern that works: one person owns the queue across both rooms, remote questions come through a vetted channel (the platform’s Q&A tool or a producer relaying), and the presenter repeats or paraphrases questions so both audiences always know what’s being answered. It’s a facilitation discipline, but it lives or dies on the audio routing underneath it.
A word on platforms
Planners often start with “should this be Zoom, Teams, or a webcast platform?” — but that’s an IT and audience question more than an AV one, and honestly, any mainstream platform can carry a well-engineered feed. The right sequence: your IT team picks what your company can access and secure, your communications team decides how interactive the remote experience should be (full participants versus view-only viewers), and your AV team engineers into whatever that answer is. What we care about technically is the same on every platform — direct console audio in, mix-minus out, produced video, tested bandwidth. A great platform can’t fix a laptop microphone, and a mediocre platform can’t break a properly built feed.
The planner’s checklist
Compressed to one screen — the questions to settle in your advance (fold them into the 30-day run-up):
- Does the remote feed take audio directly from the console? (Only correct answer: yes.)
- How is mix-minus handled for remote presenters?
- Who — by name — is mixing for the remote audience during the program?
- How do remote questions reach the room, and room questions reach the stream?
- What’s the bandwidth plan, and what backs it up?
- When are remote presenters tech-checked?
A hybrid town hall done right costs more than pointing a laptop at a stage, and it should — it’s two audiences, engineered honestly. If half your company is remote, they’re not an overflow room; they’re the other half of the event. Build for them like it, starting with the audio. If you want that built by people who’ve done it, tell us about your town hall.