Voice Activity Detector
The Voice Activity Detector (VAD) is an audio processing component that automatically detects the presence of speech within an audio signal. By distinguishing speech from background noise in real-time, the VAD provides logic that can be used to drive intelligent audio workflows such as microphone gating, automixing, and voice-driven control functions including camera switching. The VAD is designed for seamless integration into Q-SYS designs, providing reliable speech detection for a wide range of conferencing, presentation, and voice-activated applications.
Note: Voice Activity Detection (VAD) uses CAT2 processing resources for detection. VAD instances share these resources with other CAT2 features, including AEC (and NM-T1), as well as Notch Feedback Controller. VAD resource usage is reported under CAT2 in Check Design Compile Results.
Input
Input - Audio Input Pin
Connect this pin to the audio signal you want the Voice Activity Detector to analyze. This input provides the source signal used for speech detection and drives all related control outputs, including Voice, Confidence, and Threshold behavior.
The input signal is monitored continuously while the design is running.
Voice Output
Voice Output - Audio Output Pin
This audio output is available only when Voice Output is enabled in the component properties. When enabled, it provides a version of the input signal that is gated based on detected speech-audio passes when speech is present and is suppressed when speech is absent, allowing the VAD to function as a simple voice-controlled gate. When Voice Output is disabled, this audio pin is not shown; however, the VAD continues to analyze the input signal and update its control outputs for use with external processing or logic components.
These examples demonstrate common use cases for the Voice Activity Detector.
In this setup, the Voice Activity Detector acts as a filter, ensuring the Automatic Camera Preset Recall responds only to spoken audio and ignores constant or incidental background noise like HVAC systems or room sounds.
In this setup, the Voice Activity Detector is used as a lightweight way to infer room presence and even log the last time someone was active (speaking), without relying on cameras or additional sensing systems.
Voice Activity Detector Properties
Voice Output
Enables or disables the VAD's gated audio output.
-
Yes - The component exposes a Voice Output audio pin that passes the input signal only while speech is detected. Non‑speech audio and background noise are suppressed.
-
No - The component does not provide a gated audio output. The VAD continues to analyze the input signal and update its control outputs, allowing speech detection to be used for external gates, automixers, control logic, or camera control without altering the audio signal path.
Graphic Properties
Label
User-defined text that appears next to the component on the Schematic. Changing the label does not affect the component name or system behavior.
Position
Specifies the X and Y coordinates of the component on the Schematic. The values update automatically when you move the component, and can be edited to place the component at an exact location.
Fill
Sets the background color of the component block on the Schematic. This is for visual organization only and does not affect system operation.
Script Access
Code Name
Displays the currently assigned name for control access. You can use the auto-assigned name or customize it. Q-SYS will automatically check all Code Names in the design to ensure name is unique.
Script Access
Defines whether the component will be accessible by script and/or externally, or not at all. Choices include All, External, None (default), and Script.
- None (default) - Not accessible by any script, plugin, or by Q-SYS Remote Control Protocol (QRC).
- Script - Can be accessed by scripts, such as Text Controller, Block Controllers, and plugins only.
- External - Can only be accessed by 3rd party controls systems using component commands from the Q-SYS Remote Control Protocol (QRC).
- All - No restrictions, can be accessed by 3rd party control systems via Q-SYS Remote Control Protocol (QRC), or script objects or plugin objects.
Tip: Use Script Programmer Mode to quickly view the Script Access setting directly on the component in the design schematic without the need to disconnect from the Q-SYS Core processor.
Voice
A logic output that indicates whether speech is currently detected. The output is active (true/on) when the Confidence value meets or exceeds the Threshold, and inactive (false/off) otherwise. Updates in real-time based on the input audio signal.
Confidence
A real-time value generated by the Voice Activity Detector that represents the estimated likelihood that speech is present in the audio signal. Higher values indicate greater certainty of speech detection. This is read-only and may not be adjusted.
Threshold
Sets the minimum Confidence level required for speech to be considered detected, adjustable from 0–100%. When Confidence meets or exceeds the Threshold, Voice becomes active; when it falls below the Threshold, Voice becomes inactive. Increasing the Threshold helps reduce false triggers from background noise, while decreasing it allows detection of quieter speech, enabling adjustment for different room conditions.
Bypass
Disables voice output signal processing. Available only when Voice Output is enabled in the component properties. When enabled, audio passes through unaffected by voice detection, while voice detection continues running in the background. Control outputs (Voice and Confidence) remain functional.
| Control ID |
Pin Name |
Value |
String |
Position |
Pins Available |
|---|---|---|---|---|---|
|
voice |
0 1 |
No Speech Detected Speech Detected |
0 (Off) 1 (On) |
Output |
|
|
confidence |
Confidence |
0 - 100 |
Speech Detection Confidence (%) |
0 - 100 |
Output |
|
threshold |
Threshold |
0 - 100 |
Minimum Confidence Required to Detect Speech (%) |
0 - 100 |
Input / Output |
|
bypass |
Bypass |
0 1 |
Voice-Based Gating Enabled Voice-Based Gating Bypassed (audio passes through) |
0 (Off) 1 (On) |
Input / Output |
