We have tested for you the top 6 VST plugins for voiceover and dialogue cleaning in 2023 as well as 3 free tools that could be used for this purpose.
What is A Voiceover?
A voiceover is a narration of information over a video but without the speaker appearing on screen. Even if you don’t know what it is, you have undoubtedly heard it in films, TV series, documentaries, sports, commercials, and breaking news.
Voice has a particular feature not found in any other living being on Earth; it is human. But to make that voice enjoyable, clear, present, and alive, we need to record it with the right tool.
Over the years, vocal processing has evolved, and new techniques appear to improve the sound of the voices. Many recording and broadcasting engineers are listening carefully and continuously improving those recorded vocals, pursuing perfection.
So, let’s check out some of those tools and how we can use them.
Top 6 VST Plugins For Voice-Over & Dialogue Cleaning 2023
1. Waves Clarity Vx Pro (Noise Reduction For Voice)
More Info & Price (Trial Available)
Reduce any kind of noise in your voice tracks with easy and responsive controls.
Waves indeed have a good collection of sound-crafting tools in their catalog, and almost certainly, all of them can assist you in any way or another. Clarity Vx Pro is one of those tools, being the perfect companion for reducing any kinds of noises and imperfections in your speech or vocals.
- Neural networks
A neural network is a learning machine capable of storing information with the help of dedicated algorithms in order to make decisions quickly. In this particular case, Clarity Vx Pro uses neural networks to analyze each aspect of your waveform, providing instant corrections to your vocal track. It can automatically detect noises and other unwanted variations in your waveform and correct them on the go.
- Main control knob
The main control knob is the biggest one on your screen, and it will be imperative to set the adequate range in which ambiance will be separated from the voice. For example, if you need to remove said ambiance from your speech, the value you set on this knob will be the threshold that the processing mix will work on.
- Dedicated faders
There are some “Band Process” faders assigned to each particular frequency band. They are important controls that determine how much of your signal is being affected by the neural network, according to each frequency band. The intention is to reduce any processing per band, not add more. Be aware that the more it’s being used, the more CPU power will be required.
- Analysis mode
If your vocal track requires some special stereo treatment, this option might be useful to you. The “Analysis Mode” is introduced after you decide whether you want your tracks in a single or double-channel distribution. The single mode sums up both left and right channels before analysis, and the double mode separates them very clearly – either by different ambiance or the voice quality itself.
- Neural network selector
If introducing a neural network sounds too good to be true, Waves decided to go even further: you can choose between three distinct neural networks that differ in values, parameters such as bias and ratios, and statistics.Neural NetworkVoice FocusBroad 1Isolates primary and secondary voices from the ambiance, preserving the main voices.Broad 1 HFSimilar to the previous option, but with an extended high-frequency focus.Broad 2Isolates primary voice from secondary voices and ambiance, focusing on the main speaker or singer only.
- Neural networks
This plugin is available for macOS 10.14.6, 10.15.7,11.6.2, and 12.3 (64-bit only) and Windows 10 or higher (64-bit only). It runs in VST, VST3, AAX, and AU plugin formats. It’s compatible with M1 MACs only on Rosetta.
For more information on system requirements and authorization instructions, check out the Clarity Vx Pro User Manual.
Vocal processing is not an easy task, and removing noises or unwanted elements from a recorded track can be very difficult to achieve. Glady, we have plugins like Waves Clarity Vx Pro, which makes heavy use of neural networks and algorithms to get the job done.
You can isolate the noises, enhance the voice’s qualities, and even separate multiple voices with this useful technology, depending solely on the processing power of your CPU.
2. UnitedPlugins Voxessor (Vocal Channel Strip)
Voxessor is a plugin that instantly improves your vocal recording or broadcasting. It covers all the aspects needed to treat a voice: noise gate, compressor, de-esser, and EQ, all in a reduced set of commands.
It is beneficial for online activities while recording because it works with incredible speed. It provides an “automatic volume“ feature and two one-knob dynamics processors (compressor and gate). The plugin detects if the incoming signal has to be processed or not. Based on that, it can temporarily turn on the “sleep mode” and save processing resources.
The main selector lets you EQ the human voice appropriately, depending on the source. It accepts sources from low male voices to highest girl ones. You can also analyze the recorded voice – using the “Intelligent Matching” feature – and let the plugin EQ the source automatically.
It also comes with a “Tunable De-Esser” that allows you to remove excessive sibilance without losing brilliance.
Finally, it has a 64-bit audio quality engine, a “smart bypass,” and a 3D user interface.
- Perfect and Consistent Voice-over
This plugin contains all the four tools you need to get a consistent recording. Compression and noise gating to set the vocal in the center of the field. De-esser to avoid distractions in the listener with a voice analyzer and automatic EQ for an ideal voice. And simple metering to quickly monitor all the parameters mentioned before.
- “Smart Dynamics” and Intelligent Processing
This plugin comes with a unique processor of character voice (man, boy, woman, girl) that adapts the recording or broadcasting voice to correct any defect and enhance any goodness. It also has an auto-leveling feature that makes the audio signal consistently solid. To help this, its high resolution of 64 bits takes this plugin to a premium level.
- Fully Portable
Apart from its powerful internal engine, it is fully portable. Once you purchase the full license, you can install the plugin on any other computer. You only need to copy your “license file” along with the copy. This way, you can use it comfortably in your home studio and any place using your notebook.
- Compact and Easy-to-use Interface
Anyone could think that all the tools this plugin provides would make your life unpleasant. However, the automatic leveling, the intelligent character voice, and the audio processor are only one-knob tweaking. Change one thing here, modify this other thing there, and you are ready to record or broadcast. Very simple, very quick, very useful, very powerful.
United Plugins’ Voxessor is available for Windows 8 or higher 32-bit or 64-bit and macOS 10.7 or higher 64-bit only. It comes in VST2/3, AU, and AAX formats.
United Plugins’ Voxessor has an exquisite design and provides all the features needed to obtain an outstanding vocal recording. All leveling capabilities (auto-level, compressor, noise gate) are present and include a fully tunable de-esser. It supports matching EQ with a vocal analyzer and a character voice modeling.
All these make this plugin a handy tool for every voice. Simple, slight, and powerful.
3. Acon Digital Extract:Dialogue (Noise Reduction)
Extract: Dialogue is another class of plugins for the handling of voices. Extract:Dialogue plugin separates dialogue from common types of noise – generally background noise – such as rustle, wind, hum, traffic, plosives, and mouth clicks and smacks.
The use of this plugin is to remove any noise from the voiceover surgically. The algorithm can work in real-time and uses deep learning technology. Despite its complex technology, “Extract: Dialogue” has a highly intuitive design. You simply add this plugin to your dialogue bus or track, and magic appears.
Noise reduction is fully automatic; however, the plugin interface offers three simple controls to adjust the “sensitivity of the noise detection,” the “amount of noise attenuation,” and the “frequency focus.”
This way, you can adjust the sensitivity independently in up to three frequency bands.
The attenuation control is convenient when you don’t want to remove the background noise but reduce it. This control is helpful in cases where you need environmental noise, for example, news reports or sports transmission.
- Top-Notch Technology
Based on deep learning, thousands of high-quality voice recordings trained its extraction algorithm. It also incorporated an extensive database of familiar noises. The comprehensive training also enabled artificial intelligence processing to distinguish dialogue from noise without user interaction automatically.
- Easy to Use with Total Control
Although the detection and extraction of noises are automatic, the plugin gives you the tools to adjust the necessary reduction conveniently. And if you cannot perceive the noise or don’t know what is being extracted, a “solo noise” mode is available. You can listen to the noise and be secure that the voice is untouched.
- Warranty of Good Results
A large number of factory presets help to customize the work. And it supports internal sampling processing up to 96Khz.
Acon Digital’s Extract:Dialogue is available for Windows 7 or higher (32-bit or 64-bit) and macOS 10.9 or higher (32-bit or 64-bit) (please note that macOS 10.15 and higher does not run 32-bit programs any longer). It comes in VST2/3, AU, and AAX formats.
Probably the user interface of Acon Digital’s Extract:Dialogue is not better looking; its internal processing can make the difference. The “solo noise” feature is helpful. And you cannot find the deep-learning technology included in this plugin anywhere near this price range.
4. WA Production Vocal Cleaner (Multi-Purpose Cleaner)
Vocal Cleaner is a plugin that takes a dirty vocal recording and converts it to a shining finished production. You, as a producer, can forget the details of parameters to focus on what is necessary, the voice.
As you can imagine, this plugin was conceived with the mind put on a musical production. The idea behind this plugin is to prepare the initial recording to a posterior polish of compression, saturation, and reverb. But since the preparation tasks are the same, voiceover tasks take advantage of this plugin.
The main three stages are the faders for the de-noise, de-esser, and gate processing. Once you establish the thresholds, a set of intelligent algorithms optimize the output with different tones: bright, dark, or natural.
Finally, you need to determine how much presence you need, and voila, your voice is ready to shine.
- 3 Processing Stages
This plugin reduces a complex process into a series of three clearly defined steps; easy to follow and monitor. The De-noise, the de-esser, and the gate appear in front of the screen with a simple fader you need to push down to get a better voice. Select the desired tone and adjust the presence. Three steps are all you need.
- Intelligent Algorithms
Although we can talk about tone, presence, brightness, or natural sound, we know that a set of intelligent processors adjust the EQ, the compression, and the harmonics behind the scenes. Two filters (low-pass and high-pass) are responsible for the reduction of mouth plosives and smacks.
- CPU and User Friendly
This plugin is not only elegant to the user but also not CPU intensive. The new version has a window fully resizable. Additionally, it comes with a set of factory presets ready to use for voice.
WA Production Vocal Cleaner is available for Windows 8 or higher (32-bit or 64-bit) and macOS 10.13 or higher (32-bit or 64-bit) (please note that macOS 10.15 and higher does not run 32-bit programs any longer). It comes in VST2/3, AU, and AAX formats.
WA Production Vocal Cleaner is one of those plugins I feel comfortable working with because only the necessary controls and metrics are there. All the things you need are there. That, combined with intelligent processors and state-of-the-art algorithms, is a warranty of good results.
5. Neverdie AUDIO Speachy (Multi-Purpose Tool / Channel Strip)
A complete VST plugin for the human voice is called Speaky. Built with consideration for voiceovers, podcasting, content makers, and live streaming.
A complete VST plugin for the human voice is called Speaky. Carefully crafted for live streaming, podcasting, content producers, and voiceovers alike, it eliminates the majority of unnecessary parameters for quick and simple setup, even for those with little to no experience with sound engineering.
It has a full signal chain already set up in the right sequence without requiring any gain matching from the user. Many settings are pre-set for most circumstances and produce fantastic outcomes in their default forms.
- Interface Settings
Due to its dual-threshold construction, the one-knob compressor can tighten the sound and regulate strong peaks. Pleasant frequencies in the human voice can be enhanced by light harmonic saturation. The plugin has crucial capabilities for reducing and getting rid of often bothersome noises, sounds, and frequencies.
Built-in noise reduction immediately eliminates most background noise with a delay of under 10ms. A limiter guards the output volume and graphically alerts the user to any potential signal distortion. Pre/post-equalization and stereo widening are two examples of advanced options that are, by default, concealed. It’s perfect for live scenarios because it has little CPU use and zeroes internal latency in real-time mode.
For voiceovers and narration that seem natural, use the TRANSPARENT style. For strong, faster-paced speech with a wide dynamic range, switch to the AGGRESSIVE style. This is especially useful while speaking live since it helps to avoid sudden high loudness spikes.
Drag the knob up and down to adjust the degree of compression, but as long as the input volume is kept in the sweet area, the default setting is pretty much perfect. Drag the buttons up and down to enhance your voice, then pay close attention to the difference. Extreme settings may cause additional problems, such as loud bass or unpleasant upper frequencies, for which you may need to make module-level adjustments elsewhere.
- De-Noise and De-Plosive
Static, hum, and fan noise are common background noises that noise cancellation blocks. The noise reduction level is adjusted using the knob. The filtering function is altered using the smaller “style” button. Although it may not be sufficient to manage large noise sources, it is quite mild and non-destructive to your voice in its default mode.
The noise canceling becomes more forceful when the knob is turned up, but the music quality may start to suffer. Accidental air burst strength is decreased by de-plosive. For optimal effects, use it with strong compression and a lo-cut filter. The module’s sensitivity may be changed using the button. Find a spot where the LED activity catches the plosives but not your voice by carefully zooming in and observing the LED activity.
This plugin is available for macOS 10.10 or higher (64-bit only) and Windows 7 or higher (64-bit only). It runs in standalone mode and VST2, AAX, and AU plugin formats.
When used in a DAW or with VSTHost and VR connection, Speachy is an amazing tool for real-time/post-voice editing since it lets you use the top-notch noise gate without changing the sound of your original voice. The internal compressor has high power but is quiet, clear, and transparent.
The seductive low-end function provides your voice FM station’s bass, while the mid- and high-voice air settings provide signal quality and presence.
6. FabFilter Pro-G (Noise Gate)
If we are talking about “FabFilter Sofware Instruments,” we are playing in the major leagues. FabFilter is renowned because of its high-quality products, and FabFilter Pro-G is not an exception.
FabFilter Pro-G is one of the best gate/expander plugins in the market. Its algorithms are perfectly tuned, its interface design is professional, and its control over the sidechain is complete.
This plugin features not only expander/gate processors but also includes upward expansion and ducking. It has an expert mode with customizable sidechain options and linear-phase oversampling. Hence it will not add any undesired artifacts.
The implementation of sidechaining and ducking are beneficial features for radio broadcasting.
Pro-G allows mono, stereo, and mid/side processing with a zero-latency operation mode. And of course, it provides switches for A/B comparison and “undo/redo” modifications.
Pro-G provides many more features, such as Smart Parameter Interpolation or MIDI Learn, but that is out of the scope of voiceover handling.
- Sidechain Routings and Ultra-flexible Operation
FabFilter Pro-G delivers an excellent improvement of the recorded voice. Further, it provides very flexible routing, either in standard or expert mode. You can switch from mono to stereo, introduce the sidechain signal and work with oversampling.
- Ducking Mode
Radio broadcasting (and other activities as well) uses “ducking” to increase and decrease music (or other audio signals) while the radio show host starts or stops talking. You can also use this feature in voiceover when you are narrating over videos. However, I must warn you that an extreme amount of ducking can be annoying to the audience.
- Precise and Clear Metrics
Because of its multiple parameters, a clear visualization is necessary. We need to check which knobs are active, which processor is working, and how the audio signal is affected. FabFilter outstands the metering and combines it all in a unique, precise, and clear visualization panel.
- Help on Screen
As with every FabFilter plugin, Pro-G lets you enable on-screen help when you hover your mouse pointer over any plugin’s parameter. This way, you precisely know what you are doing and how that action would affect the output. This on-screen help not only avoids mistakes but also helps you learn more quickly how the plugin works.
FabFilter Pro-Gis available for Windows Vista or higher (32-bit or 64-bit) and macOS 10.10 or higher (32-bit or 64-bit) (please note that macOS 10.15 and higher does not run 32-bit programs any longer). It comes in VST2/3, AU, and AAX formats.
FabFilter Pro-G is one of the best plugins for processing audio dynamics. It provides all the tools you need to obtain the best audio signal. However, it is too much if you only need a plugin for voiceover. It exceeds all expectations, and your vocals will be great.
Still, if you don’t work with other sources (musical instruments, effects, mixed sources), I suggest opting for cheaper plugins for better value.
The 3 Best Free Voice Over Plugins 2023
1. Sleepy-Time DSP Lisp
Sleepy-Time DSP is a company that releases freeware VST plugins. Sadly, it has recently announced that it will not develop additional software or audio plugins.
Fortunately, you can still find their plugins on several archive sites of audio plugins. Sleepy-Time DSP plugins are beautifully designed and optimized for fast performance. Lisp, in particular, is a high-precision de-esser plugin. It uses an advanced transient detection algorithm to reduce sibilance.
- Transient Detection Algorithm
Not many de-esser plugins use this technology. So this is a good alternative when other plugins are not efficient in removing particular kinds of sibilance.
- Different stereo-field modes
It also presents a feature that makes it valuable. It works on all stereo modes: stereo, mono (left and right), and mid/side. This last mode is not available on some other plugins.
Sleepy-Time’s DSP Lisp is available for Windows 7 or higher (32-bit or 64-bit). It comes in VST2/3 formats.
Unfortunately, we will need to look around archived sites to find Sleepy-Time’s DSP Lisp, but if we can find it, it will be worth it.
2. Bertom Denoiser
Bertom Denoiser is a whole terrain noise reduction plugin.
It removes noise from music, post-production, voiceover, and dialogue. Further, it features a simple user interface; user-friendly and optimized for fast cleaning up. You can use it during a live broadcast because of its zero-latency processing.
Denoiser focuses on setting filters (high-pass and low-pass) and five different frequency bands. You count with five sliders to control every band. Once the noise frequencies are located, you can control the amount of reduction with another fader.
Despite its simple interface, this freeware plugin is an advanced noise production tool. It provides outstanding results while being easy to use and very light on CPU consumption.
- Automatic Noise Analysis
Denoiser’s noise reduction algorithm doesn’t need to learn a static noise profile. Instead, it tracks how the noise part of the audio signal is evolving in each band and removes it. It works similar to a dynamic equalizer: there will be no noise reduction if there is no apparent phase shift.
One thing to keep in mind when using Denoiser is that it works best for moderate noise reduction. You do not need to push it too hard. You’ll find a significant noise reduction with the threshold set about halfway down. If you try it more, it might introduce audible noise reduction artifacts. Use your ears for better results.
Bertom Denoiser is available for Windows 8 or higher, Linux, and macOS 10.9 or higher, both 32-bit or 64-bit. It comes in VST3, AU, and AAX formats.
Bertom Denoiser released this plugin recently. It uses state-of-the-art reduction techniques and is powerful and efficient. I will not surprise if this free plugin becomes a licensed product soon.
3. Cockos ReaGate
Cockos ReaGate is one of the most configurable noise gates in the market.
It not only comes with the standard parameters, like attack, hold-value, release, and reduction percentage, it also adds parameters for lookahead of the signal, hysteresis control, and the main detector input.
You can also control it remotely using MIDI CC messages if you use an external MIDI surface control.
Although it does not have intelligent algorithms that can make your life easier, it features every parameter you can use to customize this noise gate plugin. You will not find, for example, sidechain filters or hysteresis control on any free plugin.
- Updated audio engine
You can think that this is an old plugin, but it is not. Its interface indeed looks old-fashioned. But Reaper continuously updates its plugins to fix and improve every little detail.
Cockos ReaGate is available for Windows 98 or higher (32-bit or 64-bit). It comes in VST 2 format only.
Cockos ReaGate plugin is where you can customize the process of noise gating. You will control this gate. If you are familiar with noise gates and their parameters, this plugin will expand your capabilities.
1. Waves WNS Noise Suppression (Noise Reduction 2)
WNS Noise Suppressor is another excellent product developed by Waves. Waves products are a synonym of good quality.
WNS Noise Suppressor is a plugin to reduce noise from any vocal recording. As with every plugin from the family, WNS performs nicely and smoothly. It works in real-time with zero latency. You can use it with any vocal source, but it is ideal for dialog, narration, voiceovers, and broadcasting.
Its design is very similar to its little brother, Waves NS1 Noise Suppressor, but the power of WNS is superior. NS1 comes with only one fader: the threshold, the minimum volume before noise reduction. WNS, though, includes a set of frequency filters.
Audio engineers can identify the noise frequencies and isolate the noise from those frequencies exclusively using said filters. Similarly, while NS1 employs an attenuation bar to visualize the reduced noise, WNS shows the reduction in a spectral display, emphasizing the frequencies affected by the plugin.
- Hardware Power in a Small Software
Every audio professional knows that hardware boxes are unbeatable at the time of processing audio. But Waves WNS Noise Suppressor performs equal to or even better than hardware processors. For sure, it is more affordable.
- Zero Artifacts
With this plugin, you don’t need to worry about glitches or audible artifacts. You will have zero latency, zero artifacts, and zero errors. So, you can focus on your work and let the plugin will do the rest.
- Exclusive “Suggest” Feature
Apart from its six adjustable bands, large graphic display, and dynamic range selection (broad or narrow bands), it features a cutting-edge algorithm called “Suggest.” WNS analyzes the signal noise and its relationship with the active signal and automatically adjusts all the plugin parameters. Later it suggests the engineer the most probable frequencies where the noise is present.
Waves WNS Noise Suppressor is available for Windows 10 or higher (64-bit only) and macOS 10.13 or higher (64-bit only). It comes in VST2/3, AU, and AAX formats.
Waves WNS Noise Suppressor will not let you down. It is robust and well-designed. It has automatic processing and customizable parameters. It comes in broad (NS1) and multiple narrow bands (WNS). It is intuitive, and you can trust it.
We have tested six plugins for different processing of voiceovers. Some are purpose-specific, and some are more general.
The specific ones are oriented to reduce or remove complicated issues naturally present in the human voice, like sibilance, plosives, and smacks. WA Production’s Vocal Cleaner and Waves WNS Noise Suppressor belong to this class. These plugins are focused on the cleaning of a dirty voiceover.
Another class of purpose-specific tasks is FabFilter Pro-G, which focuses on the audio signal’s consistency and dynamics.
Acon Digital Extract: Dialogue or United Plugins’ Voxessor mixes both classes, de-noising plus dynamics. These plugins help current voiceover activities, like audiobooks narration, and other social network audio activities.
You can also purchase iZotope RX for professional tasks. You can do anything you need, and if you do not need a host, iZotope provides a standalone application.
Summing up, for voiceovers, I recommend United Plugins’ Voxessor. I like the tools it provides; it is fast and convenient for recording and broadcasting. I also like WA Production Vocal Cleaner because it mixes dynamics and noise reduction in real-time.
For specific de-noising tasks, WNS Suppressor is a must. And if you will work on vocal post-production or in forensic audio restoration, iZotope RX is unbeatable.
Shaurya Bhatia, is an Indian Music Producer, Composer, Rapper & Performer, who goes by the stage name MC SNUB, and is also 1/2 of the Indian pop music duo, called “babyface”. A certified Audio Engineer & Music Producer, and a practicing musician & rapper for more than 6 years, Shaurya has worked on projects of various genres and has also been a teaching faculty at Spin Gurus DJ Academy.