ElevenLabs, a pioneer in AI voice technology known for its voice cloning, text-to-speech, and speech-to-speech models, has launched a new tool: the AI Voice Isolator.
Now available on the ElevenLabs platform, this innovative offering enables creators to effortlessly remove unwanted ambient noises from various types of content, including films, podcasts, and YouTube videos.
How Does the AI Voice Isolator Work?
Background noise can significantly compromise the quality of content recordings. Creators often face challenges with sounds like chatter, wind, or nearby traffic, which can obscure the speaker's voice. While some use microphones equipped with ambient noise cancellation, this option may not always be accessible, particularly for early-stage creators.
Enter the AI Voice Isolator by ElevenLabs. This tool functions in the post-production phase, allowing users to upload their content. The advanced models analyze the file, detect, and remove background noise, ultimately extracting clean dialogue. ElevenLabs claims that the AI Voice Isolator achieves a sound quality comparable to studio recordings. A demo by the company’s head of design, Ammaar Reshi, showcased its effectiveness by removing the distracting noise of a leaf blower, resulting in crystal-clear speech.
Real-World Testing
To evaluate the AI Voice Isolator’s capabilities, we conducted three tests. First, we recorded three sentences, each interrupted by various background noises. The tool successfully processed the audio in seconds, eliminating disturbances from door openings, table banging, clapping, and household movements. The only sounds it struggled to filter were wall banging and finger snapping.
According to Sam Sklar, ElevenLabs’ growth lead, the current version of the tool does not support music vocals; however, users might achieve success with some tracks.
Future Improvements
While the Voice Isolator’s ability to handle irregular background noise differentiates it from other tools focused on flat noises, there is still room for enhancement. ElevenLabs aims to improve its performance continually.
The company has not disclosed much about the models behind the tool or if uploaded recordings will be used for training. However, users can opt out of personal data usage for training through a link in its privacy policy.
Currently, the Voice Isolator is exclusively available on the ElevenLabs platform, with plans to open API access in the upcoming weeks, although the exact timeline is unspecified. Users can access the tool for free, albeit with certain limitations.
"The Voice Isolator model costs 1,000 characters per minute of audio. We offer a free plan that includes 10,000 characters per month, allowing for the processing of 10 minutes of audio for free," Sklar explained. For those seeking to remove background noise from larger audio files, paid plans start at $5 per month.