The Rise of Generative AI
Generative AI has simplified the creation of images, texts, and audio clips to the extent that they are often indistinguishable from human-made content. At ElevenLabs, we believe in the transformative potential of these technologies and their ability to unlock new frontiers of creativity and accessibility.
At the same time, we also recognize that to fully harness the benefits of these technologies, we must prioritize the establishment of robust infrastructures that ensure their safe and responsible use. We are committed to taking meaningful steps to implement the necessary measures and foster educational initiatives needed to protect users and promote ethical usage of generative AI.
AI Speech Classifier: A Step Towards Transparency
Today, we are thrilled to introduce our authentication tool: the AI Speech Classifier. This first-of-its-kind verification mechanism lets you upload any audio sample to identify if it contains ElevenLabs AI-generated audio.
The AI Speech Classifier is a critical step forward in our mission to develop efficient tracking for AI-generated media. With today’s launch, we seek to further reinforce our commitment to transparency in the generative media space.
Mechanism and Known Limitations
The audio generated by our system has certain specific, detectable characteristics. When you upload an audio sample to the AI Speech Classifier, our algorithm will scan for these characteristics. It can then confirm whether the content was indeed generated by our platform, currently maintaining >99% accuracy if the input was unmodified. If it underwent Codec or reverb transformations, our Classifier maintains over 90% accuracy. This figure drops the more the content has been post-processed. If additional audio tracks have been added, this will also affect the result.
We are continuously analyzing and improving our model to be able to detect other audio transformations and we expect these outcomes to improve.
Test the first iteration of our Classifier yourself:
Initial Launch and Call for Input
We are releasing our tool publicly today, as we recognize the importance of the wider public being able to detect AI-generated content. We are also eager to collaborate with interested partners on deeper API integration and enhancing the value of this tool.
In future iterations, we plan to broaden the tool’s detection capabilities to include audio generated by different platforms. We are inviting fellow AI companies to collaborate with us on helping to establish a comprehensive method for detecting all AI audio content. If you're interested in either partnership or integration, please connect with us.
A Proactive Stand against Malicious Use of AI
As creators of AI technologies, we see it as our responsibility to foster education, promote safe use, and ensure transparency in the generative audio space. We want to make sure that these technologies are not only universally accessible, but also secure. With the launch of the AI Speech Classifier, we seek to provide software to supplement our wider educational efforts in the space, like our guide on the safe and legal use of Voice Cloning.
Our goal at ElevenLabs is to produce safe tools that can create remarkable content. We believe that our status as an organization gives us the ability to build and enforce the safeguards which are often lacking in open source models. With today’s launch we also aim to empower businesses and institutions to leverage our research and technology to bolster their respective safeguards.