Mati Staniszewski is the co-founder and CEO of ElevenLabs, an AI audio startup valued at 11 billion dollars that specializes in developing natural-sounding speech synthesis software. Prior to founding ElevenLabs in 2022, he worked as a Deployment Strategist at Palantir Technologies, where he managed large-scale implementation projects across public and private sectors. Under his leadership, ElevenLabs has become the leading company in voice AI, enabling audio to be accessible across languages and voices while capturing the humanness of speech through realistic emotional inflection.
In early days you try to replicate it exactly like you would replicate it with the human body… you would try to stitch in phonemes effectively different sounds of how we speak humans and then try to concatenate them together.
— Mati Staniszewski
Now we effectively do similar like neural nets in in other domains so you predict the next sound based on on of course the context of the previous sounds.
— Mati Staniszewski
When you actually try to vocalize something when you create that voice model you turn text into audio you need the text you also need the voice reference of how you want it to to to be spoken.
— Mati Staniszewski
When you actually try to vocalize something when you create that voice model you turn text into audio you need the text you also need the voice reference of how you want it to to to be spoken.
— Mati Staniszewski
The model will deduce them themselves the same for other set of parameters that are not hardcoded whether it’s the enthusiasm whether it’s the subness etcetera.
— Mati Staniszewski
When you are predicting the context you need to understand yes how that sentence will get constructed and especially if it’s more of a streaming real time use case and like a voice agent setting you need both parts to to work across.
— Mati Staniszewski
In any model you need you need architecture you need compute you need data.
— Mati Staniszewski
In like the nutshell describe eleven labs is a research and product deployment company we build foundational audio and voice models and then build a platform for businesses to transform how they communicate with their customers with their employees.
— Mati Staniszewski
It’s one thing you know with saas where you get these like vertical specific providers but I would imagine one of the biggest risks for you guys in being intermediated is if there’s you know like in this example a closed captioning service that is on a two versions old version of 11 and hasn’t upgraded that’s a problem because you want people to be using the latest and greatest model that you’ve developed.
— Mati Staniszewski
I agree with the premise that we are ten years behind in the lived experience of people day to day… there is definitely a piece of like the like we… I think the technology in many of those cases already there’s a deployment gap.
— Mati Staniszewski
I think this year it should be in the automotive side too or the some of the applications that we’ve seen we’ll start seeing kind of great voice models in cars this year.
— Mati Staniszewski
Long before we had Amazon or Facebook marketplace, or thousands of other online retailers, we…
This Week in Crypto Law The opinion editorial below was written by Alex Forehand and…
Ripple's CEO also praised the recent collaboration between the SEC and the CFTC. The…
OpenAI’s $852 billion valuation is facing skepticism from some of its own investors as the…
A crypto scam posing as the official Ledger Live hardware wallet app passed Apple’s App…
Adobe patches a critical PDF flaw exploited for months, allowing attackers to bypass sandbox protections…