Introduction to Speech Synthesis with ElevenLabs
ElevenLabs is an advanced speech synthesis platform that transforms text into realistic speech. Whether you want to enrich your videos with voice-overs for ads or tutorials, create podcast content, produce multilingual dubbing, animate educational and e-learning modules, or turn books and stories into captivating audiobooks, ElevenLabs offers a wide range of possibilities. This guide will walk you through generating your first AI voice and exploring all its features. For more information on pricing and features, check out our comprehensive ElevenLabs overview.
🚀 Stay ahead of AI
Useful tips and news, zero spam.
Step 1: Create an Account on ElevenLabs
To get started, visit elevenlabs.io and create an account by providing your email address or using your Google account.
Creating an account gives you access to all of ElevenLabs’ features, including preset voices, advanced customization options, and the ability to create custom voices. To learn about pricing and choose the plan that suits you best, check out our comprehensive ElevenLabs overview.
Step 2: Understanding the User Interface
Upon arriving at the ElevenLabs homepage, you will see several speech synthesis options at the top of the screen, including Text to Speech, Speech to Speech, Dubbing, Text to SFX, and Voice Cloning. Each option corresponds to a specific type of vocal AI functionality. In this tutorial, we will focus solely on Text to Speech (converting text to speech).
First, change the language – here set to French. You will notice use-case examples such as Tell a Story, Present a Podcast, and Create a Voice-Over for Your Videos, which demonstrate the possible applications of speech synthesis. You can try them out, then modify the text and the speaking character to familiarize yourself with the tool.

To access the full features and the creation interface, click on the Go to App button at the bottom right. This will take you to the main interface, where you can explore in detail the various options and customization settings for voice generation.
Step 3: Generate Your First AI Voice from Text
After clicking on Go to App, you are by default in the Text to Speech section, where you have more options for generating voices from text.
- Enter Your Text: In the text field, type the content you want to convert into audio.
- Select the Speech Synthesis Model: Before generating your audio, choose the appropriate model based on your needs by clicking on Settings > Model (or via the Advanced tab at the top right). Here are the two main recent models you can use:
- Multilingual v2: An advanced model offering great stability and support for 29 languages. Ideal for creating content such as voice-overs, audiobooks, and post-production.
- Turbo v2.5: A model optimized for low latency, supporting 32 languages. Perfect for real-time applications like voice assistants.
- Tip: When generating voices in French, start with the Multilingual v2 model. You can then adjust according to the specific needs of your project.
- Select a Voice: Choose a voice from the available options. ElevenLabs offers default voices, but you can also create custom voices via VoiceLab. Feel free to test different voices; one of the most popular on social media – and my personal favorite – is the Adam voice.
- Optimize the Settings: Adjust the Stability and Clarity/Similarity Enhancement parameters to refine the vocal output according to your preferences:
- Stability: Controls the consistency of the voice’s tone. Higher stability means a more constant voice, though it may reduce expressive variability.
- Clarity/Similarity Enhancement: Defines the clarity and precision of the voice, ensuring a closer match to the source voice when using a custom voice.

- Use Prompting to Refine the Vocal Output: Include specific instructions in your text to add pauses, convey emotions, and adjust the speaking pace:
- Add Pauses: Use the syntax
<break time="1.5s" />in your text to create natural pauses. - Convey Emotions: Indicate if the voice should be enthusiastic, calm, formal, etc. (e.g., … “he said confusedly”)
- Speaking Pace: Specify if the speech should be fast, slow, or moderate. (e.g., he repeated softly “…”)
- Add Pauses: Use the syntax
- Generate the Audio: Click on Generate to create the audio. Once generated, you can listen to it directly or download it.
Examples of Prompts with Tone Instructions
- Pause: “Welcome to our new app!
We are excited to offer you an exceptional experience.” - Emotion: “Thank you for choosing our services. Your satisfaction is our top priority.” he said amicably.
- Pace: “Here’s how to use our tool in a few simple steps.” he said calmly.
Additionally, ElevenLabs supports the use of the International Phonetic Alphabet (IPA) to specify the pronunciation of certain words, which is particularly useful for proper names or technical terms.
Step 4: Preview and Download Your Audio Files
After generation, listen to and download your audio file:
- Preview: Use the preview function to check the output before downloading.
- Download in MP3 or WAV Format: Download your audio files in MP3 or WAV for maximum compatibility with various platforms.
Step 5: Customize Voices with VoiceLab
To create unique voices or clone a voice, use the VoiceLab tool:
- Access VoiceLab: In the left dashboard, click on Voices.
- Add a New Voice: In the My Voices tab, click on Add a new Voice and name your new voice.
- Adjust Initial Settings: Select the gender, age, and accent to configure the voice.
- Train the Voice with Samples: Upload varied and clear audio samples to train the voice. The more varied and high-quality the samples, the better the result.
- Save and Test the Voice: Save the voice and test it in the Text to Speech section.
You can also explore voices already created by the ElevenLabs community in the Library tab.

Step 6: Leverage the Multilingual Mode
ElevenLabs offers multilingual support with its Multilingual v2 and Turbo v2.5 models. These models are specifically designed to handle multiple languages, which is particularly useful for projects requiring voices in different languages.
If you have an English-only project, the Turbo v2.5 model is optimized for that language. To switch between languages, use the same Text to Speech interface, select the Turbo v2.5 model by clicking on Settings (or via the Advanced tab at the top right), and enter your text in the desired language.
Your AI Voice is Generated!
Congratulations, you have created your first AI voice with ElevenLabs! This guide has helped you learn the basics of creating voices with ElevenLabs. You can now use an AI voice to enhance your videos with voice-overs for ads or tutorials, create podcast content, produce multilingual dubbing, animate educational and e-learning modules, or turn books and stories into captivating audiobooks. Continue to explore and refine your scripts and settings to make the most of this powerful tool.
For more information on other artificial intelligence tools, visit our AI Tools section. If you have any questions or tips to share, leave a comment below!
FAQ for Beginners on ElevenLabs
1. Is ElevenLabs Suitable for Beginners?
Yes, ElevenLabs is designed to be accessible for beginners. Its intuitive and well-organized interface makes it easy to get started, even for those without prior technical knowledge. Additionally, clear documentation and practical guides are available to help new users create synthetic voices and utilize the various features.
2. Can ElevenLabs Be Used for Commercial Purposes?
Yes, ElevenLabs offers subscription plans that allow for the commercial use of the generated voices. For this, it is important to subscribe to a plan that includes commercial usage rights, as free or basic options may have restrictions. Be sure to review the terms of use and licenses associated with each plan to ensure that your project is covered.
3. Do you have to pay to use ElevenLabs?
Yes, ElevenLabs offers paid subscription plans with various options, each providing specific features such as access to premium voices, unlimited audio generations, and multilingual support. While some features can be tested for free, advanced options and commercial usage rights require a paid subscription. Check the elevenlabs.io website to choose the plan that best meets your needs.
4. What Are the Other Features of ElevenLabs (Besides Text to Speech)?
Voice Transformation (Speech-to-Speech)
ElevenLabs provides the ability to transform one voice into another while preserving the rhythm and intonation of the original voice. This feature is ideal for multilingual dubbing or for altering the tone of a voice without changing the content.
Dubbing and Video Translation
The tool allows you to replace the original voice in a video with another voice, with the option to translate the content into different languages. This facilitates the localization of audiovisual content for international audiences.
Sound Effects Generation
With the “Text to Sound Effects” tool, you can create custom sound effects from text descriptions—ideal for enhancing your audio and video projects.
Voice Isolation
With “Voice Isolator,” you can extract a clear voice from any audio recording by removing unwanted background noise, which is particularly useful for post-processing podcasts, interviews, or films.
API and Integration
For developers, ElevenLabs offers a robust API that allows the integration of its speech synthesis and transformation capabilities into various applications, such as voice assistants, video games, or embedded systems.
Useful Links
- Official ElevenLabs Website: elevenlabs.io
- ElevenLabs Documentation: help.elevenlabs.io
- Our Article on ElevenLabs: Comprehensive ElevenLabs Overview
- More AI Tutorials: AI Tutorials





