💬 Text to Speech Converter - FREE & No Limits

Kevin Stratvert
9 Jun 202112:16

TLDRKevin introduces viewers to the concept of text-to-speech (TTS) conversion, demonstrating how to utilize Windows 10's built-in OneNote app and the Immersive Reader feature to convert text into speech. He also guides through using the Voice Recorder and Sound Settings to capture system audio, and explores Audacity for advanced recording and exporting options. Finally, he recommends Balabolka, a freeware app offering direct audio export and access to a variety of TTS services and voices, providing a hands-on tutorial on creating engaging audio content without the need for professional voice recording equipment.

Takeaways

  • 😀 The video is about converting text into speech, known as TTS (Text to Speech).
  • 🎤 The presenter, Kevin, explores if a computer can generate a better voice than his for certain applications.
  • 📚 The first method demonstrated uses OneNote for Windows 10 to convert text to speech.
  • 🔊 To record the computer's speech, Windows 10's Voice Recorder app can be used after enabling 'Stereo Mix' in Sound Settings.
  • 👂 The Immersive Reader feature in OneNote allows for text to be read aloud with adjustable voice speed and options between male or female voices.
  • 🔧 A setting change is required to record system audio using the Voice Recorder, which involves enabling 'Stereo Mix' as an input device.
  • ✂️ The Voice Recorder app allows for recording, trimming, and saving the computer-generated speech.
  • 🎵 Audacity, a free audio recording and editing software, is introduced as an alternative to record system sound with more control.
  • 📈 Balabolka is highlighted as a freeware app that can convert text to speech and export it directly as an audio file in various formats.
  • 👥 Balabolka also offers the ability to use online TTS services for a wider range of voices, including those from IBM Watson, Google, and Baidu.
  • 📝 The video concludes with a prompt for viewers to comment on which voice sounded more robotic and an invitation to subscribe for more content.

Q & A

  • What is the purpose of the text-to-speech (TTS) technology discussed in the video?

    -The purpose of TTS technology is to convert written text into spoken words, which can be used for various purposes such as having a computer read a bedtime story, adding a voiceover to a video, or for accessibility reasons.

  • Why did Kevin decide to explore text-to-speech technology?

    -Kevin decided to explore TTS technology because he received feedback on his YouTube channel that his voice sounded a bit robotic, and he was curious if the computer could do a better job.

  • What is the Kevin Cookie Company and what is its relation to the video?

    -The Kevin Cookie Company is mentioned in the video as an example of a business that could benefit from TTS technology. It is used to demonstrate how text can be converted into speech for a commercial.

  • How can one use the OneNote app to convert text into speech?

    -To convert text into speech using OneNote, you need to open the app, use the Immersive Reader feature, and click on the play icon to have the computer read the text aloud.

  • What is the Voice Recorder app and how is it used in the process?

    -The Voice Recorder app is a pre-installed application in Windows 10 that can be used to record audio. In the context of TTS, it is used to record the computer-generated speech from OneNote.

  • Why is the Stereo Mix option important for recording system sound in Windows 10?

    -The Stereo Mix option is important because it allows the system to record the audio output, which is necessary for capturing the speech generated by TTS applications like OneNote.

  • How can one edit the recorded speech using the Voice Recorder app?

    -The Voice Recorder app allows you to trim the beginning and end of the recording to remove any unwanted parts before saving the final audio file.

  • What is Audacity and how does it offer more control over audio recording?

    -Audacity is a free, open-source audio recording and editing software that provides more control over audio recording, allowing users to record system sound and export it in various formats like MP3 or WAV.

  • What is Balabolka and how does it differ from the other methods discussed?

    -Balabolka is a freeware text-to-speech app that allows users to upload a document, convert it into speech, and directly export it as an audio file in formats like WAV or MP3, without the need to record the speech.

  • How can one access more voices in Balabolka using online TTS services?

    -In Balabolka, by going to the 'Tools' menu and selecting 'Use online TTS services', users can access a variety of voices from different online TTS providers like Google, Baidu, and IBM Watson.

  • What is the main advantage of using Balabolka over the built-in Windows 10 tools?

    -The main advantage of using Balabolka is the ability to directly export the TTS as an audio file without the need for recording, and the option to access a wider range of voices from online TTS services.

Outlines

00:00

📚 Introduction to Text-to-Speech (TTS)

Kevin introduces the concept of Text-to-Speech (TTS) and explores its potential uses, such as having a computer read a bedtime story or adding a voiceover to a video. He addresses feedback about his robotic-sounding voice on his YouTube channel and sets out to test if computer-generated speech could be more appealing. He then proceeds to demonstrate how to use the built-in features of Windows 10 to convert text into speech, starting with OneNote for Windows 10.

05:04

🎙️ Using OneNote and Voice Recorder for TTS

Kevin explains how to use OneNote's Immersive Reader feature to have the computer read text aloud and how to adjust voice settings for speed and gender. He then guides viewers through enabling the 'Stereo Mix' in Sound Settings to record system audio, which is necessary for capturing the TTS output. Following this, he introduces the Voice Recorder app, showing how to record the computer's speech, trim the recording, and save it in m4a format, and mentions the limitation regarding file format flexibility.

10:10

🔊 Advanced TTS with Audacity and Balabolka

Kevin recommends Audacity, a free audio recording and editing software, for advanced TTS users. He details the process of recording system sound with Audacity using Windows WASAPI and Loopback, and exporting the recording in various formats like MP3 and WAV. Lastly, he introduces Balabolka, a freeware app that allows users to upload documents, convert them to speech, and download the audio files directly in formats such as WAV or MP3. He also highlights Balabolka's ability to access a wide range of online TTS services and voices, including those from Google, Baidu, and IBM Watson, providing a convenient alternative to recording the TTS output.

🌐 Exploring Online TTS Services with Balabolka

In the final part of the script, Kevin demonstrates how to use Balabolka to access and utilize various online TTS services. He shows how to paste text into Balabolka, select a service like IBM Watson, choose from a range of voices including different accents and languages, and directly export the synthesized speech as an audio file. This feature of Balabolka provides a user-friendly way to obtain TTS without the need for recording, offering flexibility and convenience for users looking for diverse voice options.

Mindmap

Keywords

💡TTS

TTS stands for 'Text to Speech', which is a technology that converts written text into audible speech. In the video, TTS is the central theme as the host, Kevin, demonstrates how to use various tools to convert text into speech. This is particularly useful for creating audio content for videos, reading documents aloud, or for accessibility purposes. For instance, Kevin uses TTS to generate a voiceover for a commercial of the 'Kevin Cookie Company'.

💡OneNote

OneNote is a digital note-taking application developed by Microsoft. In the script, OneNote for Windows 10 is used to demonstrate the text-to-speech functionality through its 'Immersive Reader' feature. This feature allows users to have the text read aloud, which is useful for proofreading or for individuals with visual impairments. Kevin shows how to activate and use this feature to listen to the text from the 'Kevin Cookie Company' commercial.

💡Voice Settings

Voice Settings refer to the options available for customizing the voice used by text-to-speech applications. In the video, Kevin explains how to adjust voice settings in OneNote, such as changing the speed and selecting between a male or female voice. These settings allow users to tailor the listening experience to their preferences, enhancing the naturalness and enjoyment of the spoken content.

💡Voice Recorder

Voice Recorder is an application that comes pre-installed with Windows 10 and is used for recording audio. In the context of the video, Kevin guides viewers on how to use Voice Recorder to capture the text being read aloud by OneNote. This is useful for creating audio files that can be used in other applications, such as video editing or for later listening.

💡Stereo Mix

Stereo Mix is a feature in Windows Sound Settings that allows users to record system audio. In the video, Kevin explains the process of enabling Stereo Mix to record the computer's audio output, which is essential when using Voice Recorder to capture the speech generated by OneNote's text-to-speech feature.

💡Audacity

Audacity is a free, open-source audio recording and editing software. In the script, Kevin mentions Audacity as an alternative to Voice Recorder for capturing system sound. Audacity offers more advanced features and flexibility, such as the ability to export recordings in different file formats, which makes it a popular choice among content creators.

💡Balabolka

Balabolka is a freeware text-to-speech application that can convert text into speech and save it as an audio file. Unlike the other methods demonstrated in the video, Balabolka allows users to export the generated speech directly as a WAV or MP3 file without the need for recording. Kevin highlights Balabolka as a convenient option for those who want a simple and direct way to create audio files from text.

💡Online TTS Services

Online TTS Services refer to web-based platforms that provide text-to-speech capabilities. In the video, Kevin shows how to use Balabolka to access various online TTS services, such as Google, Baidu, and IBM Watson. These services offer a wide range of voices and languages, allowing users to choose the most suitable one for their needs, as demonstrated when Kevin selects a UK voice for the 'Kevin Cookie Company' commercial.

💡Immersive Reader

Immersive Reader is a feature within OneNote that enhances the reading experience by providing options to focus on the text, such as adjusting spacing, changing themes, and modifying grammar settings. In the video, Kevin uses Immersive Reader to read the text from the 'Kevin Cookie Company' and then demonstrates how to activate the text-to-speech functionality.

💡Kevin Cookie Company

The 'Kevin Cookie Company' is a fictional company mentioned in the video script as an example for demonstrating the text-to-speech process. The company is portrayed as making delicious cookies, and its commercial text is used to illustrate how the various text-to-speech tools and applications work in converting text into speech.

Highlights

Kevin introduces the concept of Text-to-Speech (TTS) and its various applications.

He discusses the feedback he received about his voice sounding robotic and explores if a computer can do better.

Kevin demonstrates how to use OneNote for Windows 10 to convert text into speech.

He explains how to access the Immersive Reader feature in OneNote for a better reading experience.

Adjustable voice speed and voice options are available within the OneNote text-to-speech feature.

Kevin guides on how to record computer-generated speech using the Voice Recorder app in Windows 10.

He details the process of enabling the Stereo Mix option for system sound recording.

The Voice Recorder app allows for easy recording and editing of computer speech.

Kevin mentions the limitation of Voice Recorder saving files in m4a format and introduces Audacity as an alternative.

Audacity offers more control and flexibility, including the ability to export in various formats like MP3 and WAV.

He provides a step-by-step guide on recording system sound using Audacity.

Balabolka, a freeware app, is introduced as a tool for direct text-to-speech conversion and audio export.

Balabolka allows users to choose from different voices and adjust speech parameters like rate, pitch, and volume.

The app supports exporting audio files in WAV or MP3 format directly from the text.

Kevin shows how to access additional online TTS services within Balabolka for even more voice options.

He demonstrates selecting an IBM Watson voice for a British accent in a hypothetical commercial narration.

Balabolka is praised for its ease of use and the ability to export high-quality audio without the need for recording.

Kevin concludes by asking viewers to comment on which voice sounded more robotic and encourages subscription for more content.