If you've searched "how to use text to speech," you've probably already noticed the problem: there's no single answer that works for everyone. The steps on Windows are completely different from iPhone. What works in a browser won't work in Word. And most guides cover only one tool without explaining when it's actually the right choice.
This guide gives you the full picture. You'll learn how to enable and use TTS on every major platform, and more importantly, how to match the right tool to what you're actually trying to do.
Text to speech (TTS) converts written text into spoken audio. The technology has been around for decades, but recent improvements in AI voice synthesis have made modern TTS outputs sound natural — far removed from the robotic voices of earlier systems.
People use TTS in four broad scenarios:
Accessibility: For users with dyslexia, low vision, or reading fatigue, TTS makes written content consumable without visual strain.
Multitasking: Listening to articles, emails, or documents while commuting, exercising, or doing other tasks.
Proofreading: Hearing your own writing read aloud catches errors that eyes skip over.
Content creation: Adding voiceover to videos, presentations, or social content without recording a human speaker.
Which scenario fits you determines which tool you should actually use — and that's the part most guides skip.
Windows has two distinct TTS modes that are often confused: Voice Typing (speech-to-text, for dictating) and Narrator/Read Aloud (text-to-speech, for listening).
Open any webpage in Edge.
Click the Read Aloud button in the toolbar (or press Ctrl+Shift+U).
Use the controls to adjust reading speed and voice.
This works on web pages, PDFs opened in Edge, and even some documents.
Open your document.
Go to Review > Read Aloud (or use the button in the toolbar).
The feature reads the full document or selected text.
Note: Read Aloud in Word requires a Microsoft 365 subscription. If you're on a standalone Office version, this feature may not appear.
Press Win+Ctrl+Enter to toggle Narrator on or off.
Narrator reads everything on screen — menus, buttons, and content.
Configure voice settings in Settings > Accessibility > Narrator.
Narrator is designed for accessibility workflows, not casual listening. For most users who just want to hear a webpage or document read aloud, Edge Read Aloud or Word Read Aloud is the better choice.
MacOS offers two separate tools: Speak Selection (reads selected text) and Voice Control (a full dictation and accessibility mode).
Open System Settings > Accessibility > Spoken Content.
Toggle on Speak Selection.
Select any text in any app, then press Option+Esc to hear it read.
You can also enable Speak Screen, which reads everything visible — useful for reading articles or ebooks hands-free.
In the same Spoken Content settings panel, you can:
Choose from dozens of system voices (including high-quality AI voices like Siri voices)
Set the default speaking rate
Enable on-screen word highlighting while text is read
For continuous long-form listening — multiple articles saved over time — the built-in Mac TTS is best paired with a dedicated reading app that manages your content queue. AI Listen is one option that handles web articles, PDFs, and documents with adjustable playback speed and background audio support.

iPhone's TTS is found inside Accessibility settings rather than under a general "text to speech" toggle, which is why many users don't find it.
Go to Settings > Accessibility > Spoken Content.
Toggle on Speak Selection.
Select text in any app — a "Speak" button appears in the context menu.
Tap Speak to hear the text read aloud.
In the same Spoken Content menu, turn on Speak Screen. Then swipe down with two fingers from the top of the screen to read the entire current page.
In Spoken Content settings, you can choose a different Siri or third-party voice and set the default reading rate. Voices labeled "Enhanced" sound noticeably more natural.
Android's TTS is handled through the Text-to-Speech Output settings, which controls the voice engine used across apps.
Go to Settings > General Management > Language > Text-to-Speech.
Make sure a voice engine is installed (Google Text-to-Speech is the default).
Individual apps — like Google Assistant, Maps, or Play Books — use this engine to speak text.
For dictation (speaking text into your phone), the microphone in your keyboard (usually Gboard) is the primary input method. Tap the microphone icon in the keyboard to start speaking.
This is the section most guides skip, but it's where most confusion happens.
What you're trying to do | Best tool |
|---|---|
Listen to web articles while multitasking | AI Listen or iPhone Speak Screen |
Read aloud a Word document | Word Read Aloud (M365) |
Have a webpage narrated in your browser | Edge Read Aloud or Safari Reader |
Full accessibility mode (read entire screen) | Windows Narrator or Mac VoiceOver |
Add voiceover to a video | ElevenLabs, Murf, or CapCut's TTS feature |
Proofread your own writing | Mac Speak Selection (select text, Option+Esc) |
Dictate text into any app | Windows Win+H, iPhone keyboard mic, Mac Dictation |
The key distinction: listening to content and dictating text are both called "text to speech" colloquially, but they use completely different system features.

A few practical habits that make TTS significantly more useful:
Match speed to content type. For dense technical content, 1.1–1.3x is comfortable. For familiar material you're reviewing, 1.5–2x saves time without losing comprehension.
Use highlighting when it's available. Apps that highlight each word as it's read help your brain track the content, especially for longer pieces. Mac Spoken Content supports this natively.
Build a listening queue. One of the limitations of built-in OS TTS is that it's session-based — close the app and you lose your place. If you consume a lot of long-form content, a dedicated app that saves articles and remembers your position makes TTS practical for a daily workflow.
Pair with focus time. TTS while walking or doing light physical activity significantly improves long-term retention compared to passive reading for many people — it forces you to stay focused rather than skim.
The right way to use text to speech depends entirely on what you're reading and on which device. For quick, in-app use, the built-in tools on Windows, Mac, iPhone, and Android all work well — no setup required. For a consistent, cross-session listening workflow with saved articles and longer content, a dedicated app fills the gap that built-in TTS leaves open.
Start with the platform option that fits your most common scenario. Once you've made TTS part of your routine, you'll likely find yourself reaching for it in more places than you expected.





