Tutorials
Tips & Tricks
TTS
Windows Speech to Text: Built-In Voice Typing Guide for Windows 10 & 11 (Free)
Windows 10 and 11 both include free built-in speech to text — but most users don't know the difference between Voice Typing, Speech Recognition, and Voice Access. This guide explains each, shows you how to start in under a minute, and covers when a third-party tool makes more sense.
Julian Sterling
Julian Sterling
AI Content Strategist
June 7, 2026
9 min read
windows-speech-to-text
In This Article
What Is Windows Speech to Text? (Voice Typing vs Speech Recognition vs Voice Access)
How to Enable Voice Typing on Windows 10 & 11
Windows Key + H: The Shortcut Every User Needs
Voice Typing vs Windows Speech Recognition: Which to Use?
How to Improve Accuracy: Voice Training & Mic Setup
Windows STT vs Third-Party Tools (Whisper, Dragon, Google)
Conclusion

Windows includes free built-in speech to text — and most people who would benefit from it don't know it's there. The confusion usually comes from not knowing which of the three available tools to use: Voice Typing, Windows Speech Recognition, or Voice Access each do different things, and Microsoft has never explained the differences clearly in one place.

This guide does that. It covers how to start dictating in under a minute, what each tool handles, how to improve accuracy, and where third-party apps make a meaningful difference.

What Is Windows Speech to Text? (Voice Typing vs Speech Recognition vs Voice Access)

Windows offers three distinct speech input tools, each with a different purpose:

Voice Typing (Win+H): The primary dictation tool introduced in Windows 10 and significantly improved in Windows 11. Converts speech to text in real time in any app that accepts keyboard input — Word, Notepad, Outlook, browsers, chat apps. This is what most users need for day-to-day dictation.

Windows Speech Recognition (WSR): The older system, available since Windows Vista. WSR handles both dictation and full voice control of Windows — you can open programs, click buttons, navigate menus, and fill out forms entirely by voice. More powerful than Voice Typing for accessibility use cases, but requires voice training for best accuracy.

Voice Access: The newest addition, available in Windows 11 22H2 and later. A modernized take on full voice control, designed with cleaner command syntax and better integration with Windows 11 UI elements. Gradually replacing the role WSR used to fill.

For typing text into documents and apps, Voice Typing is the right starting point — it's the simplest to use and doesn't require setup or training.

How to Enable Voice Typing on Windows 10 & 11

Windows 11 (recommended path):

  1. Click Start > Settings > Accessibility > Speech

  2. Toggle Windows Speech Recognition on if you want the full control option

  3. For Voice Typing, no pre-setup is needed — just press Win+H

Windows 10:

  1. Click Start > Settings > Time & Language > Speech

  2. Follow the setup prompts to confirm your microphone is recognized

  3. Press Win+H in any app to launch Voice Typing

Microphone setup (important): Voice Typing accuracy is directly tied to microphone quality and positioning. Before your first session, check that your correct microphone is set as the default in Settings > System > Sound > Input. A headset or dedicated USB microphone outperforms laptop built-in mics for dictation.

Quick Tip: Press Windows key + H from any text field to launch Voice Typing instantly — no settings menu required. A small toolbar appears at the top of your screen. Click the microphone icon or say 'Start listening' to begin. This shortcut works in Word, Notepad, browsers, email clients, and most apps that accept keyboard input.

Windows Key + H: The Shortcut Every User Needs

Win+H is the fastest path to speech to text on Windows — no menus, no searching, just the keyboard shortcut in any text field.

What happens when you press Win+H:

  • A small floating toolbar appears near the top of the screen

  • Click the microphone icon (or say "Start listening")

  • Speak normally — text appears in whatever text field has focus

  • Say "Stop listening" or click the mic again to pause

Useful voice commands while dictating:

  • "Delete that" — removes the last phrase spoken

  • "Undo that" — reverses the last action

  • "New line" — inserts a line break

  • "Period" / "Comma" / "Question mark" — inserts punctuation

  • "Stop listening" — pauses dictation

Voice Typing vs Windows Speech Recognition: Which to Use?

Use Case

Best Tool

Typing text into Word, email, Notepad

Voice Typing (Win+H)

Dictating while keeping hands free from keyboard

Voice Typing (Win+H)

Controlling Windows by voice (opening apps, clicking)

Voice Access (Win 11)

Full hands-free PC operation including legacy apps

Windows Speech Recognition

Accessibility — operating computer without hands

Voice Access or WSR

Voice Typing is faster to start, requires no training, and is sufficient for the majority of dictation use cases. Its main limitation: it doesn't control Windows itself — you can't open programs, click buttons, or navigate menus with it.

Windows Speech Recognition is more powerful but slower to learn. The command vocabulary is more complex, and it benefits from the built-in voice training session (Control Panel > Ease of Access > Speech Recognition > Train your computer to better understand you).

Voice Access is the modern alternative to WSR for Windows 11 users, with a more intuitive command structure. Enable it via Settings > Accessibility > Voice Access.

How to Improve Accuracy: Voice Training & Mic Setup

Voice Typing accuracy out of the box is solid for standard English in a quiet environment. These steps push it further:

Microphone positioning: Hold the mic or position the headset so it's 2–4 inches from your mouth. Laptop microphones placed on a desk introduce significant room noise and distance-related accuracy drops.

Quiet environment: Background noise — TV, open windows, air conditioning — reduces accuracy measurably. For regular dictation, a relatively quiet space produces meaningfully better results than a noisy one.

Speech Recognition training (WSR): If you're using Windows Speech Recognition, run the voice training wizard (Control Panel > Ease of Access > Speech Recognition > Improve voice recognition). Reading the training prompts for 15–20 minutes calibrates the engine to your voice profile.

Technical vocabulary: Neither Voice Typing nor WSR supports custom vocabulary additions. For specialized terms (legal, medical, technical), say the words slowly and clearly on first use — the engine often learns from corrections. Dragon Professional remains the only Windows option with a true custom vocabulary feature.

Punctuation by voice: Speak punctuation explicitly ("period", "comma") if auto-punctuation misses. Windows 11's Voice Typing has improved auto-punctuation, but explicit commands are more reliable for formal documents.

Windows STT vs Third-Party Tools (Whisper, Dragon, Google)

Tool

Accuracy

Custom Vocabulary

Cost

Best For

Windows Voice Typing

Good

No

Free

Everyday dictation

Windows Speech Recognition

Good

Basic

Free

Full PC voice control

Dragon Professional

Excellent

Yes

~$500 one-time

Legal, medical, power users

OpenAI Whisper (via app)

Excellent

No

Free (open-source)

Offline, privacy-sensitive use

Google Docs Voice Typing

Good

No

Free

Typing in Google Docs

Otter.ai

Good

No

Freemium

Meeting transcription

When the built-in tool is enough: For everyday notes, emails, messages, and document drafting in English, Windows Voice Typing handles the workload. The accuracy gap between it and paid tools has narrowed in Windows 11.

When to upgrade: Dragon Professional is the only meaningful upgrade for users who dictate 3+ hours daily, need custom vocabulary for specialized terminology, or require near-perfect accuracy for formal documents. The price is steep but the accuracy on technical content is noticeably better.

Whisper as a free alternative: OpenAI's Whisper model is available as open-source software and through several Windows desktop apps. It runs locally (no cloud upload), handles accented English better than most cloud-based engines, and is free. The trade-off: no live dictation — it transcribes audio files after recording, not in real time.

Conclusion

Windows speech to text doesn't require any downloads or subscriptions — press Win+H, speak, and it works. For most users, Voice Typing covers everyday dictation needs across all major apps. If you need hands-free control of Windows itself, Voice Access (Windows 11) or Windows Speech Recognition provides that.

Third-party tools only make sense at the margins: Dragon for professional high-volume users, Whisper for privacy-conscious users who want local processing, and Otter.ai for meeting transcription that needs speaker identification and summaries.

If you're also looking for a text-to-speech tool for the reverse workflow — having Windows read text back to you — AI Listen handles that on iOS and complements a Windows dictation workflow when you're switching between devices.

ai-listen-app
Ready to Transform Your Study Sessions?
Join 50,000+ students using AI Listen to study smarter. Free forever plan available.

Frequently Asked Questions
How do I use speech to text on Windows 10 and 11?
Press Windows key + H in any text field to open the Voice Typing toolbar. Click the microphone button and start speaking. Voice Typing works in most apps including Word, Notepad, Outlook, and Chrome. No setup required beyond enabling your microphone.
What is the difference between Voice Typing and Windows Speech Recognition?
Voice Typing (Win+H) is the newer tool, introduced in Windows 10 and improved in Windows 11. It's optimized for dictating text into apps. Windows Speech Recognition is the older system that also handles voice commands to control Windows — opening programs, clicking buttons, and navigating menus. For typing only, use Voice Typing. For full hands-free PC control, use Speech Recognition or the newer Voice Access.
Is Windows speech to text free?
Yes — both Voice Typing and Windows Speech Recognition are included in every Windows 10 and Windows 11 installation at no cost. No subscription or download required.
Why is Windows Voice Typing not working?
Check that your microphone is set as the default recording device in Settings > System > Sound. Also verify microphone permissions are enabled for the application you're using in Settings > Privacy > Microphone. Restarting the Voice Typing toolbar (Win+H, close, reopen) resolves most temporary issues.
What is the most accurate speech to text tool for Windows?
For everyday dictation, Windows Voice Typing with a good microphone is accurate enough for most users. For professional use — legal documents, medical transcription, or high-volume dictation — Dragon Professional is the most accurate commercial option available for Windows. For a free, high-accuracy alternative, OpenAI's Whisper (available through third-party desktop apps) performs comparably on standard English.

Tutorials
Tips & Tricks
TTS
Share this article:
copy

Popular Articles

Continue exploring text to speech and productivity tips
AI Story Generator: What It Is, How It Works, and Why It Matters
TTS
AI Story Generator: What It Is, How It Works, and Why It Matters
AI story generators turn prompts into structured drafts for fiction, marketing, and education. In this guide, we cover how AI story generators work, their core features, benefits, limitations, and how to choose the right AI Story Generator.
How to Listen to AO3 Stories Offline on iPhone
AI Listen
How to Listen to AO3 Stories Offline on iPhone
Want AO3 offline listening on iPhone? Follow this step-by-step guide to import from the web, switch chapters smoothly, and listen on the subway or in Airplane Mode.
How to Read with ADHD: Practical Strategies That Actually Help You Finish Pages
AI Listen
How to Read with ADHD: Practical Strategies That Actually Help You Finish Pages
Struggling with ADHD reading drift? This guide shows fast environment fixes, chunked reading, recall questions, and a review step that improves retention.
Top Alternatives to Character AI: Best Tools for Chat, Roleplay, and Storytelling
TTS
Top Alternatives to Character AI: Best Tools for Chat, Roleplay, and Storytelling
In this guide, we explore the top alternatives to Character AI, including tools for roleplay, storytelling, and AI chat. You’ll find quick picks, detailed comparisons, and tips to choose the right platform based on your needs.
What Happens When Two AI Voice Assistants Talk to Each Other?
TTS
What Happens When Two AI Voice Assistants Talk to Each Other?
This guide explains what emerges, why it happens, real applications, and how to review conversations effectively.
What is an AI sound generator
AI Listen
What is an AI sound generator
AI sound generators create audio from text or prompts, from voiceovers to sound effects. They are widely used across content, apps, and media. In this article, you will learn how they work, where they fit, and how to choose one. Let’s dive in.