Audio Descriptions and Text-to-Speech for Wayfinding

Text and audio descriptions of routes and facilities are essential accessibility features. Many disabled visitors benefit from having information available in audio format, whether through screen readers, text-to-speech tools, or pre-recorded audio.

Why Audio Matters

Audio descriptions serve multiple accessibility needs:

Blind and low-vision visitors: Primary information format when navigating unfamiliar spaces
Dyslexic visitors: Audio reinforces written information and reduces cognitive load
Visitors with cognitive disabilities: Spoken instructions can be easier to process than text
Visitors with literacy challenges: Audio provides equal access to information
All visitors: Hands-free information access while navigating

Types of Audio Support

1. Browser-Native Text-to-Speech

Modern browsers include built-in text-to-speech capabilities that work without additional software.

Edge Reading Mode

Microsoft Edge offers Immersive Reader with natural-sounding voices:

Activate via the book icon in the address bar or Settings menu
Adjustable reading speed and voice selection
Highlights current word being read
Works on most web pages without special coding

Safari Reading Mode

Safari on macOS and iOS includes text-to-speech:

Activate Reader View (Safari → View → Show Reader)
Use Text to Speech (Edit → Speech → Start Speaking)
Clean presentation without distractions

Chrome and Firefox Extensions

Both browsers support extensions like:

Read Aloud: A Text to Speech Voice Reader
Natural Reader
SpeakIt!

Recommendation: Ensure your accessibility pages work well with these native tools by using semantic HTML and clear structure.

2. Web Speech API

The Web Speech API allows websites to add read-aloud functionality directly into pages.

Benefits

Works across modern browsers
No external dependencies or downloads required
Respects user’s system voice settings
Lightweight implementation

Implementation Considerations

Requires JavaScript to be enabled
Voice quality varies by operating system
Network connection may be needed for cloud-based voices
Not a replacement for screen reader compatibility

3. Pre-Recorded Audio

Professional audio recordings provide the highest quality but require maintenance.

When to Use Pre-Recorded Audio

Critical safety information
Complex navigation instructions
Information that changes infrequently
Multilingual support

Maintenance Requirements

Must be updated when information changes
Requires accessible audio player controls
Should include synchronized text transcript
File size and bandwidth considerations

Implementing Text-to-Speech on Your Website

Minimal Implementation

At minimum, ensure your access guide:

Uses semantic HTML with proper headings, lists, and landmarks
Avoids content in images - use alt text and provide text equivalents
Is readable by browser extensions - avoid complex layouts that break reader modes
Provides clear navigation - use skip links and table of contents

Enhanced Implementation

Add read-aloud functionality to your page:

<button id="read-aloud" aria-label="Read this section aloud">🔊 Read Aloud</button>

// Basic Web Speech API implementation
function readAloud(text) {
  if ('speechSynthesis' in window) {
    const utterance = new SpeechSynthesisUtterance(text);
    utterance.lang = 'en-US';
    utterance.rate = 0.9; // Slightly slower for clarity
    window.speechSynthesis.speak(utterance);
  } else {
    // Show accessible notification instead of alert
    const message = document.createElement('div');
    message.setAttribute('role', 'status');
    message.setAttribute('aria-live', 'polite');
    message.textContent = 'Text-to-speech is not supported in your browser. Try Microsoft Edge Reading Mode or Safari Reading Mode.';
    document.body.appendChild(message);
  }
}

Best Practices

Content Structure

Short paragraphs: Easier to listen to in segments
Clear headings: Allow users to skip to relevant sections
Bulleted lists: Break information into digestible chunks
Avoid abbreviations: Write “accessible toilet” not “acc. toilet”

Voice-Friendly Writing

Use simple language: Avoid jargon and complex sentences
Spell out acronyms on first use: “Americans with Disabilities Act (ADA)”
Provide directional cues: “Turn right” not “Go east”
Use distances and times: “50 meters” or “about 2 minutes walk”

Controls and User Experience

Pause/Resume buttons: Allow users to control playback
Reading speed control: Let users adjust to their preference
Visual highlighting: Show what’s currently being read
Keyboard accessible: All controls must work without a mouse

Audio Descriptions for Routes

Describe routes to common destinations in clear, sequential steps:

Example: Accessible Toilet Route

Text/Audio Description:

“From the main entrance, continue straight for 15 meters. You will pass the information desk on your left. At the corridor intersection, turn right. The accessible toilet is the second door on your left, marked with the International Symbol of Access. The door opens automatically when you press the large button at waist height.”

Format Guidelines

For each route description:

Starting point: Name the clear reference point
Distance: Provide approximate distances in meters/feet
Landmarks: Reference distinctive features along the route
Turns: Specify left/right with reference to direction of travel
Destination markers: Describe what visitors will see/feel at destination
Door operation: Explain how to open/activate doors

Combine audio with other formats:

Tactile maps: Physical maps at entrance with raised routes
Large print signs: Visual reinforcement of audio directions
Staff assistance: Backup option when technology fails
QR codes: Link to detailed audio navigation

Open-Source Text-to-Speech Libraries

Recommended Approach: Use the Web Speech API (documented above) as your primary implementation. It’s browser-native, requires no external dependencies, respects user privacy, and works without additional licensing.

If you need features beyond the Web Speech API, consider these alternatives:

ResponsiveVoice.js

Simple JavaScript API
Multiple voices and languages
Free for non-commercial use
Commercial license required for business websites
Note: Sends content to external servers, raising privacy concerns

Other Libraries

Most modern implementations should use the Web Speech API (built into browsers) rather than third-party libraries for text-to-speech functionality.

Considerations

Licensing: Verify license permits your use case
Maintenance: Check if library is actively maintained
Accessibility: Test with screen readers to avoid conflicts
Performance: Consider impact on page load time
Privacy: Understand if content is sent to external services (major concern for visitor information)

Testing Your Audio Implementation

Before publishing:

Test with screen readers: Ensure TTS doesn’t conflict with JAWS, NVDA, VoiceOver
Test browser compatibility: Chrome, Firefox, Safari, Edge
Test on mobile devices: iOS Safari, Android Chrome
Test keyboard controls: All features work without mouse
Test with actual users: Get feedback from blind and low-vision visitors
Verify focus management: Focus doesn’t get lost during playback
Check ARIA attributes: Proper labels and live regions for dynamic content

Operational Implications

Maintenance

Review audio content when information changes (routes, facilities, hours)
Test TTS functionality after website updates
Ensure backup methods remain available

Staff Training

Staff should know how to enable reading mode in different browsers
Staff should be able to describe routes verbally as backup
Staff should understand visitor needs may vary

Governance

Document your audio support in your maintenance checklist:

Which routes have audio descriptions
How audio content is generated (TTS vs. pre-recorded)
Update trigger events (renovations, signage changes, facility changes)
Testing frequency for TTS functionality

Resources

Audio Navigation Example - Working demonstration of audio route descriptions
Web Speech API - MDN Web Docs
Microsoft Edge Immersive Reader
WCAG 2.2 Success Criterion 1.2.1: Audio-only and Video-only (Prerecorded)
Web Accessibility Initiative - Making Audio and Video Media Accessible

Audio Descriptions and Text-to-Speech for Wayfinding

Why Audio Matters

Types of Audio Support

1. Browser-Native Text-to-Speech

Edge Reading Mode

Safari Reading Mode

Chrome and Firefox Extensions

2. Web Speech API

Benefits

Implementation Considerations

3. Pre-Recorded Audio

When to Use Pre-Recorded Audio

Maintenance Requirements

Implementing Text-to-Speech on Your Website

Minimal Implementation

Enhanced Implementation

Best Practices

Content Structure

Voice-Friendly Writing

Controls and User Experience

Audio Descriptions for Routes

Example: Accessible Toilet Route

Format Guidelines

Multi-Modal Support

Open-Source Text-to-Speech Libraries

ResponsiveVoice.js

Other Libraries

Considerations

Testing Your Audio Implementation

Operational Implications

Maintenance

Staff Training

Governance

Resources