Phone - Australia: 1300 553 313
Hotline - New Zealand: 0800 450 168
Chant Logo


Fine-tune Speech Synthesis Using Text-to-Speech Markup

Fine tune speech synthesis using text markup with VoiceMarkupKit 5

Avoid putting your end users to sleep with boring synthesized speech. You will be amazed at how adjusting the speed, adding pauses, injecting emphasis, and switching voices can break up the monotony of synthesized speech.

Text-to-speech (TTS) markup is text with imbedded indicators that control speech synthesis from the text. Speaking qualities such as the speed, pitch, emphasis, and word pronunciation may be tailored in reproducing speech from text.

A TTS grammar is a collection TTS markup. A text-to-speech engine (i.e., synthesizer) uses TTS markup to enhance its ability to synthesize speech from text and generate the audio for playback.

What is Markup Management?

Markup management enables you to:

  • fine-tune speech synthesis,
  • tailor synthesis for specific voices, and
  • integrate dynamic markup generation as part of deployed applications.

Applications benefits include:

  • enhanced quality of TTS playback,
  • added flexibility to use various synthesizers and tailor markup at runtime, and
  • expanded adaptability to run with available technology on the deployed system.

What is VoiceMarkupKit?

Chant VoiceMarkupKit is text-to-speech (TTS) markup language management software that enables you to generate TTS markup to enhance the playback quality when synthesizing.

The VoiceMarkupKit class library includes a voice markup management class that provides you a simple way to generate text-to-speech markup. Your application can markup text as part of its runtime operation to enable real-time customization and tailoring of your text-to-speech environment.

It simplifies the process of generating Microsoft SAPI 5, Nuance L&H Native Control Sequence, and W3C SSML markup language to use with your favorite speech synthesizers.

VoiceMarkupKit includes C++, C++Builder, Delphi, Java, .NET Framework, and Silverlight class library formats to support all your programming languages and sample projects for popular IDEs—such as the latest Visual Studio from Microsoft and RAD Studio from Embarcadero.

The class libraries can be integrated with 32-bit, 64-bit, and Universal Windows Platform (UWP) applications.



The goal of text-to-speech markup is to enhance the quality of the text-to-speech playback. With Chant VoiceMarkupKit, you can:

  • Generate markup language in Microsoft SAPI 5 XML, Nuance L&H Native Control Sequence, and W3C SSML syntax;
  • Generate pronunciation phonemes for Acapela, Cepstral, Microsoft SAPI 5, Microsoft Speech Platform, and Nuance Vocalizer synthesizers;
  • Dynamically switch among speech APIs and syntax formats.

Chant VoiceMarkupKit handles the complexities of generating text-to-speech markup for various markup syntax. This enables you to tailor speech synthesis to produce sounds in familiar dialects, speaking patterns, and accents of your end users. You can adjust TTS markup as needed for the synthesizer to enhance the playback quality when synthesizing.

Synthesizers (i.e., speech APIs) support unique markup syntax. VoiceMarkupKit supports the following synthesizers and their markup syntax:

Synthesizer Speech API Markup Syntax
Cepstral (all languages) Cepstral Swift W3C SSML
Microsoft SAPI 5 (all languages) SAPI 5 SAPI 5 XML Markup, W3C SSML
Microsoft MSP (all languages) MSP W3C SSML
Microsoft Universal Windows Platform (all languages) Windows Media W3C SSML
Nuance Vocalizer Automotive (all languages) Nuance Vocalizer Automotive L&H Native Control Sequence, SAPI 5 XML Markup
Nuance Vocalizer Expressive (all languages) Nuance Vocalizer Expressive L&H Native Control Sequence
Nuance Vocalizer Mobile (all languages) Nuance Vocalizer Mobile L&H Native Control Sequence
Nuance Vocalizer Network (all languages) Nuance Vocalizer Network L&H Native Control Sequence, SAPI 5 XML Markup, W3C SSML

By generating TTS markup at runtime, your application can maximize the quality of TTS playback and offer your end users the flexibility of using various synthesizers with your application.

Within Chant Developer Workbench, you can:

  • Create and edit documents with TTS markup;
  • Generate TTS markup;
  • Generate word pronunciation phonemes;
  • Edit word pronunciation phonemes (requires LexiconKit); and
  • Playback text with TTS markup (requires SpeechKit).

SSML Editing: Edit L&H Native Control Sequence, SAPI 5, and W3C Speech Synthesis Markup Language (SSML) faster with built-in intelliprompt that suggest valid markup syntax.

SSML Error Debugging: Automatic syntax checking displays visual cues and syntax error messages in the Error window. Click on the error to take you to the location of it in the document window

TTS Playback: Playback text-to-speech markup with a click of the button. Highlight specific text or playback the entire document.


Related Articles

Call MicroWay on 1300 553 313 or email for more information.


For more information please contact the MicroWay sales team: buynow
Head Office
MicroWay Pty Ltd
PO Box 84,
Braeside, Victoria, 3195, Australia
Ph: 1300 553 313
Fax: 1300 132 709
email: sales@microway.com.au
ABN: 56 129 024 825
Sydney Sales Office
MicroWay Pty Ltd
PO Box 1733,
Crows Nest, NSW 1585, Australia
Tel: 1300 553 313
Fax: 1300 132 709
email: sales@microway.com.au
ABN: 56 129 024 825
New Zealand Sales Office
MicroWay Pty Ltd (NZ)
PO Box 912026
Victoria Street West
Auckland 1142, New Zealand
Tel: 0800 450 168
email: sales@microway.co.nz

International: call +61 3 9580 1333, fax +61 3 9580 8995

© 1995-2021 MicroWay Pty Ltd. All Rights Reserved. Terms and Privacy Policy.