Phone - Australia: 1300 553 313
Hotline - New Zealand: 0800 450 168
Chant Logo


Fine-tune Speech Synthesis Using Text-to-Speech Markup

Fine tune speech synthesis using text markup with VoiceMarkupKit

Avoid putting your end users to sleep with boring synthesized speech. You will be amazed at how adjusting the speed, adding pauses, injecting emphasis, and switching voices can break up the monotony of synthesized speech.

Text-to-speech (TTS) markup is text with imbedded indicators that control speech synthesis from the text. Speaking qualities such as the speed, pitch, emphasis, and word pronunciation may be tailored in reproducing speech from text.

A TTS grammar is a collection TTS markup. A text-to-speech engine (i.e., synthesizer) uses TTS markup to enhance its ability to synthesize speech from text and generate the audio for playback.

What is Markup Management?

Markup management enables you to:

  • fine-tune speech synthesis,
  • tailor synthesis for specific voices, and
  • integrate dynamic markup generation as part of deployed applications.

Applications benefits include:

  • enhanced quality of TTS playback,
  • added flexibility to use various synthesizers and tailor markup at runtime, and
  • expanded adaptability to run with available technology on the deployed system.

What is VoiceMarkupKit?

Chant VoiceMarkupKit is text-to-speech (TTS) markup language management software that enables you to generate TTS markup to enhance the playback quality when synthesizing.

VoiceMarkupKit provides you a simple way to generate text-to-speech markup. Applications can markup text as part of its runtime operation to enable real-time customization and tailoring of your text-to-speech environment.

It simplifies the process of generating Acapela TTS Tag, Microsoft SAPI 5, and W3C SSML markup language to use with your favorite speech synthesizers.

VoiceMarkupKit includes C++, C++Builder, Delphi, Java, and .NET Framework, class library formats to support all your programming languages and sample projects for popular IDEs—such as the latest Visual Studio from Microsoft and RAD Studio from Embarcadero.

The class libraries can be integrated with 32-bit and 64-bit applications for Windows platforms.

Voice Markup Architecture

VoiceMarkupKit provides a simple way to generate text-to-speech markup. Applications markup text to enable real-time customization and tailoring of text-to-speech.

Applications can specify a markup language, the markup options, and generate markup prior to synthesis. Applications uses VoiceMarkupKit to manage the activities for generating the markup in the needed format. VoiceMarkupKit supports the following markup syntax:

VoiceMarkupKit encapsulates all of the technologies necessary to make the process of generating markup simple and efficient for your application.

Instantiate VoiceMarkupKit to generate markup within the application and destroy VoiceMarkupKit to release its resources when markup generation is no longer needed.

Feature Summary

The goal of text-to-speech markup is to enhance the quality of the text-to-speech playback. With Chant VoiceMarkupKit, you can:

Chant VoiceMarkupKit handles the complexities of generating text-to-speech markup for various markup syntax. Applications can tailor speech synthesis to produce sounds in familiar dialects, speaking patterns, and accents of end users. Applications can adjust TTS markup as needed for the synthesizer to enhance the playback quality when synthesizing.

Synthesizers (i.e. speech APIs) interpret different markup syntax. VoiceMarkupKit supports the following markup syntax:


Speech API Markup Syntax
Acapela TTS AcaTTS Tags
Cepstral Swift W3C SSML
CereProc CereVoice W3C SSML, CereVoice Tagset
Microsoft SAPI 5 SAPI 5 XML Markup, W3C SSML (SAPI 5.3+)
Microsoft Speech Platform W3C SSML
Microsoft .NET System.Speech W3C SSML
Microsoft .NET Microsoft.Speech W3C SSML
Microsoft WindowsMedia (UWP) W3C SSML

By generating TTS markup at runtime, your application can maximize the quality of TTS playback and offer your end users the flexibility of using various synthesizers with your application.

Within Chant Developer Workbench, you can:

  • Create and edit documents with TTS markup;
  • Generate TTS markup;
  • Generate word pronunciation phonemes;
  • Edit word pronunciation phonemes (requires LexiconKit); and
  • Playback text with TTS markup (requires SpeechKit).

SSML Editing: Edit L&H Native Control Sequence, SAPI 5, and W3C Speech Synthesis Markup Language (SSML) faster with built-in intelliprompt that suggest valid markup syntax.

SSML Error Debugging: Automatic syntax checking displays visual cues and syntax error messages in the Error window. Click on the error to take you to the location of it in the document window

TTS Playback: Playback text-to-speech markup with a click of the button. Highlight specific text or playback the entire document.


Call MicroWay on 1300 553 313 or email for more information.


For more information please contact the MicroWay sales team: buynow
Head Office
MicroWay Pty Ltd
PO Box 84,
Braeside, Victoria, 3195, Australia
Ph: 1300 553 313
Fax: 1300 132 709
email: sales@microway.com.au
ABN: 56 129 024 825
Sydney Sales Office
MicroWay Pty Ltd
PO Box 1733,
Crows Nest, NSW 1585, Australia
Tel: 1300 553 313
Fax: 1300 132 709
email: sales@microway.com.au
ABN: 56 129 024 825
New Zealand Sales Office
MicroWay Pty Ltd (NZ)
PO Box 912026
Victoria Street West
Auckland 1142, New Zealand
Tel: 0800 450 168
email: sales@microway.co.nz

International: call +61 3 9580 1333, fax +61 3 9580 8995

© 1995-2023 MicroWay Pty Ltd. All Rights Reserved. Terms and Privacy Policy.