What is Speech Management?
Speech management enables you to:
- control application functions by speaking rather than having to use a mouse or keyboard,
- capture data by speaking rather than typing, and
- prompt and confirm data capture with spoken or audio acknowledgement.
Applications benefits include:
- enhanced speed and accuracy of data capture,
- added flexibility of running applications in a variety of environments, and
- expanded operating scenarios for hands-free computing.
What is SpeechKit?
Chant SpeechKit handles the complexities of speech recognition and speech synthesis to minimise the programming necessary to develop software that speaks and listens.
It simplifies the process of managing Microsoft SAPI 5, Microsoft Speech Platform, Microsoft Windows Media (UWP), Nuance Dragon NaturallySpeaking, and Nuance Vocon 3200 recognizers, and managing Acapela, Cepstral, CereProc CereVoice, Microsoft SAPI 5, Microsoft Speech Platform, Microsoft Windows Media (UWP), and Nuance Vocalizer synthesizers.
SpeechKit includes C++, C++Builder, Delphi, Java, .NET Framework, and Silverlight class libraries to support all your programming languages and sample projects for popular IDEs—such as the latest Visual Studio from Microsoft and RAD Studio from Embarcadero.
The class libraries can be integrated with 32-bit, 64-bit, and Universal Windows Platform (UWP) applications.
Features
Chant SpeechKit handles the complexities of speech recognition and speech synthesis. The classes minimise the programming efforts necessary to construct software that speaks and listens.
A SpeechKit application can:
- Control application functions by speaking rather than having to use a mouse or keyboard;
- Prompt users for applicable data capture;
- Capture data by speaking rather than typing;
- Confirm data capture with spoken or audio acknowledgement;
- Transcribe audio buffers, files, and streams to text; and
- Synthesize speech to audio buffers, files, and streams.
Recognizers provide proprietary programming interfaces (i.e., APIs). SpeechKit supports the following recognizers and their APIs:
Recognizer |
Speech API |
Platforms |
Microsoft SAPI 5 (all languages) |
SAPI 5 |
Win64, Win32 |
Microsoft Speech Platform (all languages) |
MSP |
Win64, Win32 |
Microsoft Universal Windows Platform (all languages) |
Windows Media |
ARM, x86, x64 |
Nuance Dragon NaturallySpeaking (all languages) |
Dragon COM API |
Win64, Win32 |
Nuance VocCon 3200 V2 (all languages) |
VoCon 3200 V2 |
Win32 |
Nuance VocCon 3200 V3 (all languages) |
VoCon 3200 V3 |
Win32 |
Nuance VocCon 3200 V4 (all languages) |
VoCon 3200 V4 |
Win32 |
Synthesizers provide proprietary programming interfaces (i.e., APIs). SpeechKit supports the following synthesizers and their APIs:
Synthesizer |
Speech API |
Platforms |
Acapela (all languages) |
BabTTS |
Win64, Win32 |
Acepela (all languages) |
NSCAPI |
Win64, Win32 |
Cepstral (all languages) |
Cepstral Swift |
Win64, Win32 |
CereProc (all languages) |
CereVoice |
Win64, Win32 |
Microsoft SAPI 5 (all languages) |
SAPI 5 |
Win64, Win32 |
Microsoft Speech Platform (all languages) |
MSP |
Win64, Win32 |
Microsoft Universal Windows Platform (all languages) |
Windows Media |
ARM, x86, x64 |
Nuance Vocalizer Auotmotive (all languages) |
Vocalizer Auotmotive |
Win32 |
Nuance Vocalizer Expressive (all languages) |
Vocalizer Expressive |
Win64, Win32 |
Nuance Vocalizer Mobile (all languages) |
Vocalizer Mobile |
Win32 |
Nuance Vocalizer Network (all languages) |
Vocalizer Network |
Win64, Win32 |
Within Chant Developer Workbench, you can:
- Enumerate audio devices and speech engines for selection and testing of audio-, recognizer-, and synthesizer-specific features;
- Trace audio, recognition, and synthesis events;
- Support grammar activation and testing (requires GrammarKit); and
- Support TTS markup playback (requires VoiceMarkupKit).
Audio Device Management: Enumerate audio devices and inspect device properties.
Recognizer Management: Enumerate and test recognizers. Use the Speech Recognition window to recognize speech from a microphone, prerecorded audio, or simluate recognition from text. Trace recognition events in the Events window.
Synthesizer Management: Enumerate and test synthesizers. Use the Speech Synthesis window to synthesize text. Trace synthesis events in the Events window.
Related Articles
Call MicroWay on 1300 553 313 or email for more information.
For more information please contact
the MicroWay sales team: |
 |
Head Office
MicroWay Pty Ltd
PO Box 84,
Braeside, Victoria, 3195, Australia
Ph: 1300 553 313
Fax: 1300 132 709
email: sales@microway.com.au
ABN: 56 129 024 825 |
Sydney Sales Office
MicroWay Pty Ltd
PO Box 1733,
Crows Nest, NSW 1585, Australia
Tel: 1300 553 313
Fax: 1300 132 709
email: sales@microway.com.au
ABN: 56 129 024 825 |
New Zealand Sales Office
MicroWay Pty Ltd (NZ)
PO Box 912026
Victoria Street West
Auckland 1142, New Zealand
Tel: 0800 450 168
email: sales@microway.co.nz |
International: call
+61 3 9580 1333, fax +61 3 9580 8995
|
|