The Vecsys Research VoxSigma software suite for Linux
offers state of the art performance for broadcast data and
conversational data in Arabic, Dutch, English, French, Italian,
Mandarin, Russian and Spanish. The VoxSigma API includes Unix
commands, C and C++ libraries, import and export in XML format.
For more information contact us.
Substantial advances in speech recognition technology have been
achieved over the last decade. This core technology, available in
multiple languages, serves as the basis for a range of applications
such as voice-interactive database access, as well as more demanding
tasks such as the transcription of broadcast data. Vecsys Research has
speech-to-text systems with vocabulary sizes up to 300K words for many
languages including
Arabic, Dutch, English, French, Italian,
Mandarin, Russian and
Spanish, and is developping systems
for more languages such as
Finnish, Greek, Polish and
Portuguese.
Large vocabulary continuous speech recognition is a key technology
that can be used to enable content-based information access in audio
and video documents. Most of the linguistic information is encoded in
the audio channel of audiovisual data, which once transcribed can be
accessed using text-based tools. Via language identification, speech
recognition, and speaker recognition, spoken document retrieval can
support random access using specific criteria to relevant portions of
audio documents, reducing the time needed to identify recordings in
large multimedia databases. Some applications are data mining,
news-on-demand, and media monitoring.