Sign up


Match Document Document Title
7266236 Accelerated handwritten symbol recognition in a pen based tablet computer  
The present invention provides a method and apparatus for accelerated handwritten symbol recognition in a pen based tablet computer. In one embodiment, handwritten symbols are translated into...
7263485 Robust detection and classification of objects in audio using limited training data  
A method (200) and apparatus (100) for classifying a homogeneous audio segment are disclosed. The homogeneous audio comprises a sequence of audio samples (x(n)). The method (200) starts by forming...
7263484 Phonetic searching  
An improved method and apparatus is disclosed which uses probabilistic techniques to map an input search string with a prestored audio file, and recognize certain portions of a search string...
7260532 Hidden Markov model generation apparatus and method with selection of number of states  
A model generation unit (17) is provided. The model generation unit includes an alignment module (80) arranged to receive pairs of sequences of parameter frame vectors from a buffer (16) and to...
7254538 Nonlinear mapping for feature extraction in automatic speech recognition  
The present invention successfully combines neural-net discriminative feature processing with Gaussian-mixture distribution modeling (GMM). By training one or more neural networks to generate...
7233891 Natural language sentence parser  
A method, computer program product, and apparatus for parsing consecutive sentences which includes tokenizing the words of the sentence and putting them through an iterative inductive processor....
7231019 Automatic identification of telephone callers based on voice characteristics  
A method and apparatus are provided for identifying a caller of a call from the caller to a recipient. A voice input is received from the caller, and characteristics of the voice input are applied...
7231352 Method for computer-supported speech recognition, speech recognition system and control device for controlling a technical system and telecommunications device  
The speech recognition rate which is necessary is determined for a selected speech recognition application. The information content of the feature vector components which is at least necessary to...
7225125 Speech recognition system trained with regional speech characteristics  
A speech recognition system uses speech recognition models which are specifically trained and optimized for users residing in a particular geographic area or region. The speech models are trained...
7216066 Method and apparatus for generating and managing a language model data structure  
A method is presented comprising assigning each of a plurality of segments comprising a received corpus to a node in a data structure denoting dependencies between nodes, and calculating a...
7209883 Factorial hidden markov model for audiovisual speech recognition  
A speech recognition method includes use of synchronous or asynchronous audio and a video data to enhance speech recognition probabilities. A two stream factorial hidden Markov model is trained and...
7209881 Preparing acoustic models by sufficient statistics and noise-superimposed speech data  
Noise-superimposed speech data is grouped according to acoustic similarity, and sufficient statistics are prepared using the speech data in each of the groups. A group acoustically similar to voice...
7203368 Embedded bayesian network for pattern recognition  
A pattern recognition procedure forms a hierarchical statistical model using a hidden Markov model and a coupled hidden Markov model. The hierarchical statistical model supports a pa 20 layer...
7188064 System and method for automatic semantic coding of free response data using Hidden Markov Model methodology  
A system and method for coding text data wherein a first group of text data is coded using a Viterbi algorithm using a Hidden Markov model. The Hidden Markov Model computes a probable coding...
7181399 Recognizing the numeric language in natural spoken dialogue  
A system for recognizing connected digits in natural spoken dialogue includes a speech recognition processor that receives unconstrained fluent input speech and produces a string of words that can...
7181391 Method, apparatus, and system for bottom-up tone integration to Chinese continuous speech recognition system  
According to one aspect of the invention, a method is provided in which knowledge about tone characteristics of a tonal syllabic language is used to model speech at various levels in a bottom-up...
7171043 Image recognition using hidden markov models and coupled hidden markov models  
An image processing system useful for facial recognition and security identification obtains an array of observation vectors from a facial image to be identified. A Viterbi algorithm is applied to...
7165028 Method of speech recognition resistant to convolutive distortion and additive distortion  
A speech recognizer operating in both ambient noise (additive distortion) and microphone changes (convolutive distortion) is provided. For each utterance to be recognized the recognizer system...
7165029 Coupled hidden Markov model for audiovisual speech recognition  
A speech recognition method includes use of synchronous or asynchronous audio and a video data to enhance speech recognition probabilities. A two stream coupled hidden Markov model is trained and...
7136852 Case-based reasoning similarity metrics implementation using user defined functions  
A database system and a method for case-based reasoning are disclosed. The database system includes an exemplar object within the database configured to accept and store a plurality of exemplar...
7136802 Method and apparatus for detecting prosodic phrase break in a text to speech (TTS) system  
Methods for processing speech data are described herein. In one aspect of the invention, an exemplary method includes receiving a text sentence comprising a plurality of words, each of the...
7127393 Dynamic semantic control of a speech recognition system  
A method and apparatus are provided for automatically recognizing words of spoken speech using a computer-based speech recognition system according to a dynamic semantic model. In an embodiment,...
7120580 Method and apparatus for recognizing speech in a noisy environment  
An apparatus and a concomitant method for speech recognition. In one embodiment, the present method is referred to as a “Dynamic Noise Compensation” (DNC) method where the method estimates the mod...
7103541 Microphone array signal enhancement using mixture models  
A system and method facilitating signal enhancement utilizing mixture models is provided. The invention includes a signal enhancement adaptive system having a speech model, a noise model and a...
7089185 Embedded multi-layer coupled hidden Markov model  
An arrangement is provided for embedded coupled hidden Markov model. To train an embedded coupled hidden Markov model, training data is first segmented into uniform segments at different layers of...
7089183 Accumulating transformations for hierarchical linear regression HMM adaptation  
A new iterative hierarchical linear regression method for generating a set of linear transforms to adapt HMM speech models to a new environment for improved speech recognition is disclosed. The...
7085720 Method for task classification using morphemes  
The invention concerns a method of task classification using morphemes which operates on the task objective of a user. The morphemes may be generated by clustering selected ones of the salient...
7076102 Video monitoring system employing hierarchical hidden markov model (HMM) event learning and classification  
A method and apparatus are disclosed for automatically learning and identifying events in image data using hierarchical HMMs to define and detect one or more events. The hierarchical HMMs include...
7076422 Modelling and processing filled pauses and noises in speech recognition  
A speech recognition system recognizes filled pause utterances made by a speaker. In one embodiment, an ergodic model is used to acoustically model filled pauses that provides flexibility allowing...
7069215 Systems and methods for extracting meaning from multimodal inputs using finite-state devices  
Finite-state systems and methods allow multiple input streams to be parsed and integrated by a single finite-state device. These systems and methods not only address multimodal recognition, but are...
7062433 Method of speech recognition with compensation for both channel distortion and background noise  
A method of speech recognition with compensation is provided by modifying HMM models trained on clean speech with cepstral mean normalization. For all speech utterances the MFCC vector is...
7058576 Method of calculating HMM output probability and speech recognition apparatus  
The invention relates to speech recognition based on HMM, in which speech recognition is performed by performing vector quantization and obtaining an output probability by table reference, and the...
7054814 Method and apparatus of selecting segments for speech synthesis by way of speech segment recognition  
A speech segment search unit searches a speech database for speech segments that satisfy a phonetic environment, and a HMM learning unit computes the HMMs of phonemes on the basis of the search...
7050974 Environment adaptation for speech recognition in a speech communication system  
A speech communication system comprising a speech input terminal and a speech recognition apparatus which can communicate with each other through a wire or wireless communication network wherein...
7035802 Recognition system using lexical trees  
The dynamic programming technique employs a lexical tree that is encoded in computer memory as a flat representation in which the nodes of each generation occupy contiguous memory locations. The...
7035801 Text language detection  
A method of determining the language of a text message received by a mobile telecommunications device indicates receiving an input text message at a mobile telecommunications device; analyzing the...
7027979 Method and apparatus for speech reconstruction within a distributed speech recognition system  
A method and apparatus for speech reconstruction within a distributed speech recognition system is provided herein. Missing MFCCs are reconstructed and utilized to generate speech. Particularly,...
7024360 System for reconstruction of symbols in a sequence  
A method of reconstructing a damaged sequence of symbols where some symbols are missing is provided in which statistical parameters of the sequence are used with confidence windowing techniques to...
7016837 Voice recognition system  
An initial combination HMM 16 is generated from a voice HMM 10 having multiplicative distortions and an initial noise HMM of additive noise, and at the same time, a Jacobian matrix J is calculated...
7013273 Speech recognition based captioning system  
A system and associated method of converting audio data from a television signal into textual data for display as a closed caption on an display device is provided. The audio data is decoded and...
7013276 Method of assessing degree of acoustic confusability, and system therefor  
Predicting speech recognizer confusion where utterances can be represented by any combination of text form and audio file. The utterances are represented with an intermediate representation that...
7003456 Methods and systems of routing utterances based on confidence estimates  
A computer-based method of routing a message to a system includes receiving a message, and processing the message using large-vocabulary continuous speech recognition to generate a string of text...
7003460 Method and apparatus for an adaptive speech recognition system utilizing HMM models  
In speech recognition, phonemes of a language are modelled by a hidden Markov model, whereby each status of the hidden Markov model is described by a probability density function. For speech...
6996525 Selecting one of multiple speech recognizers in a system based on performance predections resulting from experience  
A method for selecting a speech recognizer from a number of speech recognizers in a speech recognition system. The speech recognition system receives an audio stream from an application and derives...
6990442 Parsing with controlled tokenization  
The present invention provides a parsing technique wherein a parsing process provides feedback to a tokenizer to select an appropriate sub-tokenizer process corresponding to a grammar rule being...
6980954 Search method based on single triphone tree for large vocabulary continuous speech recognizer  
A search method based on a single triphone tree for large vocabulary continuous speech recognizer is disclosed in which speech signal are received. Tokens are propagated in a phonetic tree to...
6970819 Speech synthesis device  
The principal object of this invention is to provide a suitable control method for closing length with respect to phonemes (such as unvoiced plosive consonants) having a closing interval, and as a...
6963837 Attribute-based word modeling  
An attribute-based speech recognition system is described. A speech pre-processor receives input speech and produces a sequence of acoustic observations representative of the input speech. A...
6961703 Method for speech processing involving whole-utterance modeling  
A speech verification process involves comparison of enrollment and test speech data and an improved method of comparing the data is disclosed, wherein segmented frames of speech are analyzed...
6950796 Speech recognition by dynamical noise model adaptation  
The invention provides a Hidden Markov Model (132) based automated speech recognition system (100) that dynamically adapts to changing background noise by detecting long pauses in speech, and for...