|
Match
|
Document |
Document Title |
|
|
6882972 |
Method for recognizing speech to avoid over-adaptation during online speaker adaptation
To avoid an over-adaptation of a current acoustic model (CAM) to certain and frequently occuring words for speech phrases during on-line speaker adaptation of speech recognizers it is suggested to...
|
|
|
6879953 |
Speech recognition with request level determination
A recognition result determination section determines whether or not a character string identified by a speech recognition section through a speech recognition process includes a desire type...
|
|
|
6868382 |
Speech recognizer
The generic word label series used for recognition of words uttered by unspecified speakers are stored in the vocabulary label network accumulation processing. The speech of a particular speaker is...
|
|
|
6859774 |
Error corrective mechanisms for consensus decoding of speech
Techniques are described for decreasing the number of errors when consensus decoding is used during speech recognition. A number of corrective rules are applied to confusion sets that are extracted...
|
|
|
6856952 |
Detecting a characteristic of a resonating cavity responsible for speech
A characteristic of one or more human resonating cavities may be utilized to provide information for speech recognition, independent from the actual sounds produced. In one embodiment, information...
|
|
|
6850887 |
Speech recognition in noisy environments
Methods and apparatus for providing speech recognition in noisy environments. An energy level associated with audio input is ascertained, and a decision is rendered on whether to accept the at...
|
|
|
6850885 |
Method for recognizing speech
To increase the accuracy and the flexibility of a method for recognizing speech which employs a keyword spotting process on the basis of a combination of a keyword model (KM) and a garbage model...
|
|
|
6845357 |
Pattern recognition using an observable operator model
Data structures, systems, and methods are aspects of pattern recognition using observable operator models (OOMs). OOMs are more efficient than Hidden Markov Models (HMMs). A data structure for an...
|
|
|
6839668 |
Store speech, select vocabulary to recognize word
For operating a speech recognition facility, the following steps are executed: a stream of concatenated speech items is received and stored; various such received speech items are recognized; and a...
|
|
|
6839667 |
Method of speech recognition by presenting N-best word candidates
A method for performing speech recognition can include receiving user speech and determining a plurality of potential candidates. Each of the candidates can provide a textual interpretation of the...
|
|
|
6839671 |
Learning of dialogue states and language model of spoken information system
In this invention dialogue states for a dialogue model are created using a training corpus of example human—human dialogues. Dialogue states are modelled at the turn level rather than at the move...
|
|
|
6832190 |
Method and array for introducing temporal correlation in hidden markov models for speech recognition
In the recognition of spoken language, phonemes of the language are modelled by hidden Markov models. A modified hidden Markov model includes a conditional probability of a feature vector dependent...
|
|
|
6823308 |
Speech recognition accuracy in a multimodal input system
A speech recognition method for use in a multimodal input system comprises receiving a multimodal input comprising digitized speech as a first modality input and data in at least one further...
|
|
|
6804648 |
Impulsivity estimates of mixtures of the power exponential distrubutions in speech modeling
A parametric family of multivariate density functions formed by mixture models from univariate functions of the type exp(−|x| β ) for modeling acoustic feature vectores are used in automatic...
|
|
|
6782362 |
Speech recognition method and apparatus utilizing segment models
A method and apparatus determine the likelihood of a sequence of words based in part on a segment model. The segment model includes trajectory expressions formed as the product of a polynomial...
|
|
|
6778958 |
Symbol insertion apparatus and method
An apparatus and method are provided for the insertion of punctuation marks into appropriate positions in a sentence. An acoustic processor processes input utterances to extract voice data, and...
|
|
|
6778959 |
System and method for speech verification using out-of-vocabulary models
A system and method for speech verification using out-of-vocabulary models includes a speech recognizer that has a model bank with system vocabulary word models, a garbage model, and one or more...
|
|
|
6772116 |
Method of decoding telegraphic speech
A method of selecting a language model for decoding received user spoken utterances in a speech recognition system can include a series of steps. The steps can include computing confidence scores...
|
|
|
6754625 |
Augmentation of alternate word lists by acoustic confusability criterion
There is provided a method for augmenting an alternate word list generated by a speech recognition system. The alternate word list includes at least one potentially correct word for replacing a...
|
|
|
6754629 |
System and method for automatic voice recognition using mapping
A method and system that combines voice recognition engines and resolves differences between the results of individual voice recognition engines using a mapping function. Speaker independent voice...
|
|
|
6708150 |
Speech recognition apparatus and speech recognition navigation apparatus
A speech recognition apparatus includes: a speech input device; a storage device that stores a recognition word indicating a pronunciation of a word to undergo speech recognition; and a speech...
|
|
|
6704707 |
Method for automatically and dynamically switching between speech technologies
A method for switching between speech recognition technologies. The method includes reception of an initial recognition request accompanied by control information. Recognition characteristics are...
|
|
|
6701293 |
Combining N-best lists from multiple speech recognizers
A method and system for utilizing multiple speech recognizers. The speech system includes a port through which an input audio stream may be received, at least two recognizers that may convert the...
|
|
|
6694296 |
Method and apparatus for the recognition of spelled spoken words
The speech recognizer includes a dictation language model providing a dictation model output indicative of a likely word sequence recognized based on an input utterance. A spelling language model...
|
|
|
6691091 |
Method for additive and convolutional noise adaptation in automatic speech recognition using transformed matrices
A noise adaptation system and method provide for noise adaptation in a speech recognition system. The method includes the steps of generating a reference model based on a training speech signal,...
|
|
|
6691088 |
Method of determining parameters of a statistical language model
An apparatus and method of determining parameters of a statistical language model for automatic speech recognition systems using a training corpus are disclosed. To improve the perplexity and the...
|
|
|
6678658 |
Speech processing using conditional observable maximum likelihood continuity mapping
A computer implemented method enables the recognition of speech and speech characteristics. Parameters are initialized of first probability density functions that map between the symbols in the...
|
|
|
6671669 |
combined engine system and method for voice recognition
A method and system that combines voice recognition engines and resolves any differences between the results of individual voice recognition engines. A speaker independent (SI) Hidden Markov Model...
|
|
|
6662159 |
Recognizing speech data using a state transition model
Detecting an unknown word in input speech data reduces the search space and the memory capacity for the unknown word. For this purpose, an HMM data memory stores data describing a state transition...
|
|
|
6634887 |
Methods and systems for tutoring using a tutorial model with interactive dialog
Methods and systems are provided for tutoring a student in solving a problem described in the form of a dialog with a student involving questions posed to the student and analysis of student...
|
|
|
6633845 |
Music summarization system and method
The invention provides a method and apparatus for automatically generating a summary or key phrase for a song. The song, or a portion thereof, is digitized and converted into a sequence of feature...
|
|
|
6629069 |
Speech recognizer using database linking
A speech recogniser is provided for identifying entries in a database. Results from the recognition of a user's speech are combined with each other and optionally with reference to data in the...
|
|
|
6629066 |
Method and system for building and running natural language understanding systems
A computerized method for building and running natural language understanding systems, wherein a natural language understanding system takes a sentence as input and returns some representation of...
|
|
|
6598017 |
Method and apparatus for recognizing speech information based on prediction
An apparatus for recognizing sound information includes a sound recognition unit for recognizing sound information. A knowledge base stores knowledge concerning a type of data represented by the...
|
|
|
6598019 |
Evaluation method, apparatus, and recording medium using optimum template pattern determination method, apparatus and optimum template pattern
To improve the precision in correction of an input sentence by using a template pattern for model sentence. A plurality of template patterns for the model sentence are provided beforehand. Each of...
|
|
|
6591236 |
Method and system for determining available and alternative speech commands
A method and system for use with a computer speech recognition system to efficiently identify valid system commands to users. The method involves a series of steps including: receiving data...
|
|
|
6574595 |
Method and apparatus for recognition-based barge-in detection in the context of subword-based automatic speech recognition
Robust, multi-faceted sub-word method for rapidly and reliably detecting a barge-in condition of a speaker talking while an automated audio prompt is being played. This sub-word method allows for...
|
|
|
6574597 |
Fully expanded context-dependent networks for speech recognition
A large vocabulary speech recognizer including a combined weighted network of transducers reflecting fully expanded context-dependent modeling of pronunciations and language that can be used with a...
|
|
|
6571209 |
Disabling and enabling of subvocabularies in speech recognition systems
A method for designating a subvocabulary for speech recognition systems includes the steps of providing a vocabulary of words each having a flag with a first value, selecting words to be eliminated...
|
|
|
6571208 |
Context-dependent acoustic models for medium and large vocabulary speech recognition with eigenvoice training
A reduced dimensionality eigenvoice analytical technique is used during training to develop context-dependent acoustic models for allophones. The eigenvoice technique is also used during run time...
|
|
|
6553342 |
Tone based speech recognition
A method and apparatus for speech recognition involves classifying ( 38 ) a digitized speech segment according to whether the speech segment comprises voiced or unvoiced speech and utilizing that...
|
|
|
6542866 |
Speech recognition method and apparatus utilizing multiple feature streams
A method and apparatus is provided for using multiple feature streams in speech recognition. In the method and apparatus, a feature extractor generates at least two feature vectors for a segment of...
|
|
|
6539351 |
High dimensional acoustic modeling via mixtures of compound gaussians with linear transforms
A method is provided for generating a high dimensional density model within an acoustic model for one of a speech and a speaker recognition system. Acoustic data obtained from at least one speaker...
|
|
|
6539353 |
Confidence measures using sub-word-dependent weighting of sub-word confidence scores for robust speech recognition
A method and apparatus is provided for speech recognition. The method and apparatus convert an analog speech signal into a digital signal and extract at least one feature from the digital signal. A...
|
|
|
6535849 |
Method and system for generating semi-literal transcripts for speech recognition systems
A method for generating a semi-literal transcript from a partial transcript of recorded speech. The method includes augmenting the partial transcript with words from one of a filled pause model and...
|
|
|
6526379 |
Discriminative clustering methods for automatic speech recognition
The discriminative clustering technique tests a provided set of Gaussian distributions corresponding to an acoustic vector space. A distance metric, such as the Bhattacharyya distance, is used to...
|
|
|
6526380 |
Speech recognition system having parallel large vocabulary recognition engines
A huge vocabulary speech recognition system for recognizing a sequence of spoken words, having an input means for receiving a time-sequential input pattern representative of the sequence of spoken...
|
|
|
6523005 |
Method and configuration for determining a descriptive feature of a speech signal
A method and also a configuration for determining a descriptive feature of a speech signal, in which a first speech model is trained with a first time pattern and a second speech model is trained...
|
|
|
6519562 |
Dynamic semantic control of a speech recognition system
A method and apparatus are provided for automatically recognizing words of spoken speech using a computer-based speech recognition system according to a dynamic semantic model. In an embodiment,...
|
|
|
6510410 |
Method and apparatus for recognizing tone languages using pitch information
A method and an apparatus for automatic recognition of tone languages, employing the steps of converting the words of speech into an electrical signal, generating spectral features from the...
|