Matches 1 - 32 out of 32


Match Document Document Title
8145488 Parameter clustering and sharing for variable-parameter hidden markov models  
A speech recognition system uses Gaussian mixture variable-parameter hidden Markov models (VPHMMs) to recognize speech. The VPHMMs include Gaussian parameters that vary as a function of at least...
8140333 Probability density function compensation method for hidden markov model and speech recognition method and apparatus using the same  
A probability density function compensation method used for a continuous hidden Markov model and a speech recognition method and apparatus, the probability density function compensation method...
8140334 Apparatus and method for recognizing voice  
An apparatus and method for recognizing voice. The apparatus includes a feature vector extraction unit dividing an input voice signal into predetermined unit regions, and extracting feature vectors...
8078465 System and method for detection and analysis of speech  
Certain aspects and embodiments of the present invention are directed to systems and methods for monitoring and analyzing the language environment and the development of a key child. A key child's...
8014617 Decoding apparatus, dequantizing method, distribution determining method, and program thereof  
A decoding apparatus includes a random number generating section and a decoding section. The random number generating section generates random numbers according to distribution of original data...
8005306 Decoding apparatus, inverse quantization method, and computer readable medium  
A decoding apparatus includes a classification section, a distribution-information generation section and an inverse-quantization-value generation section. The classification section classifies...
8005666 Automatic system for temporal alignment of music audio signal with lyrics  
An automatic system for temporal alignment between a music audio signal and lyrics is provided. The automatic system can prevent accuracy for temporal alignment from being lowered due to the...
7941317 Low latency real-time speech transcription  
Systems and methods for low-latency real-time speech recognition/transcription. A discriminative feature extraction, such as a heteroscedastic discriminant analysis transform, in combination with a...
7930181 Low latency real-time speech transcription  
Systems and methods for low-latency real-time speech recognition/transcription. A discriminative feature extraction, such as a heteroscedastic discriminant analysis transform, in combination with a...
7805301 Covariance estimation for pattern recognition  
A reliable full covariance matrix estimation algorithm for pattern unit's state output distribution in pattern recognition system is discussed. An intermediate hierarchical tree structure is built...
7778463 Pattern recognition system, pattern recognition method, and pattern recognition program  
A pattern recognition system, pattern recognition method, and pattern recognition program capable of increasing the accuracy in computing the false acceptance probability and capable of ensuring a...
7571097 Method for training of subspace coded gaussian models  
A method for compressing multiple dimensional gaussian distributions with diagonal covariance matrixes includes clustering a plurality of gaussian distributions in a multiplicity of clusters for...
7505950 Soft alignment based on a probability of time alignment  
Systems and methods are provided for performing soft alignment in Gaussian mixture model (GMM) based and other vector transformations. Soft alignment may assign alignment probabilities to source...
7454336 Variational inference and learning for segmental switching state space models of hidden speech dynamics  
A system and method that facilitate modeling unobserved speech dynamics based upon a hidden dynamic speech model in the form of segmental switching state space model that employs model parameters...
7454341 Method, apparatus, and system for building a compact model for large vocabulary continuous speech recognition (LVCSR) system  
According to one aspect of the invention, a method is provided in which a mean vector set and a variance vector set of a set of N Gaussians are divided into multiple mean sub-vector sets and...
7263485 Robust detection and classification of objects in audio using limited training data  
A method (200) and apparatus (100) for classifying a homogeneous audio segment are disclosed. The homogeneous audio comprises a sequence of audio samples (x(n)). The method (200) starts by forming...
6633842 Speech recognition front-end feature extraction for noisy speech  
An estimate of clean speech vector, typically Mel-Frequency Cepstral Coefficient (MFCC) given its noisy observation is provided. The method makes use of two Gaussian mixtures. The first one is...
6633845 Music summarization system and method  
The invention provides a method and apparatus for automatically generating a summary or key phrase for a song. The song, or a portion thereof, is digitized and converted into a sequence of feature...
6526379 Discriminative clustering methods for automatic speech recognition  
The discriminative clustering technique tests a provided set of Gaussian distributions corresponding to an acoustic vector space. A distance metric, such as the Bhattacharyya distance, is used to...
6502072 Two-tier noise rejection in speech recognition  
A method and apparatus is provided for two-tier noise rejection in speech recognition. The method and apparatus convert an analog speech signal into a digital signal and extract features from the...
6327565 Speaker and environment adaptation based on eigenvoices  
A set of speaker dependent models is trained upon a comparatively large number of training speakers, one model per speaker, and model parameters are extracted in a predefined order to construct a...
6324510 Method and apparatus of hierarchically organizing an acoustic model for speech recognition and adaptation of the model to unseen domains  
A method of organizing an acoustic model for speech recognition is comprised of the steps of calculating a measure of acoustic dissimilarity of subphonetic units. A clustering technique is...
6223159 Speaker adaptation device and speech recognition device  
Voice feature quantity extractor extracts feature vector time-series data by acoustic feature quantity analysis of the speaker's voice. Reference speaker-dependent conversion factor computation...
6092045 Method and apparatus for speech recognition  
Comparing a series of observations representing unknown speech, to stored models representing known speech, the series of observations being divided into at least two blocks each comprising two or...
6076056 Speech recognition system for recognizing continuous and isolated speech  
Speech recognition is performed by receiving isolated speech training data indicative of a plurality of discretely spoken training words, and receiving continuous speech training data indicative of...
6047256 Device for generating a reference pattern with a continuous probability density function derived from feature code occurrence probability distribution  
In a system for recognizing a time sequence of feature vectors of a speech signal representative of an unknown utterance as one of a plurality of reference patterns, a generator (11) for generating...
6044344 Constrained corrective training for continuous parameter system  
A method is provided for training a statistical pattern recognition decoder on new data while preserving its accuracy of old, previously learned data. Previously learned data are represented as...
5991442 Method and apparatus for pattern recognition utilizing gaussian distribution functions  
The present invention provides a method and apparatus for performing pattern recognition on given information such as speech data or image data with a reduced amount of calculations of the degree...
5857169 Method and system for pattern recognition based on tree organized probability densities  
A time-sequential input pattern (20), which is derived from a continual physical quantity, such as speech is recognized. The system includes input means (30), which accesses the physical quantity...
5452397 Method and system for preventing entry of confusingly similar phases in a voice recognition system vocabulary list  
A method and system prevent the entry of confusingly similar phrases (60) in a vocabulary list (10) of a speaker-dependent voice recognition system. The method first receives (20, 30, 50) and...
5450523 Training module for estimating mixture Gaussian densities for speech unit models in speech recognition systems  
A model-training module generates mixture Gaussian density models from speech training data for continuous, or isolated word speech recognition systems. Speech feature sequences are labeled into...
4783804 Hidden Markov model speech recognition arrangement  
Markov model speech pattern templates are formed for speech analysis systems by analyzing identified speech patterns to generate frame sequences of acoustic feature signals representative thereof....
Matches 1 - 32 out of 32