|
Match
|
Document |
Document Title |
|
|
7620547 |
Spoken man-machine interface with speaker identification
The present invention provides a method for operating and/or for controlling a man-machine interface unit (MMI) for a finite user group environment. Utterances out of a group of user are repeatedly...
|
|
|
7617103 |
Incrementally regulated discriminative margins in MCE training for speech recognition
A method and apparatus for training an acoustic model are disclosed. A training corpus is accessed and converted into an initial acoustic model. Scores are calculated for a correct class and...
|
|
|
7617101 |
Method and system for utterance verification
A method and system for utterance verification is disclosed. It first extracts a sequence of feature vectors from speech signal. At least one candidate string is obtained after speech recognition....
|
|
|
7603278 |
Segment set creating method and apparatus
A segment set before updating is read, and clustering considering a phoneme environment is performed to it. For each cluster obtained by the clustering, a representative segment of a segment set...
|
|
|
7603276 |
Standard-model generation for speech recognition using a reference model
A standard model creating apparatus which provides a high-precision standard model used for pattern recognition such as speech recognition, character recognition, or image recognition using a...
|
|
|
7603272 |
System and method of word graph matrix decomposition
Disclosed is a system and method of decomposing a lattice transition matrix into a block diagonal matrix. The method is applicable to automatic speech recognition but can be used in other contexts...
|
|
|
7590537 |
Speaker clustering and adaptation method based on the HMM model variation information and its apparatus for speech recognition
A speech recognition method and apparatus perform speaker clustering and speaker adaptation using average model variation information over speakers while analyzing the quantity variation amount and...
|
|
|
7590536 |
Voice language model adjustment based on user affinity
Methods, systems and computer readable medium for improving the accuracy of voice processing are provided. Embodiments of the present invention generally provide methods, systems and articles of...
|
|
|
7587320 |
Automatic segmentation in speech synthesis
Systems and methods for automatically segmenting speech inventories. A set of Hidden Markov Models (HMMs) are initialized using bootstrap data. The HMMs are next re-estimated and aligned to produce...
|
|
|
7587319 |
Speech recognition circuit using parallel processors
A speech recognition circuit comprises an input buffer for receiving processed speech parameters. A lexical memory contains lexical data for word recognition. The lexical data comprises a plurality...
|
|
|
7587314 |
Single-codebook vector quantization for multiple-rate applications
This invention relates to a method, a device and a software application product for N-level quantization of vectors, wherein N is selectable prior to said quantization from a set of at least two...
|
|
|
7584100 |
Method and system for clustering using generalized sentence patterns
A method and system for clustering documents based on generalized sentence patterns of the topics of the documents is provided. A generalized sentence patterns (“GSP”) system identifies a...
|
|
|
7580836 |
Speaker adaptation using weighted feedback
In some embodiments, the invention includes calculating estimated weights for identified errors in recognition of utterances. Sections of the utterances are marked as being misrecognized and the...
|
|
|
7574358 |
Natural language system and method based on unisolated performance metric
A natural language business system and method is developed to understand the underlying meaning of a person's speech, such as during a transaction with the business system. The system includes a...
|
|
|
7571098 |
System and method of spoken language understanding using word confusion networks
Word lattices that are generated by an automatic speech recognition system are used to generate a modified word lattice that is usable by a spoken language understanding module. In one embodiment,...
|
|
|
7567903 |
Low latency real-time vocal tract length normalization
A method and apparatus for performing speech recognition are provided. A Vocal Tract Length Normalized acoustic model for a speaker is generated from training data. Speech recognition is performed...
|
|
|
7565213 |
Device and method for analyzing an information signal
A significant short-time spectrum is extracted from an information signal, the means for extracting being configured to extract such short-time spectra which come closer to a specific...
|
|
|
7562019 |
Automated testing of voice recognition software
A method and a system for testing a voice enabled application on a target device, the method including conducting one or more interactions with the target device, at least some of the interactions...
|
|
|
7562015 |
Distributed pattern recognition training method and system
A distributed pattern recognition training method includes providing data communication between at least one central pattern analysis node and a plurality of peripheral data analysis sites. The...
|
|
|
7542901 |
Methods and apparatus for generating dialog state conditioned language models
Techniques are provided for generating improved language modeling. Such improved modeling is achieved by conditioning a language model on a state of a dialog for which the language model is...
|
|
|
7533018 |
Tailored speaker-independent voice recognition system
A tailored speaker-independent voice recognition system has a speech recognition dictionary ( 360 ) with at least one word ( 371 ). That word ( 371 ) has at least two transcriptions ( 373 ), each...
|
|
|
7529668 |
System and method for implementing a refined dictionary for speech recognition
A system and method for implementing a refined dictionary for speech recognition includes a database analyzer that initially identifies first vocabulary words that are present in a training...
|
|
|
7529659 |
Method and apparatus for identifying an unknown work
A system for determining an identity of a received work. The system receives audio data for an unknown work. The audio data is divided into segments. The system generates a signature of the unknown...
|
|
|
7516071 |
Method of modeling single-enrollment classes in verification and identification tasks
In automatic pattern recognition, in the context of patterns being observed either in the same or a new environment, e.g. a new acoustic channel, as compared to the one seen during the previous...
|
|
|
7509259 |
Method of refining statistical pattern recognition models and statistical pattern recognizers
A device ( 800 ) performs statistical pattern recognition using model parameters that are refined by optimizing an objective function that includes a term for many items of training data for which...
|
|
|
7502736 |
Voice registration method and system, and voice recognition method and system based on voice registration method and system
Disclosed is a voice registration method for voice recognition, comprising the steps of analyzing a spectrum of a sound signal inputted from the outside; extracting predetermined language units for...
|
|
|
7499892 |
Information processing apparatus, information processing method, and program
An information processing apparatus includes a first learning unit adapted to learn a first SOM (self-organization map), based on a first parameter extracted from an observed value, a winner node...
|
|
|
7496693 |
Wireless enabled speech recognition (SR) portable device including a programmable user trained SR profile for transmission to external SR enabled PC
A method of interacting with a speech recognition (SR)-enabled personal computer (PC) is provided in which a user SR profile is transferred from a wireless-enabled device to the SR-enabled PC....
|
|
|
7496512 |
Refining of segmental boundaries in speech waveforms using contextual-dependent models
A method and apparatus are provided for refining segmental boundaries in speech waveforms. Contextual acoustic feature similarities are used as a basis for clustering adjacent phoneme speech units,...
|
|
|
7496509 |
Methods and apparatus for statistical biometric model migration
In large-scale deployments of speaker recognition systems the potential for legacy problems increases as the evolving technology may require configuration changes in the system thus invalidating...
|
|
|
7496508 |
Method of determining database entries
The invention relates to a method of determining database entries of a database ( 9 ) by means of an automatic dialog system ( 1 ) in which the following steps are provided:
1.1 temporary...
|
|
|
7496503 |
Timing of speech recognition over lossy transmission systems
Recognizing a stream of speech received as speech vectors over a lossy communications link includes constructing for a speech recognizer a series of speech vectors from packets received over a...
|
|
|
7480616 |
Information recognition device and information recognition method
Information relating to an amount of muscle activity is extracted from a myo-electrical signal by activity amount information extraction means, and information recognition is performed by activity...
|
|
|
7480615 |
Method of speech recognition using multimodal variational inference with switching state space models
A method of efficiently setting posterior probability parameters for a switching state space model begins by defining a window containing at least two but fewer than all of the frames. A separate...
|
|
|
7472061 |
Systems and methods for building a native language phoneme lexicon having native pronunciations of non-native words derived from non-native pronunciations
Systems and methods are provided for automatically building a native phonetic lexicon for a speech-based application trained to process a native (base) language, wherein the native phonetic lexicon...
|
|
|
7467087 |
Training and using pronunciation guessers in speech recognition
The error rate of a pronunciation guesser that guesses the phonetic spelling of words used in speech recognition is improved by causing its training to weigh letter-to-phoneme mappings used as data...
|
|
|
7467086 |
Methodology for generating enhanced demiphone acoustic models for speech recognition
A system and method for effectively performing speech recognition procedures includes enhanced demiphone acoustic models that a speech recognition engine utilizes to perform the speech recognition...
|
|
|
7460995 |
System for speech recognition
This invention provides a system for speech recognition comparing speech against stored character strings in memory. Speech is transformed into spoken character strings. To accelerate the...
|
|
|
7457749 |
Noise-robust feature extraction using multi-layer principal component analysis
Extracting features from signals for use in classification, retrieval, or identification of data represented by those signals uses a “Distortion Discriminant Analysis” (DDA) of a set of...
|
|
|
7457745 |
Method and apparatus for fast on-line automatic speaker/environment adaptation for speech/speaker recognition in the presence of changing environments
A fast on-line automatic speaker/environment adaptation suitable for speech/speaker recognition system, method and computer program product are presented. The system comprises a computer system...
|
|
|
7454338 |
Training wideband acoustic models in the cepstral domain using mixed-bandwidth training data and extended vectors for speech recognition
A method and apparatus are provided that generate values for a first set of dimensions of a feature vector from a speech signal. The values of the first set of dimensions are used to estimate...
|
|
|
7454337 |
Method of modeling single data class from multi-class data
The present invention is a method of modeling a single class of data from data containing multiple classes of data of the same type of data by first receiving a collection of data that includes...
|
|
|
7454334 |
Method and apparatus for automatically identifying animal species from their vocalizations
Relatively powerful hand-held computing devices, Digital Signal Processors, Audio signal processing technology, voice recognition technology, expert systems, Hidden Markov Models, and/or neural...
|
|
|
7451081 |
System and method of performing speech recognition based on a user identifier
Speech recognition models are dynamically re-configurable based on user information, application information, background information such as background noise and transducer information such as...
|
|
|
7444282 |
Method of setting optimum-partitioned classified neural network and method and apparatus for automatic labeling using optimum-partitioned classified neural network
A method of automatic labeling using an optimum-partitioned classified neural network includes searching for neural networks having minimum errors with respect to a number of L phoneme combinations...
|
|
|
7440894 |
Method and system for creation of voice training profiles with multiple methods with uniform server mechanism using heterogeneous devices
A system and method for creating user voice profiles enables a user to create a single user voice profile that is compatible with one or more voice servers. Such a system includes a training server...
|
|
|
7430509 |
Lattice encoding
Initially an embedding module ( 22 ) determines an embedding of a lattice in a two-dimensional plane. The embedding module ( 22 ) then processes the initial embedding to generate a planar graph in...
|
|
|
7428491 |
Method and system for obtaining personal aliases through voice recognition
Methods and systems for recognizing a spoken alias are disclosed. The present invention includes generating a plurality of alias variations based on a discoverable name and creating a phonetic...
|
|
|
7418385 |
Voice detection device
This voice detection device is composed of a myoelectric signal acquisition part for acquiring, from a plurality of regions, myoelectric signals generated at the time of a vocalization operation, a...
|
|
|
7418383 |
Noise robust speech recognition with a switching linear dynamic model
A unified, nonlinear, non-stationary, stochastic model is disclosed for estimating and removing effects of background noise on speech cepstra. Generally stated, the model is a union of dynamic...
|