Match
|
Document |
Document Title |
|
US20070005363 |
Location aware multi-modal multi-lingual device
Location-based technologies (e.g., global position system (GPS)) can be employed to facilitate providing multi-modal, multi-lingual location-based services. Identification of location can provide... |
|
US20060122834 |
Emotion detection device & method for use in distributed systems
A prosody analyzer enhances the interpretation of natural language utterances. The analyzer is distributed over a client/server architecture, so that the scope of emotion recognition processing... |
|
US20140337031 |
METHOD AND APPARATUS FOR DETECTING A TARGET KEYWORD
A method of detecting a target keyword for activating a function in an electronic device is disclosed. The method includes receiving an input sound starting from one of the plurality of portions... |
|
US20050209855 |
Speech signal processing apparatus and method, and storage medium
A speech segment search unit searches a speech database for speech segments that satisfy a phonetic environment, and a HMM learning unit computes the HMMs of phonemes on the basis of the search... |
|
US20080288255 |
System and method for quantifying, representing, and identifying similarities in data streams
A method of quantifying similarities between sequential data streams typically includes providing a pair of sequential data streams, designing a Hidden Markov Model (HMM) of at least a portion of... |
|
US20060058999 |
Voice model adaptation
Voice recognition tutoring software to assist in reading development includes method and system for generating a custom voice model. |
|
US20140365221 |
METHOD AND APPARATUS FOR SPEECH RECOGNITION
A computer-implemented method performed by a computerized device, a computerized apparatus and a computer program product for recognizing speech, the method comprising: receiving a signal;... |
|
US20100070274 |
APPARATUS AND METHOD FOR SPEECH RECOGNITION BASED ON SOUND SOURCE SEPARATION AND SOUND SOURCE IDENTIFICATION
An apparatus for a speech recognition based on source separation and identification includes: a sound source separator for separating mixed signals, which are input to two or more microphones,... |
|
US20080147404 |
System and methods for accent classification and adaptation
Speech is processed that may be colored by speech accent. A method for recognizing speech includes maintaining a model of speech accent that is established based on training speech data, wherein... |
|
US20060129399 |
Speech conversion system and method
The conversion of speech can be used to transform an utterance by a source speaker to match the speech characteristic of a target speaker. During a training phase, utterances corresponding to the... |
|
US20050165607 |
System and method to disambiguate and clarify user intention in a spoken dialog system
A system and method are disclosed for controlling the flow of a dialog within a spoken dialog service dialog management module. The method uses a dialog disambiguation rooted tree, the rooted tree... |
|
US20070033044 |
System and method for creating generalized tied-mixture hidden Markov models for automatic speech recognition
A system for, and method of, creating generalized tied-mixture hidden Markov models (HMMs) for noisy automatic speech recognition. In one embodiment, the system includes: (1) an HMM estimator and... |
|
US20100169094 |
SPEAKER ADAPTATION APPARATUS AND PROGRAM THEREOF
A speaker adaptation apparatus includes an acquiring unit configured to acquire an acoustic model including HMMs and decision trees for estimating what type of the phoneme or the word is included... |
|
US20050228668 |
System and method for automatic generation of dialog run time systems
A system and method for automatically generating a spoken dialog application is disclosed. In one embodiment, a graphical representation of a call flow is converted into a context free grammar... |
|
US20120130716 |
SPEECH RECOGNITION METHOD FOR ROBOT
A speech recognition method for a robot. The speech recognition method for the robot includes one fundamental acoustic model. Whenever the noisy environment and the speaker are changed, the speech... |
|
US20170116994 |
VOICE-AWAKING METHOD, ELECTRONIC DEVICE AND STORAGE MEDIUM
The disclosure are a voice-awaking method, electronic device and storage medium, and the method includes: extracting a voice feature from obtained current input voice; determining whether the... |
|
US20060100872 |
PATTERN RECOGNITION APPARATUS AND PATTERN RECOGNITION METHOD
A feature amount extracting unit extracts feature amount vectors of an inputted image. A probability distribution calculator calculates a probability distribution of feature amount vectors in a... |
|
US20070083373 |
Discriminative training of HMM models using maximum margin estimation for speech recognition
An improved discriminative training method is provided for hidden Markov models. The method includes: defining a measure of separation margin for the data; identifying a subset of training... |
|
US20070061144 |
Batch statistics process model method and system
A method is provided for process modeling. The method may include obtaining batch statistics data records associated with one or more input variables and one or more output parameters and... |
|
US20070038451 |
Voice recognition for large dynamic vocabularies
A voice recognition method includes: representing a vocabulary translated into a Markov network; decoding by means of a Viterbi algorithm; and pruning explored solutions. The vocabulary is... |
|
US20080162129 |
METHOD AND APPARATUS PERTAINING TO THE PROCESSING OF SAMPLED AUDIO CONTENT USING A MULTI-RESOLUTION SPEECH RECOGNITION SEARCH PROCESS
One provides (101) a plurality of frames of sampled audio content and then processes (102) that plurality of frames using a speech recognition search process that comprises, at least in part,... |
|
US20070088552 |
Method and a device for speech recognition
Method for speech recognition comprising inputting frames comprising samples of an audio signal; forming a feature vector comprising a first number of vector components for each frame; projecting... |
|
US20050049873 |
Dynamic ranges for viterbi calculations
A method of recognizing speech includes determining active ranges of states to be processed for each frame and performing recognition operations for each frame only on states within the active ranges. |
|
US20070129946 |
High quality speech reconstruction for a dialog method and system
An electronic device (400) for speech dialog includes functions that receive (405, 205) a speech phrase that includes an instantiated variable (315), generate pitch and voicing characteristics... |
|
US20060100874 |
Method for inducing a Hidden Markov Model with a similarity metric
A method for inducing a Hidden Markov Model (HMM) is provided. The method using a plurality of training observations and a distance function includes assigning at least one representative... |
|
US20050154589 |
Acoustic model creating method, acoustic model creating apparatus, acoustic model creating program, and speech recognition apparatus
Exemplary embodiments of the present invention enhance the recognition ability by optimizing state numbers of respective HMM's. Exemplary embodiments provide a description length computing unit to... |
|
US20050131694 |
Acoustic model creating method, acoustic model creating apparatus, acoustic model creating program, and speech recognition apparatus
Exemplary embodiments of the invention enhance the recognition ability by optimizing the distribution numbers for respective states that constitute an HMM (for example, a syllable HMM). Exemplary... |
|
US20120330664 |
METHOD AND APPARATUS FOR COMPUTING GAUSSIAN LIKELIHOODS
The present invention relates to a method and apparatus for computing Gaussian likelihoods. One embodiment of a method for processing a speech sample includes generating a feature vector for each... |
|
US20050027530 |
Audio-visual speaker identification using coupled hidden markov models
A phoneme and a viseme of a person may be modeled using a coupled hidden Markov model. The coupled hidden Markov model and a second model may be compared to identify the person. |
|
US20060031071 |
System and method for automatically implementing a finite state automaton for speech recognition
A system and method for automatically implementing a finite state automaton for speech recognition includes a finite state automaton generator that analyzes one or more input text sequences and... |
|
US20060136210 |
System and method for tying variance vectors for speech recognition
A system and method for implementing a speech recognition engine includes acoustic models that the speech recognition engine utilizes to perform speech recognition procedures. An acoustic model... |
|
US20050021337 |
HMM modification method
A HMM modification method for preventing an overfitting problem, reducing the number of parameters and avoiding gradient calculation by implementing a weighted loss function for misclassification... |
|
US20110071835 |
SMALL FOOTPRINT TEXT-TO-SPEECH ENGINE
Embodiments of small footprint text-to-speech engine are disclosed. In operation, the small footprint text-to-speech engine generates a set of feature parameters for an input text. The set of... |
|
US20080162128 |
METHOD AND APPARATUS PERTAINING TO THE PROCESSING OF SAMPLED AUDIO CONTENT USING A FAST SPEECH RECOGNITION SEARCH PROCESS
One provides (101) a plurality of frames of sampled audio content and then processes (102) that plurality of frames using a speech recognition search process that comprises, at least in part,... |
|
US20050216266 |
Incremental adjustment of state-dependent bias parameters for adaptive speech recognition
The mismatch between the distributions of acoustic models and features in speech recognition may cause performance degradation. A sequential bias adaptation (SBA) applies state or class dependent... |
|
US20100312561 |
Information Processing Apparatus, Information Processing Method, and Computer Program
An apparatus and a method for performing a grounding process using the POMDP are provided. The configuration is designed so that, in order to understand a request from a user through the... |
|
US20050021338 |
Recognition device and system
A recognition system that operates in an iterative process on detected features progressively setting a detection window for enhancing the recognition process. The process allows integration over... |
|
US20050033576 |
Task specific code generation for speech recognition decoding
A code generation program is provided that reads in the task-specific parameters of a speech recognition system and produces a source-language decoder program that is specialized to these... |
|
US20050256714 |
Sequential variance adaptation for reducing signal mismatching
The mismatch between the distributions of acoustic models and features in speech recognition may cause performance degradation. A sequential variance adaptation (SVA) adapts the covariances... |
|
US20060053013 |
Selection of a user language on purely acoustically controlled telephone
The user language of a device can be set to a user language by speaking the designation of the user language to be set. |
|
US20060155540 |
Method for data training
In a method for data training, training is performed using multiple entries of data in a database of web pages, libraries, patent documents, etc., in combination with data selected or labeled by a... |