|
Match
|
Document |
Document Title |
|
|
7630894 |
Frame erasure concealment technique for a bitstream-based feature extractor
A frame erasure concealment technique for a bitstream-based feature extractor in a speech recognition system particularly suited for use in a wireless communication system operates to “delete”...
|
|
|
7627468 |
Apparatus and method for extracting syllabic nuclei
An apparatus enabling automatic determination of a portion that reliably represents a feature of a speech waveform includes: an acoustic/prosodic analysis unit calculating, from data, distribution...
|
|
|
7620263 |
Anti-clipping method for image sharpness enhancement
An image processing system provides image enhancement and anti-clipping units. The anti-clipping unit for image sharpness enhancement, operates such that any shoot artifacts in the enhanced image...
|
|
|
7617102 |
Speaker identifying apparatus and computer program product
A speaker identifying apparatus includes: a module for performing a principal component analysis on predetermined vocal tract geometrical parameters of a plurality of speakers and calculating an...
|
|
|
7603278 |
Segment set creating method and apparatus
A segment set before updating is read, and clustering considering a phoneme environment is performed to it. For each cluster obtained by the clustering, a representative segment of a segment set...
|
|
|
7603274 |
Method and apparatus for determining the possibility of pattern recognition of time series signal
A method and apparatus for determining the possibility of pattern recognition of time series signal independent of a pattern recognition ratio is provided. The method for determining the...
|
|
|
7593842 |
Device and method for translating language
A device and method for translating language is disclosed. In one embodiment, for example, a method for providing a translated output signal derived from a speech input signal, comprises receiving...
|
|
|
7590605 |
Lattice matching
A system is described for matching lattices such as phoneme lattices generated by an automatic speech recognition unit. The system can be used to retrieve files from a database by comparing a query...
|
|
|
7590537 |
Speaker clustering and adaptation method based on the HMM model variation information and its apparatus for speech recognition
A speech recognition method and apparatus perform speaker clustering and speaker adaptation using average model variation information over speakers while analyzing the quantity variation amount and...
|
|
|
7587318 |
Correlating video images of lip movements with audio signals to improve speech recognition
A speech recognition device can include an audio signal receiver configured to receive audio signals from a speech source, a video signal receiver configured to receive video signals from the...
|
|
|
7574357 |
Applications of sub-audible speech recognition based upon electromyographic signals
Method and system for generating electromyographic or sub-audible signals (“SAWPs”) and for transmitting and recognizing the SAWPs that represent the original words and/or phrases. The SAWPs...
|
|
|
7571098 |
System and method of spoken language understanding using word confusion networks
Word lattices that are generated by an automatic speech recognition system are used to generate a modified word lattice that is usable by a spoken language understanding module. In one embodiment,...
|
|
|
7567903 |
Low latency real-time vocal tract length normalization
A method and apparatus for performing speech recognition are provided. A Vocal Tract Length Normalized acoustic model for a speaker is generated from training data. Speech recognition is performed...
|
|
|
7565213 |
Device and method for analyzing an information signal
A significant short-time spectrum is extracted from an information signal, the means for extracting being configured to extract such short-time spectra which come closer to a specific...
|
|
|
7562014 |
Active learning process for spoken dialog systems
A large amount of human labor is required to transcribe and annotate a training corpus that is needed to create and update models for automatic speech recognition (ASR) and spoken language...
|
|
|
7546236 |
Anomaly recognition method for data streams
This invention identifies anomalies in a data stream, without prior training, by measuring the difficulty in finding similarities between neighborhoods in the ordered sequence of elements. Data...
|
|
|
7533015 |
Signal enhancement via noise reduction for speech recognition
Provides speech enhancement techniques for extemporaneous noise without a noise interval and unknown extemporaneous noise. Signal enhancement includes: subtracting a given reference signal from an...
|
|
|
7529668 |
System and method for implementing a refined dictionary for speech recognition
A system and method for implementing a refined dictionary for speech recognition includes a database analyzer that initially identifies first vocabulary words that are present in a training...
|
|
|
7529665 |
Two stage utterance verification device and method thereof in speech recognition system
A two stage utterance verification device and a method thereof are provided. The two stage utterance verification method includes performing a first utterance verification function based on a SVM...
|
|
|
7529666 |
Minimum bayes error feature selection in speech recognition
In connection with speech recognition, the design of a linear transformation θε p×n , of rank p×n, which projects the features of a classifier xε n onto y=θxε p such as to achieve minimum...
|
|
|
7509256 |
Feature extraction apparatus and method and pattern recognition apparatus and method
It is intended to increase the recognition rate in speech recognition and image recognition. An observation vector as input data, which represents a certain point in the observation vector space,...
|
|
|
7505897 |
Generalized Lempel-Ziv compression for multimedia signals
The subject matter includes systems, engines, and methods for generalizing a class of Lempel-Ziv algorithms for lossy compression of multimedia. One implementation of the subject matter compresses...
|
|
|
7493257 |
Method and apparatus handling speech recognition errors in spoken dialogue systems
To handle portions of a recognized sentence having an error, a user is questioned about contents associated with portions. According to a user's answer, a result is obtained. Speech recognition...
|
|
|
7480615 |
Method of speech recognition using multimodal variational inference with switching state space models
A method of efficiently setting posterior probability parameters for a switching state space model begins by defining a window containing at least two but fewer than all of the frames. A separate...
|
|
|
7478045 |
Method and device for characterizing a signal and method and device for producing an indexed signal
In a method for characterizing a signal representing an audio content a measure is determined for a tonality of the signal, whereupon a statement is made about the audio content of the signal on...
|
|
|
7475012 |
Signal detection using maximum a posteriori likelihood and noise spectral difference
Robust signal detection against various types of background noise is implemented. According to a signal detection apparatus, the feature amount of an input signal sequence and the feature amount of...
|
|
|
7472063 |
Audio-visual feature fusion and support vector machine useful for continuous speech recognition
A speech recognition method includes several embodiments describing application of support vector machine analysis to a mouth region. Lip position can be accurately determined and used in...
|
|
|
7464031 |
Speech recognition utilizing multitude of speech features
In a speech recognition system, the combination of a log-linear model with a multitude of speech features is provided to recognize unknown speech utterances. The speech recognition system models...
|
|
|
7418383 |
Noise robust speech recognition with a switching linear dynamic model
A unified, nonlinear, non-stationary, stochastic model is disclosed for estimating and removing effects of background noise on speech cepstra. Generally stated, the model is a union of dynamic...
|
|
|
7418382 |
Structure skeletons for efficient voice navigation through generic hierarchical objects
A system and method for providing fast and efficient conversation navigation via a hierarchical structure (structure skeleton) which fully describes functions and services supported by a dialog...
|
|
|
7412383 |
Reducing time for annotating speech data to develop a dialog application
Systems and methods for annotating speech data. The present invention reduces the time required to annotate speech data by selecting utterances for annotation that will be of greatest benefit. A...
|
|
|
7392184 |
Arrangement of speaker-independent speech recognition
A method needed in speech recognition for forming a pronunciation model in a telecommunications system comprising at least one portable electronic device and server. The electronic device is...
|
|
|
7389228 |
Speaker adaptation of vocabulary for speech recognition
A phonetic vocabulary for a speech recognition system is adapted to a particular speaker's pronunciation. A speaker can be attributed specific pronunciation styles, which can be identified from...
|
|
|
7379867 |
Discriminative training of language models for text and speech classification
Methods are disclosed for estimating language models such that the conditional likelihood of a class given a word string, which is very well correlated with classification accuracy, is maximized....
|
|
|
7376553 |
Fractal harmonic overtone mapping of speech and musical sounds
An apparatus for signal processing based on an algorithm for representing harmonics in a fractal lattice. The apparatus includes a plurality of tuned segments, each tuned segment including a...
|
|
|
7376562 |
Method and apparatus for nonlinear frequency analysis of structured signals
The present invention relates to systems and methods for processing acoustic signals, such as music and speech. The method involves nonlinear frequency analysis of an incoming acoustic signal. In...
|
|
|
7369991 |
Speech recognition system, speech recognition method, speech synthesis system, speech synthesis method, and program product having increased accuracy
The object of the present invention is to keep a high success rate in recognition with a low-volume of sound signal, without being affected by noise. The speech recognition system comprises a...
|
|
|
7366666 |
Relative delta computations for determining the meaning of language inputs
A method for processing language input can include the step of determining at least two possible meanings for a language input. For each possible meaning, a probability that the possible meaning is...
|
|
|
7363200 |
Apparatus and method for isolating noise effects in a signal
A matrix includes samples associated with a first signal and samples associated with a second signal. The second signal includes a first portion associated with the first signal and a second...
|
|
|
7349844 |
Minimizing resource consumption for speech recognition processing with dual access buffering
In a processor system for audio processing, such as voice recognition and text-to-speech, a dedicated front-end processor, a core processor and a dedicated back-end processor are provided which are...
|
|
|
7337114 |
Speech recognition using discriminant features
Methods and arrangements for representing the speech waveform in terms of a set of abstract, linguistic distinctions in order to derive a set of discriminative features for use in a speech...
|
|
|
7337107 |
Perceptual harmonic cepstral coefficients as the front-end for speech recognition
Pitch estimation and classification into voiced, unvoiced and transitional speech were performed by a spectro-temporal auto-correlation technique. A peak picking formula was then employed. A...
|
|
|
7318029 |
Method and apparatus for a interactive voice response system
There is disclosed an interactive voice response system for prompting a user with feedback during speech recognition. A user who speaks too slowly or too quickly may speak even more slowly or...
|
|
|
7302388 |
Method and apparatus for detecting voice activity
Method and apparatus detect voice activity for spectrum or power efficiency purposes. The method determines and tracks the instant, minimum and maximum power levels of the input signal. The method...
|
|
|
7295977 |
Extracting classifying data in music from an audio bitstream
The method of the present invention utilizes machine-learning techniques, particularly Support Vector Machines in combination with a neural network, to process a unique machine-learning enabled...
|
|
|
7292981 |
Signal variation feature based confidence measure
A method for predicting a misrecognition in a speech recognition system, is based on; the insight that variations in a speech input signal are different depending on the origin of the signal being...
|
|
|
7292977 |
Systems and methods for providing online fast speaker adaptation in speech recognition
A system ( 230 ) performs speaker adaptation when performing speech recognition. The system ( 230 ) receives an audio segment and identifies the audio segment as a first audio segment or a...
|
|
|
7280963 |
Method for learning linguistically valid word pronunciations from acoustic data
A computerized method is provided for generating pronunciations for words and storing the pronunciations in a pronunciation dictionary. The method includes graphing sets of initial pronunciations;...
|
|
|
7277852 |
Method, system and storage medium for commercial and musical composition recognition and storage
A playlist generating method for generating a playlist of content from received broadcasted data is provided. The playlist generating method includes the steps of: extracting features of broadcast...
|
|
|
7272559 |
Noninvasive detection of neuro diseases
Noninvasive, remote methods and apparatus for detecting early phases of neuro diseases such as the non-tremor phase of Parkinson's disease, dyskinesia, dyslexia and neuroatrophy, etc., are...
|