|
Match
|
Document |
Document Title |
|
|
7610204 |
Selective enablement of speech recognition grammars
A method for processing speech audio in a network connected client device can include selecting a speech grammar for use in a speech recognition system in the network connected client device;...
|
|
|
7599837 |
Creating a speech recognition grammar for alphanumeric concepts
A method and system to generate a grammar adapted for use by a speech recognizer includes receiving a representation of an alphanumeric expression. For instance, the representation can take the...
|
|
|
7593845 |
Method and apparatus for identifying semantic structures from text
A method and apparatus for identifying a semantic structure from an input text forms at least two candidate semantic structures. A semantic score is determined for each candidate semantic structure...
|
|
|
7584102 |
Language model for use in speech recognition
Building a language model for use in speech recognition includes identifying without user interaction a source of text related to a user. Text is retrieved from the identified source of text and a...
|
|
|
7574358 |
Natural language system and method based on unisolated performance metric
A natural language business system and method is developed to understand the underlying meaning of a person's speech, such as during a transaction with the business system. The system includes a...
|
|
|
7571098 |
System and method of spoken language understanding using word confusion networks
Word lattices that are generated by an automatic speech recognition system are used to generate a modified word lattice that is usable by a spoken language understanding module. In one embodiment,...
|
|
|
7571096 |
Speech recognition using a state-and-transition based binary speech grammar with a last transition value
A computer-loadable data structure is provided that represents a state-and-transition-based description of a speech grammar. The data structure includes first and second transition entries that...
|
|
|
7555431 |
Method for processing speech using dynamic grammars
Speech data is processed with one or more dynamic grammars, to reduce latency and improve accuracy. Different speech grammars are used by a speech recognition process depending on a context...
|
|
|
7552055 |
Dialog component re-use in recognition systems
Controls are provided for a web server to generate client side markups that include recognition and/or audible prompting. The controls comprise elements of a dialog such as a question, answer,...
|
|
|
7552051 |
Method and apparatus for mapping multiword expressions to identifiers using finite-state networks
Multiword expressions are mapped to identifiers using finite-state networks. Each of a plurality of multiword expressions is encoded into a regular expression. Each regular expression encodes a...
|
|
|
7539616 |
Speaker authentication using adapted background models
Speaker authentication is performed by determining a similarity score for a test utterance and a stored training utterance. Computing the similarity score involves determining the sum of a group of...
|
|
|
7533019 |
System and method for unsupervised and active learning for automatic speech recognition
A system and method is provided for combining active and unsupervised learning for automatic speech recognition. This process enables a reduction in the amount of human supervision required for...
|
|
|
7533018 |
Tailored speaker-independent voice recognition system
A tailored speaker-independent voice recognition system has a speech recognition dictionary ( 360 ) with at least one word ( 371 ). That word ( 371 ) has at least two transcriptions ( 373 ), each...
|
|
|
7529666 |
Minimum bayes error feature selection in speech recognition
In connection with speech recognition, the design of a linear transformation θε p×n , of rank p×n, which projects the features of a classifier xε n onto y=θxε p such as to achieve minimum...
|
|
|
7529657 |
Configurable parameters for grammar authoring for speech recognition and natural language understanding
A method for authoring a grammar for use in a language processing application is provided. The method includes receiving at least one grammar configuration parameter relating to how to configure a...
|
|
|
7523034 |
Adaptation of Compound Gaussian Mixture models
Methods and arrangements for enhancing speech recognition in noisy environments, via providing at least one initial Compound Gaussian Mixture model, applying an adaptation algorithm to at least one...
|
|
|
7519534 |
Speech controlled access to content on a presentation medium
One embodiment of the invention provides television viewers with an intuitive and easy-to-use way to find the programs they want and to control their television viewing experience. In a further...
|
|
|
7519531 |
Speaker adaptive learning of resonance targets in a hidden trajectory model of speech coarticulation
A computer-implemented method is provided for training a hidden trajectory model, of a speech recognition system, which generates Vocal Tract Resonance (VTR) targets. The method includes obtaining...
|
|
|
7509259 |
Method of refining statistical pattern recognition models and statistical pattern recognizers
A device ( 800 ) performs statistical pattern recognition using model parameters that are refined by optimizing an objective function that includes a term for many items of training data for which...
|
|
|
7509258 |
Phonetic, syntactic and conceptual analysis driven speech recognition system and method
A new approach to speech recognition that reacts to concepts conveyed through speech, which shifts the balance of power in speech recognition from straight sound recognition and statistical models...
|
|
|
7505903 |
Speech recognition dictionary creation method and speech recognition dictionary creating device
A speech recognition dictionary creation method is provided for creating a speech recognition dictionary that is used for creating document data such as electronic mails through voice input in an...
|
|
|
7499858 |
Methods of information retrieval
A method of information retrieval using a hybrid interface is provided. The method includes providing a graphical user interface having a plurality of views with a list of potential targets within...
|
|
|
7499857 |
Adaptation of compressed acoustic models
The present invention is used to adapt acoustic models, quantized in subspaces, using adaptation training data (such as speaker-dependent training data). The acoustic model is compressed into...
|
|
|
7496508 |
Method of determining database entries
The invention relates to a method of determining database entries of a database ( 9 ) by means of an automatic dialog system ( 1 ) in which the following steps are provided:
1.1 temporary...
|
|
|
7487092 |
Interactive debugging and tuning method for CTTS voice building
A speech recognition device which can preferably be used for reducing the memory capacity required for speaker-independent speech recognition is provided. A matching unit loads speech models...
|
|
|
7487091 |
Speech recognition device for recognizing a word sequence using a switching speech model network
A speech recognition device which can preferably be used for reducing the memory capacity required for speaker-independent speech recognition is provided. A matching unit loads speech models...
|
|
|
7487087 |
Method of speech recognition using variational inference with switching state space models
A method is developed which includes 1) defining a switching state space model for a continuous valued hidden production-related parameter and the observed speech acoustics, and 2) approximating a...
|
|
|
7486807 |
Image retrieving device, method for adding keywords in image retrieving device, and computer program therefor
An image retrieving device for classifying and retrieving an image by detecting an object in the image and adding a keyword comprises an image storing section for storing the image which is...
|
|
|
7480615 |
Method of speech recognition using multimodal variational inference with switching state space models
A method of efficiently setting posterior probability parameters for a switching state space model begins by defining a window containing at least two but fewer than all of the frames. A separate...
|
|
|
7478038 |
Language model adaptation using semantic supervision
A method and apparatus are provided for adapting a language model. The method and apparatus provide supervised class-based adaptation of the language model utilizing in-domain semantic information.
|
|
|
7475016 |
Speech segment clustering and ranking
A system, method, and apparatus for identifying problematic speech segments is provided. The system includes a clustering module for generating a first cluster of one or more consecutive speech...
|
|
|
7475015 |
Semantic language modeling and confidence measurement
A system and method for speech recognition includes generating a set of likely hypotheses in recognizing speech, rescoring the likely hypotheses by using semantic content by employing semantic...
|
|
|
7472062 |
Efficient recursive clustering based on a splitting function derived from successive eigen-decompositions
Methods and arrangements for facilitating data clustering. From a set of input data, a predetermined number of non-overlapping subsets are created. The input data is split recursively to create the...
|
|
|
7472061 |
Systems and methods for building a native language phoneme lexicon having native pronunciations of non-native words derived from non-native pronunciations
Systems and methods are provided for automatically building a native phonetic lexicon for a speech-based application trained to process a native (base) language, wherein the native phonetic lexicon...
|
|
|
7467086 |
Methodology for generating enhanced demiphone acoustic models for speech recognition
A system and method for effectively performing speech recognition procedures includes enhanced demiphone acoustic models that a speech recognition engine utilizes to perform the speech recognition...
|
|
|
7454344 |
Language model architecture
An architectural design is disclosed wherein a single reusable language model component is shared by multiple applications. The language model component is loaded once for a plurality of...
|
|
|
7454341 |
Method, apparatus, and system for building a compact model for large vocabulary continuous speech recognition (LVCSR) system
According to one aspect of the invention, a method is provided in which a mean vector set and a variance vector set of a set of N Gaussians are divided into multiple mean sub-vector sets and...
|
|
|
7454336 |
Variational inference and learning for segmental switching state space models of hidden speech dynamics
A system and method that facilitate modeling unobserved speech dynamics based upon a hidden dynamic speech model in the form of segmental switching state space model that employs model parameters...
|
|
|
7451125 |
System and method for compiling rules created by machine learning program
A system, a method, and a machine-readable medium are provided. A group of linear rules and associated weights are provided as a result of machine learning. Each one of the group of linear rules is...
|
|
|
7437290 |
Automatic censorship of audio data for broadcast
An input audio data stream comprising speech is processed by an automatic censoring filter in either a real-time mode, or a batch mode, producing censored speech that has been altered so that...
|
|
|
7430509 |
Lattice encoding
Initially an embedding module ( 22 ) determines an embedding of a lattice in a two-dimensional plane. The embedding module ( 22 ) then processes the initial embedding to generate a planar graph in...
|
|
|
7424429 |
Information processing apparatus, information processing method, program, and storage medium
The correspondence between input fields and grammars is obtained (S 102 ), and a speech utterance example is displayed using a grammar corresponding to a portion (field) designated by an input...
|
|
|
7418387 |
Generic spelling mnemonics
A system and method for creating a mnemonics Language Model for use with a speech recognition software application, wherein the method includes generating an n-gram Language Model containing a...
|
|
|
7415409 |
Method to train the language model of a speech recognition system to convert and index voicemails on a search engine
A method and a related system to index voicemail documents by training a language model for a speaker or group of speakers by using existing emails and contact information on available repositories.
|
|
|
7406416 |
Representation of a deleted interpolation N-gram language model in ARPA standard format
A method and apparatus are provided for storing parameters of a deleted interpolation language model as parameters of a backoff language model. In particular, the parameters of the deleted...
|
|
|
7403896 |
Speech recognition system and program thereof
Speech recognition is performed by matching between a characteristic quantity of an inputted speech and a composite HMM obtained by synthesizing a speech HMM (hidden Markov model) and a noise HMM...
|
|
|
7401019 |
Phonetic fragment search in speech data
A method of searching audio data is provided including receiving a query defining multiple phonetic possibilities. The method also includes comparing the query with a lattice of phonetic hypotheses...
|
|
|
7398210 |
System and method for performing analysis on word variants
A computer-readable medium stores a first lexicon data structure for lexicon words. The first data structure includes a host form variant field containing a host form variant such as a clitic host...
|
|
|
7392185 |
Speech based learning/training system using semantic decoding
An intelligent query system for processing voiced-based queries is disclosed, which uses a combination of both statistical and semantic based processing to identify the question posed by the user...
|
|
|
7392184 |
Arrangement of speaker-independent speech recognition
A method needed in speech recognition for forming a pronunciation model in a telecommunications system comprising at least one portable electronic device and server. The electronic device is...
|