Matches 1 - 50 out of 433 1 2 3 4 5 6 7 8 9 >
Match Document Document Title
7610204 Selective enablement of speech recognition grammars  
A method for processing speech audio in a network connected client device can include selecting a speech grammar for use in a speech recognition system in the network connected client device;...
7599837 Creating a speech recognition grammar for alphanumeric concepts  
A method and system to generate a grammar adapted for use by a speech recognizer includes receiving a representation of an alphanumeric expression. For instance, the representation can take the...
7593845 Method and apparatus for identifying semantic structures from text  
A method and apparatus for identifying a semantic structure from an input text forms at least two candidate semantic structures. A semantic score is determined for each candidate semantic structure...
7584102 Language model for use in speech recognition  
Building a language model for use in speech recognition includes identifying without user interaction a source of text related to a user. Text is retrieved from the identified source of text and a...
7574358 Natural language system and method based on unisolated performance metric  
A natural language business system and method is developed to understand the underlying meaning of a person's speech, such as during a transaction with the business system. The system includes a...
7571098 System and method of spoken language understanding using word confusion networks  
Word lattices that are generated by an automatic speech recognition system are used to generate a modified word lattice that is usable by a spoken language understanding module. In one embodiment,...
7571096 Speech recognition using a state-and-transition based binary speech grammar with a last transition value  
A computer-loadable data structure is provided that represents a state-and-transition-based description of a speech grammar. The data structure includes first and second transition entries that...
7555431 Method for processing speech using dynamic grammars  
Speech data is processed with one or more dynamic grammars, to reduce latency and improve accuracy. Different speech grammars are used by a speech recognition process depending on a context...
7552055 Dialog component re-use in recognition systems  
Controls are provided for a web server to generate client side markups that include recognition and/or audible prompting. The controls comprise elements of a dialog such as a question, answer,...
7552051 Method and apparatus for mapping multiword expressions to identifiers using finite-state networks  
Multiword expressions are mapped to identifiers using finite-state networks. Each of a plurality of multiword expressions is encoded into a regular expression. Each regular expression encodes a...
7539616 Speaker authentication using adapted background models  
Speaker authentication is performed by determining a similarity score for a test utterance and a stored training utterance. Computing the similarity score involves determining the sum of a group of...
7533019 System and method for unsupervised and active learning for automatic speech recognition  
A system and method is provided for combining active and unsupervised learning for automatic speech recognition. This process enables a reduction in the amount of human supervision required for...
7533018 Tailored speaker-independent voice recognition system  
A tailored speaker-independent voice recognition system has a speech recognition dictionary ( 360 ) with at least one word ( 371 ). That word ( 371 ) has at least two transcriptions ( 373 ), each...
7529666 Minimum bayes error feature selection in speech recognition  
In connection with speech recognition, the design of a linear transformation θε p×n , of rank p×n, which projects the features of a classifier xε n onto y=θxε p such as to achieve minimum...
7529657 Configurable parameters for grammar authoring for speech recognition and natural language understanding  
A method for authoring a grammar for use in a language processing application is provided. The method includes receiving at least one grammar configuration parameter relating to how to configure a...
7523034 Adaptation of Compound Gaussian Mixture models  
Methods and arrangements for enhancing speech recognition in noisy environments, via providing at least one initial Compound Gaussian Mixture model, applying an adaptation algorithm to at least one...
7519534 Speech controlled access to content on a presentation medium  
One embodiment of the invention provides television viewers with an intuitive and easy-to-use way to find the programs they want and to control their television viewing experience. In a further...
7519531 Speaker adaptive learning of resonance targets in a hidden trajectory model of speech coarticulation  
A computer-implemented method is provided for training a hidden trajectory model, of a speech recognition system, which generates Vocal Tract Resonance (VTR) targets. The method includes obtaining...
7509259 Method of refining statistical pattern recognition models and statistical pattern recognizers  
A device ( 800 ) performs statistical pattern recognition using model parameters that are refined by optimizing an objective function that includes a term for many items of training data for which...
7509258 Phonetic, syntactic and conceptual analysis driven speech recognition system and method  
A new approach to speech recognition that reacts to concepts conveyed through speech, which shifts the balance of power in speech recognition from straight sound recognition and statistical models...
7505903 Speech recognition dictionary creation method and speech recognition dictionary creating device  
A speech recognition dictionary creation method is provided for creating a speech recognition dictionary that is used for creating document data such as electronic mails through voice input in an...
7499858 Methods of information retrieval  
A method of information retrieval using a hybrid interface is provided. The method includes providing a graphical user interface having a plurality of views with a list of potential targets within...
7499857 Adaptation of compressed acoustic models  
The present invention is used to adapt acoustic models, quantized in subspaces, using adaptation training data (such as speaker-dependent training data). The acoustic model is compressed into...
7496508 Method of determining database entries  
The invention relates to a method of determining database entries of a database ( 9 ) by means of an automatic dialog system ( 1 ) in which the following steps are provided: 1.1 temporary...
7487092 Interactive debugging and tuning method for CTTS voice building  
A speech recognition device which can preferably be used for reducing the memory capacity required for speaker-independent speech recognition is provided. A matching unit loads speech models...
7487091 Speech recognition device for recognizing a word sequence using a switching speech model network  
A speech recognition device which can preferably be used for reducing the memory capacity required for speaker-independent speech recognition is provided. A matching unit loads speech models...
7487087 Method of speech recognition using variational inference with switching state space models  
A method is developed which includes 1) defining a switching state space model for a continuous valued hidden production-related parameter and the observed speech acoustics, and 2) approximating a...
7486807 Image retrieving device, method for adding keywords in image retrieving device, and computer program therefor  
An image retrieving device for classifying and retrieving an image by detecting an object in the image and adding a keyword comprises an image storing section for storing the image which is...
7480615 Method of speech recognition using multimodal variational inference with switching state space models  
A method of efficiently setting posterior probability parameters for a switching state space model begins by defining a window containing at least two but fewer than all of the frames. A separate...
7478038 Language model adaptation using semantic supervision  
A method and apparatus are provided for adapting a language model. The method and apparatus provide supervised class-based adaptation of the language model utilizing in-domain semantic information.
7475016 Speech segment clustering and ranking  
A system, method, and apparatus for identifying problematic speech segments is provided. The system includes a clustering module for generating a first cluster of one or more consecutive speech...
7475015 Semantic language modeling and confidence measurement  
A system and method for speech recognition includes generating a set of likely hypotheses in recognizing speech, rescoring the likely hypotheses by using semantic content by employing semantic...
7472062 Efficient recursive clustering based on a splitting function derived from successive eigen-decompositions  
Methods and arrangements for facilitating data clustering. From a set of input data, a predetermined number of non-overlapping subsets are created. The input data is split recursively to create the...
7472061 Systems and methods for building a native language phoneme lexicon having native pronunciations of non-native words derived from non-native pronunciations  
Systems and methods are provided for automatically building a native phonetic lexicon for a speech-based application trained to process a native (base) language, wherein the native phonetic lexicon...
7467086 Methodology for generating enhanced demiphone acoustic models for speech recognition  
A system and method for effectively performing speech recognition procedures includes enhanced demiphone acoustic models that a speech recognition engine utilizes to perform the speech recognition...
7454344 Language model architecture  
An architectural design is disclosed wherein a single reusable language model component is shared by multiple applications. The language model component is loaded once for a plurality of...
7454341 Method, apparatus, and system for building a compact model for large vocabulary continuous speech recognition (LVCSR) system  
According to one aspect of the invention, a method is provided in which a mean vector set and a variance vector set of a set of N Gaussians are divided into multiple mean sub-vector sets and...
7454336 Variational inference and learning for segmental switching state space models of hidden speech dynamics  
A system and method that facilitate modeling unobserved speech dynamics based upon a hidden dynamic speech model in the form of segmental switching state space model that employs model parameters...
7451125 System and method for compiling rules created by machine learning program  
A system, a method, and a machine-readable medium are provided. A group of linear rules and associated weights are provided as a result of machine learning. Each one of the group of linear rules is...
7437290 Automatic censorship of audio data for broadcast  
An input audio data stream comprising speech is processed by an automatic censoring filter in either a real-time mode, or a batch mode, producing censored speech that has been altered so that...
7430509 Lattice encoding  
Initially an embedding module ( 22 ) determines an embedding of a lattice in a two-dimensional plane. The embedding module ( 22 ) then processes the initial embedding to generate a planar graph in...
7424429 Information processing apparatus, information processing method, program, and storage medium  
The correspondence between input fields and grammars is obtained (S 102 ), and a speech utterance example is displayed using a grammar corresponding to a portion (field) designated by an input...
7418387 Generic spelling mnemonics  
A system and method for creating a mnemonics Language Model for use with a speech recognition software application, wherein the method includes generating an n-gram Language Model containing a...
7415409 Method to train the language model of a speech recognition system to convert and index voicemails on a search engine  
A method and a related system to index voicemail documents by training a language model for a speaker or group of speakers by using existing emails and contact information on available repositories.
7406416 Representation of a deleted interpolation N-gram language model in ARPA standard format  
A method and apparatus are provided for storing parameters of a deleted interpolation language model as parameters of a backoff language model. In particular, the parameters of the deleted...
7403896 Speech recognition system and program thereof  
Speech recognition is performed by matching between a characteristic quantity of an inputted speech and a composite HMM obtained by synthesizing a speech HMM (hidden Markov model) and a noise HMM...
7401019 Phonetic fragment search in speech data  
A method of searching audio data is provided including receiving a query defining multiple phonetic possibilities. The method also includes comparing the query with a lattice of phonetic hypotheses...
7398210 System and method for performing analysis on word variants  
A computer-readable medium stores a first lexicon data structure for lexicon words. The first data structure includes a host form variant field containing a host form variant such as a clitic host...
7392185 Speech based learning/training system using semantic decoding  
An intelligent query system for processing voiced-based queries is disclosed, which uses a combination of both statistical and semantic based processing to identify the question posed by the user...
7392184 Arrangement of speaker-independent speech recognition  
A method needed in speech recognition for forming a pronunciation model in a telecommunications system comprising at least one portable electronic device and server. The electronic device is...
Matches 1 - 50 out of 433 1 2 3 4 5 6 7 8 9 >