Match Document Document Title
9043208 System, method and program product for providing automatic speech recognition (ASR) in a shared resource environment  
A speech recognition system, method of recognizing speech and a computer program product therefor. A client device identified with a context for an associated user selectively streams audio to a...
9043205 Dynamic language model  
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for speech recognition. One of the methods includes receiving a base language model for speech...
9037459 Selection of text prediction results by an accessory  
A method for entering text in a text input field using a non-keyboard type accessory includes selecting a character for entry into the text field presented by a portable computing device. The...
9031838 Method and apparatus for voice clarity and speech intelligibility detection and correction  
Systems, methods and apparatus are described herein for continuously measuring voice clarity and speech intelligibility by evaluating a plurality of telecommunications channels in real time. Voice...
9031840 Identifying media content  
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving (i) audio data that encodes a spoken natural language query, and (ii) environmental...
9026446 System for generating captions for live video broadcasts  
An adaptive workflow system can be used to implement captioning projects, such as projects for creating captions or subtitles for live and non-live broadcasts. Workers can repeat words spoken...
9026438 Detecting barge-in in a speech dialogue system  
A method for detecting barge-in in a speech dialog system comprising determining whether a speech prompt is output by the speech dialog system, and detecting whether speech activity is present in...
9020816 Hidden markov model for speech processing with training method  
A method, system and apparatus are shown for identifying non-language speech sounds in a speech or audio signal. An audio signal is segmented and feature vectors are extracted from the segments of...
9020825 Voice gestures  
A voice gesture is determined from characteristics of an audio signal based on sound uttered by a user. The voice gesture may represent a command or parameters or a command, and may be context...
9020817 Using speech to text for detecting commercials and aligning edited episodes with transcripts  
Methods and apparatus, including computer program products, for using speech to text for detecting commercials and aligning edited episodes with transcripts. A method includes, receiving an...
9020808 Document summarization using noun and sentence ranking  
Systems and methods are provided for summarization of electronic text documents. Nouns and sentences are identified in a text document, and the most-prevalent nouns are further identified based on...
9014346 Methods and systems for touch-free call handling  
A method, apparatus and computer-readable medium for handling incoming calls destined for a called party. The method comprises detecting arrival of an incoming call destined for the called party...
9015046 Methods and apparatus for real-time interaction analysis in call centers  
A method and system for indicating in real time that an interaction is associated with a problem or issue, comprising: receiving a segment of an interaction in which a representative of the...
9015693 System and method for modifying and updating a speech recognition program  
The system provides a speech recognition program, an update website for updating a speech recognition program, and a way of storing data. A user may utilize an update website, to add, modify, and...
9009049 Recognition of speech with different accents  
Computer-based speech recognition can be improved by recognizing words with an accurate accent model. In order to provide a large number of possible accents, while providing real-time speech...
9009041 Systems and methods for improving the accuracy of a transcription using auxiliary data such as personal data  
A method is described for improving the accuracy of a transcription generated by an automatic speech recognition (ASR) engine. A personal vocabulary is maintained that includes replacement words....
9009043 Pattern processing system specific to a user group  
Methods and apparatus for identifying a user group in connection with user group-based speech recognition. An exemplary method comprises receiving, from a user, a user group identifier that...
9009042 Machine translation of indirect speech  
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating direct speech messages based on voice commands that include indirect speech...
9009040 Training a transcription system  
According to certain embodiments, training a transcription system includes accessing recorded voice data of a user from one or more sources. The recorded voice data comprises voice samples. A...
9009055 Hosted voice recognition system for wireless devices  
Methods, systems, and software for converting the audio input of a user of a hand-held client device or mobile phone into a textual representation by means of a backend server accessed by the...
9009044 Multiple subspace discriminative feature training  
Methods and apparatus related to speech recognition performed by a speech recognition device are disclosed. The speech recognition device can receive a plurality of samples corresponding to an...
9002708 Speech recognition system and method based on word-level candidate generation  
A speech recognition system and method based on word-level candidate generation are provided. The speech recognition system may include a speech recognition result verifying unit to verify a word...
9002707 Determining the position of the source of an utterance  
An information processing apparatus includes: a plurality of information input units; an event detection unit that generates event information including estimated position information and...
9002698 Speech translation apparatus, method and program  
According to one embodiment, a speech translation apparatus includes a speech recognition unit, a translation unit, a search unit and a selection unit. The speech recognition unit successively...
9002713 System and method for speech personalization by need  
Disclosed herein are systems, computer-implemented methods, and tangible computer-readable storage media for speaker recognition personalization. The method recognizes speech received from a...
8996381 Background speech recognition assistant  
In one embodiment, a method receives an acoustic input signal at a speech recognizer configured to recognize the acoustic input signal in an always on mode. A set of responses based on the...
8996369 System and method for transcribing audio files of various languages  
System, method and program product for transcribing an audio file included in or referenced by a web page. A language of text in the web page is determined. Then, voice recognition software of the...
8996371 Method and system for automatic domain adaptation in speech recognition applications  
A system and method for adapting a language model to a specific environment by receiving interactions captured the specific environment, generating a collection of documents from documents...
8996372 Using adaptation data with cloud-based speech recognition  
Speech recognition may be improved using data derived from an utterance. In some embodiments, audio data is received by a user device. Adaptation data may be retrieved from a data store accessible...
8996387 Release of transaction data  
For clearing transaction data selected for a processing, there is generated in a portable data carrier (1) a transaction acoustic signal (003; 103; 203) (S007; S107; S207) upon whose acoustic...
8996374 Senone scoring for multiple input streams  
Embodiments of the present invention include an apparatus, method, and system for calculating senone scores for multiple concurrent input speech streams. The method can include the following:...
8996386 Method and system for creating a voice recognition database for a mobile device using image processing and optical character recognition  
A method and system for controlling a mobile device from a head unit using voice control is disclosed. The head unit receives a graphical representation of a current user interface screen of the...
8996366 Multi-stage speaker adaptation  
A first gender-specific speaker adaptation technique may be selected based on characteristics of a first set of feature vectors that correspond to a first unit of input speech. The first set of...
8996363 Apparatus and method for determining a plurality of local center of gravity frequencies of a spectrum of an audio signal  
An apparatus for determining a plurality of local center-of-gravity frequencies of a spectrum of an audio signal includes an offset determiner, a frequency determiner and an iteration controller....
8996368 Online maximum-likelihood mean and variance normalization for speech recognition  
A feature transform for speech recognition is described. An input speech utterance is processed to produce a sequence of representative speech vectors. A time-synchronous speech recognition pass...
8996380 Methods and systems for synchronizing media  
Systems and methods of synchronizing media are provided. A client device may be used to capture a sample of a media stream being rendered by a media rendering source. The client device sends the...
8990078 Information presentation device associated with sound source separation  
An information presentation device includes an audio signal input unit configured to input an audio signal, an image signal input unit configured to input an image signal, an image display unit...
8990071 Telephony service interaction management  
A method for managing an interaction of a calling party to a communication partner is provided. The method includes automatically determining if the communication partner expects DTMF input. The...
8990085 System and method for handling repeat queries due to wrong ASR output by modifying an acoustic, a language and a semantic model  
Disclosed herein are systems, computer-implemented methods, and computer-readable storage media for handling expected repeat speech queries or other inputs. The method causes a computing device to...
8990077 Method and system for sharing portable voice profiles  
An embodiment of the present invention provides a speech recognition engine that utilizes portable voice profiles for converting recorded speech to text. Each portable voice profile includes...
8990079 Automatic calibration of command-detection thresholds  
When a voice-activated device or application is first started, the signal levels corresponding to spoken commands are initially unknown, making it difficult to set detection thresholds. The...
8990082 Non-scorable response filters for speech scoring systems  
A method for scoring non-native speech includes receiving a speech sample spoken by a non-native speaker and performing automatic speech recognition and metric extraction on the speech sample to...
8983835 Electronic device and server for processing voice message  
An electronic device includes a voice processing unit, a wireless communication unit, and a combining unit. The voice processing unit receives speech signals. The wireless communication unit sends...
8983838 Global speech user interface  
A global speech user interface (GSUI) comprises an input system to receive a user's spoken command, a feedback system along with a set of feedback overlays to give the user information on the...
8983836 Captioning using socially derived acoustic profiles  
Mechanisms for performing dynamic automatic speech recognition on a portion of multimedia content are provided. Multimedia content is segmented into homogeneous segments of content with regard to...
8983834 Multichannel audio coding  
Multiple channels of audio are combined either to a monophonic composite signal or to multiple channels of audio along with related auxiliary information from which multiple channels of audio are...
8983845 Third-party audio subsystem enhancement  
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing audio subsystem enhancement. In one aspect, a method includes: receiving a voice...
8977547 Voice recognition system for registration of stable utterances  
A voice recognition system includes: a voice input unit 11 for inputting a voice uttered a plurality of times; a registering voice data storage unit 12 for storing voice data uttered the plurality...
8972243 Parse information encoding in a finite state transducer  
In automatic speech recognition, certain parsing information, such as rules and tags, may be embedded into a finite state transducer (FST) to produce FST output that includes speech recognition...
8972254 Turbo processing for speech recognition with local-scale and broad-scale decoders  
Environmental recognition systems may improve recognition accuracy by leveraging local and nonlocal features in a recognition target. A local decoder may be used to analyze local features, and a...