Match Document Document Title
9043349 Image-based character recognition  
Various embodiments enable a device to perform tasks such as processing an image to recognize and locate text in the image, and providing the recognized text an application executing on the device...
9043210 Biometric voice command and control switching device and method of use  
A biometric voice command and control switching device has a microphone assembly for receiving a currently spoken challenge utterance and a reference utterance, and a voice processing circuit for...
9043207 Speaker recognition from telephone calls  
The present invention relates to a method for speaker recognition, comprising the steps of obtaining and storing speaker information for at least one target speaker; obtaining a plurality of...
9037469 Automated communication integrator  
An apparatus includes a plurality of applications and an integrator having a voice recognition module configured to identify at least one voice command from a user. The integrator is configured to...
9037470 Script compliance and quality assurance based on speech recognition and duration of interaction  
Apparatus and methods are provided for using automatic speech recognition to analyze a voice interaction and verify compliance of an agent reading a script to a client during the voice...
9031828 Systems and methods for multi-user multi-lingual communications  
Various embodiments described herein facilitate multi-lingual communications. The systems and methods of some embodiments may enable multi-lingual communications through different modes of...
9031829 Systems and methods for multi-user multi-lingual communications  
Various embodiments described herein facilitate multi-lingual communications. The systems and methods of some embodiments enable multi-lingual communications through different modes of...
9031842 Methods and devices for facilitating communications  
Methods and electronic devices for facilitating communications are described. In one aspect, a method for facilitating communications is described. The method includes: monitoring audio based...
9031614 Method and apparatus for secure electronic business card exchange  
An electronic business card is provided with voice data associated with the card owner. In some embodiments, the voice data is a digitized voice sample; in other embodiments, the voice data is a...
9026182 Communication device  
The communication device comprising a voice communication implementer, a multiple & real-time & chronological speech-to-text implementer, and a caller ID.
9026447 Command and control of devices and applications by voice using a communication base system  
A first communication path for receiving a communication is established. The communication includes speech, which is processed. A speech pattern is identified as including a voice-command. A...
9026431 Semantic parsing with multiple parsers  
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for semantic parsing with multiple parsers. One of the methods includes obtaining one or more...
9026438 Detecting barge-in in a speech dialogue system  
A method for detecting barge-in in a speech dialog system comprising determining whether a speech prompt is output by the speech dialog system, and detecting whether speech activity is present in...
9026442 System and method for adapting automatic speech recognition pronunciation by acoustic model restructuring  
Disclosed herein are systems, computer-implemented methods, and computer-readable storage media for recognizing speech by adapting automatic speech recognition pronunciation by acoustic model...
9020816 Hidden markov model for speech processing with training method  
A method, system and apparatus are shown for identifying non-language speech sounds in a speech or audio signal. An audio signal is segmented and feature vectors are extracted from the segments of...
9020818 Format based speech reconstruction from noisy signals  
Implementations of systems, method and devices described herein enable enhancing the intelligibility of a target voice signal included in a noisy audible signal received by a hearing aid device or...
9015044 Formant based speech reconstruction from noisy signals  
Implementations of systems, method and devices described herein enable enhancing the intelligibility of a target voice signal included in a noisy audible signal received by a hearing aid device or...
9015043 Choosing recognized text from a background environment  
A computer-implemented method includes receiving an electronic representation of one or more human voices, recognizing words in a first portion of the electronic representation of the one or more...
9014346 Methods and systems for touch-free call handling  
A method, apparatus and computer-readable medium for handling incoming calls destined for a called party. The method comprises detecting arrival of an incoming call destined for the called party...
9015046 Methods and apparatus for real-time interaction analysis in call centers  
A method and system for indicating in real time that an interaction is associated with a problem or issue, comprising: receiving a segment of an interaction in which a representative of the...
9009039 Noise adaptive training for speech recognition  
Technologies are described herein for noise adaptive training to achieve robust automatic speech recognition. Through the use of these technologies, a noise adaptive training (NAT) approach may...
9009025 Context-based utterance recognition  
In some implementations, a digital work provider may provide language model information related to a plurality of different contexts, such as a plurality of different digital works. For example,...
9002706 Cut and paste spoofing detection using dynamic time warping  
The invention refers to a method for comparing voice utterances, the method comprising the steps: extracting a plurality of features (201) from a first voice utterance of a given text sample and...
9002702 Confidence level assignment to information from audio transcriptions  
Embodiments of the present invention provide an approach for automatically assigning a confidence level to information extracted from a transcription of a voice recording. Specifically, in a...
9002710 System and method for applying dynamic contextual grammars and language models to improve automatic speech recognition accuracy  
The invention involves the loading and unloading of dynamic section grammars and language models in a speech recognition system. The values of the sections of the structured document are either...
9002707 Determining the position of the source of an utterance  
An information processing apparatus includes: a plurality of information input units; an event detection unit that generates event information including estimated position information and...
9002713 System and method for speech personalization by need  
Disclosed herein are systems, computer-implemented methods, and tangible computer-readable storage media for speaker recognition personalization. The method recognizes speech received from a...
9002705 Interactive device that recognizes input voice of a user and contents of an utterance of the user, and performs a response corresponding to the recognized contents  
The present invention provides an interactive device which allows quick utterance recognition results and sequential output thereof and which diminishes a recognition rate decrease even if user's...
9001976 Speaker adaptation  
A method for speaker adaptation includes receiving a plurality of media files, each associated with a call center agent of a plurality of call center agents and receiving a plurality of terms....
9002703 Community audio narration generation  
The community-based generation of audio narrations for a text-based work leverages collaboration of a community of people to provide human-voiced audio readings. During the community-based...
8996387 Release of transaction data  
For clearing transaction data selected for a processing, there is generated in a portable data carrier (1) a transaction acoustic signal (003; 103; 203) (S007; S107; S207) upon whose acoustic...
8996374 Senone scoring for multiple input streams  
Embodiments of the present invention include an apparatus, method, and system for calculating senone scores for multiple concurrent input speech streams. The method can include the following:...
8996382 Lips blockers, headsets and systems  
Systems and methods for inhibiting access to the lips of speaking person including a sound receiving device for receiving speech of a person speaking, the person having lips that move when the...
8996373 State detection device and state detecting method  
A state detection device includes: a first model generation unit to generate a first specific speaker model obtained by modeling speech features of a specific speaker in an undepressed state; a...
8996368 Online maximum-likelihood mean and variance normalization for speech recognition  
A feature transform for speech recognition is described. An input speech utterance is processed to produce a sequence of representative speech vectors. A time-synchronous speech recognition pass...
8990071 Telephony service interaction management  
A method for managing an interaction of a calling party to a communication partner is provided. The method includes automatically determining if the communication partner expects DTMF input. The...
8983838 Global speech user interface  
A global speech user interface (GSUI) comprises an input system to receive a user's spoken command, a feedback system along with a set of feedback overlays to give the user information on the...
8983836 Captioning using socially derived acoustic profiles  
Mechanisms for performing dynamic automatic speech recognition on a portion of multimedia content are provided. Multimedia content is segmented into homogeneous segments of content with regard to...
8982971 System for spectrum sensing of multi-carrier signals with equidistant sub-carriers  
A multi-carrier signal is typically comprised of many equidistant sub-carriers. This results in periodicity of spectrum within the bandwidth of such a multi-carrier signal. An unknown...
8983837 Alert mode management method and communication device having alert mode management function  
A computerized alert mode management method of a communication device, the communication device includes a sound capture unit. Vocal sounds of the environment around the communication device are...
8983207 Mitigating replay attacks using multiple-image authentication  
A technique for authenticating a user is described. During this authentication technique, an electronic device (such as a cellular telephone) captures multiple images of the user while the user...
8977547 Voice recognition system for registration of stable utterances  
A voice recognition system includes: a voice input unit 11 for inputting a voice uttered a plurality of times; a registering voice data storage unit 12 for storing voice data uttered the plurality...
8977555 Identification of utterance subjects  
Features are disclosed for generating markers for elements or other portions of an audio presentation so that a speech processing system may determine which portion of the audio presentation a...
8976906 Method for spectrum sensing of multi-carrier signals with equidistant sub-carriers  
A multi-carrier signal is typically comprised of many equidistant sub-carriers. This results in periodicity of spectrum within the bandwidth of such a multi-carrier signal. An unknown...
8977549 Natural language system and method based on unisolated performance metric  
A natural language business system and method is developed to understand the underlying meaning of a person's speech, such as during a transaction with the business system. The system includes a...
8976943 Voice phone-based method and system to authenticate users  
Provided is a method and a telephone-based system with voice-verification capabilities that enable a user to safely and securely conduct transactions with his or her online financial transaction...
8972265 Multiple voices in audio content  
A content customization service is disclosed. The content customization service may identify one or more speakers in an item of content, and map one or more portions of the item of content to a...
8972855 Method and apparatus for providing case restoration  
A method and apparatus for providing case restoration in a communication network are disclosed. For example, the method obtains one or more content sources from one or more information feeds, and...
8972259 System and method for teaching non-lexical speech effects  
A method and system for teaching non-lexical speech effects includes delexicalizing a first speech segment to provide a first prosodic speech signal and data indicative of the first prosodic...
8972260 Speech recognition using multiple language models  
In accordance with one embodiment, a method of generating language models for speech recognition includes identifying a plurality of utterances in training data corresponding to speech, generating...