Sign up


Match Document Document Title
8731930 Contextual voice query dilation to improve spoken web searching  
A method for contextual voice query dilation in a Spoken Web search includes determining a context in which a voice query is created, generating a set of multiple voice query terms based on the...
8731919 Methods and system for capturing voice files and rendering them searchable by keyword or phrase  
A system for capturing voice files and rendering them searchable, comprising one or more devices capable of capturing audio speech electronically, a recorder coupled to the devices for retrieving...
8731912 Delaying audio notifications  
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for audible alert tones are disclosed. The methods, systems, and apparatus include actions of...
8731937 Updating speech recognition models for contacts  
Systems, methods and apparatus for generating, distributing, and using speech recognition models. A shared speech processing facility is used to support speech recognition for a wide variety of...
8731926 Spoken term detection apparatus, method, program, and storage medium  
A spoken term detection apparatus includes: processing performed by a processor includes a feature extraction process extracting an acoustic feature from speech data accumulated in an accumulation...
8731929 Agent architecture for determining meanings of natural language utterances  
Systems and methods for receiving natural language queries and/or commands and execute the queries and/or commands. The systems and methods overcomes the deficiencies of prior art speech query and...
8731934 System and method for multi-modal audio mining of telephone conversations  
A system and method for the automated monitoring of inmate telephone calls as well as multi-modal search, retrieval and playback capabilities for said calls. A general term for such capabilities is...
8731936 Energy-efficient unobtrusive identification of a speaker  
Functionality is described herein for recognizing speakers in an energy-efficient manner. The functionality employs a heterogeneous architecture that comprises at least a first processing unit and...
8725511 Enhanced accuracy for speech recognition grammars  
Disclosed herein are methods and systems for recognizing speech. A method embodiment comprises comparing received speech with a precompiled grammar based on a database and if the received speech...
8725515 Electronic apparatus and method for controlling the electronic apparatus using voice  
An electronic apparatus includes a microphone, a processor, a motherboard, and a voice recognition microchip. The voice recognition microchip compares a voice command with a pre-stored voice...
8725517 System and dialog manager developed using modular spoken-dialog components  
A dialog manager and spoken dialog service having a dialog manager generated according to a method comprising selecting a top level flow controller based on application type, selecting available...
8725512 Method and system having hypothesis type variable thresholds  
A method (and system) for spoken dialog confirmation classifies a plurality of spoken dialog hypotheses, and assigns a threshold to each class of spoken dialog hypotheses.
8725497 System and method for detecting and correcting mismatched Chinese character  
A system and method for detecting and correcting mismatched Chinese characters in a phrase. The system comprises a database for the look-up of characters and Chinese phrases, a module to compare...
8724780 Voice interaction method of mobile terminal based on voiceXML and mobile terminal  
The present invention discloses a voice interaction method of a mobile terminal based on VoiceXML and a mobile terminal, which comprises: converting received voice information into a VoiceXML...
8725491 Mobile electronic device and associated method enabling identification of previously entered data for transliteration of an input  
An improved mobile electronic device and associated method enable the identification of previously-entered textual objects in one or more custom wordlists to identify possible transliterations of...
8725281 Recording and/or reproducing apparatus and recording apparatus  
A recording and/or reproducing apparatus includes a microphone, a semiconductor memory, an operating section and a controller. An output signal from the microphone is written in the semiconductor...
8725518 Automatic speech analysis  
A system for providing automatic quality management regarding a level of conformity to a specific accent, including, a recording system, a statistical model database with statistical models...
8725513 Providing expressive user interaction with a multimodal application  
Methods, apparatus, and products are disclosed for providing expressive user interaction with a multimodal application, the multimodal application operating in a multimodal browser on a multimodal...
8725514 Verifying a user using speaker verification and a multimodal web-based interface  
A method of verifying a user identity using a Web-based multimodal interface can include sending, to a remote client device, a multimodal markup language document that, when rendered by the remote...
8719015 Dialogue system and method for responding to multimodal input using calculated situation adaptability  
A dialogue system and a method for the same are disclosed. The dialogue system includes a multimodal input unit receiving speech and non-speech information of a user, a domain reasoner, which...
8718242 Method and apparatus for automatically building conversational systems  
A system and method provides a natural language interface to world-wide web content. Either in advance or dynamically, webpage content is parsed using a parsing algorithm. A person using a...
8719004 Systems and methods for punctuating voicemail transcriptions  
A system, method and software product punctuates voicemail transcription text. A transcription text of the voicemail message is generated and the pauses between words of the transcribed text are...
8719035 Method and apparatus for recognizing and reacting to user personality in accordance with speech recognition system  
Techniques are disclosed for recognizing user personality in accordance with a speech recognition system. For example, a technique for recognizing a personality trait associated with a user...
8719031 Dynamic access to external media content based on speaker content  
An audio conference is supplemented based on speaker content. Speaker content from at least one audio conference participant is monitored using a computer with a tangible non-transitory processor...
8719034 Displaying speech command input state information in a multimodal browser  
Methods, systems, and products are disclosed for displaying speech command input state information in a multimodal browser including displaying an icon representing a speech command type and...
8719032 Methods for presenting speech blocks from a plurality of audio input data streams to a user in an interface  
A clear picture of who is speaking in a setting where there are multiple input sources (e.g., a conference room with multiple microphones) can be obtained by comparing input channels against each...
8719029 File format, server, viewer device for digital comic, digital comic generation device  
A viewer device for a digital comic comprising: an information acquisition unit that acquires a digital comic in a file format for a digital comic viewed on a viewer device, the file format...
8719036 Voice dialogue system, method, and program  
A voice dialogue system executing an operation command inputted by a voice dialogue user which stores a history of the number of times each operation is executed. Upon the reception or detection of...
8719033 Greeting card having karaoke record feature and simultaneous playback  
A greeting card having an audio recording and playback device permits recording of a karaoke-style song to be played upon opening of the greeting card. A user sings along with a permanently...
8718622 Pervasive contact center  
Methods and systems that support the receipt of location data and/or touch data from a mobile communication device are provided. More particularly, a mobile customer service server is provided that...
8713593 Detection system and method for mobile device application  
A system and method for detecting a non-visual code using an application on a mobile device, where the application is capable of associating the non-visual code with at least one item contained in...
8712772 Method and system for processing dictated information  
A method and system for processing dictated information into a dynamic form are disclosed. The method comprises presenting an image (3) belonging to an image category to a user, dicatating a first...
8713119 Electronic devices with voice command and contextual data processing capabilities  
An electronic device may capture a voice command from a user. The electronic device may store contextual information about the state of the electronic device when the voice command is received. The...
8712777 Computerized information and display methods  
Methods for obtaining and displaying information, such as directions to a desires entity or organization. In one embodiment, the method makes use of a computerized apparatus configured to receive...
8712779 Information retrieval system, information retrieval method, and information retrieval program  
An information retrieval system comprises: a speech input unit for inputting speech; an information storage unit for storing information with which speech information, of a length with which text...
8712781 System and method for customized prompting  
A method for providing an audible prompt to a user within a vehicle. The method includes retrieving one or more data files from a memory device. The data files define certain characteristics of an...
8712778 Systems and methods for archiving and retrieving navigation points in a voice command platform  
A method and system for identifying, saving and utilizing bookmarks in a voice-command platform. The system allows a user to bookmark objects specified within voice-markup filed resulting in the...
8706473 System and method for insertion and removal of video objects  
An example method may include receiving a media stream from a first endpoint, where the media stream is intended for a second endpoint; processing the media stream according to at least one...
8706484 Voice recognition dictionary generation apparatus and voice recognition dictionary generation method  
A voice recognition dictionary generation apparatus and method for suppressing reduction of processing speed at the time of updating. The apparatus includes an input unit configured to receive a...
8706489 System and method for selecting audio contents by using speech recognition  
A system and method for selecting audio contents by using the speech recognition to obtain a textual phrase from a series of audio contents are provided. The system includes an output module...
8706471 Communication system using mixed translating while in multilingual communication  
A translation between a source language and a target language is disclosed. The source language items are divided, with primary and secondary source language items or named entities being...
8706501 Method and system for sharing speech processing resources over a communication network  
A method and system (40) for sharing speech processing resources (54) over a (communication network (21) for handling multiple client types (100, 101, etc.) and multiple media protocol types. The...
8706498 System for dynamic management of customer direction during live interaction  
A system for customer interaction includes a telephony-enabled device for receiving voice calls from customers, a voice recognition engine connected to the telephony-enabled device for monitoring...
8705705 Voice rendering of E-mail with tags for improved user experience  
Tags, such as XML tags, are inserted into email to separate email content from signature blocks, privacy notices and confidentiality notices, and to separate original email messages from replies...
8706499 Periodic ambient waveform analysis for enhanced social functions  
Client devices periodically capture ambient audio waveforms, generate waveform fingerprints, and upload the fingerprints to a server for analysis. The server compares the waveforms to a database of...
8706500 Establishing a multimodal personality for a multimodal application  
Methods, apparatus, and computer program products are described for establishing a multimodal personality for a multimodal application that include selecting, by the multimodal application,...
8706497 Speech signal restoration device and speech signal restoration method  
A synthesis filter 106 synthesizes a plurality of wide-band speech signals by combining wide-band phoneme signals and sound source signals from a speech signal code book 105, and a distortion...
8700406 Preserving audio data collection privacy in mobile devices  
Techniques are disclosed for using the hardware and/or software of the mobile device to obscure speech in the audio data before a context determination is made by a context awareness application...
8700396 Generating speech data collection prompts  
This document generally describes computer technologies relating to generating speech data collection prompts, such as textual scripts and/or textual scenarios. Speech data collection prompts for a...
8700407 Systems and methods for recognizing sound and music signals in high noise and distortion  
A method for recognizing an audio sample locates an audio file that closely matches the audio sample from a database indexing a large set of original recordings. Each indexed audio file is...