Match Document Document Title
7617094 Methods, apparatus, and products for identifying a conversation  
One aspect of the invention is a method of using a computer to identify a conversation. Another aspect is a method for an audio processing system that identifies conversations and enhances each...
7613610 Transcription data extraction  
A computer program product, for performing data determination from medical record transcriptions, resides on a computer-readable medium and includes computer-readable instructions for causing a...
7610202 Integrated system and method for mobile audio playback and dictation  
A method and system provides for a single-pass review and feedback of a document. During audio playback of the document to be reviewed, voice-activated recording of feedback and submission of...
7606706 System and method for storage and retrieval of personal communications in a broadband network  
A mechanism is provided to build and maintain a searchable database of communication content and related indicia information of all voice and multimedia (audio and video) communications in which a...
7603273 Simultaneous multi-user real-time voice recognition system  
This invention is a combination of software and hardware components and methodologies that enable voice recognition for multiple users simultaneously. It introduces the concept of a...
7599475 Method and apparatus for generic analytics  
A method and apparatus for revealing business or organizational aspects of an organization in audio signals captured from interactions, broadcasts or other sources. The method and apparatus...
7590542 Method of generating test scripts using a voice-capable markup language  
A method is disclosed for generating test scripts for testing an IVR from an IVR's voice-capable markup language application. The test scripts can be generated directly from the voice-capable...
7590541 HMI presentation layer configuration system  
A human-machine interface generation system comprises a voice recognition component that receives voice commands relating to generation of a human-machine interface within an industrial automation...
7590535 Method and system of handling the selection of alternates for recognized words  
Methods and systems for facilitating the selection of alternates for hand written word. Rules select words user based on operating modes and cursor positions and sequential orderings. User...
7590534 Method and apparatus for processing voice data  
One aspect of the present disclosure is a device for processing voice data associated with an application program. The application program has a form therein for entering voice data. The...
7590533 New-word pronunciation learning using a pronunciation graph  
A method and computer-readable medium convert the text of a word and a user's pronunciation of the word into a phonetic description to be added to a speech recognition lexicon. Initially, a...
7587319 Speech recognition circuit using parallel processors  
A speech recognition circuit comprises an input buffer for receiving processed speech parameters. A lexical memory contains lexical data for word recognition. The lexical data comprises a plurality...
7584102 Language model for use in speech recognition  
Building a language model for use in speech recognition includes identifying without user interaction a source of text related to a user. Text is retrieved from the identified source of text and a...
7580835 Question-answering method, system, and program for answering question input by speech  
Disclosed is a question answering method for answering a question by using a text database storing text data in conjunction with a speech database storing speech data. In this method, a speech...
7577569 Combined speech recognition and text-to-speech generation  
Text-to-speech (TTS) generation is used in conjunction with large vocabulary speech recognition to say words selected by the speech recognition. The software for performing the large vocabulary...
7574356 System and method for spelling recognition using speech and non-speech input  
A system and method for non-speech input or keypad-aided word and spelling recognition is disclosed. The method comprises performing spelling recognition via automatic speech recognition (ASR) on...
7558730 Speech recognition and transcription among users having heterogeneous protocols  
A system is disclosed for facilitating speech recognition and transcription among users employing incompatible protocols for generating, transcribing, and exchanging speech. The system includes a...
7546529 Method and system for providing alternatives for text derived from stochastic input sources  
A computer-implemented method for providing a candidate list of alternatives for a text selection containing text from multiple input sources, each of which can be stochastic (such as a speech...
7539086 System and method for the secure, real-time, high accuracy conversion of general-quality speech into text  
The system is designed to interface with external devices and services, to transcribe audio that may be stored elsewhere such as a wireless phone'voice mail, or occurring between two or more...
7526431 Speech recognition using ambiguous or phone key spelling and/or filtering  
Alphabetic filtering of the speech recognition of words uses a key press to indicate a desired character in an alphabetic filter string, where each key press represents two or more letters. The key...
7526429 Spelled speech recognition method and system accounting for possible misrecognized characters  
Caller interface systems and methods are described. In one aspect, a sequence of recognized characters beginning with a first recognized character and ending with a last recognized character is...
7519347 Method and device for noise detection  
A system and method for detecting cell phone noise induced in telecommunication equipment, especially in microphones and other unshielded electronic units connected to a communication terminal. A...
7519165 Method for providing digital notification and receiving responses  
A method to provide a digital notification and response to groups of users comprising storing user contact data for at least one group of users, user selected priority information, and user...
7516070 Method for simultaneously creating audio-aligned final and verbatim text with the assistance of a speech recognition program as may be useful in form completion using a verbal entry method  
A system and method for creating a final text from an audio file. This has particular utility in completing forms with speech-to-text conversion. The system and method includes transcribing the...
7515770 Information processing method and apparatus  
In order to associate image data with speech data, a character detection unit detects a text region from the image data, and a character recognition unit recognizes a character from the text...
7512537 NLP tool to dynamically create movies/animated scenes  
The subject invention provides a unique system and method that facilitates integrating natural language input and graphics in a cooperative manner. In particular, as natural language input is...
7505903 Speech recognition dictionary creation method and speech recognition dictionary creating device  
A speech recognition dictionary creation method is provided for creating a speech recognition dictionary that is used for creating document data such as electronic mails through voice input in an...
7505902 Discrimination of components of audio signals based on multiscale spectro-temporal modulations  
An audio signal ( 172 ) representative of an acoustic signal is provided to an auditory model ( 105 ). The auditory model ( 105 ) produces a high-dimensional feature set based on physiological...
7505901 Intelligent acoustic microphone fronted with speech recognizing feedback  
Voice command operated systems are being installed in modern motor vehicles with increasing frequency. Such systems should be operable by various vehicle occupants and from various seating...
7499859 Video device with voice-assisted system  
A video device with a voice-assisted system is provided by using a voice command to adjust the images. The voice-assisted system includes a voice recognition engine and a control unit. The voice...
7490042 Methods and apparatus for adapting output speech in accordance with context of communication  
A technique for producing speech output in an automatic dialog system in accordance with a detected context is provided. Communication is received from a user at the automatic dialog system. A...
7487440 Reusable voiceXML dialog components, subdialogs and beans  
Systems and methods for building speech-based applications using reusable dialog components based on VoiceXML (Voice eXtensible Markup Language). VoiceXML reusable dialog components can be used for...
7487086 Transcript alignment  
An approach to alignment of transcripts with recorded audio is tolerant of moderate transcript inaccuracies, untranscribed speech, and significant non-speech noise. In one aspect, a number of...
7487085 Method and system of building a grammar rule with baseforms generated dynamically from user utterances  
A method ( 200 ) of building a grammar with baseforms generated dynamically from user utterances can include the steps of recording ( 205 ) a user utterance, generating ( 210 ) a baseform using the...
7480613 Method of supporting the proof-reading of speech-recognized text with a replay speed adapted to the recognition reliability  
The invention relates to a method of supporting the proof-reading of a text ( 2, 30 ) obtained in particular by speech recognition from a speech signal ( 1 ), of which at least one text component...
7478044 Facilitating navigation of voice data  
A system, system, and program for facilitating navigation of voice data are provided. Tokens are added to voice data based on predefined content criteria. Then, bidirectional scanning of the voice...
7471964 Mobile communication terminal  
When reproducing a multimedia file received by a receiving section, a control section controls a sound reproduction section to reproduce a sound on the basis of sound data. Further, the control...
7467087 Training and using pronunciation guessers in speech recognition  
The error rate of a pronunciation guesser that guesses the phonetic spelling of words used in speech recognition is improved by causing its training to weigh letter-to-phoneme mappings used as data...
7461001 Speech-to-speech generation system and method  
An expressive speech-to-speech generation system and method which can generate expressive speech output by using expressive parameters extracted from the original speech signal to drive the...
7457756 Method of generating time-frequency signal representation preserving phase information  
A method of generating a time-frequency representation of a signal that preserves phase information by receiving the signal, calculating a joint time-frequency domain of the signal, estimating...
7457749 Noise-robust feature extraction using multi-layer principal component analysis  
Extracting features from signals for use in classification, retrieval, or identification of data represented by those signals uses a “Distortion Discriminant Analysis” (DDA) of a set of...
7457466 Method and system of handling the selection of alternates for recognized words  
Methods and systems for facilitating the selection of alternates for hand written word. Rules select words user based on operating modes and cursor positions and sequential orderings. User...
7451084 Cell phone having an information-converting function  
A voice discriminating tag for making a reception voice and a transmission voice distinguishable is added to the voice inputted into a cell phone. Further, a volume discriminating tag is added...
7447625 Method for generating text script of high efficiency  
This proposal presents performance indices and search criteria for the text script generation in the design of corpus-based TTS systems. Based on our criteria a new search method is presented to...
7444287 Efficient monitoring system and method  
A method, article of manufacture, and apparatus for monitoring a location having a plurality of audio sensors and video sensors are disclosed. In an embodiment, this comprises receiving auditory...
7444285 Method and system for sequential insertion of speech recognition results to facilitate deferred transcription services  
Methods and systems are disclosed for the sequential insertion of speech recognition results for deferred transcription services. Speech recognition results are used in the creation of resultant...
7440895 System and method for tuning and testing in a speech recognition system  
Systems and methods for improving the performance of a speech recognition system. In some embodiments a tuner module and/or a tester module are configured to cooperate with a speech recognition...
7437287 Apparatus and method for voice recognition and displaying of characters in mobile telecommunication system  
An apparatus and method for speech recognition and character displaying in a mobile telecommunication system. In a speech recognition and character displaying apparatus for a mobile phone, an RF...
7433818 Subscriber terminal for providing speech-text encoding and telephony service  
A subscriber terminal is provided for speech-to-text translation. Speech packets are received at a broadband telephony interface and stored in a buffer. The speech packets are processed and textual...
7433490 System and method for real time lip synchronization  
A novel method for synchronizing the lips of a sketched face to an input voice. The lip synchronization system and method approach is to use training video as much as possible when the input voice...