Match Document Document Title
US20120166191 ELECTRONIC BOOK WITH VOICE EMULATION FEATURES  
A method and system for providing text-to-audio conversion of an electronic book displayed on a viewer. A user selects a portion of displayed text and converts it into audio. The text-to-audio...
US20140046661 APPARATUSES, METHODS AND SYSTEMS TO PROVIDE TRANSLATIONS OF INFORMATION INTO SIGN LANGUAGE OR OTHER FORMATS  
Some embodiments provide methods of providing a translation of information to a translated format comprising: receiving information in a first format; identifying the first format, where in the...
US20110112834 Communication method and terminal  
A communication method and terminal assist hearing and speech impaired persons. The communication method includes generating a text by combining at least one character input in a text call mode....
US20130041663 COMMUNICATION APPLICATION FOR CONDUCTING CONVERSATIONS INCLUDING MULTIPLE MEDIA TYPES IN EITHER A REAL-TIME MODE OR A TIME-SHIFTED MODE  
A communication application configured to support a conversation among participants over a communication network. The communication application is configured to (i) support one or more media types...
US20110184730 MULTI-DIMENSIONAL DISAMBIGUATION OF VOICE COMMANDS  
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for processing voice commands. In one aspect, a method includes receiving an audio signal at a...
US20150058006 PHONETIC ALIGNMENT FOR USER-AGENT DIALOGUE RECOGNITION  
A method for speech to text transcription uses a knowledge base containing solution descriptions, each describing, in words, a solution to a respective problem. An audio recording of a dialogue...
US20130124202 METHOD AND APPARATUS FOR PROCESSING SCRIPTS AND RELATED DATA  
Provided in some embodiments is a method including receiving ordered script words are indicative of dialogue words to be spoken, receiving audio data corresponding to at least a portion of the...
US20090319266 MULTIMODAL INPUT USING SCRATCHPAD GRAPHICAL USER INTERFACE TO EDIT SPEECH TEXT INPUT WITH KEYBOARD INPUT  
A system and method for multimodal input into an application program. The method may include performing speech recognition on speech audio input to thereby produce recognized speech text input for...
US20110054895 UTILIZING USER TRANSMITTED TEXT TO IMPROVE LANGUAGE MODEL IN MOBILE DICTATION APPLICATION  
In embodiments of the present invention improved capabilities are described for utilizing user transmitted text to improve language modeling in converting voice to text on a mobile communication...
US20110276325 Training A Transcription System  
According to certain embodiments, training a transcription system includes accessing recorded voice data of a user from one or more sources. The recorded voice data comprises voice samples. A...
US20130144603 ENHANCED VOICE CONFERENCING WITH HISTORY  
Techniques for ability enhancement are described. Some embodiments provide an ability enhancement facilitator system (“AEFS”) configured to enhance voice conferencing among multiple speakers. Some...
US20110231379 SEARCH ENGINE INFERENCE BASED VIRTUAL ASSISTANCE  
Techniques described herein generally relate to real time inference based systems. Example embodiments may set forth devices, methods, and computer programs related to search engine inference...
US20140330562 Method and Apparatus for Obtaining Information from the Web  
An intelligent conversation system augmenting a conversation between two or more individuals uses a speech to text block configured to convert voices of the conversation into text, a determination...
US20150228279 LANGUAGE MODELS USING NON-LINGUISTIC CONTEXT  
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for language models using non-linguistic context. In some implementations, context data...
US20150046158 METHOD AND APPARATUS FOR VOICE MODIFICATION DURING A CALL  
A method for voice modification during a telephone call comprising receiving a source audio signal associated with at least one participant, wherein the source audio signal comprises a voice of...
US20100174543 SYSTEMS FOR DISPLAYING CONVERSIONS OF TEXT EQUIVALENTS  
Embodiments of the invention include a system for displaying an audit diagram. The system includes a monitor capable of electronically displaying the audit diagram. The monitor includes a text...
US20070136064 MOBILE PERSONAL COMPUTER WITH MOVEMENT SENSOR  
A mobile personal computer including a case, a display device, a speech recognition system, a movement sensor, a microprocessor, and a power source. The case is sized for handling by a single,...
US20050125223 Audio-visual highlights detection using coupled hidden markov models  
A method uses probabilistic fusion to detect highlights in videos using both audio and visual information. Specifically, the method uses coupled hidden Markov models (CHMMs). Audio labels are...
US20080091694 Transcriptional dictation  
A method of dictation in which authors may assign their own words or phrases (UnifiedWords) to introduce various subjects/document elements in a document to be dictated. Each element is terminated...
US20140350930 Real Time Generation of Audio Content Summaries  
Audio content is converted to text using speech recognition software. The text is then associated with a distinct voice or a generic placeholder label if no distinction can be made. From the text...
US20120179465 REAL TIME GENERATION OF AUDIO CONTENT SUMMARIES  
Audio content is converted to text using speech recognition software. The text is then associated with a distinct voice or a generic placeholder label if no distinction can be made. From the text...
US20130211833 TECHNIQUES FOR OVERLAYING A CUSTOM INTERFACE ONTO AN EXISTING KIOSK INTERFACE  
Techniques for overlaying a custom interface onto an existing kiosk interface are provided. An event is detected that triggers a kiosk to process an agent that overlays, and without modifying, the...
US20060241943 Medical vocabulary templates in speech recognition  
A system of templates of words and terms use in medicine and surgery by physicians for optimizing the outcomes of speech recognition process of converting digital voice data produced by physicians...
US20110282687 Clinical Data Reconciliation as Part of a Report Generation Solution  
An automated system updates electronic medical records (EMRs) based on dictated reports, without requiring manual data entry into on-screen forms. A dictated report is transcribed by an automatic...
US20140316780 METHOD AND SYSTEM FOR PROVIDING AN AUTOMATED WEB TRANSCRIPTION SERVICE  
A system, method and computer readable medium that provides an automated web transcription service is disclosed. The method may include receiving input speech from a user using a communications...
US20090037171 Real-time voice transcription system  
The real-time voice transcription system provides a speech recognition system and method that includes use of speech and spatial-temporal acoustic data to enhance speech recognition probabilities...
US20140032215 Asynchronous Video Interview System  
Aspects of an asynchronous video interview system and related techniques include a server that receives a plurality of pre-recorded video prompts, generates an interview script, transmits a video...
US20130226578 ASYNCHRONOUS VIDEO INTERVIEW SYSTEM  
Aspects of an asynchronous video interview system and related techniques include a server that receives a plurality of pre-recorded video prompts, generates an interview script, transmits a video...
US20110276327 VOICE-TO-EXPRESSIVE TEXT  
A method including receiving a vocal input including words spoken by a user; determining vocal characteristics associated with the vocal input mapping the vocal characteristics to textual...
US20140249815 METHOD, APPARATUS AND COMPUTER PROGRAM PRODUCT FOR PROVIDING TEXT INDEPENDENT VOICE CONVERSION  
An apparatus for providing text independent voice conversion may include a first voice conversion model and a second voice conversion model. The first voice conversion model may be trained with...
US20100121629 Method and apparatus for translating speech during a call  
A translation platform allows a client using a first language to communicate via translated voice and/or text to at least a second client using a second language. A control server uses various...
US20100100377 GENERATING AND PROCESSING FORMS FOR RECEIVING SPEECH DATA  
A system and method for dynamically generating and processing forms for receiving data, such as text-based data or speech data provided over a telephone, mobile device, via a computer and...
US20080052069 INTEGRATED SPEECH RECOGNITION, CLOSED CAPTIONING, AND TRANSLATION SYSTEM AND METHOD  
A system and method that integrates automated voice recognition technology and speech-to-text technology with automated translation and closed captioning technology to provide translations of...
US20070198258 Method and portable device for inputting characters by using voice recognition  
A method for inputting characters by using voice recognition, which is applied to a portable device, and comprises the steps of: collecting at least one external voice to convert into a voice data...
US20120245935 ELECTRONIC DEVICE AND SERVER FOR PROCESSING VOICE MESSAGE  
An electronic device includes a voice processing unit, a wireless communication unit, and a combining unit. The voice processing unit receives speech signals. The wireless communication unit sends...
US20070136067 Audio dialogue system and voice browsing method  
An audio dialog system and a voice browsing method are described. An audio input unit (12) acquires an audio input signal. Speech recognition means (20) convert the audio input signal into text...
US20120296646 MULTI-MODE TEXT INPUT  
Concepts and technologies are described herein for multi-mode text input. In accordance with the concepts and technologies disclosed herein, content is received. The content can include one or...
US20140379337 METHOD AND SYSTEM FOR TESTING CLOSED CAPTION CONTENT OF VIDEO ASSETS  
A method and system for monitoring video assets provided by a multimedia content distribution network includes testing closed captions provided in output video signals. A video and audio portion...
US20120143606 METHOD AND SYSTEM FOR TESTING CLOSED CAPTION CONTENT OF VIDEO ASSETS  
A method and system for monitoring video assets provided by a multimedia content distribution network includes testing closed captions provided in output video signals. A video and audio portion...
US20130117022 Personalized Vocabulary for Digital Assistant  
Methods, systems, and computer readable storage medium related to operating an intelligent digital assistant are disclosed. A text string is obtained from a speech input received from a user. The...
US20130066630 AUDIO TRANSCRIPTION GENERATOR AND EDITOR  
A system for correcting errors in automatically generated audio transcriptions includes an audio recorder, a computerized transcription generator, a voice recording, a collection of link data,...
US20060259301 High quality thai text-to-phoneme converter  
An improved, high-quality Thai text-to-phoneme converter. Syllabification is performed strictly according to the Thai pronunciation rules. Initial vowels, Thai syllable structures, special vowels,...
US20080071533 Automatic generation of statistical language models for interactive voice response applications  
A Statistical Language Model (SLM) that can be used in an ASR for Interactive Voice Response (IVR) systems in general and Natural Language Speech Applications (NLSAs) in particular can be created...
US20080208596 AUTOMATED INTERPRETATION OF CLINICAL ENCOUNTERS WITH CULTURAL CUES  
A method, system and a computer program product for an automated interpretation and translation are disclosed. An automated interpretation occurs by receiving language-based content from a user....
US20090138262 SYSTEMS AND METHODS TO INDEX AND SEARCH VOICE SITES  
A method comprises crawling and indexing voice sites and storing results in an index; receiving a search request in voice from a user via a telephone; performing speech recognition on the voice...
US20050240406 Speech recognition computing device display with highlighted text  
A computing device having a display screen and a speech recognition module, and related method of operation. Received audio input is processed with the speech recognition module to convert the...
US20110173267 Spoken email-audio file integrated with text message as a new way of email for communication  
A method of creating, transferring and comprehending the audio file integrated with text message in email system as a new way of email for communication. It makes working with email much more...
US20060195319 Method for converting phonemes to written text and corresponding computer system and computer program  
Method for converting phonemes to written text and corresponding computer system and computer program. In languages having a low correspondence between sounds and letters, converting phonemes to...
US20110270609 REAL-TIME SPEECH-TO-TEXT CONVERSION IN AN AUDIO CONFERENCE SESSION  
Various embodiments of systems, methods, and computer programs are disclosed for providing real-time resources to participants in an audio conference session. One embodiment is a method for...
US20150106089 Name Based Initiation of Speech Recognition  
A computer-implemented method includes listening for audio name information indicative of a name of a computer, with the computer configured to listen for the audio name information in a first...