Match Document Document Title
9043213 Speech recognition and synthesis utilizing context dependent acoustic models containing decision trees  
A speech recognition method including the steps of receiving a speech input from a known speaker of a sequence of observations and determining the likelihood of a sequence of words arising from...
9037466 Email administration for rendering email on a digital audio player  
Methods, systems, and computer program products are provided for email administration for rendering email on a digital audio player. Embodiments include retrieving an email message; extracting...
9037467 Speech effects  
A method of complementing a spoken text. The method including receiving text data representative of a natural language text, receiving effect control data including at least one effect control...
9026445 Text-to-speech user's voice cooperative server for instant messaging clients  
A system and method to allow an author of an instant message to enable and control the production of audible speech to the recipient of the message. The voice of the author of the message is...
9020821 Apparatus and method for editing speech synthesis, and computer readable medium  
An acquisition unit analyzes a text, and acquires phonemic and prosodic information. An editing unit edits a part of the phonemic and prosodic information. A speech synthesis unit converts the...
9009051 Apparatus, method, and program for reading aloud documents based upon a calculated word presentation order  
According to one embodiment, a reading aloud support apparatus includes a reception unit, a first extraction unit, a second extraction unit, an acquisition unit, a generation unit, a presentation...
9009050 System and method for cloud-based text-to-speech web services  
Disclosed herein are systems, methods, and non-transitory computer-readable storage media for generating speech. One variation of the method is from a server side, and another variation of the...
9009042 Machine translation of indirect speech  
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating direct speech messages based on voice commands that include indirect speech...
9009040 Training a transcription system  
According to certain embodiments, training a transcription system includes accessing recorded voice data of a user from one or more sources. The recorded voice data comprises voice samples. A...
9009200 Method of searching text based on two computer hardware processing properties: indirect memory addressing and ASCII encoding  
A method and process for searching and inserting a word or set of words in a large data set for real-time data intensive search applications using memory banks is disclosed. Traditional search...
9008598 Broadcast channel identification  
An apparatus including a memory for associating at least one user defined channel identifier with at least one selection item of the apparatus and a control unit coupled to the memory, the control...
9002712 Voice-interactive marketplace providing promotion and promotion tracking, loyalty reward and redemption, and other features  
The invention provides a system, method, and business model for an information system and service having business self-promotion, promotion and promotion tracking, loyalty or frequent participant...
9002703 Community audio narration generation  
The community-based generation of audio narrations for a text-based work leverages collaboration of a community of people to provide human-voiced audio readings. During the community-based...
8996384 Transforming components of a web page to voice prompts  
Embodiments of the invention address the deficiencies of the prior art by providing a method, apparatus, and program product to of converting components of a web page to voice prompts for a user....
8996377 Blending recorded speech with text-to-speech output for specific domains  
A text-to-speech (TTS) engine combines recorded speech with synthesized speech from a TTS synthesizer based on text input. The TTS engine receives the text input and identifies the domain for the...
8996376 Intelligent text-to-speech conversion  
Techniques for improved text-to-speech processing are disclosed. The improved text-to-speech processing can convert text from an electronic document into an audio output that includes speech...
8990089 Text to speech synthesis for texts with foreign language inclusions  
A speech output is generated from a text input written in a first language and containing inclusions in a second language. Words in the native language are pronounced with a native pronunciation...
8990087 Providing text to speech from digital content on an electronic device  
A method for providing text to speech from digital content in an electronic device is described. Digital content including a plurality of words and a pronunciation database is received....
8983842 Apparatus, process, and program for combining speech and audio data  
There is provided a speech processing apparatus including: a data obtaining unit which obtains music progression data defining a property of one or more time points or one or more time periods...
8983835 Electronic device and server for processing voice message  
An electronic device includes a voice processing unit, a wireless communication unit, and a combining unit. The voice processing unit receives speech signals. The wireless communication unit sends...
8977555 Identification of utterance subjects  
Features are disclosed for generating markers for elements or other portions of an audio presentation so that a speech processing system may determine which portion of the audio presentation a...
8977551 Parametric speech synthesis method and system  
The present invention provides a parametric speech synthesis method and a parametric speech synthesis system. The method comprises sequentially processing each frame of speech of each phone in a...
8977552 Method and system for enhancing a speech database  
A system, method and computer readable medium that enhances a speech database for speech synthesis is disclosed. The method may include labeling audio files in a primary speech database,...
8972260 Speech recognition using multiple language models  
In accordance with one embodiment, a method of generating language models for speech recognition includes identifying a plurality of utterances in training data corresponding to speech, generating...
8965767 System and method for synthetic voice generation and modification  
Disclosed herein are systems, methods, and non-transitory computer-readable storage media for generating a synthetic voice. A system configured to practice the method combines a first database of...
8965761 Differential dynamic content delivery with text display in dependence upon simultaneous speech  
Differential dynamic content delivery including providing a session document for a presentation, wherein the session document includes a session grammar and a session structured document;...
8965768 System and method for automatic detection of abnormal stress patterns in unit selection synthesis  
Disclosed herein are systems, methods, and non-transitory computer-readable storage media for detecting and correcting abnormal stress patterns in unit-selection speech synthesis. A system...
8965769 Markup assistance apparatus, method and program  
According to one embodiment, a markup assistance apparatus includes an acquisition unit, a first calculation unit, a detection unit and a presentation unit. The acquisition unit acquires a feature...
8959021 Single interface for local and remote speech synthesis  
Features are disclosed for providing a consistent interface for local and distributed text to speech (TTS) systems. Some portions of the TTS system, such as voices and TTS engine components, may...
8954328 Systems and methods for document narration with multiple characters having multiple moods  
Disclosed are techniques and systems to provide a narration of a text in multiple different voices. Further disclosed are techniques and systems for providing a plurality of characters at least...
8954329 Methods and apparatus for acoustic disambiguation by insertion of disambiguating textual information  
Techniques for disambiguating at least one text segment from at least one acoustically similar word and/or phrase. The techniques include identifying at least one text segment, in a textual...
8954335 Speech translation system, control device, and control method  
Appropriate processing results or appropriate apparatuses can be selected with a control device that selects the most probable speech recognition result by using speech recognition scores received...
8949122 Stored phrase reutilization when testing speech recognition  
A set of audio phrases and corresponding phrase characteristics can be maintained, such as in a database. The phrase characteristics can include a translation of speech in the associated audio...
8947683 Printing apparatus and method for controlling printing apparatus  
A method for controlling a printing apparatus includes holding a plurality of jobs including a cover job having print data for a cover and a content job having print data for content, which are...
8949128 Method and apparatus for providing speech output for speech-enabled applications  
Techniques for providing speech output for speech-enabled applications. A synthesis system receives from a speech-enabled application a text input including a text transcription of a desired...
8949126 Creating statistical language models for spoken CAPTCHAs  
Methods for creating statistical language models (SLMs) for spoken Completely Automated Turing Tests for Telling Computers and Humans Apart (CAPTCHAs) are disclosed. In these methods, candidate...
8942983 Method of speech synthesis  
The present invention relates to a method of text-based speech synthesis, wherein at least one portion of a text is specified; the intonation of each portion is determined; target speech sounds...
8942982 Semiconductor integrated circuit device and electronic instrument  
A semiconductor integrated circuit device including: a storage section which temporarily stores a command and text data input from the outside; a speech synthesis section which synthesizes a...
8934652 Visual presentation of speaker-related information  
Techniques for ability enhancement are described. Some embodiments provide an ability enhancement facilitator system (“AEFS”) configured to determine and present speaker-related information based...
8935385 System and method of multimodality-appended rich media comments  
A system of multimodality-appended rich media comments is provided. The system includes a server and an electronic device. The electronic device includes a network access module, at least one...
8930192 Computer-based grapheme-to-speech conversion using a pointing device  
Methods, systems and apparatus for a computer based grapheme-to-speech conversion using a pointing device. In one aspect the method of grapheme-to-speech conversion comprises the steps of...
8924216 System and method for synchronizing sound and manually transcribed text  
A method for synchronizing sound data and text data, said text data being obtained by manual transcription of said sound data during playback of the latter. The proposed method comprises the steps...
8918313 Replay apparatus, signal processing apparatus, and signal processing method  
A method of selectively performing signal processing in a first mode and in a second mode. In the first mode, a noise cancel signal having a signal characteristic to cancel an external noise...
8918322 Personalized text-to-speech services  
A personalized text-to-speech (pTTS) system provides a method for converting text data to speech data utilizing a pTTS template representing the voice characteristics of an individual. A memory...
8918323 Contextual conversion platform for generating prioritized replacement text for spoken content output  
A contextual conversion platform, and method for converting text-to-speech, are described that can convert content of a target to spoken content. Embodiments of the contextual conversion platform...
8914290 Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment  
Method and apparatus that dynamically adjusts operational parameters of a text-to-speech engine in a speech-based system. A voice engine or other application of a device provides a mechanism to...
8914291 Method and apparatus for generating synthetic speech with contrastive stress  
Techniques for generating synthetic speech with contrastive stress. In one aspect, a speech-enabled application generates a text input including a text transcription of a desired speech output,...
8914277 Speech and language translation of an utterance  
According to example configurations, a speech-processing system parses an uttered sentence into segments. The speech-processing system translates each of the segments in the uttered sentence into...
8909528 Method and system for prompt construction for selection from a list of acoustically confusable items in spoken dialog systems  
A method (and system) of determining confusable list items and resolving this confusion in a spoken dialog system includes receiving user input, processing the user input and determining if a list...
8909538 Enhanced interface for use with speech recognition  
Improved methods of presenting speech prompts to a user as part of an automated system that employs speech recognition or other voice input are described. The invention improves the user interface...