Match Document Document Title
8554565 Speech segment processor  
According to one embodiment, a speech synthesizer generates a speech segment sequence and synthesizes speech by connecting speech segments of the generated speech segment sequence. If a speech...
8548809 Voice guidance system and voice guidance method using the same  
A voice guidance system for providing a guidance by voice concerning operations of an information processing apparatus, comprises a detector that detects that a predetermined function of the...
8542839 Audio processing apparatus and method of mobile device  
An audio processing apparatus and method for a mobile device are provided. The audio processing apparatus and method may appropriately determine sound source localizations corresponding to a voice...
8538743 Disambiguating text that is to be converted to speech using configurable lexeme based rules  
A software language including language constructs for disambiguating text that is to be converted to speech using configurable lexeme based rules. The language can include at least one conditional...
8527276 Speech synthesis using deep neural networks  
A method and system for is disclosed for speech synthesis using deep neural networks. A neural network may be trained to map input phonetic transcriptions of training-time text strings into...
8527281 Method and apparatus for sculpting synthesized speech  
Methods and systems for sculpting synthesized speech using a graphic user interface are disclosed. An operator enters a stream of text that is used to produce a stream of target phonetic-units....
8527258 Simultaneous interpretation system  
A simultaneous interpretation system includes headsets for inputting and outputting voice, and a portable terminal for receiving an original language voice speech signal to be interpreted that is...
8527283 Method and apparatus for estimating high-band energy in a bandwidth extension system  
A method (100) includes receiving (101) an input digital audio signal comprising a narrow-band signal. The input digital audio signal is processed (102) to generate a processed digital audio...
8527275 Transforming a tactually selected user input into an audio output  
A contextual input device includes a plurality of tactually discernable keys disposed in a predetermined configuration which replicates a particular relationship among a plurality of items...
8527273 Systems and methods for determining the N-best strings  
Systems and methods for identifying the N-best strings of a weighted automaton. A potential for each state of an input automaton to a set of destination states of the input automaton is first...
8521535 Biochemical analyzer having microprocessing apparatus with expandable voice capacity  
A biochemical analyzer having a microprocessing apparatus with expandable voice capacity is characterized in that a driving module is installed in a data processor and a voice carrier is...
8521513 Localization for interactive voice response systems  
A language-neutral speech grammar extensible markup language (GRXML) document and a localized response document are used to build a localized GRXML document. The language-neutral GRXML document...
8515749 Speech-to-speech translation  
Systems and methods for facilitating communication including recognizing speech in a first language represented in a first audio signal; forming a first text representation of the speech;...
8515759 Apparatus and method for synthesizing an output signal  
An apparatus for synthesizing a rendered output signal having a first audio channel and a second audio channel includes a decorrelator stage for generating a decorrelator signal based on a downmix...
8510112 Method and system for enhancing a speech database  
A system, method and computer readable medium that enhances a speech database for speech synthesis is disclosed. The method may include labeling audio files in a primary speech database,...
8510113 Method and system for enhancing a speech database  
A system, method and computer readable medium that enhances a speech database for speech synthesis is disclosed. The method may include labeling audio files in a primary speech database,...
8504368 Synthetic speech text-input device and program  
A synthetic speech text-input device is provided that allows a user to intuitively know an amount of an input text that can be fit in a desired duration. A synthetic speech text-input device 1...
8498860 Modulation device, modulation method, demodulation device, and demodulation method  
A modulation device including: a modulation unit for modulating a carrier in an audible sound range by an encoded transmission signal to generate a modulated signal; a masker sound generation unit...
8498867 Systems and methods for selection and use of multiple characters for document narration  
Disclosed are techniques and systems to provide a narration of a text in multiple different voices. Further disclosed are techniques and systems for generating an audible output in which different...
8498866 Systems and methods for multiple language document narration  
Disclosed are techniques and systems to provide a narration of a text in multiple different languages where the portions of the text narrated using the different voices associated with different...
8494849 Method and apparatus for transmitting speech data to a remote device in a distributed speech recognition system  
A method of transmitting speech data to a remote device in a distributed speech recognition system, includes the steps of: dividing an input speech signal into frames; calculating, for each frame,...
8489400 System and method for audibly presenting selected text  
Disclosed herein are methods for presenting speech from a selected text that is on a computing device. This method includes presenting text on a touch-sensitive display and having that text size...
8484026 Portable audio control system and audio control device thereof  
A portable audio control system that controls an audio signal transmitted from an electronic device, including an earphone device and an audio control device. The audio control device includes an...
8484035 Modification of voice waveforms to change social signaling  
A method of altering a social signaling characteristic of a speech signal. A statistically large number of speech samples created by different speakers in different tones of voice are evaluated to...
8484027 Method for live remote narration of a digital book  
A method for narrating a digital book includes retrievably storing first data relating to narration of the digital book by a first end-user. The first data is then provided to a user device having...
8484028 Systems and methods for document navigation with a text-to-speech engine  
A system for visually navigating a document in conjunction with a text-to-speech (“TTS) engine presents a visual display of a region of interest that is related to the text of the document that is...
8478582 Server for automatically scoring opinion conveyed by text message containing pictorial-symbols  
A server is disclosed for computing a score of an opinion that a message in a text file is expected to convey regarding a subject to be evaluated, wherein the message is written using literal...
8478597 Method and system for assessing pronunciation difficulties of non-native speakers  
The present disclosure presents a useful metric for assessing the relative difficulty which non-native speakers face in pronouncing a given utterance and a method and systems for using such a...
8468020 Speech synthesis apparatus and method wherein more than one speech unit is acquired from continuous memory region by one access  
An apparatus for synthesizing a speech including a waveform memory that stores a plurality of speech unit waveforms, an information memory that correspondingly stores speech unit information and...
8468017 Multi-stage quantization method and device  
The invention discloses a multi-stage quantization method, which includes the following steps: obtaining a reference codebook according to a previous stage codebook; obtaining a current stage...
8456420 Audible list traversal  
Many embodiments may comprise logic such as hardware and/or code to implement user interface for traversal of long sorted lists, via audible mapping of the lists, using sensor based gesture...
8457967 Automatic evaluation of spoken fluency  
A procedure to automatically evaluate the spoken fluency of a speaker by prompting the speaker to talk on a given topic, recording the speaker's speech to get a recorded sample of speech, and then...
8452600 Assisted reader  
An electronic reading device for reading ebooks and other digital media items combines a touch surface electronic reading device with accessibility technology to provide a visually impaired user...
8447592 Methods and apparatus for formant-based voice systems  
In one aspect, a method of processing a voice signal to extract information to facilitate training a speech synthesis model is provided. The method comprises acts of detecting a plurality of...
8447604 Method and apparatus for processing scripts and related data  
Provided in some embodiments is a method including receiving ordered script words are indicative of dialogue words to be spoken, receiving audio data corresponding to at least a portion of the...
8447609 Adjustment of temporal acoustical characteristics  
Embodiments may be a standalone module or part of mobile devices, desktop computers, servers, stereo systems, or any other systems that might benefit from condensed audio presentations of item...
8447613 Robot and server with optimized message decoding  
A method for optimizing message transmission and decoding comprises: reading data from a memory of an originating device, the data comprising information regarding the originating device; encoding...
8447610 Method and apparatus for generating synthetic speech with contrastive stress  
Techniques for generating synthetic speech with contrastive stress. In one aspect, a speech-enabled application generates a text input including a text transcription of a desired speech output,...
8442423 Testing within digital media items  
A digital media item, such as an electronic book (eBook), may include testing content. The testing content may include questions about the content of the digital media item. When is user is...
8433575 Augmenting an audio signal via extraction of musical features and obtaining of media fragments  
A system and method is described in which a multimedia story is rendered to a consumer in dependence on features extracted from an audio signal representing for example a musical selection of the...
8433573 Prosody modification device, prosody modification method, and recording medium storing prosody modification program  
A prosody modification device includes: a real voice prosody input part that receives real voice prosody information extracted from an utterance of a human; a regular prosody generating part that...
8433574 Hosted voice recognition system for wireless devices  
Methods, systems, and software for converting the audio input of a user of a hand-held client device or mobile phone into a textual representation by means of a backend server accessed by the...
8433369 Mobile terminal and method of using text data obtained as result of voice recognition  
A mobile terminal has a sound obtaining unit configured to obtain a sound signal; a voice recognition unit configured to recognize the sound signal and convert the sound signal into a text data; a...
8428952 Text-to-speech user's voice cooperative server for instant messaging clients  
A system and method to allow an author of an instant message to enable and control the production of audible speech to the recipient of the message. The voice of the author of the message is...
8423365 Contextual conversion platform  
A contextual conversion platform, and method for converting text-to-speech, are described that can convert content of a target to spoken content. Embodiments of the contextual conversion platform...
8423366 Automatically training speech synthesizers  
A method includes receiving, by a system, a voice recording associated with a user, transcribing, the voice recording into text that includes a group of words, and storing an association between a...
8422641 Distributed record server architecture for recording call sessions over a VoIP network  
Devices, systems, and methods for recording call sessions over a VoIP network using a distributed record server architecture are disclosed. An example recording device for recording segments of a...
8412528 Back-end database reorganization for application-specific concatenative text-to-speech systems  
The present invention relates to computer-generated text-to-speech conversion. It relates in particular to a method and system for updating a Concatenative Text-To-Speech (CTTS) system with a...
8412529 Method and system for enhancing verbal communication sessions  
An approach is provided for enhancing verbal communication sessions. A verbal component of a communication session is converted into textual information. The converted textual information is...
8401856 Automatic normalization of spoken syllable duration  
A very common problem is when people speak a language other than the language which they are accustomed, syllables can be spoken for longer or shorter than the listener would regard as...