Match Document Document Title
8024193 Methods and apparatus related to pruning for concatenative text-to-speech synthesis  
The present invention provides, among other things, automatic identification of near-redundant units in a large TTS voice table, identifying which units are distinctive enough to keep and which...
8024192 Time-warping of decoded audio signal after packet loss  
A technique is described for use in a decoder configured to decode a series of frames representing an encoded audio signal. The technique is for transitioning between a lost frame and one or more...
8024194 Dynamic switching between local and remote speech rendering  
A multimodal browser for rendering a multimodal document on an end system defining a host can include a visual browser component for rendering visual content, if any, of the multimodal document,...
8019605 Reducing recording time when constructing a concatenative TTS voice using a reduced script and pre-recorded speech assets  
The present invention discloses a system and a method for creating a reduced script, which is read by a voice talent to create a concatenative text-to-speech (TTS) voice. The method can...
8019276 Audio transmission method and system  
An audio transmission method and system. The method includes detecting by a computing system, a wireless device belonging to a user. The computing system enables a connection between the wireless...
8015012 Data-driven global boundary optimization  
Portions from segment boundary regions of a plurality of speech segments are extracted. Each segment boundary region is based on a corresponding initial unit boundary. Feature vectors that...
8014650 Feedback of out-of-range signals  
Briefly, in accordance with one or more embodiments, relatively immediate feedback may be provided for out of range signals. Embodiments may include, subsequent to an edit of at least a portion of...
8015009 Speech derived from text in computer presentation applications  
A computer system comprising hardware and software elements; the hardware elements including a processor, a display means and a speaker, the software elements comprising a speech synthesizer, a...
8015011 Generating objectively evaluated sufficiently natural synthetic speech from text by using selective paraphrases  
A synthetic speech system includes a phoneme segment storage section for storing multiple phoneme segment data pieces; a synthesis section for generating voice data from text by reading phoneme...
8010368 Surgical system controlling apparatus and surgical system controlling method  
In this invention, a voice recognition engine 110 outputs to a controlling section 103 a matching state of a voice input signal as an error code. Then, the controlling section 103 determines the...
8005678 Re-phasing of decoder states after packet loss  
A technique is described herein for updating a state of a decoder configured to decode a series of frames representing an encoded audio signal. In accordance with the technique, an output audio...
8005676 Speech analysis using statistical learning  
Included are embodiments for providing speech analysis. At least one embodiment of a method includes receiving audio data associated with a communication and providing the at least one phoneme in...
7991616 Speech synthesizer  
The present invention is a speech synthesizer that generates speech data of text including a fixed part and a variable part, in combination with recorded speech and rule-based synthetic speech....
7987093 Speech synthesizing device, speech synthesizing system, language processing device, speech synthesizing method and recording medium  
A speech synthesizing device, the device includes: a text accepting unit for accepting text data; an extracting unit for extracting a special character including a pictographic character, a face...
7983918 Audio instruction system and method  
A device and method for assisting a human user in performing processes includes a speaker that provides audible instructions to the user corresponding to multiple tasks associated with performing...
7983399 Remote notification system and method and intelligent agent therefor  
The invention relates to remote access systems and methods using automatic speech recognition to access a computer system. The invention also relates to an intelligent agent resident on the...
7966186 System and method for blending synthetic voices  
A system and method for generating a synthetic text-to-speech TTS voice are disclosed. A user is presented with at least one TTS voice and at least one voice characteristic. A new synthetic TTS...
7966185 Application of emotion-based intonation and prosody to speech in text-to-speech systems  
A text-to-speech system that includes an arrangement for accepting text input, an arrangement for providing synthetic speech output, and an arrangement for imparting emotion-based features to...
7962341 Method and apparatus for labelling speech  
A method for the prosodic labelling of speech including performing a first analysis step using data from an audio file, wherein the audio file is analysed as a plurality of frames positioned at...
7953600 System and method for hybrid speech synthesis  
A speech synthesis system receives symbolic input describing an utterance to be synthesized. In one embodiment, different portions of the utterance are constructed from different sources, one of...
7953590 Using separate recording channels for speech-to-speech translation systems  
A system and method for speech-to-speech translation using a translation system includes designating separate input channels for each of a plurality of speakers. In response to speech from a first...
7953601 Method and apparatus for preparing a document to be read by text-to-speech reader  
There is disclosed a method and system for preparing a document to be read by a text-to-speech reader. The method can include identifying two or more voice types available to the text-to-speech...
7949520 Adaptive filter pitch extraction  
An enhancement system extracts pitch from a processed speech signal. The system estimates the pitch of voiced speech by deriving filter coefficients of an adaptive filter and using the obtained...
7949651 Disambiguating residential listing search results  
A directory assistance system includes a directory database and a search engine. The search engine is configured to search the directory database for a first set of residential listings based on...
7945447 Sound coding device and sound coding method  
A sound coding device having a monaural/stereo scalable structure and capable of efficiently coding stereo sound. even when the correlation between the channel signals of a stereo signal is small....
7941795 System for updating and outputting speech data  
A system for outputting speech from speech data that may include an application, an internal speech data module, and an external speech data module is provided. The internal speech module stores...
7933772 System and method for triphone-based unit selection for visual speech synthesis  
A system and method for generating a video sequence having mouth movements synchronized with speech sounds are disclosed. The system utilizes a database of n-phones as the smallest selectable...
7930172 Global boundary-centric feature extraction and associated discontinuity metrics  
Portions from time-domain speech segments are extracted. Feature vectors that represent the portions in a vector space are created. The feature vectors incorporate phase information of the...
7925304 Audio manipulation systems and methods  
An audio manipulation method is provided. A call associated with a mobile device is received. Whether an audio manipulation flag is set for the mobile device is determined. An audio signal from...
7921013 System and method for sending multi-media messages using emoticons  
A system and method of providing sender-customization of multi-media messages through the use of emoticons is disclosed. The sender inserts the emoticons into a text message. As an animated face...
7917352 Language processing system  
A language processing system including: a forbidden word memory part that stores a forbidden word; a sequence candidate generator that generates a plurality of word sequence candidates where each...
7912727 Apparatus and method for integrated phrase-based and free-form speech-to-speech translation  
An apparatus and method that integrates both phrase-based and free-form speech-to-speech translation approaches using probability models. The starting step of the method is to receive vocal...
7912718 Method and system for enhancing a speech database  
A system, method and computer readable medium that enhances a speech database for speech synthesis is disclosed. The method may include labeling audio files in a primary speech database and a...
7912708 Method for controlling duration in speech synthesis  
The present invention relates to a method of synthesizing of a speech signal, comprising: —assigning of a first identifier to a first class of intervals of an original speech signal and assigning...
7912719 Speech synthesis device and speech synthesis method for changing a voice characteristic  
A speech synthesis device, in which the sound quality is not significantly degraded when generating a synthesized sound, includes a target element information generation unit (102), an element...
7908143 Dialog call-flow optimization  
The present invention is concerned with reorganizing dialog call-flow in the presence of resource constraints. A call-flow has a set of dialogs. The set of grammars in a given call-flow set of...
7899672 Method and system for generating synthesized speech based on human recording  
A method and system that incorporates human recording with a TTS system to generate synthesized speech with high quality by searching over a database of pre-recorded utterances to select an...
7895041 Text to speech interactive voice response system  
A text to speech interactive voice response system is operable within a personal computer having a processor, data storage means and an operating system. The system comprises an input subsystem...
7890331 System and method for generating audio-visual summaries for audio-visual program content  
The invention describes a system (1) for generating audio-visual summaries for audio-visual program content (3). The system comprises a search unit (4) for locating a pre-generated text summary...
7890330 Voice recording tool for creating database used in text to speech synthesis system  
A method records verbal expressions of a person for use in a vehicle navigation system. The vehicle navigation system has a database including a map and text describing street names and points of...
7890332 Information processing apparatus and user interface control method  
An information processing apparatus can set one of a plurality of setting values for a setting item. A guidance holding unit holds guidance information for voice output for each of the plurality...
7885815 System and method for access to multimedia structures  
A system for access to multimedia structures has telephone sets capable of connecting to a telephone network, a storage device capable of storing a plurality of multimedia structures representing...
7881934 Method and system for adjusting the voice prompt of an interactive system based upon the user's state  
The voice prompt of an interactive system is adjusted based upon a state of a user. An utterance of the user is received, and the state of the user is determined based upon signal processing of...
7873520 Method and apparatus for tagtoe reminders  
A network-based text-to-speech (TTS) TagToe alert system is configured to take a user's textual and/or multimedia input to a TagToe user interface to schedule delivery of text-to-speech-converted...
7869999 Systems and methods for selecting from multiple phonectic transcriptions for text-to-speech synthesis  
A system and method for generating synthetic speech, which operates in a computer implemented Text-To-Speech system. The system comprises at least a speaker database that has been previously...
7869577 Remote access system and method and intelligent agent therefor  
The invention relates to remote access systems and methods using automatic speech recognition to access a computer system. The invention also relates to an intelligent agent resident on the...
7870142 Text to grammar enhancements for media files  
A control system in a vehicle for extracting meta data from a digital media storage device over a communication link. The system includes a communication module for establishing a communication...
7865365 Personalized voice playback for screen reader  
A method, system, and computer program product is disclosed for customizing a synthesized voice based upon audible input voice data. The input voice data is typically in the form of one or more...
7865366 Data preparation for media browsing  
A system is described which includes a content retriever to retrieve and format data and a media file playlist generated by the content retriever from the data. The media file playlist includes a...
7860705 Methods and apparatus for context adaptation of speech-to-speech translation systems  
A technique for context adaptation of a speech-to-speech translation system is provided. A plurality of sets of paralinguistic attribute values is obtained from a plurality of input signals. Each...