Match Document Document Title
7617282 Apparatus for converting e-mail data into audio data and method therefor  
An apparatus and method are provided for converting an e-mail data into an audio data. The apparatus includes a communication connector connected with a communication line for controlling a...
7617105 Converting text-to-speech and adjusting corpus  
A method for text to speech conversion and for adjusting a corpus. The method includes a text analysis step for parsing the text to obtain descriptive prosody annotations of the text based on a TTS...
7613613 Method and system for converting text to lip-synchronized speech in real time  
A method and system for presenting lip-synchronized speech corresponding to the text received in real time is provided. A lip synchronization system provides an image of a character that is to be...
7610202 Integrated system and method for mobile audio playback and dictation  
A method and system provides for a single-pass review and feedback of a document. During audio playback of the document to be reviewed, voice-activated recording of feedback and submission of...
7610201 Method and apparatus for synthesizing speech  
A method for synthesizing speech includes an obtaining step of obtaining a speech message, and a resuming step of resuming speech output of the speech message according to resumption data...
7606710 Method for text-to-pronunciation conversion  
A method for text-to-pronunciation conversion includes a process for searching grapheme-phoneme segments and a three-stage process of text-to-pronunciation conversion. This method looks for a...
7603278 Segment set creating method and apparatus  
A segment set before updating is read, and clustering considering a phoneme environment is performed to it. For each cluster obtained by the clustering, a representative segment of a segment set...
7590542 Method of generating test scripts using a voice-capable markup language  
A method is disclosed for generating test scripts for testing an IVR from an IVR's voice-capable markup language application. The test scripts can be generated directly from the voice-capable...
7590540 Method and system for statistic-based distance definition in text-to-speech conversion  
A method for distance definition in a text-to-speech conversion system by applying Gaussian Mixture Model (GMM) to a distance definition. According to an embodiment, the text that is to be...
7590539 System and method for email notification  
Email subscribers are notified of the receipt of new email messages when they are not at their computers via voice or page. An email notification server polls the email server corresponding to the...
7584105 Method and system for aligning natural and synthetic video to speech synthesis  
According to MPEG-4's TTS architecture, facial animation can be driven by two streams simultaneously—text, and Facial Animation Parameters. In this architecture, text input is sent to a...
7584104 Method and system for training a text-to-speech synthesis system using a domain-specific speech database  
A system, method and computer readable medium that trains a text-to-speech synthesis system for use in speech synthesis is disclosed. The method may include recording audio files of one or more...
7584103 Automated extraction of semantic content and generation of a structured document from speech  
Techniques are disclosed for automatically generating structured documents based on speech, including identification of relevant concepts and their interpretation. In one embodiment, a structured...
7580842 System and method of providing a spoken dialog interface to a website  
Disclosed is a system and method for generating a spoken dialog service from website data. Spoken dialog components typically include an automatic speech recognition module, a language...
7577569 Combined speech recognition and text-to-speech generation  
Text-to-speech (TTS) generation is used in conjunction with large vocabulary speech recognition to say words selected by the speech recognition. The software for performing the large vocabulary...
7577568 Methods and system for creating voice files using a VoiceXML application  
Methods and systems for automating the assembly or creation of audio files for providing to listeners or for use in voice interactive services are provided. A voice application script is prepared...
7574361 Radio audio indicator  
A user interface for a communication device includes a light emitting diode (LED) ( 200 ) providing both a transmit-carrier indicator and transmit-audio feedback to the user. By varying the...
7574360 Unit selection module and method of chinese text-to-speech synthesis  
A unit selection module for Chinese Text-to-Speech (TTS) synthesis includes a probabilistic context free grammar (PCFG) parser, a latent semantic indexing (LSI) module, and a modified...
7571099 Voice synthesis device  
A voice synthesis device for generating synthetic voice having great freedom in voice quality and good sound quality from text data is provided. The voice synthesis device is provided with: voice...
7567908 Differential dynamic content delivery with text display in dependence upon simultaneous speech  
Differential dynamic content delivery including providing a session document for a presentation, wherein the session document includes a session grammar and a session structured document; selecting...
7567905 Method for identifying and verifying an element using a voice system  
An identification and verification method for identifying and verifying an element. The method uses a voice system. The method includes the steps of associating at least one of an at least 4 unit...
7565293 Seamless hybrid computer human call service  
A Voice User Interface is provided for interactively responding in a synthesized voice to a call from a human caller, a Text to Speech system by which text entered by an agent and interactive data...
7565292 Quantitative model for formant dynamics and contextually assimilated reduction in fluent speech  
A method of identifying a sequence of formant trajectory values is provided in which a sequence of target values are identified for a formant as step functions. The target values and the duration...
7558733 System and method for dialog caching  
A system for providing efficient data handling for automated dialogs and the resulting data is disclosed. The system for providing efficient data handling for automated dialogs and the resulting...
7558732 Method and system for computer-aided speech synthesis  
Method and system for computer-aided speed synthesis for synthesizing electronic text by performing a predefined series of rules-based analyses in a predefined order, each of the analyses operating...
7558727 Method of synthesis for a steady sound signal  
The present invention relates to a method of synthesizing a first sound signal based on a second sound signal, the first sound signal having a required first fundamental frequency and the second...
7552052 Voice synthesis apparatus and method  
A plurality of voice segments, each including one or more phonemes are acquired in a time-serial manner, in correspondence with desired singing or speaking words. As necessary, a boundary is...
7548977 Client / server application task allocation based upon client resources  
A software method for allocating application tasks between a client and a server can include the step of detecting client-based computing resources for executing at least one application task. At...
7548858 System and method for selective audible rendering of data to a user based on user input  
A method of rendering information is provided and includes rendering data to a user, identifying a first object and a second object in a query, and accessing the document to identify semantic tags...
7546241 Speech synthesis method and apparatus, and dictionary generation method and apparatus  
In a speech synthesis process, micro-segments are cut from acquired waveform data and a window function. The obtained micro-segments are re-arranged to implement a desired prosody, and superposed...
7529672 Speech synthesis using concatenation of speech waveforms  
A method of synthesizing a speech signal by providing a first speech unit signal having an end interval and a second speech unit signal having a front interval, wherein at least some of the periods...
7526430 Speech synthesis apparatus  
A speech synthesis apparatus, which can embed unchangeable additional information into synthesized speech without causing a deterioration of speech quality and restriction by bands, includes a...
7523036 Text-to-speech synthesis system  
The present invention is intended to provide a text-to-speech synthesis apparatus, including a storage for storing phoneme data of a plurality of speakers; a selector for selecting one of the...
7523035 Hands-free circuit and method for communicating with a wireless device  
A hands-free circuit ( 10 ) and method produces audio information ( 90 ) corresponding to voice tag information ( 60 ) stored either in the hands-free circuit ( 10 ) or in a wireless device ( 320...
7516073 Electronic-book read-aloud device and electronic-book read-aloud method  
A control unit of an electronic-book read-aloud device reads book data and electronic-book data from an electronic bookmark and stores the read data in a storage unit. Further, the control unit...
7512217 System and method for communicating with instant messaging clients using a telephone  
Systems and methods allow a user at a telephone to communicate with an instant messaging client. The instant messaging client may be identified by a PIN that is entered on the telephone. Input from...
7505911 Combined speech recognition and sound recording  
A handheld device with both large-vocabulary speech recognition and audio recoding allows users to switch between at least two of the following three modes: (1) recording audio without...
7502740 Communication apparatus  
A communication apparatus includes a registration unit for registering setting data specifying a type of email message to be read, a synthetic-speech output unit for outputting resulting speech...
7502739 Intonation generation method, speech synthesis apparatus using the method and voice server  
In generation of an intonation pattern of a speech synthesis, a speech synthesis system is capable of providing a highly natural speech and capable of reproducing speech characteristics of a...
7499686 Method and apparatus for multi-sensory speech enhancement on a mobile device  
A mobile device is provided that includes a digit input that can be manipulated by a user's fingers or thumb, an air conduction microphone and an alternative sensor that provides an alternative...
7496693 Wireless enabled speech recognition (SR) portable device including a programmable user trained SR profile for transmission to external SR enabled PC  
A method of interacting with a speech recognition (SR)-enabled personal computer (PC) is provided in which a user SR profile is transferred from a wireless-enabled device to the SR-enabled PC....
7496513 Combined input processing for a computing device  
Input is received from at least two different input sources. Information from these sources are combined together to provide a result. In a particular example, input from one source corresponds to...
7496498 Front-end architecture for a multi-lingual text-to-speech system  
A text processing system for processing multi-lingual text for a speech synthesizer includes a first language dependent module for performing at least one of text and prosody analysis on a portion...
7490040 Method and apparatus for preparing a document to be read by a text-to-speech reader  
There is disclosed a method and system for preparing a document to be read by a text-to-speech reader. The method can include identifying two or more voice types available to the text-to-speech...
7490039 Text to speech system and method having interactive spelling capabilities  
A method for audibly spelling a word in an audio file includes playing an audio file to a user, receiving a command to spell a word in the audio file from the user, identifying a textual word in a...
7487440 Reusable voiceXML dialog components, subdialogs and beans  
Systems and methods for building speech-based applications using reusable dialog components based on VoiceXML (Voice eXtensible Markup Language). VoiceXML reusable dialog components can be used for...
7487092 Interactive debugging and tuning method for CTTS voice building  
A speech recognition device which can preferably be used for reducing the memory capacity required for speaker-independent speech recognition is provided. A matching unit loads speech models...
7483834 Method and apparatus for audio navigation of an information appliance  
The invention includes an apparatus and method of providing information using an information appliance coupled to a network. The method includes storing text files in a database at a remote...
7483832 Method and system for customizing voice translation of text to speech  
A method and system of customizing voice translation of a text to speech includes digitally recording speech samples of a known speaker, correlating each of the speech samples with a standardized...
7483522 Sponsored information distribution method and apparatus  
A database having information sought by a consumer, a database containing consumer attributes, and a database of advertising messages are made responsive to telephone calls placed to an interactive...