Match Document Document Title
7200560 Portable reading device with display capability  
A hand held device that captures information with the capability to read only the captured information, display only the captured information, or simultaneously read and display the captured...
7197462 System and method for information access  
To convert the structure of a Web page into contents that a user can easily listen to, and for permitting the user to access and obtain information without have to perform and navigation, in a...
7191132 Speech synthesis apparatus and method  
A speech synthesiser is provided with a dialog-style selection arrangement responsive to a factor affecting intelligibility of speech output by the apparatus to select a dialog style intended to...
7184958 Speech synthesis method  
A speech synthesis method subjects a reference speech signal to windowing to extract a speech pitch wave having a window function of a window length double a pitch period of the reference speech...
7177817 Automatic generation of voice content for a voice response system  
In one embodiment, the invention provides a method for building a voice response system. The method comprises developing voice content for the voice response system, the voice content including...
7177801 Speech transfer over packet networks using very low digital data bandwidths  
A method of communicating speech across a communication link using very low digital data bandwidth is disclosed, having the steps of: translating speech into text at a source terminal;...
7174294 Speech platform architecture  
A speech platform architecture is described that provides standardized methods of interaction for users across multiple speech-enabled applications. Listener objects corresponding to speech-enabled...
7174296 Transcription service stopping automatic transcription  
A transcription system including a transcription device for the automatic transcription of dictated material and additionally employs transcribers who manually transcribe some of the dictated...
7174295 User interface for text to speech conversion  
An electronic device which includes a user interface having a display for displaying text and a speech synthesiser including a loudspeaker, arranged to convert an input, dependent upon a text, to...
7168953 Trainable videorealistic speech animation  
A method and apparatus for videorealistic, speech animation is disclosed. A human subject is recorded using a video camera as he/she utters a predetermined speech corpus. After processing the...
7171363 Pick-by-line system and method  
The invention is a method for picking an object from a source transport device vehicle using a mobile computer with text-to-speech software adapted for communication between a pick-by-line server...
7165030 Concatenative speech synthesis using a finite-state transducer  
A method for concatenative speech synthesis includes a processing stage that selects segments based on their symbolic labeling in an efficient graph-based search, which uses a finite-state...
7165032 Unsupervised data-driven pronunciation modeling  
Pronunciation for an input word is modeled by generating a set of candidate phoneme strings having pronunciations close to the input word in an orthographic space. Phoneme sub-strings in the set...
7162424 Method and system for defining a sequence of sound modules for synthesis of a speech signal in a tonal language  
The invention relates to a method for defining a sequence of sound modules for synthesis of a speech signal in a tonal language corresponding to a sequence of speech modules. The method according...
7143038 Speech synthesis system  
A speech synthesizing system producing a speech of an improved quality of voice by selecting a combination of speech segment most suitable for a synthesis speech unit sequence. The speech...
7139711 Noise filtering utilizing non-Gaussian signal statistics  
The present invention is directed to a method and system for capturing an information signal from within a noisy background utilizing a non-Gaussian model for the a priori statistics of the...
7139710 Audio synthesis of a currently tuned frequency  
A system 10 is provided which is receptive to selective tuning at particular frequencies. The system 10 includes a display device 15 , an audio synthesizer 14 , and a controller 20 . The...
7138982 Information processing apparatus and information output controlling method  
The invention provides an information processing apparatus wherein playback of display data and playback of audio data relating to the display data are changed over in an associated relationship...
7136816 System and method for predicting prosodic parameters  
A method for generating a prosody model that predicts prosodic parameters is disclosed. Upon receiving text annotated with acoustic features, the method comprises generating first classification...
7136811 Low bandwidth speech communication using default and personal phoneme tables  
A voice coding and decoding system 300 and method uses a personal phoneme table ( 320, 344 ) associated with a voice signature identifier ( 348 ) to permit encoding of true sounding voice by...
7133535 System and method for real time lip synchronization  
A novel method for synchronizing the lips of a sketched face to an input voice. The lip synchronization system and method approach is to use training video as much as possible when the input voice...
7127397 Method of training a computer system via human voice input  
A method of training a computer system via human voice input from a human teacher is provided. In one embodiment, the method includes presenting a text spelling of an unknown word and receiving a...
7127396 Method and apparatus for speech synthesis without prosody modification  
A speech synthesizer is provided that concatenates stored samples of speech units without modifying the prosody of the samples. The present invention is able to achieve a high level of naturalness...
7127400 Methods and systems for personal interactive voice response  
A personal interactive voice response system with a web-based interface allowing the user to specify treatment of incoming calls based on voice or touchtone responses provided by the calling party....
7124082 Phonetic speech-to-text-to-speech system and method  
A speech-to-text-to-speech for use with on-line and real time transmission of speech with a small bandwidth from a source to a destination. A speech is received and broken down to phonemes, which...
7124365 Method and device for detecting an event in a program of a video and/or audio signal and for providing the program to a display upon detection of the event  
A device for automatically providing program information to a television display upon detection of an event in the program. The event being a user definable event. The device also provides...
7120583 Information presentation system, information presentation apparatus, control method thereof and computer readable memory  
An information presentation computer receiving news articles distributed from an information distribution computer performs voice synthesis based on text information included in received send data,...
7117159 Method and system for dynamic control over modes of operation of voice-processing in a voice command platform  
A method and system for dynamically controlling a voice-processing mechanism in a voice command platform. The platform receives a specification during a voice command session with a user and...
7117155 Coarticulation method for audio-visual text-to-speech synthesis  
A method for generating animated sequences of talking heads in text-to-speech applications wherein a processor samples a plurality of frames comprising image samples. Representative parameters are...
7113909 Voice synthesizing method and voice synthesizer performing the same  
A stereotypical sentence is synthesized into a voice of an arbitrary speech style. A third party is able to prepare prosody data and a user of a terminal device having a voice synthesizing part can...
7110950 Method and system for aligning natural and synthetic video to speech synthesis  
According to MPEG-4's TTS architecture, facial animation can be driven by two streams simultaneously—text, and Facial Animation Parameters. In this architecture, text input is sent to a...
7110945 Interactive book  
An electronic interactive book allowing a child to learn the pronunciation of various words. The book will be provided with a permanent or impermanent display onto which a plurality of words,...
7107216 Grapheme-phoneme conversion of a word which is not contained as a whole in a pronunciation lexicon  
In a method for grapheme-phoneme conversion of a word which is not contained as a whole in a pronunciation lexicon, the word is firstly decomposed into subwords. The subwords are transcribed and...
7103548 Audio-form presentation of text messages  
A text message generated at a sending device is converted into audio form by a message-conversion system for delivery to a target recipient. This conversion is effected in a manner enabling...
7103154 Automatic transmission of voice-to-text converted voice message  
A voice messaging system includes an input device to accept a destination electronic messaging address, a voice-to-text converter to convert or transcribe a received voice message into a converted...
7099826 Text-to-speech synthesis system  
The present invention is intended to provide a text-to-speech synthesis apparatus, including a storage for storing phoneme data of a plurality of speakers; a selector for selecting one of the...
7096183 Customizing the speaking style of a speech synthesizer based on semantic analysis  
A method is provided for customizing the speaking style of a speech synthesizer. The method includes: receiving input text; determining semantic information for the input text; determining a...
7092884 Method of nonvisual enrollment for speech recognition  
In a speech recognition system, a method of nonvisual enrollment comprising playing an audio representation of an enrollment script. As the enrollment is playing, shadowed speech from a user can be...
7092873 Method of upgrading a data stream of multimedia data  
For upgrading a data stream of multimedia data, which comprises features with textual description, a set of phonetic translation hints is included in the data stream, which specifies the phonetic...
7092928 Intelligent portal engine  
A human-computer interface system and methods for providing intelligent, adaptive, multimodal interaction with users while accomplishing tasks on their behalf in some particular domain or...
7089188 Method to expand inputs for word or document searching  
An electronic document searching system or word searching system which when given an input, expands the input as a function of acoustic similarity and/or word sequence occurrence frequency. Results...
7088853 Robot apparatus, method and device for recognition of letters or characters, control program and recording medium  
A plural number of letters or characters, inferred from the results of letter/character recognition of an image photographed by a CCD camera ( 20 ), a plural number of kana readings inferred from...
7085718 Method for speaker-identification using application speech  
It is suggested to include application speech (AS) into the set of identification speech data (ISD) for training a speaker-identification process so as to make possible a reduction of the set of...
7080015 Synchronization control apparatus and method, and recording medium  
In a synchronization control apparatus, a voice-language-information generating section generates the voice language information of a word which a robot utters. A voice synthesizing section...
7076426 Advance TTS for facial animation  
An enhanced system is achieved by allowing bookmarks which can specify that the stream of bits that follow corresponds to phonemes and a plurality of prosody information, including duration...
7069216 Corpus-based prosody translation system  
A method of prosody translation is given. A target input symbol sequence is provided, including a first set of speech prosody descriptors. An instance-based learning algorithm is applied to a...
7062437 Audio renderings for expressing non-audio nuances  
Methods, systems, computer program products, and methods of doing business by adapting audio renderings of non-audio messages (for example, e-mail messages that are processed by a text-to-speech...
7062438 Speech synthesis method and apparatus, program, recording medium and robot apparatus  
A sentence or a singing is to be synthesized with a natural speech close to the human voice. To this end, singing metrical data are formed in a tag processing unit 211 in a singing synthesis unit...
7058889 Synchronizing text/visual information with audio playback  
A method of synchronizing visual information with audio playback includes the steps of selecting a desired audio file from a list stored in memory associated with a display device, sending a signal...
7047194 Method and device for co-articulated concatenation of audio segments  
The invention provides a method, apparatus, and a computer program stored on a data carrier that generates synthesized acoustical data by concatenating audio segments of sounds to reproduce a...