|
Match
|
Document |
Document Title |
|
|
7200560 |
Portable reading device with display capability
A hand held device that captures information with the capability to read only the captured information, display only the captured information, or simultaneously read and display the captured...
|
|
|
7197462 |
System and method for information access
To convert the structure of a Web page into contents that a user can easily listen to, and for permitting the user to access and obtain information without have to perform and navigation, in a...
|
|
|
7191132 |
Speech synthesis apparatus and method
A speech synthesiser is provided with a dialog-style selection arrangement responsive to a factor affecting intelligibility of speech output by the apparatus to select a dialog style intended to...
|
|
|
7184958 |
Speech synthesis method
A speech synthesis method subjects a reference speech signal to windowing to extract a speech pitch wave having a window function of a window length double a pitch period of the reference speech...
|
|
|
7177817 |
Automatic generation of voice content for a voice response system
In one embodiment, the invention provides a method for building a voice response system. The method comprises developing voice content for the voice response system, the voice content including...
|
|
|
7177801 |
Speech transfer over packet networks using very low digital data bandwidths
A method of communicating speech across a communication link using very low digital data bandwidth is disclosed, having the steps of: translating speech into text at a source terminal;...
|
|
|
7174294 |
Speech platform architecture
A speech platform architecture is described that provides standardized methods of interaction for users across multiple speech-enabled applications. Listener objects corresponding to speech-enabled...
|
|
|
7174296 |
Transcription service stopping automatic transcription
A transcription system including a transcription device for the automatic transcription of dictated material and additionally employs transcribers who manually transcribe some of the dictated...
|
|
|
7174295 |
User interface for text to speech conversion
An electronic device which includes a user interface having a display for displaying text and a speech synthesiser including a loudspeaker, arranged to convert an input, dependent upon a text, to...
|
|
|
7168953 |
Trainable videorealistic speech animation
A method and apparatus for videorealistic, speech animation is disclosed. A human subject is recorded using a video camera as he/she utters a predetermined speech corpus. After processing the...
|
|
|
7171363 |
Pick-by-line system and method
The invention is a method for picking an object from a source transport device vehicle using a mobile computer with text-to-speech software adapted for communication between a pick-by-line server...
|
|
|
7165030 |
Concatenative speech synthesis using a finite-state transducer
A method for concatenative speech synthesis includes a processing stage that selects segments based on their symbolic labeling in an efficient graph-based search, which uses a finite-state...
|
|
|
7165032 |
Unsupervised data-driven pronunciation modeling
Pronunciation for an input word is modeled by generating a set of candidate phoneme strings having pronunciations close to the input word in an orthographic space. Phoneme sub-strings in the set...
|
|
|
7162424 |
Method and system for defining a sequence of sound modules for synthesis of a speech signal in a tonal language
The invention relates to a method for defining a sequence of sound modules for synthesis of a speech signal in a tonal language corresponding to a sequence of speech modules. The method according...
|
|
|
7143038 |
Speech synthesis system
A speech synthesizing system producing a speech of an improved quality of voice by selecting a combination of speech segment most suitable for a synthesis speech unit sequence. The speech...
|
|
|
7139711 |
Noise filtering utilizing non-Gaussian signal statistics
The present invention is directed to a method and system for capturing an information signal from within a noisy background utilizing a non-Gaussian model for the a priori statistics of the...
|
|
|
7139710 |
Audio synthesis of a currently tuned frequency
A system 10 is provided which is receptive to selective tuning at particular frequencies. The system 10 includes a display device 15 , an audio synthesizer 14 , and a controller 20 . The...
|
|
|
7138982 |
Information processing apparatus and information output controlling method
The invention provides an information processing apparatus wherein playback of display data and playback of audio data relating to the display data are changed over in an associated relationship...
|
|
|
7136816 |
System and method for predicting prosodic parameters
A method for generating a prosody model that predicts prosodic parameters is disclosed. Upon receiving text annotated with acoustic features, the method comprises generating first classification...
|
|
|
7136811 |
Low bandwidth speech communication using default and personal phoneme tables
A voice coding and decoding system 300 and method uses a personal phoneme table ( 320, 344 ) associated with a voice signature identifier ( 348 ) to permit encoding of true sounding voice by...
|
|
|
7133535 |
System and method for real time lip synchronization
A novel method for synchronizing the lips of a sketched face to an input voice. The lip synchronization system and method approach is to use training video as much as possible when the input voice...
|
|
|
7127397 |
Method of training a computer system via human voice input
A method of training a computer system via human voice input from a human teacher is provided. In one embodiment, the method includes presenting a text spelling of an unknown word and receiving a...
|
|
|
7127396 |
Method and apparatus for speech synthesis without prosody modification
A speech synthesizer is provided that concatenates stored samples of speech units without modifying the prosody of the samples. The present invention is able to achieve a high level of naturalness...
|
|
|
7127400 |
Methods and systems for personal interactive voice response
A personal interactive voice response system with a web-based interface allowing the user to specify treatment of incoming calls based on voice or touchtone responses provided by the calling party....
|
|
|
7124082 |
Phonetic speech-to-text-to-speech system and method
A speech-to-text-to-speech for use with on-line and real time transmission of speech with a small bandwidth from a source to a destination. A speech is received and broken down to phonemes, which...
|
|
|
7124365 |
Method and device for detecting an event in a program of a video and/or audio signal and for providing the program to a display upon detection of the event
A device for automatically providing program information to a television display upon detection of an event in the program. The event being a user definable event. The device also provides...
|
|
|
7120583 |
Information presentation system, information presentation apparatus, control method thereof and computer readable memory
An information presentation computer receiving news articles distributed from an information distribution computer performs voice synthesis based on text information included in received send data,...
|
|
|
7117159 |
Method and system for dynamic control over modes of operation of voice-processing in a voice command platform
A method and system for dynamically controlling a voice-processing mechanism in a voice command platform. The platform receives a specification during a voice command session with a user and...
|
|
|
7117155 |
Coarticulation method for audio-visual text-to-speech synthesis
A method for generating animated sequences of talking heads in text-to-speech applications wherein a processor samples a plurality of frames comprising image samples. Representative parameters are...
|
|
|
7113909 |
Voice synthesizing method and voice synthesizer performing the same
A stereotypical sentence is synthesized into a voice of an arbitrary speech style. A third party is able to prepare prosody data and a user of a terminal device having a voice synthesizing part can...
|
|
|
7110950 |
Method and system for aligning natural and synthetic video to speech synthesis
According to MPEG-4's TTS architecture, facial animation can be driven by two streams simultaneously—text, and Facial Animation Parameters. In this architecture, text input is sent to a...
|
|
|
7110945 |
Interactive book
An electronic interactive book allowing a child to learn the pronunciation of various words. The book will be provided with a permanent or impermanent display onto which a plurality of words,...
|
|
|
7107216 |
Grapheme-phoneme conversion of a word which is not contained as a whole in a pronunciation lexicon
In a method for grapheme-phoneme conversion of a word which is not contained as a whole in a pronunciation lexicon, the word is firstly decomposed into subwords. The subwords are transcribed and...
|
|
|
7103548 |
Audio-form presentation of text messages
A text message generated at a sending device is converted into audio form by a message-conversion system for delivery to a target recipient. This conversion is effected in a manner enabling...
|
|
|
7103154 |
Automatic transmission of voice-to-text converted voice message
A voice messaging system includes an input device to accept a destination electronic messaging address, a voice-to-text converter to convert or transcribe a received voice message into a converted...
|
|
|
7099826 |
Text-to-speech synthesis system
The present invention is intended to provide a text-to-speech synthesis apparatus, including a storage for storing phoneme data of a plurality of speakers; a selector for selecting one of the...
|
|
|
7096183 |
Customizing the speaking style of a speech synthesizer based on semantic analysis
A method is provided for customizing the speaking style of a speech synthesizer. The method includes: receiving input text; determining semantic information for the input text; determining a...
|
|
|
7092884 |
Method of nonvisual enrollment for speech recognition
In a speech recognition system, a method of nonvisual enrollment comprising playing an audio representation of an enrollment script. As the enrollment is playing, shadowed speech from a user can be...
|
|
|
7092873 |
Method of upgrading a data stream of multimedia data
For upgrading a data stream of multimedia data, which comprises features with textual description, a set of phonetic translation hints is included in the data stream, which specifies the phonetic...
|
|
|
7092928 |
Intelligent portal engine
A human-computer interface system and methods for providing intelligent, adaptive, multimodal interaction with users while accomplishing tasks on their behalf in some particular domain or...
|
|
|
7089188 |
Method to expand inputs for word or document searching
An electronic document searching system or word searching system which when given an input, expands the input as a function of acoustic similarity and/or word sequence occurrence frequency. Results...
|
|
|
7088853 |
Robot apparatus, method and device for recognition of letters or characters, control program and recording medium
A plural number of letters or characters, inferred from the results of letter/character recognition of an image photographed by a CCD camera ( 20 ), a plural number of kana readings inferred from...
|
|
|
7085718 |
Method for speaker-identification using application speech
It is suggested to include application speech (AS) into the set of identification speech data (ISD) for training a speaker-identification process so as to make possible a reduction of the set of...
|
|
|
7080015 |
Synchronization control apparatus and method, and recording medium
In a synchronization control apparatus, a voice-language-information generating section generates the voice language information of a word which a robot utters. A voice synthesizing section...
|
|
|
7076426 |
Advance TTS for facial animation
An enhanced system is achieved by allowing bookmarks which can specify that the stream of bits that follow corresponds to phonemes and a plurality of prosody information, including duration...
|
|
|
7069216 |
Corpus-based prosody translation system
A method of prosody translation is given. A target input symbol sequence is provided, including a first set of speech prosody descriptors. An instance-based learning algorithm is applied to a...
|
|
|
7062437 |
Audio renderings for expressing non-audio nuances
Methods, systems, computer program products, and methods of doing business by adapting audio renderings of non-audio messages (for example, e-mail messages that are processed by a text-to-speech...
|
|
|
7062438 |
Speech synthesis method and apparatus, program, recording medium and robot apparatus
A sentence or a singing is to be synthesized with a natural speech close to the human voice. To this end, singing metrical data are formed in a tag processing unit 211 in a singing synthesis unit...
|
|
|
7058889 |
Synchronizing text/visual information with audio playback
A method of synchronizing visual information with audio playback includes the steps of selecting a desired audio file from a list stored in memory associated with a display device, sending a signal...
|
|
|
7047194 |
Method and device for co-articulated concatenation of audio segments
The invention provides a method, apparatus, and a computer program stored on a data carrier that generates synthesized acoustical data by concatenating audio segments of sounds to reproduce a...
|