|
Match
|
Document |
Document Title |
|
|
7617106 |
Error detection for speech to text transcription systems
A method, a system and a computer program product detects errors within text generated by a speech to text transcription system. The transcribed text is re-transformed into an artificial speech...
|
|
|
7613612 |
Voice synthesizer of multi sounds
In a voice synthesizer, an envelope acquisition portion obtains a spectral envelope of a reference frequency spectrum of a given voice. A spectrum acquisition portion obtains a collective frequency...
|
|
|
7610201 |
Method and apparatus for synthesizing speech
A method for synthesizing speech includes an obtaining step of obtaining a speech message, and a resuming step of resuming speech output of the speech message according to resumption data...
|
|
|
7610200 |
System and method for controlling sound data
A system and method for controlling access to parameter blocks of a sound processor. According to the method and system disclosed herein, the present invention includes a host, a sound processor...
|
|
|
7606709 |
Voice converter with extraction and modification of attribute data
An apparatus is constructed for converting an input voice signal into an output voice signal according to a target voice signal. In the apparatus, an input device provides the input voice signal...
|
|
|
7603280 |
Speech output apparatus, speech output method, and program
A speech output apparatus is disclosed, which can allow the user to easily catch synthetic speech when the synthetic speech is output upon being superposed on a music output. The apparatus output...
|
|
|
7603278 |
Segment set creating method and apparatus
A segment set before updating is read, and clustering considering a phoneme environment is performed to it. For each cluster obtained by the clustering, a representative segment of a segment set...
|
|
|
7599838 |
Speech animation with behavioral contexts for application scenarios
Methods and systems, including computer program products, for speech animation. The system includes a speech animation server and one or more speech animation clients. The speech animation server...
|
|
|
7596497 |
Speech synthesis apparatus and speech synthesis method
A speech synthesis apparatus and a speech synthesis method, in which a waveform of a desired formant shape may be generated with a small volume of computing operations. A voiced sound generating...
|
|
|
7590626 |
Distributional similarity-based models for query correction
A distributional similarity between a word of a search query and a term of a candidate word sequences is used to determine an error model probability that describes the probability of the search...
|
|
|
7590540 |
Method and system for statistic-based distance definition in text-to-speech conversion
A method for distance definition in a text-to-speech conversion system by applying Gaussian Mixture Model (GMM) to a distance definition. According to an embodiment, the text that is to be...
|
|
|
7587320 |
Automatic segmentation in speech synthesis
Systems and methods for automatically segmenting speech inventories. A set of Hidden Markov Models (HMMs) are initialized using bootstrap data. The HMMs are next re-estimated and aligned to produce...
|
|
|
7587319 |
Speech recognition circuit using parallel processors
A speech recognition circuit comprises an input buffer for receiving processed speech parameters. A lexical memory contains lexical data for word recognition. The lexical data comprises a plurality...
|
|
|
7587310 |
Sound processor architecture using single port memory unit
A system and method for implementing a sound processor. The sound processor includes a first voice engine, a second voice engine, and at least one single-port memory unit. An operation of the first...
|
|
|
7584104 |
Method and system for training a text-to-speech synthesis system using a domain-specific speech database
A system, method and computer readable medium that trains a text-to-speech synthesis system for use in speech synthesis is disclosed. The method may include recording audio files of one or more...
|
|
|
7580839 |
Apparatus and method for voice conversion using attribute information
A speech processing apparatus according to an embodiment of the invention includes a conversion-source-speaker speech-unit database; a voice-conversion-rule-learning-data generating means; and a...
|
|
|
7577568 |
Methods and system for creating voice files using a VoiceXML application
Methods and systems for automating the assembly or creation of audio files for providing to listeners or for use in voice interactive services are provided. A voice application script is prepared...
|
|
|
7571104 |
Dynamic real-time cross-fading of voice prompts
A system and method are provided for creating shorter more natural sounding voice messages and prompts from a plurality of pre-recorded sound segments, the prerecorded sound segments are...
|
|
|
7571099 |
Voice synthesis device
A voice synthesis device for generating synthetic voice having great freedom in voice quality and good sound quality from text data is provided. The voice synthesis device is provided with: voice...
|
|
|
7565293 |
Seamless hybrid computer human call service
A Voice User Interface is provided for interactively responding in a synthesized voice to a call from a human caller, a Text to Speech system by which text entered by an agent and interactive data...
|
|
|
7565291 |
Synthesis-based pre-selection of suitable units for concatenative speech
The instructions on the computer-readable medium control a computing device to perform the steps: selecting at least one phoneme from a triphone unit selection database as at least candidate...
|
|
|
7562018 |
Speech synthesis method and speech synthesizer
A language processing portion ( 31 ) analyzes a text from a dialogue processing section ( 20 ) and transforms the text to information on pronunciation and accent. A prosody generation portion ( 32...
|
|
|
7558732 |
Method and system for computer-aided speech synthesis
Method and system for computer-aided speed synthesis for synthesizing electronic text by performing a predefined series of rules-based analyses in a predefined order, each of the analyses operating...
|
|
|
7555433 |
Voice generator, method for generating voice, and navigation apparatus
A main controller feeds a spelling translator with a text item representing a place name stored in a map database. The spelling translator translates the spelling of the text item according to...
|
|
|
7552053 |
Techniques for aiding speech-to-speech translation
Techniques for assisting in translation are provided. A speech recognition hypothesis is obtained, corresponding to a source language utterance. Information retrieval is performed on a supplemental...
|
|
|
7552052 |
Voice synthesis apparatus and method
A plurality of voice segments, each including one or more phonemes are acquired in a time-serial manner, in correspondence with desired singing or speaking words. As necessary, a boundary is...
|
|
|
7546241 |
Speech synthesis method and apparatus, and dictionary generation method and apparatus
In a speech synthesis process, micro-segments are cut from acquired waveform data and a window function. The obtained micro-segments are re-arranged to implement a desired prosody, and superposed...
|
|
|
7542905 |
Method for synthesizing a voice waveform which includes compressing voice-element data in a fixed length scheme and expanding compressed voice-element data of voice data sections
A method for synthesizing a voice waveform includes compressing voice-element data in a fixed length scheme that uses data from a preceding or succeeding frame. The compressed voice-element data of...
|
|
|
7533021 |
Speech processing for telephony API
Systems, methods, and structures are discussed that enhance media processing. One aspect of the present invention includes a data structure to enhance media processing. The data structure includes...
|
|
|
7529674 |
Speech animation
Methods and systems, including computer program products, for speech animation. The system includes a speech animation engine and a client application in communication with the speech animation...
|
|
|
7529672 |
Speech synthesis using concatenation of speech waveforms
A method of synthesizing a speech signal by providing a first speech unit signal having an end interval and a second speech unit signal having a front interval, wherein at least some of the periods...
|
|
|
7526430 |
Speech synthesis apparatus
A speech synthesis apparatus, which can embed unchangeable additional information into synthesized speech without causing a deterioration of speech quality and restriction by bands, includes a...
|
|
|
7523037 |
Data synthesis apparatus and program
A data synthesis apparatus detects the start of a period of voice waveform data, stores the voice waveform data in a first storage device, starting with its part indicative of the start of the...
|
|
|
7523035 |
Hands-free circuit and method for communicating with a wireless device
A hands-free circuit ( 10 ) and method produces audio information ( 90 ) corresponding to voice tag information ( 60 ) stored either in the hands-free circuit ( 10 ) or in a wireless device ( 320...
|
|
|
7516073 |
Electronic-book read-aloud device and electronic-book read-aloud method
A control unit of an electronic-book read-aloud device reads book data and electronic-book data from an electronic bookmark and stores the read data in a storage unit. Further, the control unit...
|
|
|
7507894 |
Sound data encoding apparatus and sound data decoding apparatus
A processing load at the time of playing back sound data having a loop part is reduced. A sound data encoding apparatus comprises a block dividing means that divides the sound data into blocks...
|
|
|
7502740 |
Communication apparatus
A communication apparatus includes a registration unit for registering setting data specifying a type of email message to be read, a synthetic-speech output unit for outputting resulting speech...
|
|
|
7487093 |
Text structure for voice synthesis, voice synthesis method, voice synthesis apparatus, and computer program thereof
In a voice synthesis apparatus, by bounding a desired range of input text to be output by, e.g., a start tag “<morphing type=“emotion” start=“happy” end=“angry”>” and end tag...
|
|
|
7487092 |
Interactive debugging and tuning method for CTTS voice building
A speech recognition device which can preferably be used for reducing the memory capacity required for speaker-independent speech recognition is provided. A matching unit loads speech models...
|
|
|
7483834 |
Method and apparatus for audio navigation of an information appliance
The invention includes an apparatus and method of providing information using an information appliance coupled to a network. The method includes storing text files in a database at a remote...
|
|
|
7483832 |
Method and system for customizing voice translation of text to speech
A method and system of customizing voice translation of a text to speech includes digitally recording speech samples of a known speaker, correlating each of the speech samples with a standardized...
|
|
|
7478047 |
Interactive character system
A system and method for controlling a synthetic character using a control system displays the character engaged in an activity, receiving a first input from a user, determines whether the input is...
|
|
|
7478039 |
Stochastic modeling of spectral adjustment for high quality pitch modification
Natural-sounding synthesized speech is obtained from pieced elemental speech units that have their super-class identities known (e.g. phoneme type), and their line spectral frequencies (LSF) set in...
|
|
|
7475016 |
Speech segment clustering and ranking
A system, method, and apparatus for identifying problematic speech segments is provided. The system includes a clustering module for generating a first cluster of one or more consecutive speech...
|
|
|
7475007 |
Expression extraction device, expression extraction method, and recording medium
Provided is an expression extraction device for extracting evaluation expressions from text having descriptions on evaluations of a specific evaluation target, which includes a registered...
|
|
|
7472066 |
Automatic speech segmentation and verification using segment confidence measures
An automatic speech segmentation and verification system and method is disclosed, which has a known text script and a recorded speech corpus corresponding to the known text script. A speech unit...
|
|
|
7472065 |
Generating paralinguistic phenomena via markup in text-to-speech synthesis
Converting marked-up text into a synthesized stream includes providing marked-up text to a processor-based system, converting the marked-up text into a text stream including vocabulary items,...
|
|
|
7460997 |
Method and system for preselection of suitable units for concatenative speech
A system and method for improving the response time of text-to-speech synthesis utilizes “triphone contexts” (i.e., triplets comprising a central phoneme and its immediate context) as the basic...
|
|
|
7457752 |
Method and apparatus for controlling the operation of an emotion synthesizing device
Method and apparatus for controlling the operation of an emotion synthesizing device, notably of the type where the emotion is conveyed by a sound, having at least one input parameter whose value...
|
|
|
7457748 |
Method of automatic processing of a speech signal
Method of automatically processing a speech signal which comprises the steps of:
determining a sequence of probability models corresponding to a given text; determining a sequence of...
|