Match Document Document Title
8868422 Storing a representative speech unit waveform for speech synthesis based on searching for similar speech units  
According to one embodiment, a method for editing speech is disclosed. The method can generate speech information from a text. The speech information includes phonologic information and prosody...
8868425 System and method for providing network coordinated conversational services  
A system and method for providing automatic and coordinated sharing of conversational resources, e.g., functions and arguments, between network-connected servers and devices and their...
8868431 Recognition dictionary creation device and voice recognition device  
A recognition dictionary creation device identifies the language of a reading of an inputted text which is a target to be registered and adds a reading with phonemes in the language identified...
8862471 Establishing a multimodal advertising personality for a sponsor of a multimodal application  
Establishing a multimodal advertising personality for a sponsor of a multimodal application, including associating one or more vocal demeanors with a sponsor of a multimodal application and...
8862461 Fraud detection using text analysis  
In one embodiment, a method executed by at least one processor includes receiving text from submitted by a user. The method also includes determining a text score for the received text by...
8862478 Speech translation system, first terminal apparatus, speech recognition server, translation server, and speech synthesis server  
In conventional network-type speech translation systems, devices or models for recognizing or synthesizing speech cannot be changed in accordance with speakers' attributes, and therefore, accuracy...
8856007 Use text to speech techniques to improve understanding when announcing search results  
Disclosed are apparatus and methods for generating synthesized utterances related to output of commands. A command is received at a computing device. A textual output for the command is determined...
8856008 Training and applying prosody models  
Techniques for training and applying prosody models for speech synthesis are provided. A speech recognition engine processes audible speech to produce text annotated with prosody information. A...
8849666 Conference call service with speech processing for heavily accented speakers  
Speech recognition processing captures phonemes of words in a spoken speech string and retrieves text of words corresponding to particular combinations of phonemes from a phoneme dictionary. A...
8838450 Presentation of written works based on character identities and attributes  
A method is provided for presenting a written work. A character identity is recognized within a written work. Presentation information for the written work, such as a graphical scheme or an...
8838451 System, methods and automated technologies for translating words into music and creating music pieces  
Systems, methods and computer program products are provided for translating a natural language into music. Through systematic parsing, music compositions can be created. These compositions can be...
8825491 System and method to use text-to-speech to prompt whether text-to-speech output should be added during installation of a program on a computer system normally controlled through a user interactive display  
An auditory user interactive interface to an application program being installed in the computer controlled system. A routine in an object, in an application program being installed in the...
8825483 Apparatus and method for transforming audio characteristics of an audio recording  
A method of audio processing comprises composing one or more transformation profiles for transforming audio characteristics of an audio recording and then generating for the or each transformation...
8825484 Character input apparatus equipped with auto-complete function, method of controlling the character input apparatus, and storage medium  
A character input apparatus which makes it possible to suppress degradation of use-friendliness in a case where a visually disabled user inputs characters using an auto-complete function. In the...
8823523 Refrigerator having input voice commands and output voice messages  
A refrigerator is provided. The refrigerator includes a voice recognition unit for recognizing a voice of a name of food, a memory for storing location information of the food received in a...
8825485 Text to speech method and system converting acoustic units to speech vectors using language dependent weights for a selected language  
A text-to-speech method for use in a plurality of languages, including: inputting text in a selected language; dividing the inputted text into a sequence of acoustic units; converting the sequence...
8825486 Method and apparatus for generating synthetic speech with contrastive stress  
Techniques for generating synthetic speech with contrastive stress. In one aspect, a speech-enabled application generates a text input including a text transcription of a desired speech output,...
8805687 System and method for generalized preselection for unit selection synthesis  
Disclosed herein are systems, computer-implemented methods, and computer-readable storage media for unit selection synthesis. The method causes a computing device to add a supplemental phoneset to...
8805695 Bandwidth expansion method and apparatus  
A bandwidth expansion method and apparatus are disclosed, where the method includes: estimating a bandwidth of at least one decoded frame of a whole-band signal, so as to obtain an estimated...
8798998 Pre-saved data compression for TTS concatenation cost  
Pre-saved concatenation cost data is compressed through speech segment grouping. Speech segments are assigned to a predefined number of groups based on their concatenation cost values with other...
8793128 Speech signal processing system, speech signal processing method and speech signal processing method program using noise environment and volume of an input speech signal at a time point  
A speech signal processing system that includes a speech input unit for inputting a speech signal; input speech storage unit for storing an input speech signal that is the speech signal inputted...
8788268 Speech synthesis from acoustic units with default values of concatenation cost  
A speech synthesis system can select recorded speech fragments, or acoustic units, from a very large database of acoustic units to produce artificial speech. When a pair of acoustic units in the...
8789040 Converting non-natively executable programs to downloadable executable programs  
In an embodiment, a data processing method comprises receiving, from a first computer, and storing at a service provider computer, a copy of a non-natively-executable computer program; generating...
8781836 Hearing assistance system for providing consistent human speech  
Broadly speaking, the embodiments disclosed herein describe an apparatus, system, and method that allows a user of a hearing assistance system to perceive consistent human speech. The consistent...
8781844 Audio coding  
A method for encoding an audio signal including: processing a selected subset of a lower series of samples forming a lower frequency spectral band of the audio signal and a higher series of...
8775185 Speech samples library for text-to-speech and methods and apparatus for generating and using same  
A method for converting translating text into speech with a speech sample library is provided. The method comprises converting translating an input text to a sequence of triphones; determining...
8768703 Methods and apparatus to present a video program to a visually impaired person  
Methods and apparatus to present a video program to a visually impaired person are disclosed. An example method comprises detecting a text portion of a media stream including a video stream, the...
8768691 Sound encoding device and sound encoding method  
A sound encoder for efficiently encoding stereophonic sound. A prediction parameter analyzer determines a delay difference D and an amplitude ratio g of a first-channel sound signal with respect...
8768704 Methods and systems for automated generation of nativized multi-lingual lexicons  
An input signal that includes linguistic content in a first language may be received by a computing device. The linguistic content may include text or speech. Based on an acoustic feature...
8768711 Method and apparatus for voice-enabling an application  
A method of voice-enabling an application for command and control and content navigation can include the application dynamically generating a markup language fragment specifying a command and...
8768696 Speech recognition circuit using parallel processors  
A speech recognition circuit comprises a memory containing lexical data for word recognition, the lexical data comprising a plurality of lexical data structures stored in each of a plurality of...
8768702 Multi-tiered voice feedback in an electronic device  
This invention is directed to providing voice feedback to a user of an electronic device. Because each electronic device display may include several speakable elements (i.e., elements for which...
8768701 Prosodic mimic method and apparatus  
A method and apparatus for synthesizing audible phrases (words) that includes capturing a spoken utterance, which may be a word, and extracting prosodic information (parameters) there from, then...
8751562 Systems and methods for pre-rendering an audio representation of textual content for subsequent playback  
A system configured to pre-render an audio representation of textual content for subsequent playback includes a network, a source server, and a requesting device. The source server is configured...
8751235 Annotating phonemes and accents for text-to-speech system  
A system that outputs phonemes and accents of texts. The system has a storage section storing a first corpus in which spellings, phonemes, and accents of a text input beforehand are recorded...
8751237 Text-to-speech device and text-to-speech method  
A sound control section (114) selects and outputs a text-to-speech item from items included in program information multiplexed with a broadcast signal; and starts or stops outputting the...
8751239 Method, apparatus and computer program product for providing text independent voice conversion  
An apparatus for providing text independent voice conversion may include a first voice conversion model and a second voice conversion model. The first voice conversion model may be trained with...
8751236 Devices and methods for speech unit reduction in text-to-speech synthesis systems  
A device may receive a plurality of speech sounds that are indicative of pronunciations of a first linguistic term. The device may determine concatenation features of the plurality of speech...
8744851 Method and system for enhancing a speech database  
A system, method and computer readable medium that enhances a speech database for speech synthesis is disclosed. The method may include labeling audio files in a primary speech database,...
8744848 Methods and systems for training dictation-based speech-to-text systems using recorded samples  
A method and apparatus useful to train speech recognition engines is provided. Many of today's speech recognition engines require training to particular individuals to accurately convert speech to...
8744852 Spoken interfaces  
A spoken interface is described for assisting a visually impaired user to obtain audible information and interact with elements displayed on a display screen. The spoken interface also enables...
8744853 Speaker-adaptive synthesized voice  
An objective is to provide a technique for accurately reproducing features of a fundamental frequency of a target-speaker's voice on the basis of only a small amount of learning data. A learning...
8738280 Methods for activity reduction in pedestrian-to-vehicle communication networks  
Methods for pedestrian unit (PU) communication activity reduction in pedestrian-to-vehicle communication networks include obtaining safety risk information for a pedestrian at risk for involvement...
8731931 System and method for unit selection text-to-speech using a modified Viterbi approach  
Disclosed herein are systems, methods, and non-transitory computer-readable storage media for speech synthesis. A system practicing the method receives a set of ordered lists of speech units, for...
8731933 Speech synthesis apparatus and method utilizing acquisition of at least two speech unit waveforms acquired from a continuous memory region by one access  
A speech synthesizing apparatus includes a selector configured to select a plurality of speech units for synthesizing a speech of a phoneme sequence by referring to speech unit information stored...
8731913 Scaled window overlap add for mixed signals  
A method for overlap-adding signals useful for performing frame loss concealment (FLC) in an audio decoder as well as in other applications. The method uses a dynamic mix of windows to overlap two...
8731932 System and method for synthetic voice generation and modification  
Disclosed herein are systems, methods, and non-transitory computer-readable storage media for generating a synthetic voice. A system configured to practice the method combines a first database of...
8731943 Systems, methods and automated technologies for translating words into music and creating music pieces  
Systems, methods and computer program products are provided for translating a natural language into music. Through systematic parsing, music compositions can be created. These compositions can be...
8725513 Providing expressive user interaction with a multimodal application  
Methods, apparatus, and products are disclosed for providing expressive user interaction with a multimodal application, the multimodal application operating in a multimodal browser on a multimodal...
8725505 Verb error recovery in speech recognition  
A computer implemented method and system for speech recognition are provided. The method and system generally maintain a set of verbs for speech recognition commands. Upon recognizing utterance of...