Match Document Document Title
9037466 Email administration for rendering email on a digital audio player  
Methods, systems, and computer program products are provided for email administration for rendering email on a digital audio player. Embodiments include retrieving an email message; extracting...
9026445 Text-to-speech user's voice cooperative server for instant messaging clients  
A system and method to allow an author of an instant message to enable and control the production of audible speech to the recipient of the message. The voice of the author of the message is...
9020821 Apparatus and method for editing speech synthesis, and computer readable medium  
An acquisition unit analyzes a text, and acquires phonemic and prosodic information. An editing unit edits a part of the phonemic and prosodic information. A speech synthesis unit converts the...
9015032 Multilingual speech recognition and public announcement  
Embodiments of the present invention provide a system, method, and program product to deliver an announcement to people, such as a public announcement. A computer receives input representative of...
9009051 Apparatus, method, and program for reading aloud documents based upon a calculated word presentation order  
According to one embodiment, a reading aloud support apparatus includes a reception unit, a first extraction unit, a second extraction unit, an acquisition unit, a generation unit, a presentation...
9002711 Speech synthesis apparatus and method  
According to an embodiment, a speech synthesis apparatus includes a selecting unit configured to select speaker's parameters one by one for respective speakers and obtain a plurality of speakers'...
9002703 Community audio narration generation  
The community-based generation of audio narrations for a text-based work leverages collaboration of a community of people to provide human-voiced audio readings. During the community-based...
8996387 Release of transaction data  
For clearing transaction data selected for a processing, there is generated in a portable data carrier (1) a transaction acoustic signal (003; 103; 203) (S007; S107; S207) upon whose acoustic...
8996377 Blending recorded speech with text-to-speech output for specific domains  
A text-to-speech (TTS) engine combines recorded speech with synthesized speech from a TTS synthesizer based on text input. The TTS engine receives the text input and identifies the domain for the...
8990089 Text to speech synthesis for texts with foreign language inclusions  
A speech output is generated from a text input written in a first language and containing inclusions in a second language. Words in the native language are pronounced with a native pronunciation...
8990087 Providing text to speech from digital content on an electronic device  
A method for providing text to speech from digital content in an electronic device is described. Digital content including a plurality of words and a pronunciation database is received....
8983835 Electronic device and server for processing voice message  
An electronic device includes a voice processing unit, a wireless communication unit, and a combining unit. The voice processing unit receives speech signals. The wireless communication unit sends...
8983841 Method for enhancing the playback of information in interactive voice response systems  
A network communication node includes an audio outputter that outputs an audible representation of data to be provided to a requester. The network communication node also includes a processor that...
8977551 Parametric speech synthesis method and system  
The present invention provides a parametric speech synthesis method and a parametric speech synthesis system. The method comprises sequentially processing each frame of speech of each phone in a...
8977550 Information providing apparatus and information providing method  
Part units of speech information are arranged in a predetermined order to generate a sentence unit of a speech information set. To each of a plurality of speech part units of the speech...
8977552 Method and system for enhancing a speech database  
A system, method and computer readable medium that enhances a speech database for speech synthesis is disclosed. The method may include labeling audio files in a primary speech database,...
8972248 Band broadening apparatus and method  
A band broadening apparatus includes a processor configured to analyze a fundamental frequency based on an input signal bandlimited to a first band, generate a signal that includes a second band...
8972265 Multiple voices in audio content  
A content customization service is disclosed. The content customization service may identify one or more speakers in an item of content, and map one or more portions of the item of content to a...
8965767 System and method for synthetic voice generation and modification  
Disclosed herein are systems, methods, and non-transitory computer-readable storage media for generating a synthetic voice. A system configured to practice the method combines a first database of...
8965773 Coding with noise shaping in a hierarchical coder  
A method is provided for hierarchical coding of a digital audio signal comprising, for a current frame of the input signal: a core coding, delivering a scalar quantization index for each sample of...
8965768 System and method for automatic detection of abnormal stress patterns in unit selection synthesis  
Disclosed herein are systems, methods, and non-transitory computer-readable storage media for detecting and correcting abnormal stress patterns in unit-selection speech synthesis. A system...
8965764 Electronic apparatus and voice recognition method for the same  
Disclosed are an electronic apparatus and a voice recognition method for the same. The voice recognition method for the electronic apparatus includes: receiving an input voice of a user;...
8965769 Markup assistance apparatus, method and program  
According to one embodiment, a markup assistance apparatus includes an acquisition unit, a first calculation unit, a detection unit and a presentation unit. The acquisition unit acquires a feature...
8959021 Single interface for local and remote speech synthesis  
Features are disclosed for providing a consistent interface for local and distributed text to speech (TTS) systems. Some portions of the TTS system, such as voices and TTS engine components, may...
8959022 System for media correlation based on latent evidences of audio  
A method for determining a relatedness between a query video and a database video is provided. A processor extracts an audio stream from the query video to produce a query audio stream, extracts...
8954328 Systems and methods for document narration with multiple characters having multiple moods  
Disclosed are techniques and systems to provide a narration of a text in multiple different voices. Further disclosed are techniques and systems for providing a plurality of characters at least...
8954335 Speech translation system, control device, and control method  
Appropriate processing results or appropriate apparatuses can be selected with a control device that selects the most probable speech recognition result by using speech recognition scores received...
8949123 Display apparatus and voice conversion method thereof  
The voice conversion method of a display apparatus includes: in response to the receipt of a first video frame, detecting one or more entities from the first video frame; in response to the...
8949128 Method and apparatus for providing speech output for speech-enabled applications  
Techniques for providing speech output for speech-enabled applications. A synthesis system receives from a speech-enabled application a text input including a text transcription of a desired...
8942983 Method of speech synthesis  
The present invention relates to a method of text-based speech synthesis, wherein at least one portion of a text is specified; the intonation of each portion is determined; target speech sounds...
8930200 Vector joint encoding/decoding method and vector joint encoder/decoder  
A vector joint encoding/decoding method and a vector joint encoder/decoder are provided, more than two vectors are jointly encoded, and an encoding index of at least one vector is split and then...
8924217 Communication converter for converting audio information/textual information to corresponding textual information/audio information  
A communication converter is described for converting among speech signals and textual information, permitting communication between telephone users and textual instant communications users.
8918313 Replay apparatus, signal processing apparatus, and signal processing method  
A method of selectively performing signal processing in a first mode and in a second mode. In the first mode, a noise cancel signal having a signal characteristic to cancel an external noise...
8917876 Earguard monitoring system  
SPL monitoring systems are provided. A SPL monitoring system includes an audio transducer configured to receive sound pressure, a logic circuit which calculates a safe time duration over which a...
8918322 Personalized text-to-speech services  
A personalized text-to-speech (pTTS) system provides a method for converting text data to speech data utilizing a pTTS template representing the voice characteristics of an individual. A memory...
8918323 Contextual conversion platform for generating prioritized replacement text for spoken content output  
A contextual conversion platform, and method for converting text-to-speech, are described that can convert content of a target to spoken content. Embodiments of the contextual conversion platform...
8914290 Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment  
Method and apparatus that dynamically adjusts operational parameters of a text-to-speech engine in a speech-based system. A voice engine or other application of a device provides a mechanism to...
8914291 Method and apparatus for generating synthetic speech with contrastive stress  
Techniques for generating synthetic speech with contrastive stress. In one aspect, a speech-enabled application generates a text input including a text transcription of a desired speech output,...
8909538 Enhanced interface for use with speech recognition  
Improved methods of presenting speech prompts to a user as part of an automated system that employs speech recognition or other voice input are described. The invention improves the user interface...
8898066 Multi-lingual text-to-speech system and method  
A multi-lingual text-to-speech system and method processes a text to be synthesized via an acoustic-prosodic model selection module and an acoustic-prosodic model mergence module, and obtains a...
8892442 System and method for answering a communication notification  
Disclosed herein are systems, methods, and computer readable-media for answering a communication notification. The method for answering a communication notification comprises receiving a...
8892230 Multicore system, control method of multicore system, and non-transitory readable medium storing program  
A multicore system 2 includes a main system program 610 that operates on a first processor core 61 and stores synthesized audio data, which is mixed audio data, to a buffer for DMA transfer 63, a...
8892440 Electronic device and control method thereof  
Disclosed are an electronic device and a control method thereof, The electronic device includes a text-to-speech unit which converts a text into an audio signal, an audio output unit which outputs...
8886542 Voice interactive service system and method for providing different speech-based services  
A voice interactive service system provides different speech-based services to a plurality of users. Using a communication terminal, the services are accessed via a telecommunication network...
8886537 Method and system for text-to-speech synthesis with personalized voice  
A method and system are provided for text-to-speech synthesis with personalized voice. The method includes receiving an incidental audio input (403) of speech in the form of an audio communication...
8886539 Prosody generation using syllable-centered polynomial representation of pitch contours  
The present invention discloses a parametrical representation of prosody based on polynomial expansion coefficients of the pitch contour near the center of each syllable. The said syllable pitch...
8880495 Search query expansion and group search  
Audio information is recorded in an overwriteable circular buffer of a computing device. Construction of a search query is initiated by receiving a user input. The user input includes one or more...
8880401 Communication converter for converting audio information/textual information to corresponding textual information/audio information  
A communication converter is described for converting among speech signals and textual information, permitting communication between telephone users and textual instant communications users.
8874443 System and method for generating natural language phrases from user utterances in dialog systems  
Embodiments of a dialog system that employs a corpus-based approach to generate responses based on a given number of semantic constraint-value pairs are described. The system makes full use of the...
8874444 Simulated conversation by pre-recorded audio navigator  
A method is provided for a simulated conversation by a pre-recorded audio navigator, with particular application to informational and entertainment settings. A monitor may utilize a navigation...