Multilingual Artificial Intelligence is a guide for non-computer science specialists and learners looking to explore the implementation of AI technologies to solve real-life problems involving language data.
Bringing together leading scholars and practitioners, Rethinking Writing Education in the Age of Generative AI offers a timely exploration of pressing issues in writing pedagogies within an increasingly AI-mediated educational landscape.
In knowledge-based natural language generation, issues of formal knowledge representation meet with the linguistic problems of choosing the most appropriate verbalization in a particular situation of utterance.
Dictation systems, read-aloud software for the blind, speech control of machinery, geographical information systems with speech input and output, and educational software with `talking head' artificial tutorial agents are already on the market.
Parsing technology traditionally consists of two branches, which correspond to the two main application areas of context-free grammars and their generalizations.
Speech--to--Speech Translation: a Massively Parallel Memory-Based Approach describes one of the world's first successful speech--to--speech machine translation systems.
Reversible grammar allows computational models to be built that are equally well suited for the analysis and generation of natural language utterances.
In this brief, the authors discuss recently explored spectral (sub-segmental and pitch synchronous) and prosodic (global and local features at word and syllable levels in different parts of the utterance) features for discerning emotions in a robust manner.
"e;Mobile Speech and Advanced Natural Language Solutions"e; presents the discussion of the most recent advances in intelligent human-computer interaction, including fascinating new study findings on talk-in-interaction, which is the province of conversation analysis, a subfield in sociology/sociolinguistics, a new and emerging area in natural language understanding.
Data driven methods have long been used in Automatic Speech Recognition (ASR) and Text-To-Speech (TTS) synthesis and have more recently been introduced for dialogue management, spoken language understanding, and Natural Language Generation.
Cross-Word Modeling for Arabic Speech Recognition utilizes phonological rules in order to model the cross-word problem, a merging of adjacent words in speech caused by continuous speech, to enhance the performance of continuous speech recognition systems.
Extraction and Representation of Prosodic Features for Speech Processing Applications deals with prosody from speech processing point of view with topics including: The significance of prosody for speech processing applicationsWhy prosody need to be incorporated in speech processing applicationsDifferent methods for extraction and representation of prosody for applications such as speech synthesis, speaker recognition, language recognition and speech recognitionThis book is for researchers and students at the graduate level.
Predicting Prosody from Text for Text-to-Speech Synthesis covers the specific aspects of prosody, mainly focusing on how to predict the prosodic information from linguistic text, and then how to exploit the predicted prosodic knowledge for various speech applications.
Cognitive and Computational Strategies for Word Sense Disambiguation examines cognitive strategies by humans and computational strategies by machines, for WSD in parallel.
Pervasive and ubiquitous, machine translation systems have been transforming communication and understanding across languages and cultures on a historical scale.
This comprehensive reference work provides an overview of the concepts, methodologies, and applications in computational linguistics and natural language processing (NLP).
Adaptive Multimodal Interactive Systems introduces a general framework for adapting multimodal interactive systems and comprises a detailed discussion of each of the steps required for adaptation.
Audio Signal Processing for Next-Generation Multimedia Communication Systems presents cutting-edge digital signal processing theory and implementation techniques for problems including speech acquisition and enhancement using microphone arrays, new adaptive filtering algorithms, multichannel acoustic echo cancellation, sound source tracking and separation, audio coding, and realistic sound stage reproduction.
It has been said of the brothers Wilhelm and Alexander von Humboldt that between them they were the last people to have known all that there was to know, to have had a mastery of the best that contemporary science knew and to have made significant contributions, to be that rare thing Renaissance men.
In its nine chapters, this book provides an overview of the state-of-the-art and best practice in several sub-fields of evaluation of text and speech systems and components.
Automated question answering - the ability of a machine to answer questions, simple or complex, posed in ordinary human language - is one of today's most exciting technological developments.