close
1.

図書

図書
Edited by Jean-Philippe Thiran, Ferran Marqués, Hervé Bourlard
出版情報: Oxford : Academic Press, 2010  xiv, 328 p. ; 24 cm
シリーズ名: EURASIP and Academic Press Series in Signal and Image Processing
所蔵情報: loading…
目次情報: 続きを見る
Preface
Introduction / Jean-Philippe Thiran ; Ferran Marqués ; Hervé Bourlard1:
Signal Processing, Modelling and Related Mathematical Tools / Part I:
Statistical Machine Learning for HCI / Samy Bengio2:
Introduction to Statistical Learning / 2.1:
Types of Problem / 2.2.1:
Function Space / 2.2.2:
Loss Functions / 2.2.3:
Expected Risk and Empirical Risk / 2.2.4:
Statistical Learning Theory / 2.2.5:
Support Vector Machines for Binary Classification / 2.3:
Hidden Markov Models for Speech Recognition / 2.4:
Speech Recognition / 2.4.1:
Markovian Processes / 2.4.2:
Hidden Markov Models / 2.4.3:
Inference and Learning with HMMs / 2.4.4:
HMMs for Speech Recognition / 2.4.5:
Conclusion / 2.5:
References
Speech Processing / Thierry Dutoit ; Stéphane Dupont3:
Feature Extraction / 3.1:
Acoustic Modelling / 3.2.2:
Language Modelling / 3.2.3:
Decoding / 3.2.4:
Multiple Sensors / 3.2.5:
Confidence Measures / 3.2.6:
Robustness / 3.2.7:
Speaker Recognition / 3.3:
Overview / 3.3.1:
Text-to-Speech Synthesis / 3.3.2:
Natural Language Processing for Speech Synthesis / 3.4.1:
Concatenative Synthesis with a Fixed Inventory / 3.4.2:
Unit Selection-Based Synthesis / 3.4.3:
Statistical Parametric Synthesis / 3.4.4:
Conclusions / 3.5:
Natural Language and Dialogue Processing / Olivier Pietquin4:
Natural Language Understanding / 4.1:
Syntactic Parsing / 4.2.1:
Semantic Parsing / 4.2.2:
Contextual Interpretation / 4.2.3:
Natural Language Generation / 4.3:
Document Planning / 4.3.1:
Microplanning / 4.3.2:
Surface Realisation / 4.3.3:
Dialogue Processing / 4.4:
Discourse Modelling / 4.4.1:
Dialogue Management / 4.4.2:
Degrees of Initiative / 4.4.3:
Evaluation / 4.4.4:
Image and Video Processing Tools for HCI / Montse Pardàs ; Verónica Vilaplana ; Cristian Canton-Ferrer4.5:
Face Analyses / 5.1:
Face Detection / 5.2.1:
Face Tracking / 5.2.2:
Facial Feature Detection and Tracking / 5.2.3:
Gaze Analysis / 5.2.4:
Face Recognition / 5.2.5:
Facial Expression Recognition / 5.2.6:
Hand-Gesture Analysis / 5.3:
Head Orientation Analysis and FoA Estimation / 5.4:
Head Orientation Analysis / 5.4.1:
Focus of Attention Estimation / 5.4.2:
Body Gesture Analysis / 5.5:
Processing of Handwriting and Sketching Dynamics / Claus Vielhauer5.6:
History of Handwriting Modality and the Acquisition of Online Handwriting Signals / 6.1:
Basics in Acquisition, Examples for Sensors / 6.3:
Analysis of Online Handwriting and Sketching Signals / 6.4:
Overview of Recognition Goals in HCI / 6.5:
Sketch Recognition for User Interface Design / 6.6:
Similarity Search in Digital Ink / 6.7:
Summary and Perspectives for Handwriting and Sketching in HCI / 6.8:
Multimodal Signal Processing and Modelling / Part II:
Basic Concepts of Multimodal Analysis / Mihai Curban7:
Defining Multimodality / 7.1:
Advantages of Multimodal Analysis / 7.2:
Multimodal Information Fusion / Norman Poh ; Josef Kittler7.3:
Levels of Fusion / 8.1:
Adaptive versus Non-Adaptive Fusion / 8.3:
Other Design Issues / 8.4:
Modality Integration Methods / Mihai Gurban ; jean-Philippe Thiran8.5:
Multimodal Fusion for AVSR / 9.1:
Types of Fusion / 9.2.1:
Multistream HMMs / 9.2.2:
Stream Reliability Estimates / 9.2.3:
Multimodal Speaker Localisation / 9.3:
A Multimodal Recognition Framework for Joint Modality Compensation and Fusion / Konstantinos Moustakas ; Savvas Argyropoulos ; Dimitrios Tzovaras9.4:
Joint Modality Recognition and Applications / 10.1:
A New Joint Modality Recognition Scheme / 10.3:
Concept / 10.3.1:
Theoretical Background / 10.3.2:
Joint Modality Audio-Visual Speech Recognition / 10.4:
Signature Extraction Stage / 10.4.1:
Recognition Stage / 10.4.2:
Joint Modality Recognition in Biometrics / 10.5:
Results / 10.5.1:
References|204 / 10.6:
Managing Multimodal Data, Metadata and Annotations: Challenges and Solutions / Andrei Popescu-Belis11:
Setting the Stage: Concepts and Projects / 11.1:
Metadate-versusAnnotations / 11.2.l:
Examples of Large Multimodal Collections / 11.2.2:
Capturing and Recording Multimodal Data / 11.3:
Capture Devices / 11.3.1:
Synchronisation / 11.3.2:
Activity Types in Multimodal Corpora / 11.3.3:
Examples of Set-ups and Raw Data / 11.3.4:
Reference Metadata and Annotations / 11.4:
Gathering Metadata: Methods / 11.4.1:
Metadata for the AMI Corpus / 11.4.2:
Reference Annotations: Procedure and Tools / 11.4.3:
Data Storage and Access / 11.5:
Exchange Formats for Metadata and Annotations / 111.5.1:
Data Servers / 111.5.2:
Accessing Annotated Multimodal Data / 111.5.3:
Conclusions and Perspectives / 11.6:
Multimodal Human-Computer and Human-to-Human Interaction / Part III:
Multimodal Input / Natalie Ruiz ; Fang Chen ; Sharon Oviatt12:
Advantages of Multimodal Input Interfaces / 12.1:
State-of-the-Art Multimodal Input Systems / 12.2.1:
Multimodality, Cognition and Performance / 12.3:
Multimodal Perception and Cognition / 12.3.1:
Cognitive Load and Performance / 12.3.2:
Understanding Multimodal Input Behaviour / 12.4:
Theoretical Frameworks / 12.4.1:
Interpretation of Multimodal Input Patterns / 12.4.2:
Adaptive Multimodal Interfaces / 12.5:
Designing Multimodal Interfaces that Manage Users' Cognitive Load / 12.5.1:
Designing Low-Load Multimodal Interfaces for Education / 12.5.2:
Conclusions and Future Directions / 12.6:
MuItimodal Output: Facial Motion, Gestures and Synthesised Speech Synchronisation / Igor S. Pand ić13:
Basic AV Speech Synthesis / 13.1:
The Animation System / 13.3:
Coarticulation / 13.4:
Extended AV Speech Synthesis / 13.5:
Data-Driven Approaches / 13.5.1:
Rule-Based Approaches / 13.5.2:
Embodied Conversational Agents / 13.6:
TTS Timing Issues / 13.7:
On-the-Fly Synchronisation / 13.7.1:
A Priori Synchronisation / 13.7.2:
Interactive Representations of Multimodal Databases / Stéphane Marchand-Maillet ; Donn Morrison ; Enikö Szekely ; Eric Bruno13.8:
Multimodal Data Representation / 14.1:
Multimodal Data Access / 14.3:
Browsing as Extension of the Query Formulation Mechanism / 14.3.1:
Browsing for the Exploration of the Content Space / 14.3.2:
Alternative Representations / 14.3.3:
Commercial Impact / 14.3.4:
Gaining Semantic from User Interaction / 14.4:
Multimodal Interactive Retrieval / 14.4.1:
Crowdsourcing / 14.4.2:
Conclusion and Discussion / 14.5:
Modelling Interest in Face-to-Face Conversations from Multimodal Nonverbal Behaviour / Daniel Catica-Perez15:
Perspectives on Interest Modelling / 15.1:
Computing Interest from Audio Cues / 15.3:
Computing interest from Multimodal Cues / 15.4:
Other Concepts Related to Interest / 15.5:
Concluding Remarks / 15.6:
Index
Preface
Introduction / Jean-Philippe Thiran ; Ferran Marqués ; Hervé Bourlard1:
Signal Processing, Modelling and Related Mathematical Tools / Part I:
2.

電子ブック

EB
Ben Gold, Nelson Morgan, Dan Ellis ; with contributions from Hervé Bourlard ... [et al.]
出版情報: Wiley Online Library, 2011  1 online resource (xxii, 661p.)
所蔵情報: loading…
目次情報: 続きを見る
Preface To The 2011 Edition
Introduction / Chapter 1:
Historical Background / Part I:
Synthetic A Udio: A Brief History / Chapter 2:
Speech Analysis And Synthesis Overview / Chapter 3:
Brief History Of Automatic Speech Recognition / Chapter 4:
Speech-Recognition Overview / Chapter 5:
Mathematical Background / Part II:
Digital Signal Processing / Chapter 6:
Digital Filtersand Discrete Fourier Transform / Chapter 7:
Pattern Classification / Chapter 8:
Statistical Pattern Classification / Chapter 9:
Acoustics / Part III:
Wave Basics / Chapter 10:
Acoustic Tube Modeling Of Speech Production / Chapter 11:
Musical Instrument Acoustics / Chapter 12:
Room Acoustics / Chapter 13:
Auditory Perception / Part IV:
Ear Physiology / Chapter 14:
Psychoacoustics / Chapter 15:
Models Of Pitch Perception / Chapter 16:
Speech Perception / Chapter 17:
Human Speech Recognition / Chapter 18:
Speech Features / Part V:
The Auditory System As A Filter Bank / Chapter 19:
The Cepstrum As A Spectral Analyzer / Chapter 20:
Linear Prediction / Chapter 21:
A Utomatic Speech Recognition / Part VI:
Feature Extraction For Asr / Chapter 22:
Linguistic Categories For Speech Recognition / Chapter 23:
Deterministic Sequence Recognition For Asr / Chapter 24:
Statistical Sequence Recognition / Chapter 25:
Statistical Model Training / Chapter 26:
Discriminant Acoustic Probability Estimation / Chapter 27:
Acoustic Model Training: Further Topics / Chapter 28:
Speech Recognition And Understanding / Chapter 29:
Synthesis And Coding / Part VII:
Speech Synthesis / Chapter 30:
Pitch Detection / Chapter 31:
Vocoders / Chapter 32:
Low-Rate Vocoders / Chapter 33:
Medium-Rate And High-Rate Vocoders / Chapter 34:
Perceptual A Udio Coding / Chapter 35:
Other Applications / Part VIII:
Some Aspects Of Computer Music Synthesis / Chapter 36:
Music Signal Analysis / Chapter 37:
Music Retrieval / Chapter 38:
Source Separation / Chapter 39:
Speech Transformations / Chapter 40:
Speaker Verification / Chapter 41:
Speaker Diarization / Chapter 42:
Preface To The 2011 Edition
Introduction / Chapter 1:
Historical Background / Part I:
文献の複写および貸借の依頼を行う
 文献複写・貸借依頼