Preface |
Introduction / Jean-Philippe Thiran ; Ferran Marqués ; Hervé Bourlard1: |
Signal Processing, Modelling and Related Mathematical Tools / Part I: |
Statistical Machine Learning for HCI / Samy Bengio2: |
Introduction to Statistical Learning / 2.1: |
Types of Problem / 2.2.1: |
Function Space / 2.2.2: |
Loss Functions / 2.2.3: |
Expected Risk and Empirical Risk / 2.2.4: |
Statistical Learning Theory / 2.2.5: |
Support Vector Machines for Binary Classification / 2.3: |
Hidden Markov Models for Speech Recognition / 2.4: |
Speech Recognition / 2.4.1: |
Markovian Processes / 2.4.2: |
Hidden Markov Models / 2.4.3: |
Inference and Learning with HMMs / 2.4.4: |
HMMs for Speech Recognition / 2.4.5: |
Conclusion / 2.5: |
References |
Speech Processing / Thierry Dutoit ; Stéphane Dupont3: |
Feature Extraction / 3.1: |
Acoustic Modelling / 3.2.2: |
Language Modelling / 3.2.3: |
Decoding / 3.2.4: |
Multiple Sensors / 3.2.5: |
Confidence Measures / 3.2.6: |
Robustness / 3.2.7: |
Speaker Recognition / 3.3: |
Overview / 3.3.1: |
Text-to-Speech Synthesis / 3.3.2: |
Natural Language Processing for Speech Synthesis / 3.4.1: |
Concatenative Synthesis with a Fixed Inventory / 3.4.2: |
Unit Selection-Based Synthesis / 3.4.3: |
Statistical Parametric Synthesis / 3.4.4: |
Conclusions / 3.5: |
Natural Language and Dialogue Processing / Olivier Pietquin4: |
Natural Language Understanding / 4.1: |
Syntactic Parsing / 4.2.1: |
Semantic Parsing / 4.2.2: |
Contextual Interpretation / 4.2.3: |
Natural Language Generation / 4.3: |
Document Planning / 4.3.1: |
Microplanning / 4.3.2: |
Surface Realisation / 4.3.3: |
Dialogue Processing / 4.4: |
Discourse Modelling / 4.4.1: |
Dialogue Management / 4.4.2: |
Degrees of Initiative / 4.4.3: |
Evaluation / 4.4.4: |
Image and Video Processing Tools for HCI / Montse Pardàs ; Verónica Vilaplana ; Cristian Canton-Ferrer4.5: |
Face Analyses / 5.1: |
Face Detection / 5.2.1: |
Face Tracking / 5.2.2: |
Facial Feature Detection and Tracking / 5.2.3: |
Gaze Analysis / 5.2.4: |
Face Recognition / 5.2.5: |
Facial Expression Recognition / 5.2.6: |
Hand-Gesture Analysis / 5.3: |
Head Orientation Analysis and FoA Estimation / 5.4: |
Head Orientation Analysis / 5.4.1: |
Focus of Attention Estimation / 5.4.2: |
Body Gesture Analysis / 5.5: |
Processing of Handwriting and Sketching Dynamics / Claus Vielhauer5.6: |
History of Handwriting Modality and the Acquisition of Online Handwriting Signals / 6.1: |
Basics in Acquisition, Examples for Sensors / 6.3: |
Analysis of Online Handwriting and Sketching Signals / 6.4: |
Overview of Recognition Goals in HCI / 6.5: |
Sketch Recognition for User Interface Design / 6.6: |
Similarity Search in Digital Ink / 6.7: |
Summary and Perspectives for Handwriting and Sketching in HCI / 6.8: |
Multimodal Signal Processing and Modelling / Part II: |
Basic Concepts of Multimodal Analysis / Mihai Curban7: |
Defining Multimodality / 7.1: |
Advantages of Multimodal Analysis / 7.2: |
Multimodal Information Fusion / Norman Poh ; Josef Kittler7.3: |
Levels of Fusion / 8.1: |
Adaptive versus Non-Adaptive Fusion / 8.3: |
Other Design Issues / 8.4: |
Modality Integration Methods / Mihai Gurban ; jean-Philippe Thiran8.5: |
Multimodal Fusion for AVSR / 9.1: |
Types of Fusion / 9.2.1: |
Multistream HMMs / 9.2.2: |
Stream Reliability Estimates / 9.2.3: |
Multimodal Speaker Localisation / 9.3: |
A Multimodal Recognition Framework for Joint Modality Compensation and Fusion / Konstantinos Moustakas ; Savvas Argyropoulos ; Dimitrios Tzovaras9.4: |
Joint Modality Recognition and Applications / 10.1: |
A New Joint Modality Recognition Scheme / 10.3: |
Concept / 10.3.1: |
Theoretical Background / 10.3.2: |
Joint Modality Audio-Visual Speech Recognition / 10.4: |
Signature Extraction Stage / 10.4.1: |
Recognition Stage / 10.4.2: |
Joint Modality Recognition in Biometrics / 10.5: |
Results / 10.5.1: |
References|204 / 10.6: |
Managing Multimodal Data, Metadata and Annotations: Challenges and Solutions / Andrei Popescu-Belis11: |
Setting the Stage: Concepts and Projects / 11.1: |
Metadate-versusAnnotations / 11.2.l: |
Examples of Large Multimodal Collections / 11.2.2: |
Capturing and Recording Multimodal Data / 11.3: |
Capture Devices / 11.3.1: |
Synchronisation / 11.3.2: |
Activity Types in Multimodal Corpora / 11.3.3: |
Examples of Set-ups and Raw Data / 11.3.4: |
Reference Metadata and Annotations / 11.4: |
Gathering Metadata: Methods / 11.4.1: |
Metadata for the AMI Corpus / 11.4.2: |
Reference Annotations: Procedure and Tools / 11.4.3: |
Data Storage and Access / 11.5: |
Exchange Formats for Metadata and Annotations / 111.5.1: |
Data Servers / 111.5.2: |
Accessing Annotated Multimodal Data / 111.5.3: |
Conclusions and Perspectives / 11.6: |
Multimodal Human-Computer and Human-to-Human Interaction / Part III: |
Multimodal Input / Natalie Ruiz ; Fang Chen ; Sharon Oviatt12: |
Advantages of Multimodal Input Interfaces / 12.1: |
State-of-the-Art Multimodal Input Systems / 12.2.1: |
Multimodality, Cognition and Performance / 12.3: |
Multimodal Perception and Cognition / 12.3.1: |
Cognitive Load and Performance / 12.3.2: |
Understanding Multimodal Input Behaviour / 12.4: |
Theoretical Frameworks / 12.4.1: |
Interpretation of Multimodal Input Patterns / 12.4.2: |
Adaptive Multimodal Interfaces / 12.5: |
Designing Multimodal Interfaces that Manage Users' Cognitive Load / 12.5.1: |
Designing Low-Load Multimodal Interfaces for Education / 12.5.2: |
Conclusions and Future Directions / 12.6: |
MuItimodal Output: Facial Motion, Gestures and Synthesised Speech Synchronisation / Igor S. Pand ić13: |
Basic AV Speech Synthesis / 13.1: |
The Animation System / 13.3: |
Coarticulation / 13.4: |
Extended AV Speech Synthesis / 13.5: |
Data-Driven Approaches / 13.5.1: |
Rule-Based Approaches / 13.5.2: |
Embodied Conversational Agents / 13.6: |
TTS Timing Issues / 13.7: |
On-the-Fly Synchronisation / 13.7.1: |
A Priori Synchronisation / 13.7.2: |
Interactive Representations of Multimodal Databases / Stéphane Marchand-Maillet ; Donn Morrison ; Enikö Szekely ; Eric Bruno13.8: |
Multimodal Data Representation / 14.1: |
Multimodal Data Access / 14.3: |
Browsing as Extension of the Query Formulation Mechanism / 14.3.1: |
Browsing for the Exploration of the Content Space / 14.3.2: |
Alternative Representations / 14.3.3: |
Commercial Impact / 14.3.4: |
Gaining Semantic from User Interaction / 14.4: |
Multimodal Interactive Retrieval / 14.4.1: |
Crowdsourcing / 14.4.2: |
Conclusion and Discussion / 14.5: |
Modelling Interest in Face-to-Face Conversations from Multimodal Nonverbal Behaviour / Daniel Catica-Perez15: |
Perspectives on Interest Modelling / 15.1: |
Computing Interest from Audio Cues / 15.3: |
Computing interest from Multimodal Cues / 15.4: |
Other Concepts Related to Interest / 15.5: |
Concluding Remarks / 15.6: |
Index |
Preface |
Introduction / Jean-Philippe Thiran ; Ferran Marqués ; Hervé Bourlard1: |
Signal Processing, Modelling and Related Mathematical Tools / Part I: |
Statistical Machine Learning for HCI / Samy Bengio2: |
Introduction to Statistical Learning / 2.1: |
Types of Problem / 2.2.1: |