Search records | 東京工業大学附属図書館蔵書検索

図書

1. Multimodal signal processing : theory and applications for human-computer interaction

Edited by Jean-Philippe Thiran, Ferran Marqués, Hervé Bourlard

出版情報:	Oxford : Academic Press, 2010 xiv, 328 p. ; 24 cm
シリーズ名:	EURASIP and Academic Press Series in Signal and Image Processing
子書誌情報:	loading…
所蔵情報:	loading…

目次情報: 続きを見る

Preface

Introduction / Jean-Philippe Thiran ; Ferran Marqués ; Hervé Bourlard1：

Signal Processing, Modelling and Related Mathematical Tools / Part I：

Statistical Machine Learning for HCI / Samy Bengio2：

Introduction to Statistical Learning / 2.1：

Types of Problem / 2.2.1：

Function Space / 2.2.2：

Loss Functions / 2.2.3：

Expected Risk and Empirical Risk / 2.2.4：

Statistical Learning Theory / 2.2.5：

Support Vector Machines for Binary Classification / 2.3：

Hidden Markov Models for Speech Recognition / 2.4：

Speech Recognition / 2.4.1：

Markovian Processes / 2.4.2：

Hidden Markov Models / 2.4.3：

Inference and Learning with HMMs / 2.4.4：

HMMs for Speech Recognition / 2.4.5：

Conclusion / 2.5：

References

Speech Processing / Thierry Dutoit ; Stéphane Dupont3：

Feature Extraction / 3.1：

Acoustic Modelling / 3.2.2：

Language Modelling / 3.2.3：

Decoding / 3.2.4：

Multiple Sensors / 3.2.5：

Confidence Measures / 3.2.6：

Robustness / 3.2.7：

Speaker Recognition / 3.3：

Overview / 3.3.1：

Text-to-Speech Synthesis / 3.3.2：

Natural Language Processing for Speech Synthesis / 3.4.1：

Concatenative Synthesis with a Fixed Inventory / 3.4.2：

Unit Selection-Based Synthesis / 3.4.3：

Statistical Parametric Synthesis / 3.4.4：

Conclusions / 3.5：

Natural Language and Dialogue Processing / Olivier Pietquin4：

Natural Language Understanding / 4.1：

Syntactic Parsing / 4.2.1：

Semantic Parsing / 4.2.2：

Contextual Interpretation / 4.2.3：

Natural Language Generation / 4.3：

Document Planning / 4.3.1：

Microplanning / 4.3.2：

Surface Realisation / 4.3.3：

Dialogue Processing / 4.4：

Discourse Modelling / 4.4.1：

Dialogue Management / 4.4.2：

Degrees of Initiative / 4.4.3：

Evaluation / 4.4.4：

Image and Video Processing Tools for HCI / Montse Pardàs ; Verónica Vilaplana ; Cristian Canton-Ferrer4.5：

Face Analyses / 5.1：

Face Detection / 5.2.1：

Face Tracking / 5.2.2：

Facial Feature Detection and Tracking / 5.2.3：

Gaze Analysis / 5.2.4：

Face Recognition / 5.2.5：

Facial Expression Recognition / 5.2.6：

Hand-Gesture Analysis / 5.3：

Head Orientation Analysis and FoA Estimation / 5.4：

Head Orientation Analysis / 5.4.1：

Focus of Attention Estimation / 5.4.2：

Body Gesture Analysis / 5.5：

Processing of Handwriting and Sketching Dynamics / Claus Vielhauer5.6：

History of Handwriting Modality and the Acquisition of Online Handwriting Signals / 6.1：

Basics in Acquisition, Examples for Sensors / 6.3：

Analysis of Online Handwriting and Sketching Signals / 6.4：

Overview of Recognition Goals in HCI / 6.5：

Sketch Recognition for User Interface Design / 6.6：

Similarity Search in Digital Ink / 6.7：

Summary and Perspectives for Handwriting and Sketching in HCI / 6.8：

Multimodal Signal Processing and Modelling / Part II：

Basic Concepts of Multimodal Analysis / Mihai Curban7：

Defining Multimodality / 7.1：

Advantages of Multimodal Analysis / 7.2：

Multimodal Information Fusion / Norman Poh ; Josef Kittler7.3：

Levels of Fusion / 8.1：

Adaptive versus Non-Adaptive Fusion / 8.3：

Other Design Issues / 8.4：

Modality Integration Methods / Mihai Gurban ; jean-Philippe Thiran8.5：

Multimodal Fusion for AVSR / 9.1：

Types of Fusion / 9.2.1：

Multistream HMMs / 9.2.2：

Stream Reliability Estimates / 9.2.3：

Multimodal Speaker Localisation / 9.3：

A Multimodal Recognition Framework for Joint Modality Compensation and Fusion / Konstantinos Moustakas ; Savvas Argyropoulos ; Dimitrios Tzovaras9.4：

Joint Modality Recognition and Applications / 10.1：

A New Joint Modality Recognition Scheme / 10.3：

Concept / 10.3.1：

Theoretical Background / 10.3.2：

Joint Modality Audio-Visual Speech Recognition / 10.4：

Signature Extraction Stage / 10.4.1：

Recognition Stage / 10.4.2：

Joint Modality Recognition in Biometrics / 10.5：

Results / 10.5.1：

References|204 / 10.6：

Managing Multimodal Data, Metadata and Annotations: Challenges and Solutions / Andrei Popescu-Belis11：

Setting the Stage: Concepts and Projects / 11.1：

Metadate-versusAnnotations / 11.2.l：

Examples of Large Multimodal Collections / 11.2.2：

Capturing and Recording Multimodal Data / 11.3：

Capture Devices / 11.3.1：

Synchronisation / 11.3.2：

Activity Types in Multimodal Corpora / 11.3.3：

Examples of Set-ups and Raw Data / 11.3.4：

Reference Metadata and Annotations / 11.4：

Gathering Metadata: Methods / 11.4.1：

Metadata for the AMI Corpus / 11.4.2：

Reference Annotations: Procedure and Tools / 11.4.3：

Data Storage and Access / 11.5：

Exchange Formats for Metadata and Annotations / 111.5.1：

Data Servers / 111.5.2：

Accessing Annotated Multimodal Data / 111.5.3：

Conclusions and Perspectives / 11.6：

Multimodal Human-Computer and Human-to-Human Interaction / Part III：

Multimodal Input / Natalie Ruiz ; Fang Chen ; Sharon Oviatt12：

Advantages of Multimodal Input Interfaces / 12.1：

State-of-the-Art Multimodal Input Systems / 12.2.1：

Multimodality, Cognition and Performance / 12.3：

Multimodal Perception and Cognition / 12.3.1：

Cognitive Load and Performance / 12.3.2：

Understanding Multimodal Input Behaviour / 12.4：

Theoretical Frameworks / 12.4.1：

Interpretation of Multimodal Input Patterns / 12.4.2：

Adaptive Multimodal Interfaces / 12.5：

Designing Multimodal Interfaces that Manage Users' Cognitive Load / 12.5.1：

Designing Low-Load Multimodal Interfaces for Education / 12.5.2：

Conclusions and Future Directions / 12.6：

MuItimodal Output: Facial Motion, Gestures and Synthesised Speech Synchronisation / Igor S. Pand ić13：

Basic AV Speech Synthesis / 13.1：

The Animation System / 13.3：

Coarticulation / 13.4：

Extended AV Speech Synthesis / 13.5：

Data-Driven Approaches / 13.5.1：

Rule-Based Approaches / 13.5.2：

Embodied Conversational Agents / 13.6：

TTS Timing Issues / 13.7：

On-the-Fly Synchronisation / 13.7.1：

A Priori Synchronisation / 13.7.2：

Interactive Representations of Multimodal Databases / Stéphane Marchand-Maillet ; Donn Morrison ; Enikö Szekely ; Eric Bruno13.8：

Multimodal Data Representation / 14.1：

Multimodal Data Access / 14.3：

Browsing as Extension of the Query Formulation Mechanism / 14.3.1：

Browsing for the Exploration of the Content Space / 14.3.2：

Alternative Representations / 14.3.3：

Commercial Impact / 14.3.4：

Gaining Semantic from User Interaction / 14.4：

Multimodal Interactive Retrieval / 14.4.1：

Crowdsourcing / 14.4.2：

Conclusion and Discussion / 14.5：

Modelling Interest in Face-to-Face Conversations from Multimodal Nonverbal Behaviour / Daniel Catica-Perez15：

Perspectives on Interest Modelling / 15.1：

Computing Interest from Audio Cues / 15.3：

Computing interest from Multimodal Cues / 15.4：

Other Concepts Related to Interest / 15.5：

Concluding Remarks / 15.6：

Index

Preface

Introduction / Jean-Philippe Thiran ; Ferran Marqués ; Hervé Bourlard1：

Signal Processing, Modelling and Related Mathematical Tools / Part I：

電子ブック

ＥＢ

2. Speech and audio signal processing : processing and perception of speech and music. 2nd ed (: electronic bk)

Ben Gold, Nelson Morgan, Dan Ellis ; with contributions from Hervé Bourlard ... [et al.]

出版情報:	Wiley Online Library, 2011 1 online resource (xxii, 661p.)
子書誌情報:	loading…
所蔵情報:	loading…

目次情報: 続きを見る

Preface To The 2011 Edition

Introduction / Chapter 1：

Historical Background / Part I：

Synthetic A Udio: A Brief History / Chapter 2：

Speech Analysis And Synthesis Overview / Chapter 3：

Brief History Of Automatic Speech Recognition / Chapter 4：

Speech-Recognition Overview / Chapter 5：

Mathematical Background / Part II：

Digital Signal Processing / Chapter 6：

Digital Filtersand Discrete Fourier Transform / Chapter 7：

Pattern Classification / Chapter 8：

Statistical Pattern Classification / Chapter 9：

Acoustics / Part III：

Wave Basics / Chapter 10：

Acoustic Tube Modeling Of Speech Production / Chapter 11：

Musical Instrument Acoustics / Chapter 12：

Room Acoustics / Chapter 13：

Auditory Perception / Part IV：

Ear Physiology / Chapter 14：

Psychoacoustics / Chapter 15：

Models Of Pitch Perception / Chapter 16：

Speech Perception / Chapter 17：

Human Speech Recognition / Chapter 18：

Speech Features / Part V：

The Auditory System As A Filter Bank / Chapter 19：

The Cepstrum As A Spectral Analyzer / Chapter 20：

Linear Prediction / Chapter 21：

A Utomatic Speech Recognition / Part VI：

Feature Extraction For Asr / Chapter 22：

Linguistic Categories For Speech Recognition / Chapter 23：

Deterministic Sequence Recognition For Asr / Chapter 24：

Statistical Sequence Recognition / Chapter 25：

Statistical Model Training / Chapter 26：

Discriminant Acoustic Probability Estimation / Chapter 27：

Acoustic Model Training: Further Topics / Chapter 28：

Speech Recognition And Understanding / Chapter 29：

Synthesis And Coding / Part VII：

Speech Synthesis / Chapter 30：

Pitch Detection / Chapter 31：

Vocoders / Chapter 32：

Low-Rate Vocoders / Chapter 33：

Medium-Rate And High-Rate Vocoders / Chapter 34：

Perceptual A Udio Coding / Chapter 35：

Other Applications / Part VIII：

Some Aspects Of Computer Music Synthesis / Chapter 36：

Music Signal Analysis / Chapter 37：

Music Retrieval / Chapter 38：

Source Separation / Chapter 39：

Speech Transformations / Chapter 40：

Speaker Verification / Chapter 41：

Speaker Diarization / Chapter 42：

Preface To The 2011 Edition

Introduction / Chapter 1：

Historical Background / Part I：

文献の複写および貸借の依頼を行う

文献複写・貸借依頼

絞り込み条件: