Search records | 東京工業大学附属図書館蔵書検索

図書

1. Computer speech technology (: alk. paper)

Robert D. Rodman

出版情報:	Boston : Artech House, c1999 xv, 344 p. ; 24 cm
シリーズ名:	The Artech House signal processing library
子書誌情報:	loading…
所蔵情報:	loading…

目次情報: 続きを見る

Preface

About Speech / 1：

Introduction / 1.1：

How Speech Is Produced / 1.2：

The Vocal Tract / 1.2.1：

Articulatory Phonetics / 1.2.2：

Phonetic Alphabets / 1.2.3：

Prosody and Suprasegmentals / 1.2.4：

Syllables / 1.2.5：

Dialects / 1.2.6：

Languages (Other Than English) / 1.2.7：

Acoustic Phonetics / 1.3：

Phonemics / 1.4：

Articulatory Processes / 1.5：

References

Representing Speech in the Computer / 2：

Microphones / 2.1：

Sampling / 2.3：

Sampling Rate / 2.3.1：

Quantization / 2.3.2：

Speech Digitization / 2.4：

Wave Form Coders / 2.4.1：

Voice Coders (Vocoders) / 2.4.2：

The Frequency Domain / 2.5：

The Game of Jumble: Spectrum-Cepstrum, Frequency-Quefrency, Filtering-Liftering / 2.5.1：

Spectrograms: A Hybrid Representation of Speech / 2.5.2：

Speech Recognition / 3：

Speech Recognition: What It Is; What It Isn't / 3.1：

Why Is Speech Recognition Easy for Us and Difficult for Our Computers? / 3.3：

A Brief History of Speech Recognition / 3.4：

The Era of ARPA / 3.4.1：

After ARPA / 3.4.2：

Three Dimensions of Speech Recognition / 3.5：

Continuous Versus Noncontinuous / 3.5.1：

Speaker-Independent Versus Speaker-Dependent / 3.5.2：

Vocabulary Size / 3.5.3：

Tradeoffs and Interactions / 3.5.4：

Units of Speech Recognition / 3.6：

Words and Phrases / 3.6.1：

Phonemes / 3.6.2：

Diphones and Triphones / 3.6.4：

Representing the Units / 3.7：

Acoustic Features / 3.7.1：

Comparing the Units / 3.8：

Dynamic Time Warping (DTW) / 3.8.1：

Hidden Markov Models (HMMs) / 3.8.2：

Future Challenges I / 3.9：

Artificial Neural Networks (ANNs) / 3.9.1：

Errors / 3.10：

Types of Errors / 3.10.1：

Error Tolerances / 3.10.2：

Performance Evaluation of Speech Recognizers / 3.11：

Error Rates / 3.11.1：

Other Factors / 3.11.2：

Error Reduction / 3.12：

Environmental Effects / 3.12.1：

Human Factors / 3.12.2：

Subsetting / 3.12.3：

Vocabulary Selection / 3.12.4：

Error Detection and Correction / 3.13：

Feedback Systems / 3.13.1：

Higher Levels of Linguistic Knowledge / 3.13.2：

Automatic Error Correction / 3.13.3：

Future Challenges II / 3.14：

Speech Synthesis / 4：

Introduction and History / 4.1：

Parametric Coding (Electronic Synthesis) / 4.2：

Parameters of Parametric Speech Synthesis / 4.2.1：

Input Units of Parametric Speech Synthesis / 4.2.2：

Concatenative Synthesis / 4.3：

Allophone Concatenation / 4.3.1：

Diphone Concatenation / 4.3.2：

Demisyllable Concatenation / 4.3.3：

Waveform of Concatenative Units / 4.3.4：

Text-to-Speech Processing / 4.4：

Rules and Exceptions / 4.4.1：

Morphological Analysis / 4.4.2：

Articulation Effects / 4.4.3：

Prosody / 4.4.4：

Special Problems / 4.4.5：

Concept-to-Speech / 4.5：

Languages of the World / 4.6：

Performance Evaluation / 4.6.1：

Intelligibility / 4.7.1：

Comprehensibility / 4.7.2：

Pleasantness/Naturalness / 4.7.3：

Future Challenges / 4.8：

Speaker Recognition, Language Identification, and Lip Synchronization / 5：

Speaker Recognition / 5.1：

Speaker Recognition Versus Speech Recognition / 5.1.1：

Types of Speaker Recognition / 5.1.2：

Text-Dependent, Text-Independent, and Text-Prompted Speaker Recognition / 5.1.3：

"Voiceprints" / 5.1.4：

Methods of Speaker Recognition / 5.1.5：

Noise / 5.1.6：

Performance Evaluation of Speaker Recognition Systems / 5.1.7：

Co-channel Speaker Separation / 5.2：

Language Identification / 5.3：

Four Computational Approaches to Language Identification / 5.3.1：

Performance Evaluation of Language Identification Systems / 5.3.2：

Lip Synchronization / 5.4：

Visemes / 5.4.1：

Mapping Directly From the Speech Signal to Mouth Shapes / 5.4.2：

Applications in Speech Recognition / 5.5：

Criteria for a Viable Speech Recognition Application / 6.1：

Hands Busy, Eyes Busy / 6.1.1：

Remoteness / 6.1.2：

Miniaturization / 6.1.3：

2001 Won't Be 2001 / 6.2：

The Role of Human Factors in Speech Recognition Applications / 6.3：

Application Areas / 6.4：

Assistive Technology / 6.4.1：

Telecommunications / 6.4.2：

Command and Control / 6.4.3：

Data Entry and Retrieval / 6.4.4：

Education / 6.4.5：

Applications in Speech Synthesis / 7：

"At the Tone, the Time Will Be..." / 7.1：

When To Use Text-to-Speech; When To Use Digitally Recorded Speech / 7.2：

Interactive Voice Response Systems (IVRs) / 7.3：

Human Factors Revisited / 7.4：

Aid for Persons With Disabilities / 7.5：

Emergency Scenarios / 7.5.2：

En Masse Advisories / 7.5.4：

Information Retrieval / 7.5.5：

Information Reporting / 7.5.6：

Electronic Mail and Fax Readers / 7.5.7：

In the Dark / 7.5.8：

Toys and Games / 7.5.9：

Transportation / 7.5.10：

Government Services / 7.5.11：

Disguise / 7.5.12：

Applications in Speaker Recognition, Language Identification, and Lip Synchronization / 8：

Applications in Speaker Recognition / 8.1：

Access / 8.1.1：

Authentication / 8.1.2：

Monitoring / 8.1.3：

Fraud Prevention / 8.1.4：

Forensics / 8.1.5：

Personal Services / 8.1.6：

Applications in Language Identification / 8.2：

Communications Monitoring / 8.2.1：

Public Information Systems / 8.2.3：

Applications in Automatic Lip Synching / 8.3：

Animation / 8.3.1：

Glossary

About the Author

Index

Preface

About Speech / 1：

Introduction / 1.1：

図書

2. Robustness in automatic speech recognition : fundamentals and applications

Jean-Claude Junqua, Jean-Paul Haton ; foreword by Hisashi Wakita

出版情報:	Boston : Kluwer Academic Publishers, c1996 xxx, 440 p. ; 25 cm
シリーズ名:	The Kluwer international series in engineering and computer science ; SECS 341 The Kluwer international series in engineering and computer science ; VLSI, computer architecture, and digital signal processing
子書誌情報:	loading…
所蔵情報:	loading…

図書

3. Acoustical and environmental robustness in automatic speech recognition

by Alejandro Acero ; foreword by Richard M. Stern

出版情報:	Boston : Kluwer Academic Publishers, c1993 xx, 186 p. ; 25 cm
シリーズ名:	The Kluwer international series in engineering and computer science ; SECS 201 . VLSI, computer architecture and digital signal processing
子書誌情報:	loading…
所蔵情報:	loading…

目次情報: 続きを見る

List of Figures

List of Tables

Foreword

Introduction / 1：

Experimental Procedure / 2：

Frequency Domain Processing / 3：

The SDCN Algorithm / 4：

The CDCN Algorithm / 5：

Other Algorithms / 6：

Frequency Normalization / 7：

Summary of Results / 8：

Conclusions / 9：

Glossary / Appendix I：

Signal Processing in Sphinx / Appendix II：

The Bilinear Transform / Appendix III：

Spectral Estimation Issues / Appendix IV：

MMSE Estimation in the CDCN Algorithm / Appendix V：

Maximum Likelihood via the EM Algorithm / Appendix VI：

Estimation of Noise and Spectral Tilt / Appendix VII：

Vocabulary and Pronunciation Dictionary / Appendix VIII：

References

Index

List of Figures

List of Tables

Foreword

文献の複写および貸借の依頼を行う

文献複写・貸借依頼

絞り込み条件: