Gold / Morgan / Ellis Speech and Audio Signal Processing

Processing and Perception of Speech and Music
2. Auflage 2011
ISBN: 978-1-118-14291-2
Verlag: John Wiley & Sons
Format: PDF
Kopierschutz: Adobe DRM (»Systemvoraussetzungen)

Häufig gestellte Fragen zu E-Books

E-Book, Englisch, 688 Seiten, E-Book

Speech Audio Signal Processing
2. Auflage 2011, 978-0-470-19536-9, Buch

Processing and Perception of Speech and Music

E-Book, Englisch, 688 Seiten, E-Book

ISBN: 978-1-118-14291-2
Verlag: John Wiley & Sons
Format: PDF
Kopierschutz: Adobe DRM (»Systemvoraussetzungen)

Häufig gestellte Fragen zu E-Books

115,99 €

(inkl. MwSt.)

versandkostenfreie Lieferung
sofort verfügbar

When Speech and Audio Signal Processing published in 1999,it stood out from its competition in its breadth of coverage andits accessible, intutiont-based style. This book was aimed atindividual students and engineers excited about the broad span ofaudio processing and curious to understand the availabletechniques. Since then, with the advent of the iPod in 2001,the field of digital audio and music has exploded, leading to amuch greater interest in the technical aspects of audioprocessing.
This Second Edition will update and revise the originalbook to augment it with new material describing both the enablingtechnologies of digital music distribution (most significantly theMP3) and a range of exciting new research areas in automatic musiccontent processing (such as automatic transcription, musicsimilarity, etc.) that have emerged in the past five years, drivenby the digital music revolution.
New chapter topics include:
* Psychoacoustic Audio Coding, describing MP3 and relatedaudio coding schemes based on psychoacoustic masking ofquantization noise
* Music Transcription, including automatically derivingnotes, beats, and chords from music signals.
* Music Information Retrieval, primarily focusing onaudio-based genre classification, artist/style identification, andsimilarity estimation.
* Audio Source Separation, including multi-microphonebeamforming, blind source separation, and the perception-inspiredtechniques usually referred to as Computational Auditory SceneAnalysis (CASA).

Gold / Morgan / Ellis Speech and Audio Signal Processing jetzt bestellen!

Autoren/Hrsg.

Gold, Ben

Morgan, Nelson

Ellis, Dan

Weitere Infos & Material

Inhaltsverzeichnis

PREFACE TO THE 2011 EDITION xxi
CHAPTER 1 INTRODUCTION 1
PART I HISTORICAL BACKGROUND
CHAPTER 2 SYNTHETIC A UDIO: A BRIEF HISTORY 9
CHAPTER 3 SPEECH ANALYSIS AND SYNTHESIS OVERVIEW 21
CHAPTER 4 BRIEF HISTORY OF AUTOMATIC SPEECH RECOGNITION 40
CHAPTER 5 SPEECH-RECOGNITION OVERVIEW 59
PART II MATHEMATICAL BACKGROUND
CHAPTER 6 DIGITAL SIGNAL PROCESSING 73
CHAPTER 7 DIGITAL FILTERSAND DISCRETE FOURIER TRANSFORM 87
CHAPTER 8 PATTERN CLASSIFICATION 105
CHAPTER 9 STATISTICAL PATTERN CLASSIFICATION 124
PART III ACOUSTICS
CHAPTER 10 WAVE BASICS 141
CHAPTER 11 ACOUSTIC TUBE MODELING OF SPEECH PRODUCTION 152
CHAPTER 12 MUSICAL INSTRUMENT ACOUSTICS 158
CHAPTER 13 ROOM ACOUSTICS 179
PART IV AUDITORY PERCEPTION
CHAPTER 14 EAR PHYSIOLOGY 193
CHAPTER 15 PSYCHOACOUSTICS 209
CHAPTER 16 MODELS OF PITCH PERCEPTION 218
CHAPTER 17 SPEECH PERCEPTION 232
CHAPTER 18 HUMAN SPEECH RECOGNITION 250
PART V SPEECH FEATURES
CHAPTER 19 THE AUDITORY SYSTEM AS A FILTER BANK 263
CHAPTER 20 THE CEPSTRUM AS A SPECTRAL ANALYZER 277
CHAPTER 21 LINEAR PREDICTION 286
PART VI A UTOMATIC SPEECH RECOGNITION
CHAPTER 22 FEATURE EXTRACTION FOR ASR 301
CHAPTER 23 LINGUISTIC CATEGORIES FOR SPEECH RECOGNITION 319
CHAPTER 24 DETERMINISTIC SEQUENCE RECOGNITION FOR ASR 337
CHAPTER 25 STATISTICAL SEQUENCE RECOGNITION 350
CHAPTER 26 STATISTICAL MODEL TRAINING 364
CHAPTER 27 DISCRIMINANT ACOUSTIC PROBABILITY ESTIMATION 381
CHAPTER 28 ACOUSTIC MODEL TRAINING: FURTHER TOPICS 394
CHAPTER 29 SPEECH RECOGNITION AND UNDERSTANDING 416
PART VII SYNTHESIS AND CODING
CHAPTER 30 SPEECH SYNTHESIS 431
CHAPTER 31 PITCH DETECTION 455
CHAPTER 32 VOCODERS 473
CHAPTER 33 LOW-RATE VOCODERS 493
CHAPTER 34 MEDIUM-RATE AND HIGH-RATE VOCODERS 505
CHAPTER 35 PERCEPTUAL A UDIO CODING 531
PART VIII OTHER APPLICATIONS
CHAPTER 36 SOME ASPECTS OF COMPUTER MUSIC SYNTHESIS 553
CHAPTER 37 MUSIC SIGNAL ANALYSIS 567
CHAPTER 38 MUSIC RETRIEVAL 581
CHAPTER 39 SOURCE SEPARATION 59
CHAPTER 40 SPEECH TRANSFORMATIONS 617
CHAPTER 41 SPEAKER VERIFICATION 633
CHAPTER 42 SPEAKER DIARIZATION 644

Über Autor(innen)

The late Ben Gold consulted at Massachusetts Institute ofTechnology and Lincoln Laboratory and taught at the University ofCalifornia at Berkeley. He was the author of Digital Processingof Signals and the coauthor of Theory and Applications ofDigital Signal Processing. Dr. Gold was an IEEE Fellow, memberof the National Academy of Engineering, and recipient of severalIEEE awards.
Nelson Morgan is the Director of the InternationalComputer Science Institute, an independent, not-for profit researchlaboratory affiliated with the University of California atBerkeley. Dr. Morgan is also Professor-in-Residence in theElectrical Engineering and Computer Sciences Department at UCBerkeley. Dr. Morgan is an IEEE Fellow.
Dan Ellis is Associate Professor in the ElectricalEngineering Department of Columbia University. Dr. Ellis'sLaboratory for Recognition and Organization of Speech and Audio(LabROSA) investigates how to extract high-level information fromaudio, including speech recognition, music description, andenvironmental sound processing.

Produktsicherheit

Fragen zum Artikel?

Ihre Fragen, Wünsche oder Anmerkungen

Vorname*

Nachname*

Ihre E-Mail-Adresse*

Kundennr.

Ihre Nachricht*

Lediglich mit * gekennzeichnete Felder sind Pflichtfelder.

Wenn Sie die im Kontaktformular eingegebenen Daten durch Klick auf den nachfolgenden Button übersenden, erklären Sie sich damit einverstanden, dass wir Ihr Angaben für die Beantwortung Ihrer Anfrage verwenden. Selbstverständlich werden Ihre Daten vertraulich behandelt und nicht an Dritte weitergegeben. Sie können der Verwendung Ihrer Daten jederzeit widersprechen. Das Datenhandling bei Sack Fachmedien erklären wir Ihnen in unserer Datenschutzerklärung.

115,99 € (inkl. MwSt.)

sofort verfügbar

Webcode: www2.sack.de/t7615