skip to content
Spoken language processing : a guide to theory, algorithm, and system development. Preview this item
ClosePreview this item

Spoken language processing : a guide to theory, algorithm, and system development.

Author: Xuedong Huang; Alex Acero; Hsiao-Wuen Hon
Publisher: Estados Unidos : Prentice Hall, ©2001.
Edition/Format:   Print book : EnglishView all editions and formats

This title is a guide to building systems that interact with the user via speech as well as other modalities. The fundamentals of speech recognition, text to speech and dialogue processing are  Read more...


(not yet rated) 0 with reviews - Be the first.

More like this


Find a copy in the library

&AllPage.SpinnerRetrieving; Finding libraries that hold this item...


Document Type: Book
All Authors / Contributors: Xuedong Huang; Alex Acero; Hsiao-Wuen Hon
ISBN: 0130226165 9780130226167
OCLC Number: 926434379
Description: 980 páginas
Contents: (NOTE: Each chapter ends with Historical Perspective and Further Reading.) 1. Introduction. Motivations. Spoken Language System Architecture. Book Organization. Target Audiences. I. FUNDAMENTAL THEORY. 2. Spoken language Structure. Sound and Human Speech Systems. Phonetics and Phonology. Syllables and Words. Syntax and Semantics. 3. Probability, Statistics, and Information Theory. Probability Theory. Estimation Theory. Significance Testing. Information Theory. 4. Pattern Recognition. Bayes' Decision Theory. How to Construct Classifiers. Discriminative Training. Unsupervised Estimation Methods. Classification and Regression Trees. II. SPEECH PROCESSING. 5. Digital Signal Processing. Digital Signals and Systems. Continuous-Frequency Transforms. Discrete-Frequency Transforms. Digital Filters and Windows. Digital Processing of Analog Signals. Multirate Signal Processing. Filterbanks. Stochastic Processes. 6. Speech Signal Representations. Short-Time Fourier Analysis. Acoustical Model of Speech Production. Linear Predictive Coding. Cepstral Processing. Perceptually Motivated Representations. Formant Frequencies. The Role of Pitch. 7. Speech Coding. Speech Coders Attributes. Scalar Waveform Coders. Scalar Frequency Domain Coders. Code Excited Linear Prediction (CELP). Low-Brit Speech Coders. III. SPEECH RECOGNITION. 8. Hidden Markov Models. The Markov Chain. Definition of the Hidden Markov Model. Continuous and Semicontinuous HMMs. Practical Issues in Using HMMs. HMM Limitations. 9. Acoustic Modeling. Variability in the Speech Signal. How to Measure Speech Recognition Errors. Signal Processing-Extracting Features. Phonectic Modeling-Selecting Appropriate Units. Acoustic Modeling-Scoring Acoustic Features. Adaptive Techniques-Minimizing Mismatches. Confidence Measures: Measuring the Reliability. Other Techniques. Case Study: Whisper. 10. Environmental Robustness. The Acoustical Environment. Acoustical Transducers. Adaptive Echo Cancellation (AEC). Multimicrophone Speech Enhancement. Environment Compensation Preprocessing. Environment Model Adaptation. Modeling Nonstationary Noise. 11. Language Modeling. Formal Language Theory. Stochastic Language Models. Complexity Measure of Language Models. N-Gram Smoothing. Adaptive Language Models. Practical Issues. 12. Basic Search Algorithms. Basic Search Algorithms. Search Algorithms for Speech Recognition. Language Model States. Time-Synchronous Viterbi Beam Search. Stack Decoding (A Search). 13. Large-Vocabulary Search Algorithms. Efficient Manipulation of a Tree Lexicon. Other Efficient Search Techniques. N-Best and Multipass Search Strategies. Search-Algorithm Evaluation. Case Study-Microsoft Whisper. IV. TEXT-TO-SPEECH SYSTEMS. 14. Text and Phonetic Analysis. Modules and Data Flow. Lexicon. Document Structured Detection. Text Normalization. Linguistic Analysis. Homograph Disambiguation. Morphological Analysis. Letter-to-Sound Conversion. Evaluation. Case Study: Festival. 15. Prosody. The Role of Understanding. Prosody Generation Schematic. Speaking Style. Symbolic Prosody. Duration Assignment. Pitch Generation. Prosody Markup Languages. Prosody Evaluation. 16. Speech Synthesis. Attributes of Speech Synthesis. Formant Speech Synthesis. Concatenative Speech Synthesis. Prosodic Modification of Speech. Source-Filter Models for Prosody Modification. Evaluation of TTS Systems. V. SPOKEN LANGUAGE SYSTEMS. 17. Spoken Language Understanding. Written vs. Spoken Languages. Dialog Structure. Semantic Representation. Sentence Interpretation. Discourse Analysis. Dialog Management. Response Generation and Rendition. Evaluation. Case Study-Dr. Who. 18. Applications and User Interfaces. Application Architecture. Typical Applications. Speech Interface Design. Internationalization. Case Study-MIPAD. Index.


User-contributed reviews
Retrieving GoodReads reviews...
Retrieving DOGObooks reviews...


Be the first.
Confirm this request

You may have already requested this item. Please select Ok if you would like to proceed with this request anyway.

Linked Data

Primary Entity

<> # Spoken language processing : a guide to theory, algorithm, and system development.
    a schema:CreativeWork, schema:Book ;
   library:oclcnum "926434379" ;
   library:placeOfPublication <> ; # Estados Unidos
   schema:about <> ; # Lingüística computacional
   schema:about <> ; # Procesamiento de lenguaje natural
   schema:about <> ; # Reconocimiento de modelos
   schema:about <> ; # Modelos del lenguaje
   schema:about <> ;
   schema:about <> ; # Algoritmos de busqueda
   schema:about <> ; # Modelos ocultos de markov
   schema:about <> ; # Reconocimiento automático de la voz
   schema:about <> ; # Inteligencia artificial
   schema:author <> ; # Hsiao-Wuen Hon
   schema:author <> ; # Xuedong Huang
   schema:author <> ; # Alex Acero
   schema:bookFormat bgn:PrintBook ;
   schema:copyrightYear "2001" ;
   schema:exampleOfWork <> ;
   schema:inLanguage "en" ;
   schema:name "Spoken language processing : a guide to theory, algorithm, and system development." ;
   schema:productID "926434379" ;
   schema:publication <> ;
   schema:publisher <> ; # Prentice Hall
   schema:workExample <> ;
   wdrs:describedby <> ;

Related Entities

<> # Alex Acero
    a schema:Person ;
   schema:familyName "Acero" ;
   schema:givenName "Alex" ;
   schema:name "Alex Acero" ;

<> # Hsiao-Wuen Hon
    a schema:Person ;
   schema:familyName "Hon" ;
   schema:givenName "Hsiao-Wuen" ;
   schema:name "Hsiao-Wuen Hon" ;

<> # Xuedong Huang
    a schema:Person ;
   schema:familyName "Huang" ;
   schema:givenName "Xuedong" ;
   schema:name "Xuedong Huang" ;

<> # Algoritmos de busqueda
    a schema:Intangible ;
   schema:name "Algoritmos de busqueda" ;

<> # Inteligencia artificial
    a schema:Intangible ;
   schema:name "Inteligencia artificial" ;

<> # Lingüística computacional
    a schema:Intangible ;
   schema:name "Lingüística computacional" ;

<> # Modelos del lenguaje
    a schema:Intangible ;
   schema:name "Modelos del lenguaje" ;

<> # Modelos ocultos de markov
    a schema:Intangible ;
   schema:name "Modelos ocultos de markov" ;

<> # Procesamiento de lenguaje natural
    a schema:Intangible ;
   schema:name "Procesamiento de lenguaje natural" ;

<> # Reconocimiento automático de la voz
    a schema:Intangible ;
   schema:name "Reconocimiento automático de la voz" ;

<> # Reconocimiento de modelos
    a schema:Intangible ;
   schema:name "Reconocimiento de modelos" ;

    a schema:ProductModel ;
   schema:isbn "0130226165" ;
   schema:isbn "9780130226167" ;

    a genont:InformationResource, genont:ContentTypeGenericResource ;
   schema:about <> ; # Spoken language processing : a guide to theory, algorithm, and system development.
   schema:dateModified "2017-03-31" ;
   void:inDataset <> ;

Content-negotiable representations

Close Window

Please sign in to WorldCat 

Don't have an account? You can easily create a free account.