skip to content
Quality of synthetic speech : perceptual dimensions, influencing factors, and instrumental assessment Preview this item
ClosePreview this item
Checking...

Quality of synthetic speech : perceptual dimensions, influencing factors, and instrumental assessment

Author: Florian Hinterleitner
Publisher: Singapore : Springer, [2017]
Series: T-labs series in telecommunication services.
Edition/Format:   eBook : Document : EnglishView all editions and formats
Summary:
This book reviews research towards perceptual quality dimensions of synthetic speech, compares these findings with the state of the art, and derives a set of five universal perceptual quality dimensions for TTS signals. They are: (i) naturalness of voice, (ii) prosodic quality, (iii) fluency and intelligibility, (iv) absence of disturbances, and (v) calmness. Moreover, a test protocol for the efficient  Read more...
Rating:

(not yet rated) 0 with reviews - Be the first.

Subjects
More like this

 

Find a copy online

Links to this item

Find a copy in the library

&AllPage.SpinnerRetrieving; Finding libraries that hold this item...

Details

Genre/Form: Electronic books
Additional Physical Format: Printed edition:
Material Type: Document, Internet resource
Document Type: Internet Resource, Computer File
All Authors / Contributors: Florian Hinterleitner
ISBN: 9789811037344 9811037345
OCLC Number: 982121294
Description: 1 online resource.
Contents: Acknowledgements; Contents; Acronyms; Abstract; 1 Introduction; 1.1 Motivation; 1.2 Outline; References; 2 Speech Synthesis; 2.1 Setup of a Speech Synthesizer; 2.1.1 Natural Language Processing (NLP); 2.1.2 Prosody Generation; 2.1.3 Concatenation and Generation of Speech-Signal Parameters; 2.1.4 Speech Signal Generation; 2.2 The Mary Text-to-Speech System (MaryTTS); References; 3 Auditory and Instrumental Quality Evaluation Metrics; 3.1 What Is Perceptual Quality?; 3.2 Taxonomy for the Quality Assessment of Synthetic Speech; 3.2.1 Glass Box Versus Black Box. 3.2.2 Laboratory Versus Field Studies3.2.3 Linguistic Versus Acoustic; 3.2.4 Auditory Versus Instrumental; 3.3 Auditory Quality Evaluation Metrics; 3.3.1 Functional TestsThe content of this section has previously been published in a slightly different version in [6].; 3.3.2 Judgment TestsParts of the content of this section have previously been published in a slightly different version in [13] and [6].; 3.4 Instrumental Quality Evaluation Metrics; 3.4.1 Reference-Based MeasuresParts of the content of this section have previously been published in a slightly different version in [21]. 3.4.2 Reference-Free MeasuresReferences; 4 Perceptual Quality Dimensions; 4.1 State-of-the-Art Perceptual Quality DimensionsParts of the content of this section have previously been published in a slightly different version in [1].; 4.1.1 Study: Kraft and Portele (Kraft1995); 4.1.2 Study: Mayo et al. I (Mayo2005); 4.1.3 Study: Viswanathan and Viswanathan (Vis2005); 4.1.4 Study: Seget (Seget2007); 4.1.5 Study: Hinterleitner (Hint2010); 4.1.6 Study: Mayo et al. II (Mayo2011); 4.1.7 Restrictions of Discussed Studies. 4.2 Semantic Differential and Factor AnalysisParts of the content of this section have previously been published in a slightly different version in [13].4.2.1 Experimental Setup; 4.2.2 Statistical Analysis; 4.3 Sorting Task and Multidimensional ScalingParts of the content of this section have previously been published in a slightly different version in [16].; 4.3.1 Experimental Setup; 4.3.2 Statistical Analysis; 4.4 Summary of the SD/FA and ST/MDS StudiesParts of the content of this section have previously been published in a slightly different version in [16]. 4.5 4.5 Universal Perceptual Quality Dimensions4.5.1 Naturalness of Voice; 4.5.2 Prosodic Quality; 4.5.3 Fluency and Intelligibility; 4.5.4 Absence of Disturbances; 4.5.5 Calmness; 4.5.6 Instructions for TTS Quality Assessment; 4.6 Summary; References; 5 Influencing Factors on Perceptual Quality; 5.1 Influence of the ApplicationParts of the content of this section have previously been published in a slightly different version in [1].; 5.1.1 Pretest; 5.1.2 Main TestThe content of this section has previously been published in a slightly different version in [10].; 5.1.3 Conclusions.
Series Title: T-labs series in telecommunication services.
Responsibility: Florian Hinterleitner.

Abstract:

This book reviews research towards perceptual quality dimensions of synthetic speech, compares these findings with the state of the art, and derives a set of five universal perceptual quality  Read more...

Reviews

User-contributed reviews
Retrieving GoodReads reviews...
Retrieving DOGObooks reviews...

Tags

Be the first.
Confirm this request

You may have already requested this item. Please select Ok if you would like to proceed with this request anyway.

Linked Data


Primary Entity

<http://www.worldcat.org/oclc/982121294> # Quality of synthetic speech : perceptual dimensions, influencing factors, and instrumental assessment
    a schema:Book, schema:CreativeWork, schema:MediaObject ;
    library:oclcnum "982121294" ;
    library:placeOfPublication <http://id.loc.gov/vocabulary/countries/si> ;
    schema:about <http://experiment.worldcat.org/entity/work/data/4009894848#Topic/text_to_speech_software> ; # Text-to-speech software
    schema:about <http://dewey.info/class/006.454/e23/> ;
    schema:about <http://experiment.worldcat.org/entity/work/data/4009894848#Topic/speech_synthesis> ; # Speech synthesis
    schema:about <http://experiment.worldcat.org/entity/work/data/4009894848#Topic/speech_processing_systems> ; # Speech processing systems
    schema:about <http://experiment.worldcat.org/entity/work/data/4009894848#Topic/computers_general> ; # COMPUTERS--General
    schema:about <http://experiment.worldcat.org/entity/work/data/4009894848#Topic/telecommunication> ; # Telecommunication
    schema:about <http://experiment.worldcat.org/entity/work/data/4009894848#Topic/technology_&_engineering_telecommunications> ; # TECHNOLOGY & ENGINEERING--Telecommunications
    schema:bookFormat schema:EBook ;
    schema:creator <http://experiment.worldcat.org/entity/work/data/4009894848#Person/hinterleitner_florian> ; # Florian Hinterleitner
    schema:datePublished "2017" ;
    schema:description "This book reviews research towards perceptual quality dimensions of synthetic speech, compares these findings with the state of the art, and derives a set of five universal perceptual quality dimensions for TTS signals. They are: (i) naturalness of voice, (ii) prosodic quality, (iii) fluency and intelligibility, (iv) absence of disturbances, and (v) calmness. Moreover, a test protocol for the efficient indentification of those dimensions in a listening test is introduced. Furthermore, several factors influencing these dimensions are examined. In addition, different techniques for the instrumental quality assessment of TTS signals are introduced, reviewed and tested. Finally, the requirements for the integration of an instrumental quality measure into a concatenative TTS system are examined."@en ;
    schema:description "Acknowledgements; Contents; Acronyms; Abstract; 1 Introduction; 1.1 Motivation; 1.2 Outline; References; 2 Speech Synthesis; 2.1 Setup of a Speech Synthesizer; 2.1.1 Natural Language Processing (NLP); 2.1.2 Prosody Generation; 2.1.3 Concatenation and Generation of Speech-Signal Parameters; 2.1.4 Speech Signal Generation; 2.2 The Mary Text-to-Speech System (MaryTTS); References; 3 Auditory and Instrumental Quality Evaluation Metrics; 3.1 What Is Perceptual Quality?; 3.2 Taxonomy for the Quality Assessment of Synthetic Speech; 3.2.1 Glass Box Versus Black Box."@en ;
    schema:exampleOfWork <http://worldcat.org/entity/work/id/4009894848> ;
    schema:genre "Electronic books"@en ;
    schema:inLanguage "en" ;
    schema:isPartOf <http://experiment.worldcat.org/entity/work/data/4009894848#Series/t_labs_series_in_telecommunication_services> ; # T-labs series in telecommunication services.
    schema:isSimilarTo <http://worldcat.org/entity/work/data/4009894848#CreativeWork/> ;
    schema:name "Quality of synthetic speech : perceptual dimensions, influencing factors, and instrumental assessment"@en ;
    schema:productID "982121294" ;
    schema:url <https://link.springer.com/openurl?genre=book&isbn=978-981-10-3733-7> ;
    schema:url <https://grinnell.idm.oclc.org/login?url=http://link.springer.com/10.1007/978-981-10-3734-4> ;
    schema:url <http://dx.doi.org/10.1007/978-981-10-3734-4> ;
    schema:url <https://0-link-springer-com.pugwash.lib.warwick.ac.uk/book/10.1007/978-981-10-3734-4> ;
    schema:url <http://link.springer.com/10.1007/978-981-10-3734-4> ;
    schema:url <http://public.eblib.com/choice/publicfullrecord.aspx?p=4838376> ;
    schema:url <http://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&db=nlabk&AN=1500577> ;
    schema:workExample <http://worldcat.org/isbn/9789811037344> ;
    schema:workExample <http://dx.doi.org/10.1007/978-981-10-3734-4> ;
    wdrs:describedby <http://www.worldcat.org/title/-/oclc/982121294> ;
    .


Related Entities

<http://experiment.worldcat.org/entity/work/data/4009894848#Person/hinterleitner_florian> # Florian Hinterleitner
    a schema:Person ;
    schema:familyName "Hinterleitner" ;
    schema:givenName "Florian" ;
    schema:name "Florian Hinterleitner" ;
    .

<http://experiment.worldcat.org/entity/work/data/4009894848#Series/t_labs_series_in_telecommunication_services> # T-labs series in telecommunication services.
    a bgn:PublicationSeries ;
    schema:hasPart <http://www.worldcat.org/oclc/982121294> ; # Quality of synthetic speech : perceptual dimensions, influencing factors, and instrumental assessment
    schema:name "T-labs series in telecommunication services." ;
    schema:name "T-labs series in telecommunication services" ;
    .

<http://experiment.worldcat.org/entity/work/data/4009894848#Topic/computers_general> # COMPUTERS--General
    a schema:Intangible ;
    schema:name "COMPUTERS--General"@en ;
    .

<http://experiment.worldcat.org/entity/work/data/4009894848#Topic/speech_processing_systems> # Speech processing systems
    a schema:Intangible ;
    schema:name "Speech processing systems"@en ;
    .

<http://experiment.worldcat.org/entity/work/data/4009894848#Topic/technology_&_engineering_telecommunications> # TECHNOLOGY & ENGINEERING--Telecommunications
    a schema:Intangible ;
    schema:name "TECHNOLOGY & ENGINEERING--Telecommunications"@en ;
    .

<http://experiment.worldcat.org/entity/work/data/4009894848#Topic/telecommunication> # Telecommunication
    a schema:Intangible ;
    schema:name "Telecommunication"@en ;
    .

<http://experiment.worldcat.org/entity/work/data/4009894848#Topic/text_to_speech_software> # Text-to-speech software
    a schema:Intangible ;
    schema:name "Text-to-speech software"@en ;
    .

<http://worldcat.org/entity/work/data/4009894848#CreativeWork/>
    a schema:CreativeWork ;
    schema:description "Printed edition:" ;
    schema:isSimilarTo <http://www.worldcat.org/oclc/982121294> ; # Quality of synthetic speech : perceptual dimensions, influencing factors, and instrumental assessment
    .

<http://worldcat.org/isbn/9789811037344>
    a schema:ProductModel ;
    schema:isbn "9811037345" ;
    schema:isbn "9789811037344" ;
    .

<http://www.worldcat.org/title/-/oclc/982121294>
    a genont:InformationResource, genont:ContentTypeGenericResource ;
    schema:about <http://www.worldcat.org/oclc/982121294> ; # Quality of synthetic speech : perceptual dimensions, influencing factors, and instrumental assessment
    schema:dateModified "2018-01-31" ;
    void:inDataset <http://purl.oclc.org/dataset/WorldCat> ;
    .


Content-negotiable representations

Close Window

Please sign in to WorldCat 

Don't have an account? You can easily create a free account.