skip to content
Bitext alignment Preview this item
ClosePreview this item
Checking...

Bitext alignment

Author: Jörg Tiedemann
Publisher: San Rafael, Calif. (1537 Fourth Street, San Rafael, CA 94901 USA) : Morgan & Claypool, ©2011.
Series: Synthesis lectures on human language technologies, lecture #14.
Edition/Format:   eBook : Document : EnglishView all editions and formats
Summary:
This book provides an overview of various techniques for the alignment of bitexts. It describes general concepts and strategies that can be applied to map corresponding parts in parallel documents on various levels of granularity. Bitexts are valuable linguistic resources for many different research fields and practical applications. The most predominant application is machine translation, in particular, statistical  Read more...
Rating:

(not yet rated) 0 with reviews - Be the first.

Subjects
More like this

Find a copy online

Links to this item

Find a copy in the library

&AllPage.SpinnerRetrieving; Finding libraries that hold this item...

Details

Genre/Form: Electronic books
Additional Physical Format: Print version:
Tiedemann, Jorg.
Bitext alignment.
[S.l.] : M & C, 2011
(OCoLC)741023825
Material Type: Document, Internet resource
Document Type: Internet Resource, Computer File
All Authors / Contributors: Jörg Tiedemann
ISBN: 9781608455119 1608455114 9781608455102 1608455106
OCLC Number: 742535715
Description: 1 online resource (viii, 153 pages) : illustrations.
Contents: Preface --
Acknowledgments --
1. Introduction --
Applications --
Further readings --
2. Basic concepts and terminology --
Bitext and alignment --
Alignment and segmentation --
Alignment spaces and constraints --
Correlations and cues --
Alignment models and search algorithms --
Evaluation of bitext alignment --
Summary and further reading --
3. Building parallel corpora --
Document alignment --
Mining the web --
Extracting parallel data from comparable corpora --
Summary and further reading --
4. Sentence alignment --
Length-based approaches --
Lexical matching approaches --
Combined and resource-specific techniques --
Summary and further reading --
5. Word alignment --
Generative alignment models --
Constraints and heuristics --
Discriminative alignment models --
Translation spotting and bilingual lexicon induction --
Summary and further reading --
6. Phrase and tree alignment --
Parallel treebanks and tree alignment --
Hierarchical alignment and transduction grammars --
Summary and further reading --
7. Concluding remarks --
Final recommendations --
A. Resources & tools --
Bibliography --
Author's biography.
Series Title: Synthesis lectures on human language technologies, lecture #14.
Responsibility: Jörg Tiedemann.
More information:

Abstract:

Provides an overview of various techniques for the alignment of bitexts. The text describes general concepts and strategies that can be applied to map corresponding parts in parallel documents on  Read more...

Reviews

Editorial reviews

Publisher Synopsis

"Overall, Bitext Alignment is a very well written book which comprehensively addresses all aspects of bitext alignment. It is self-contained and requires only a basic prior knowledge of the theory of Read more...

 
User-contributed reviews
Retrieving GoodReads reviews...
Retrieving DOGObooks reviews...

Tags

Be the first.
Confirm this request

You may have already requested this item. Please select Ok if you would like to proceed with this request anyway.

Linked Data


Primary Entity

<http://www.worldcat.org/oclc/742535715> # Bitext alignment
    a schema:CreativeWork, schema:MediaObject, schema:Book ;
    library:oclcnum "742535715" ;
    library:placeOfPublication <http://id.loc.gov/vocabulary/countries/cau> ;
    library:placeOfPublication <http://experiment.worldcat.org/entity/work/data/932317553#Place/san_rafael_calif_1537_fourth_street_san_rafael_ca_94901_usa> ; # San Rafael, Calif. (1537 Fourth Street, San Rafael, CA 94901 USA)
    rdfs:comment "Warning: This malformed URI has been treated as a string - 'https://ebookcentral.proquest.com/lib/unt/detail.action?docID=881222";'" ;
    schema:about <http://dewey.info/class/418.020285/e22/> ;
    schema:about <http://experiment.worldcat.org/entity/work/data/932317553#Thing/sentence_alignment> ; # Sentence alignment
    schema:about <http://experiment.worldcat.org/entity/work/data/932317553#Thing/word_alignment> ; # Word alignment
    schema:about <http://experiment.worldcat.org/entity/work/data/932317553#Thing/alignment> ; # Alignment
    schema:about <http://experiment.worldcat.org/entity/work/data/932317553#Thing/transduction_grammars> ; # Transduction grammars
    schema:about <http://experiment.worldcat.org/entity/work/data/932317553#Thing/lexicon_induction> ; # Lexicon induction
    schema:about <http://experiment.worldcat.org/entity/work/data/932317553#Thing/bitexts> ; # Bitexts
    schema:about <http://experiment.worldcat.org/entity/work/data/932317553#Thing/parallel_corpora> ; # Parallel corpora
    schema:about <http://id.worldcat.org/fast/1004851> ; # Machine translating
    schema:about <http://id.worldcat.org/fast/871998> ; # Computational linguistics
    schema:about <http://experiment.worldcat.org/entity/work/data/932317553#Topic/language_arts_&_disciplines_translating_&_interpreting> ; # LANGUAGE ARTS & DISCIPLINES--Translating & Interpreting
    schema:about <http://experiment.worldcat.org/entity/work/data/932317553#Thing/statistical_machine_translation> ; # Statistical machine translation
    schema:about <http://experiment.worldcat.org/entity/work/data/932317553#Thing/tree_alignment> ; # Tree alignment
    schema:about <http://experiment.worldcat.org/entity/work/data/932317553#Thing/text_mining> ; # Text mining
    schema:bookFormat schema:EBook ;
    schema:copyrightYear "2011" ;
    schema:creator <http://viaf.org/viaf/53859662> ; # Jörg Tiedemann
    schema:datePublished "2011" ;
    schema:description "This book provides an overview of various techniques for the alignment of bitexts. It describes general concepts and strategies that can be applied to map corresponding parts in parallel documents on various levels of granularity. Bitexts are valuable linguistic resources for many different research fields and practical applications. The most predominant application is machine translation, in particular, statistical machine translation. However, there are various other threads that can be followed which may be supported by the rich linguistic knowledge implicitly stored in parallel resources. Bitexts have been explored in lexicography, word sense disambiguation, terminology extraction, computer-aided language learning and translation studies to name just a few. The book covers the essential tasks that have to be carried out when building parallel corpora starting from the collection of translated documents up to sub-sentential alignments. In particular, it describes various approaches to document alignment, sentence alignment, word alignment and tree structure alignment. It also includes a list of resources and a comprehensive review of the literature on alignment techniques."@en ;
    schema:description "Preface -- Acknowledgments -- 1. Introduction -- Applications -- Further readings -- 2. Basic concepts and terminology -- Bitext and alignment -- Alignment and segmentation -- Alignment spaces and constraints -- Correlations and cues -- Alignment models and search algorithms -- Evaluation of bitext alignment -- Summary and further reading -- 3. Building parallel corpora -- Document alignment -- Mining the web -- Extracting parallel data from comparable corpora -- Summary and further reading -- 4. Sentence alignment -- Length-based approaches -- Lexical matching approaches -- Combined and resource-specific techniques -- Summary and further reading -- 5. Word alignment -- Generative alignment models -- Constraints and heuristics -- Discriminative alignment models -- Translation spotting and bilingual lexicon induction -- Summary and further reading -- 6. Phrase and tree alignment -- Parallel treebanks and tree alignment -- Hierarchical alignment and transduction grammars -- Summary and further reading -- 7. Concluding remarks -- Final recommendations -- A. Resources & tools -- Bibliography -- Author's biography."@en ;
    schema:exampleOfWork <http://worldcat.org/entity/work/id/932317553> ;
    schema:genre "Electronic books"@en ;
    schema:inLanguage "en" ;
    schema:isPartOf <http://worldcat.org/issn/1947-4059> ; # Synthesis lectures on human language technologies ;
    schema:isSimilarTo <http://www.worldcat.org/oclc/741023825> ;
    schema:name "Bitext alignment"@en ;
    schema:productID "742535715" ;
    schema:publication <http://www.worldcat.org/title/-/oclc/742535715#PublicationEvent/san_rafael_calif_1537_fourth_street_san_rafael_ca_94901_usa_morgan_&_claypool_2011> ;
    schema:publisher <http://experiment.worldcat.org/entity/work/data/932317553#Agent/morgan_&_claypool> ; # Morgan & Claypool
    schema:url <http://www.morganclaypool.com/doi/abs/10.2200/S00367ED1V01Y201106HLT014> ;
    schema:url <http://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&db=nlabk&AN=440434> ;
    schema:url "https://ebookcentral.proquest.com/lib/unt/detail.action?docID=881222";" ;
    schema:url <http://public.eblib.com/choice/publicfullrecord.aspx?p=881222> ;
    schema:url <http://dx.doi.org/10.2200/S00367ED1V01Y201106HLT014> ;
    schema:workExample <http://worldcat.org/isbn/9781608455102> ;
    schema:workExample <http://dx.doi.org/10.2200/S00367ED1V01Y201106HLT014> ;
    schema:workExample <http://worldcat.org/isbn/9781608455119> ;
    wdrs:describedby <http://www.worldcat.org/title/-/oclc/742535715> ;
    .


Related Entities

<http://experiment.worldcat.org/entity/work/data/932317553#Agent/morgan_&_claypool> # Morgan & Claypool
    a bgn:Agent ;
    schema:name "Morgan & Claypool" ;
    .

<http://experiment.worldcat.org/entity/work/data/932317553#Place/san_rafael_calif_1537_fourth_street_san_rafael_ca_94901_usa> # San Rafael, Calif. (1537 Fourth Street, San Rafael, CA 94901 USA)
    a schema:Place ;
    schema:name "San Rafael, Calif. (1537 Fourth Street, San Rafael, CA 94901 USA)" ;
    .

<http://experiment.worldcat.org/entity/work/data/932317553#Thing/lexicon_induction> # Lexicon induction
    a schema:Thing ;
    schema:name "Lexicon induction" ;
    .

<http://experiment.worldcat.org/entity/work/data/932317553#Thing/parallel_corpora> # Parallel corpora
    a schema:Thing ;
    schema:name "Parallel corpora" ;
    .

<http://experiment.worldcat.org/entity/work/data/932317553#Thing/sentence_alignment> # Sentence alignment
    a schema:Thing ;
    schema:name "Sentence alignment" ;
    .

<http://experiment.worldcat.org/entity/work/data/932317553#Thing/statistical_machine_translation> # Statistical machine translation
    a schema:Thing ;
    schema:name "Statistical machine translation" ;
    .

<http://experiment.worldcat.org/entity/work/data/932317553#Thing/transduction_grammars> # Transduction grammars
    a schema:Thing ;
    schema:name "Transduction grammars" ;
    .

<http://experiment.worldcat.org/entity/work/data/932317553#Thing/tree_alignment> # Tree alignment
    a schema:Thing ;
    schema:name "Tree alignment" ;
    .

<http://experiment.worldcat.org/entity/work/data/932317553#Thing/word_alignment> # Word alignment
    a schema:Thing ;
    schema:name "Word alignment" ;
    .

<http://experiment.worldcat.org/entity/work/data/932317553#Topic/language_arts_&_disciplines_translating_&_interpreting> # LANGUAGE ARTS & DISCIPLINES--Translating & Interpreting
    a schema:Intangible ;
    schema:name "LANGUAGE ARTS & DISCIPLINES--Translating & Interpreting"@en ;
    .

<http://id.worldcat.org/fast/1004851> # Machine translating
    a schema:Intangible ;
    schema:name "Machine translating"@en ;
    .

<http://id.worldcat.org/fast/871998> # Computational linguistics
    a schema:Intangible ;
    schema:name "Computational linguistics"@en ;
    .

<http://viaf.org/viaf/53859662> # Jörg Tiedemann
    a schema:Person ;
    schema:familyName "Tiedemann" ;
    schema:givenName "Jörg" ;
    schema:name "Jörg Tiedemann" ;
    .

<http://worldcat.org/isbn/9781608455102>
    a schema:ProductModel ;
    schema:isbn "1608455106" ;
    schema:isbn "9781608455102" ;
    .

<http://worldcat.org/isbn/9781608455119>
    a schema:ProductModel ;
    schema:isbn "1608455114" ;
    schema:isbn "9781608455119" ;
    .

<http://worldcat.org/issn/1947-4059> # Synthesis lectures on human language technologies ;
    a bgn:PublicationSeries ;
    schema:hasPart <http://www.worldcat.org/oclc/742535715> ; # Bitext alignment
    schema:issn "1947-4059" ;
    schema:name "Synthesis lectures on human language technologies ;" ;
    schema:name "Synthesis lectures on human language technologies," ;
    .

<http://www.worldcat.org/oclc/741023825>
    a schema:CreativeWork ;
    rdfs:label "Bitext alignment." ;
    schema:description "Print version:" ;
    schema:isSimilarTo <http://www.worldcat.org/oclc/742535715> ; # Bitext alignment
    .


Content-negotiable representations

Close Window

Please sign in to WorldCat 

Don't have an account? You can easily create a free account.