skip to content
Bitext alignment Preview this item
ClosePreview this item
Checking...

Bitext alignment

Author: Jörg Tiedemann
Publisher: San Rafael, Calif. (1537 Fourth Street, San Rafael, CA 94901 USA) : Morgan & Claypool, ©2011.
Series: Synthesis lectures on human language technologies, lecture #14.
Edition/Format:   eBook : Document : EnglishView all editions and formats
Summary:
This book provides an overview of various techniques for the alignment of bitexts. It describes general concepts and strategies that can be applied to map corresponding parts in parallel documents on various levels of granularity. Bitexts are valuable linguistic resources for many different research fields and practical applications. The most predominant application is machine translation, in particular, statistical  Read more...
Rating:

(not yet rated) 0 with reviews - Be the first.

Subjects
More like this

 

Find a copy online

Links to this item

Find a copy in the library

&AllPage.SpinnerRetrieving; Finding libraries that hold this item...

Details

Genre/Form: Electronic books
Additional Physical Format: Print version:
Tiedemann, Jorg.
Bitext alignment.
[S.l.] : M & C, 2011
(OCoLC)741023825
Material Type: Document, Internet resource
Document Type: Internet Resource, Computer File
All Authors / Contributors: Jörg Tiedemann
ISBN: 9781608455119 1608455114 9781608455102 1608455106
OCLC Number: 742535715
Description: 1 online resource (viii, 153 pages) : illustrations.
Contents: Preface --
Acknowledgments --
1. Introduction --
Applications --
Further readings --
2. Basic concepts and terminology --
Bitext and alignment --
Alignment and segmentation --
Alignment spaces and constraints --
Correlations and cues --
Alignment models and search algorithms --
Evaluation of bitext alignment --
Summary and further reading --
3. Building parallel corpora --
Document alignment --
Mining the web --
Extracting parallel data from comparable corpora --
Summary and further reading --
4. Sentence alignment --
Length-based approaches --
Lexical matching approaches --
Combined and resource-specific techniques --
Summary and further reading --
5. Word alignment --
Generative alignment models --
Constraints and heuristics --
Discriminative alignment models --
Translation spotting and bilingual lexicon induction --
Summary and further reading --
6. Phrase and tree alignment --
Parallel treebanks and tree alignment --
Hierarchical alignment and transduction grammars --
Summary and further reading --
7. Concluding remarks --
Final recommendations --
A. Resources & tools --
Bibliography --
Author's biography.
Series Title: Synthesis lectures on human language technologies, lecture #14.
Responsibility: Jörg Tiedemann.
More information:

Abstract:

Provides an overview of various techniques for the alignment of bitexts. The text describes general concepts and strategies that can be applied to map corresponding parts in parallel documents on  Read more...

Reviews

User-contributed reviews
Retrieving GoodReads reviews...
Retrieving DOGObooks reviews...

Tags

Be the first.
Confirm this request

You may have already requested this item. Please select Ok if you would like to proceed with this request anyway.

Linked Data


Primary Entity

<http://www.worldcat.org/oclc/742535715> # Bitext alignment
    a schema:CreativeWork, schema:MediaObject, schema:Book ;
   library:oclcnum "742535715" ;
   library:placeOfPublication <http://id.loc.gov/vocabulary/countries/cau> ;
   library:placeOfPublication <http://experiment.worldcat.org/entity/work/data/932317553#Place/san_rafael_calif_1537_fourth_street_san_rafael_ca_94901_usa> ; # San Rafael, Calif. (1537 Fourth Street, San Rafael, CA 94901 USA)
   schema:about <http://dewey.info/class/418.020285/e22/> ;
   schema:about <http://experiment.worldcat.org/entity/work/data/932317553#Thing/sentence_alignment> ; # Sentence alignment
   schema:about <http://experiment.worldcat.org/entity/work/data/932317553#Thing/word_alignment> ; # Word alignment
   schema:about <http://experiment.worldcat.org/entity/work/data/932317553#Thing/alignment> ; # Alignment
   schema:about <http://experiment.worldcat.org/entity/work/data/932317553#Thing/transduction_grammars> ; # Transduction grammars
   schema:about <http://experiment.worldcat.org/entity/work/data/932317553#Thing/lexicon_induction> ; # Lexicon induction
   schema:about <http://experiment.worldcat.org/entity/work/data/932317553#Thing/bitexts> ; # Bitexts
   schema:about <http://experiment.worldcat.org/entity/work/data/932317553#Thing/parallel_corpora> ; # Parallel corpora
   schema:about <http://id.worldcat.org/fast/1004851> ; # Machine translating
   schema:about <http://id.worldcat.org/fast/871998> ; # Computational linguistics
   schema:about <http://experiment.worldcat.org/entity/work/data/932317553#Topic/language_arts_&_disciplines_translating_&_interpreting> ; # LANGUAGE ARTS & DISCIPLINES--Translating & Interpreting
   schema:about <http://experiment.worldcat.org/entity/work/data/932317553#Thing/statistical_machine_translation> ; # Statistical machine translation
   schema:about <http://experiment.worldcat.org/entity/work/data/932317553#Thing/tree_alignment> ; # Tree alignment
   schema:about <http://experiment.worldcat.org/entity/work/data/932317553#Thing/text_mining> ; # Text mining
   schema:bookFormat schema:EBook ;
   schema:copyrightYear "2011" ;
   schema:creator <http://viaf.org/viaf/53859662> ; # Jörg Tiedemann
   schema:datePublished "2011" ;
   schema:description "This book provides an overview of various techniques for the alignment of bitexts. It describes general concepts and strategies that can be applied to map corresponding parts in parallel documents on various levels of granularity. Bitexts are valuable linguistic resources for many different research fields and practical applications. The most predominant application is machine translation, in particular, statistical machine translation. However, there are various other threads that can be followed which may be supported by the rich linguistic knowledge implicitly stored in parallel resources. Bitexts have been explored in lexicography, word sense disambiguation, terminology extraction, computer-aided language learning and translation studies to name just a few. The book covers the essential tasks that have to be carried out when building parallel corpora starting from the collection of translated documents up to sub-sentential alignments. In particular, it describes various approaches to document alignment, sentence alignment, word alignment and tree structure alignment. It also includes a list of resources and a comprehensive review of the literature on alignment techniques."@en ;
   schema:description "Preface -- Acknowledgments -- 1. Introduction -- Applications -- Further readings -- 2. Basic concepts and terminology -- Bitext and alignment -- Alignment and segmentation -- Alignment spaces and constraints -- Correlations and cues -- Alignment models and search algorithms -- Evaluation of bitext alignment -- Summary and further reading -- 3. Building parallel corpora -- Document alignment -- Mining the web -- Extracting parallel data from comparable corpora -- Summary and further reading -- 4. Sentence alignment -- Length-based approaches -- Lexical matching approaches -- Combined and resource-specific techniques -- Summary and further reading -- 5. Word alignment -- Generative alignment models -- Constraints and heuristics -- Discriminative alignment models -- Translation spotting and bilingual lexicon induction -- Summary and further reading -- 6. Phrase and tree alignment -- Parallel treebanks and tree alignment -- Hierarchical alignment and transduction grammars -- Summary and further reading -- 7. Concluding remarks -- Final recommendations -- A. Resources & tools -- Bibliography -- Author's biography."@en ;
   schema:exampleOfWork <http://worldcat.org/entity/work/id/932317553> ;
   schema:genre "Electronic books"@en ;
   schema:inLanguage "en" ;
   schema:isPartOf <http://worldcat.org/issn/1947-4059> ; # Synthesis lectures on human language technologies ;
   schema:isSimilarTo <http://www.worldcat.org/oclc/741023825> ;
   schema:name "Bitext alignment"@en ;
   schema:productID "742535715" ;
   schema:publication <http://www.worldcat.org/title/-/oclc/742535715#PublicationEvent/san_rafael_calif_1537_fourth_street_san_rafael_ca_94901_usa_morgan_&_claypool_2011> ;
   schema:publisher <http://experiment.worldcat.org/entity/work/data/932317553#Agent/morgan_&_claypool> ; # Morgan & Claypool
   schema:url <http://www.morganclaypool.com/doi/abs/10.2200/S00367ED1V01Y201106HLT014> ;
   schema:url <http://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&db=nlabk&AN=440434> ;
   schema:url <http://public.eblib.com/choice/publicfullrecord.aspx?p=881222> ;
   schema:url <http://uri.idm.oclc.org/login?url=http://dx.doi.org/10.2200/S00367ED1V01Y201106HLT014> ;
   schema:workExample <http://worldcat.org/isbn/9781608455102> ;
   schema:workExample <http://dx.doi.org/10.2200/S00367ED1V01Y201106HLT014> ;
   schema:workExample <http://worldcat.org/isbn/9781608455119> ;
   wdrs:describedby <http://www.worldcat.org/title/-/oclc/742535715> ;
    .


Related Entities

<http://experiment.worldcat.org/entity/work/data/932317553#Agent/morgan_&_claypool> # Morgan & Claypool
    a bgn:Agent ;
   schema:name "Morgan & Claypool" ;
    .

<http://experiment.worldcat.org/entity/work/data/932317553#Place/san_rafael_calif_1537_fourth_street_san_rafael_ca_94901_usa> # San Rafael, Calif. (1537 Fourth Street, San Rafael, CA 94901 USA)
    a schema:Place ;
   schema:name "San Rafael, Calif. (1537 Fourth Street, San Rafael, CA 94901 USA)" ;
    .

<http://experiment.worldcat.org/entity/work/data/932317553#Thing/lexicon_induction> # Lexicon induction
    a schema:Thing ;
   schema:name "Lexicon induction" ;
    .

<http://experiment.worldcat.org/entity/work/data/932317553#Thing/parallel_corpora> # Parallel corpora
    a schema:Thing ;
   schema:name "Parallel corpora" ;
    .

<http://experiment.worldcat.org/entity/work/data/932317553#Thing/sentence_alignment> # Sentence alignment
    a schema:Thing ;
   schema:name "Sentence alignment" ;
    .

<http://experiment.worldcat.org/entity/work/data/932317553#Thing/statistical_machine_translation> # Statistical machine translation
    a schema:Thing ;
   schema:name "Statistical machine translation" ;
    .

<http://experiment.worldcat.org/entity/work/data/932317553#Thing/transduction_grammars> # Transduction grammars
    a schema:Thing ;
   schema:name "Transduction grammars" ;
    .

<http://experiment.worldcat.org/entity/work/data/932317553#Topic/language_arts_&_disciplines_translating_&_interpreting> # LANGUAGE ARTS & DISCIPLINES--Translating & Interpreting
    a schema:Intangible ;
   schema:name "LANGUAGE ARTS & DISCIPLINES--Translating & Interpreting"@en ;
    .

<http://id.worldcat.org/fast/1004851> # Machine translating
    a schema:Intangible ;
   schema:name "Machine translating"@en ;
    .

<http://id.worldcat.org/fast/871998> # Computational linguistics
    a schema:Intangible ;
   schema:name "Computational linguistics"@en ;
    .

<http://viaf.org/viaf/53859662> # Jörg Tiedemann
    a schema:Person ;
   schema:familyName "Tiedemann" ;
   schema:givenName "Jörg" ;
   schema:name "Jörg Tiedemann" ;
    .

<http://worldcat.org/isbn/9781608455102>
    a schema:ProductModel ;
   schema:isbn "1608455106" ;
   schema:isbn "9781608455102" ;
    .

<http://worldcat.org/isbn/9781608455119>
    a schema:ProductModel ;
   schema:isbn "1608455114" ;
   schema:isbn "9781608455119" ;
    .

<http://worldcat.org/issn/1947-4059> # Synthesis lectures on human language technologies ;
    a bgn:PublicationSeries ;
   schema:hasPart <http://www.worldcat.org/oclc/742535715> ; # Bitext alignment
   schema:issn "1947-4059" ;
   schema:name "Synthesis lectures on human language technologies ;" ;
   schema:name "Synthesis lectures on human language technologies," ;
    .

<http://www.worldcat.org/oclc/741023825>
    a schema:CreativeWork ;
   rdfs:label "Bitext alignment." ;
   schema:description "Print version:" ;
   schema:isSimilarTo <http://www.worldcat.org/oclc/742535715> ; # Bitext alignment
    .


Content-negotiable representations

Close Window

Please sign in to WorldCat 

Don't have an account? You can easily create a free account.