skip to content
Building and Using Comparable Corpora Preview this item
ClosePreview this item
Checking...

Building and Using Comparable Corpora

Author: Serge Sharoff; Reinhard Rapp; Pierre Zweigenbaum; Pascale Fung
Publisher: Berlin, Heidelberg Imprint: Springer 2013
Edition/Format:   eBook : Document : EnglishView all editions and formats
Publication:Building and using comparable corpora.
Summary:
The 1990s saw a paradigm change in the use of corpus-driven methods in NLP. In the field of multilingual NLP (such as machine translation and terminology mining) this implied the use of parallel corpora. However, parallel resources are relatively scarce: many more texts are produced daily by native speakers of any given language than translated. This situation resulted in a natural drive towards the use of  Read more...
Rating:

(not yet rated) 0 with reviews - Be the first.

Subjects
More like this

Find a copy online

Find a copy in the library

&AllPage.SpinnerRetrieving; Finding libraries that hold this item...

Details

Genre/Form: Online-Publikation
Material Type: Document, Internet resource
Document Type: Internet Resource, Computer File
All Authors / Contributors: Serge Sharoff; Reinhard Rapp; Pierre Zweigenbaum; Pascale Fung
ISBN: 9783642201271 364220127X 9783642201288 3642201288
OCLC Number: 867051371
Description: 1 online resource (XII, 335 Seiten) 70 Illustrationen, 14 Illustrationen in color
Contents: Preface - Building and Using Comparable Corpora. S.Sharoff, R.Rapp, P.Zweigenbaum.- Overviewing Important Aspects of the Last 20 Years of Research in Comparable Corpora.- S.Sharoff, R.Rapp, P.Zweigenbaum.- Part I: Compiling and Measuring Comparable Corpora.- Multilingual Corpus Collection. S.Shi, P.Fung.- Automatic Comparable Web Corpora Collection and Bilingual Terminology Extraction for Specialized Dictionary Making. A.Gurrutxaga, I.Leturia, I.San Vicente, X.Saralegi.- Statistical Comparability: Methodological Caveats. R.Koehler.- Methods for Collection and Evaluation of Comparable Documents. M.Lestari Paramita, D.Guthrie, E.Kanoulas, R.Gaizauskas, P.Clough and M.Sanderson.- Measuring the Distance between Comparable Corpora between Languages. S.Sharoff.- Exploiting Comparable Corpora for Lexicon Extraction: Measuring and Improving Corpus Quality. B.Li, E.Gaussier.- Statistical Corpus and Language Comparison on Comparable Corpora. T.Eckart, U.Quasthoff.- Comparable Multilingual Patents as Large-scale Parallel Corpora. B.Lu and B.Tsou.- Part II: Using Comparable Corpora.- Extracting Parallel Phrases from Comparable Data. S.Hewavitharana, S.Vogel.- Exploiting Comparable Corpora. D.S.Munteanu, D.Marcu.- Paraphrase Detection in Comparable Monolingual Corpora. L.Deleger, B.Cartoni, P.Zweigenbaum.- Information Network Construction and Alignment from Automatically Acquired Comparable Corpora. H.Ji, W.-P.Lin.- Bilingual Terminology Mining from Comparable Corpora. B.Daille, E.Morin.- The Place of Comparable Corpora in Providing Terminological Reference Information to Online Translators: A Strategic Framework. K.Kageura, T.Abekawa.- Old Needs, New Solutions: Comparable Corpora for Language Professionals. S.Bernardini, A.Ferraresi.- Exploiting the Incomparability of Comparable Corpora for Contrastive Linguistics and Translation Studies. S.Neumann, S.Hansen-Schirra.
Responsibility: edited by Serge Sharoff, Reinhard Rapp, Pierre Zweigenbaum, Pascale Fung
More information:

Abstract:

In the field of multilingual NLP (such as machine translation and terminology mining) this implied the use of parallel corpora. Nevertheless, this research direction has not produced a single  Read more...

Reviews

Editorial reviews

Publisher Synopsis

"I would like to recommend 'Building and Using Comparable ... to those who are working with or are interested in multilingual and monolingual comparable corpora. ... it is easy to say that the notion Read more...

 
User-contributed reviews
Retrieving GoodReads reviews...
Retrieving DOGObooks reviews...

Tags

Be the first.
Confirm this request

You may have already requested this item. Please select Ok if you would like to proceed with this request anyway.

Linked Data


\n\n

Primary Entity<\/h3>\n
<http:\/\/www.worldcat.org\/oclc\/867051371<\/a>> # Building and Using Comparable Corpora<\/span>\n\u00A0\u00A0\u00A0\u00A0a \nschema:MediaObject<\/a>, schema:Book<\/a>, schema:CreativeWork<\/a> ;\u00A0\u00A0\u00A0\nlibrary:oclcnum<\/a> \"867051371<\/span>\" ;\u00A0\u00A0\u00A0\nlibrary:placeOfPublication<\/a> <http:\/\/experiment.worldcat.org\/entity\/work\/data\/1785695596#Place\/berlin_heidelberg<\/a>> ; # Berlin, Heidelberg<\/span>\n\u00A0\u00A0\u00A0\nlibrary:placeOfPublication<\/a> <http:\/\/id.loc.gov\/vocabulary\/countries\/gw<\/a>> ;\u00A0\u00A0\u00A0\nschema:about<\/a> <http:\/\/id.loc.gov\/authorities\/subjects\/sh89003285<\/a>> ; # Computer science<\/span>\n\u00A0\u00A0\u00A0\nschema:about<\/a> <http:\/\/id.worldcat.org\/fast\/872451<\/a>> ; # Computer science<\/span>\n\u00A0\u00A0\u00A0\nschema:about<\/a> <http:\/\/id.worldcat.org\/fast\/871998<\/a>> ; # Computational linguistics<\/span>\n\u00A0\u00A0\u00A0\nschema:about<\/a> <http:\/\/id.loc.gov\/authorities\/subjects\/sh85077224<\/a>> ; # Computational linguistics<\/span>\n\u00A0\u00A0\u00A0\nschema:about<\/a> <http:\/\/dewey.info\/class\/006.35\/<\/a>> ;\u00A0\u00A0\u00A0\nschema:about<\/a> <http:\/\/id.worldcat.org\/fast\/1154842<\/a>> ; # Translators (Computer programs)<\/span>\n\u00A0\u00A0\u00A0\nschema:author<\/a> <http:\/\/viaf.org\/viaf\/204731113<\/a>> ; # Pierre Zweigenbaum<\/span>\n\u00A0\u00A0\u00A0\nschema:author<\/a> <http:\/\/viaf.org\/viaf\/3146574840138150693<\/a>> ; # Pascale Fung<\/span>\n\u00A0\u00A0\u00A0\nschema:author<\/a> <http:\/\/viaf.org\/viaf\/173964850<\/a>> ; # Serge Sharoff<\/span>\n\u00A0\u00A0\u00A0\nschema:author<\/a> <http:\/\/viaf.org\/viaf\/44932114<\/a>> ; # Reinhard Rapp<\/span>\n\u00A0\u00A0\u00A0\nschema:bookFormat<\/a> schema:EBook<\/a> ;\u00A0\u00A0\u00A0\nschema:datePublished<\/a> \"2013<\/span>\" ;\u00A0\u00A0\u00A0\nschema:description<\/a> \"The 1990s saw a paradigm change in the use of corpus-driven methods in NLP. In the field of multilingual NLP (such as machine translation and terminology mining) this implied the use of parallel corpora. However, parallel resources are relatively scarce: many more texts are produced daily by native speakers of any given language than translated. This situation resulted in a natural drive towards the use of comparable corpora, i.e. non-parallel texts in the same domain or genre. Nevertheless, this research direction has not produced a single authoritative source suitable for researchers and students coming to the field. The proposed volume providesa reference source, identifying the state of the art in the field as well as future trends. The book is intended for specialists and students in natural language processing, machine translation and computer-assisted translation.<\/span>\" ;\u00A0\u00A0\u00A0\nschema:exampleOfWork<\/a> <http:\/\/worldcat.org\/entity\/work\/id\/1785695596<\/a>> ;\u00A0\u00A0\u00A0\nschema:genre<\/a> \"Online-Publikation<\/span>\" ;\u00A0\u00A0\u00A0\nschema:inLanguage<\/a> \"en<\/span>\" ;\u00A0\u00A0\u00A0\nschema:name<\/a> \"Building and Using Comparable Corpora<\/span>\" ;\u00A0\u00A0\u00A0\nschema:productID<\/a> \"867051371<\/span>\" ;\u00A0\u00A0\u00A0\nschema:publication<\/a> <http:\/\/www.worldcat.org\/title\/-\/oclc\/867051371#PublicationEvent\/berlin_heidelbergimprint_springer2013<\/a>> ;\u00A0\u00A0\u00A0\nschema:publisher<\/a> <http:\/\/experiment.worldcat.org\/entity\/work\/data\/1785695596#Agent\/imprint_springer<\/a>> ; # Imprint: Springer<\/span>\n\u00A0\u00A0\u00A0\nschema:url<\/a> <http:\/\/swbplus.bsz-bw.de\/bsz39953346xcov.htm<\/a>> ;\u00A0\u00A0\u00A0\nschema:url<\/a> <http:\/\/scans.hebis.de\/HEBCGI\/show.pl?33493147_toc.html<\/a>> ;\u00A0\u00A0\u00A0\nschema:url<\/a> <https:\/\/doi.org\/10.1007\/978-3-642-20128-8<\/a>> ;\u00A0\u00A0\u00A0\nschema:workExample<\/a> <http:\/\/worldcat.org\/isbn\/9783642201271<\/a>> ;\u00A0\u00A0\u00A0\nschema:workExample<\/a> <http:\/\/worldcat.org\/isbn\/9783642201288<\/a>> ;\u00A0\u00A0\u00A0\nschema:workExample<\/a> <http:\/\/dx.doi.org\/10.1007\/978-3-642-20128-8<\/a>> ;\u00A0\u00A0\u00A0\nwdrs:describedby<\/a> <http:\/\/www.worldcat.org\/title\/-\/oclc\/867051371<\/a>> ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n\n

Related Entities<\/h3>\n
<http:\/\/dewey.info\/class\/006.35\/<\/a>>\u00A0\u00A0\u00A0\u00A0a \nschema:Intangible<\/a> ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/dx.doi.org\/10.1007\/978-3-642-20128-8<\/a>>\u00A0\u00A0\u00A0\u00A0a \nschema:IndividualProduct<\/a> ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/experiment.worldcat.org\/entity\/work\/data\/1785695596#Agent\/imprint_springer<\/a>> # Imprint: Springer<\/span>\n\u00A0\u00A0\u00A0\u00A0a \nbgn:Agent<\/a> ;\u00A0\u00A0\u00A0\nschema:name<\/a> \"Imprint: Springer<\/span>\" ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/experiment.worldcat.org\/entity\/work\/data\/1785695596#Place\/berlin_heidelberg<\/a>> # Berlin, Heidelberg<\/span>\n\u00A0\u00A0\u00A0\u00A0a \nschema:Place<\/a> ;\u00A0\u00A0\u00A0\nschema:name<\/a> \"Berlin, Heidelberg<\/span>\" ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/id.loc.gov\/authorities\/subjects\/sh85077224<\/a>> # Computational linguistics<\/span>\n\u00A0\u00A0\u00A0\u00A0a \nschema:Intangible<\/a> ;\u00A0\u00A0\u00A0\nschema:name<\/a> \"Computational linguistics<\/span>\" ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/id.loc.gov\/authorities\/subjects\/sh89003285<\/a>> # Computer science<\/span>\n\u00A0\u00A0\u00A0\u00A0a \nschema:Intangible<\/a> ;\u00A0\u00A0\u00A0\nschema:name<\/a> \"Computer science<\/span>\" ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/id.loc.gov\/vocabulary\/countries\/gw<\/a>>\u00A0\u00A0\u00A0\u00A0a \nschema:Place<\/a> ;\u00A0\u00A0\u00A0\ndcterms:identifier<\/a> \"gw<\/span>\" ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/id.worldcat.org\/fast\/1154842<\/a>> # Translators (Computer programs)<\/span>\n\u00A0\u00A0\u00A0\u00A0a \nschema:Intangible<\/a> ;\u00A0\u00A0\u00A0\nschema:name<\/a> \"Translators (Computer programs)<\/span>\" ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/id.worldcat.org\/fast\/871998<\/a>> # Computational linguistics<\/span>\n\u00A0\u00A0\u00A0\u00A0a \nschema:Intangible<\/a> ;\u00A0\u00A0\u00A0\nschema:name<\/a> \"Computational linguistics<\/span>\" ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/id.worldcat.org\/fast\/872451<\/a>> # Computer science<\/span>\n\u00A0\u00A0\u00A0\u00A0a \nschema:Intangible<\/a> ;\u00A0\u00A0\u00A0\nschema:name<\/a> \"Computer science<\/span>\" ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/viaf.org\/viaf\/173964850<\/a>> # Serge Sharoff<\/span>\n\u00A0\u00A0\u00A0\u00A0a \nschema:Person<\/a> ;\u00A0\u00A0\u00A0\nschema:familyName<\/a> \"Sharoff<\/span>\" ;\u00A0\u00A0\u00A0\nschema:givenName<\/a> \"Serge<\/span>\" ;\u00A0\u00A0\u00A0\nschema:name<\/a> \"Serge Sharoff<\/span>\" ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/viaf.org\/viaf\/204731113<\/a>> # Pierre Zweigenbaum<\/span>\n\u00A0\u00A0\u00A0\u00A0a \nschema:Person<\/a> ;\u00A0\u00A0\u00A0\nschema:familyName<\/a> \"Zweigenbaum<\/span>\" ;\u00A0\u00A0\u00A0\nschema:givenName<\/a> \"Pierre<\/span>\" ;\u00A0\u00A0\u00A0\nschema:name<\/a> \"Pierre Zweigenbaum<\/span>\" ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/viaf.org\/viaf\/3146574840138150693<\/a>> # Pascale Fung<\/span>\n\u00A0\u00A0\u00A0\u00A0a \nschema:Person<\/a> ;\u00A0\u00A0\u00A0\nschema:familyName<\/a> \"Fung<\/span>\" ;\u00A0\u00A0\u00A0\nschema:givenName<\/a> \"Pascale<\/span>\" ;\u00A0\u00A0\u00A0\nschema:name<\/a> \"Pascale Fung<\/span>\" ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/viaf.org\/viaf\/44932114<\/a>> # Reinhard Rapp<\/span>\n\u00A0\u00A0\u00A0\u00A0a \nschema:Person<\/a> ;\u00A0\u00A0\u00A0\nschema:familyName<\/a> \"Rapp<\/span>\" ;\u00A0\u00A0\u00A0\nschema:givenName<\/a> \"Reinhard<\/span>\" ;\u00A0\u00A0\u00A0\nschema:name<\/a> \"Reinhard Rapp<\/span>\" ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/worldcat.org\/isbn\/9783642201271<\/a>>\u00A0\u00A0\u00A0\u00A0a \nschema:ProductModel<\/a> ;\u00A0\u00A0\u00A0\nschema:isbn<\/a> \"364220127X<\/span>\" ;\u00A0\u00A0\u00A0\nschema:isbn<\/a> \"9783642201271<\/span>\" ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/worldcat.org\/isbn\/9783642201288<\/a>>\u00A0\u00A0\u00A0\u00A0a \nschema:ProductModel<\/a> ;\u00A0\u00A0\u00A0\nschema:isbn<\/a> \"3642201288<\/span>\" ;\u00A0\u00A0\u00A0\nschema:isbn<\/a> \"9783642201288<\/span>\" ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/www.worldcat.org\/title\/-\/oclc\/867051371<\/a>>\u00A0\u00A0\u00A0\u00A0a \ngenont:InformationResource<\/a>, genont:ContentTypeGenericResource<\/a> ;\u00A0\u00A0\u00A0\nschema:about<\/a> <http:\/\/www.worldcat.org\/oclc\/867051371<\/a>> ; # Building and Using Comparable Corpora<\/span>\n\u00A0\u00A0\u00A0\nschema:dateModified<\/a> \"2020-10-14<\/span>\" ;\u00A0\u00A0\u00A0\nvoid:inDataset<\/a> <http:\/\/purl.oclc.org\/dataset\/WorldCat<\/a>> ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n
<http:\/\/www.worldcat.org\/title\/-\/oclc\/867051371#PublicationEvent\/berlin_heidelbergimprint_springer2013<\/a>>\u00A0\u00A0\u00A0\u00A0a \nschema:PublicationEvent<\/a> ;\u00A0\u00A0\u00A0\nschema:location<\/a> <http:\/\/experiment.worldcat.org\/entity\/work\/data\/1785695596#Place\/berlin_heidelberg<\/a>> ; # Berlin, Heidelberg<\/span>\n\u00A0\u00A0\u00A0\nschema:organizer<\/a> <http:\/\/experiment.worldcat.org\/entity\/work\/data\/1785695596#Agent\/imprint_springer<\/a>> ; # Imprint: Springer<\/span>\n\u00A0\u00A0\u00A0\nschema:startDate<\/a> \"2013<\/span>\" ;\u00A0\u00A0\u00A0\u00A0.\n\n\n<\/div>\n\n

Content-negotiable representations<\/p>\n