Bültmann & Gerriets
Bitext Alignment
von Jörg Tiedemann
Verlag: Springer International Publishing
Reihe: Synthesis Lectures on Human Language Technologies
Hardcover
ISBN: 978-3-031-01014-9
Erschienen am 01.06.2011
Sprache: Englisch
Format: 235 mm [H] x 191 mm [B] x 10 mm [T]
Gewicht: 327 Gramm
Umfang: 168 Seiten

Preis: 37,44 €
keine Versandkosten (Inland)


Dieser Titel wird erst bei Bestellung gedruckt. Eintreffen bei uns daher ca. am 22. Oktober.

Der Versand innerhalb der Stadt erfolgt in Regel am gleichen Tag.
Der Versand nach außerhalb dauert mit Post/DHL meistens 1-2 Tage.

37,44 €
merken
zum E-Book (PDF) 37,44 €
klimaneutral
Der Verlag produziert nach eigener Angabe noch nicht klimaneutral bzw. kompensiert die CO2-Emissionen aus der Produktion nicht. Daher übernehmen wir diese Kompensation durch finanzielle Förderung entsprechender Projekte. Mehr Details finden Sie in unserer Klimabilanz.
Biografische Anmerkung
Inhaltsverzeichnis
Klappentext

Jörg Tiedemann is currently employed as a visiting professor of computational linguistics at the Department of Linguistics and Philology at Uppsala University,Sweden.He got his Ph.D.from the same department in 2003. His work is mainly focused on machine translation, question answering and data mining from multilingual resources. He has initiated and still maintains a freely available collection of parallel corpora (OPUS), which is widely used and appreciated.



Introduction.- Basic Concepts and Terminology.- Building Parallel Corpora.- Sentence Alignment.- Word Alignment.- Phrase and Tree Alignment.- Concluding Remarks.



This book provides an overview of various techniques for the alignment of bitexts. It describes general concepts and strategies that can be applied to map corresponding parts in parallel documents on various levels of granularity. Bitexts are valuable linguistic resources for many different research fields and practical applications. The most predominant application is machine translation, in particular, statistical machine translation. However, there are various other threads that can be followed which may be supported by the rich linguistic knowledge implicitly stored in parallel resources. Bitexts have been explored in lexicography, word sense disambiguation, terminology extraction, computer-aided language learning and translation studies to name just a few. The book covers the essential tasks that have to be carried out when building parallel corpora starting from the collection of translated documents up to sub-sentential alignments. In particular, it describes various approachesto document alignment, sentence alignment, word alignment and tree structure alignment. It also includes a list of resources and a comprehensive review of the literature on alignment techniques. Table of Contents: Introduction / Basic Concepts and Terminology / Building Parallel Corpora / Sentence Alignment / Word Alignment / Phrase and Tree Alignment / Concluding Remarks


andere Formate
weitere Titel der Reihe