Show Less
Restricted access

Translation Studies and Translation Practice: Proceedings of the 2nd International TRANSLATA Conference, 2014

Part 1

Series:

Edited By Lew N. Zybatow, Andy Stauder and Michael Ustaszewski

TRANSLATA II was the second in a series of triennial conferences on Translation and Interpreting Studies, held at the University of Innsbruck. The series is conceptualized as a forum for Translation Studies research. The contributions to this volume focus on humo(u)r translation, legal translation, and human-machine interaction in translation. The contributors also regard computer-aided translation, specialised translation, terminology as well as audiovisual translation and professional aspects in translation and interpreting.

Show Summary Details
Restricted access

Are Classical Principles of Corpus Compiling Applicable to Parallel Corpora of Literary Texts? (Mikhail Mikhailov)

Extract

Mikhail Mikhailov, University of Tampere

Are Classical Principles of Corpus Compiling Applicable to Parallel Corpora of Literary Texts?

Abstract: The main principles of compiling corpora cannot be applied to parallel corpora. Such corpora are more useful when composed of full texts, they should include diachronic data and retranslations, as well as works by prominent authors and translators. The reliable data can be obtained via subcorpora.

Electronic text corpora of all kinds – collections of whole texts, text fragments, transcripts of recorded speech, etc. – are becoming so common that research that does not use corpus data arouses suspicion. Availability of online text archives (documents archives, newspapers, fiction books, etc.) makes it possible to automate collecting the data. Although the problem of corpus availability is still far from being resolved, monolingual corpus linguistics is progressing rapidly. ‘National’ corpora (BNC, ANC, the Czech National Corpus, the Russian National Corpus, etc.) include hundreds of millions of running words, and Sketch Engine corpora are even bigger.

Research using multilingual corpora is less encouraging. Multilingual language resources are much more limited and more modest in size. The reason is obvious: it is far easier to obtain a large number of texts in one language than to find texts with corresponding versions in several languages. Besides, automation of collecting data for multilingual corpora is more difficult to handle. However, multilingual text corpora are being compiled; sometimes their compilation is automated (Koehn 2005), their sizes increase, the language repertoire is being...

You are not authenticated to view the full text of this chapter or article.

This site requires a subscription or purchase to access the full text of books or journals.

Do you have any questions? Contact us.

Or login to access all content.