English and Spanish discourse markers in translationCorpus analysis and annotation

  1. Lavid-López, Julia 1
  1. 1 Universidad Complutense de Madrid
    info

    Universidad Complutense de Madrid

    Madrid, España

    ROR 02p0gd045

Llibre:
Corpora in Translation and Contrastive Research in the Digital Age: Recent advances and explorations
  1. Lavid-López, Julia (ed. lit.)
  2. Maíz-Arévalo, Carmen (ed. lit.)
  3. Zamorano-Mansilla, Juan Rafael (ed. lit.)

Editorial: John Benjamins

ISBN: 978-90-272-0918-4

Any de publicació: 2021

Pàgines: 177-207

Tipus: Capítol de llibre

Resum

The study and annotation of discourse markers (DMs) in the context of translation is a much needed and challenging task not only for descriptive translation studies, but also for Natural Language Processing (NLP) applications. Their various meanings are difficult to identify and annotate, even for trained human experts. In this chapter, a methodology for the analysis and annotation of DMs is proposed, using three highly frequent DMs in English -in fact, actually and really- and their translations into Spanish as a case study. The methodology consists of an initial corpus analysis phase followed by a corpus annotation phase. The corpus analysis provides qualitative and quantitative information on the meanings of these DMs by looking at their translations in large parallel corpora. The corpus annotation phase specifies the annotation procedure, which can be generalized to other DMs and to other language pairs, and form the basis for large-scale cross-linguistic annotation of DMs.