English and Spanish discourse markers in translationCorpus analysis and annotation

  1. Lavid-López, Julia 1
  1. 1 Universidad Complutense de Madrid
    info

    Universidad Complutense de Madrid

    Madrid, España

    ROR 02p0gd045

Libro:
Corpora in Translation and Contrastive Research in the Digital Age: Recent advances and explorations
  1. Lavid-López, Julia (ed. lit.)
  2. Maíz-Arévalo, Carmen (ed. lit.)
  3. Zamorano-Mansilla, Juan Rafael (ed. lit.)

Editorial: John Benjamins

ISBN: 978-90-272-0918-4

Año de publicación: 2021

Páginas: 177-207

Tipo: Capítulo de Libro

Resumen

The study and annotation of discourse markers (DMs) in the context of translation is a much needed and challenging task not only for descriptive translation studies, but also for Natural Language Processing (NLP) applications. Their various meanings are difficult to identify and annotate, even for trained human experts. In this chapter, a methodology for the analysis and annotation of DMs is proposed, using three highly frequent DMs in English -in fact, actually and really- and their translations into Spanish as a case study. The methodology consists of an initial corpus analysis phase followed by a corpus annotation phase. The corpus analysis provides qualitative and quantitative information on the meanings of these DMs by looking at their translations in large parallel corpora. The corpus annotation phase specifies the annotation procedure, which can be generalized to other DMs and to other language pairs, and form the basis for large-scale cross-linguistic annotation of DMs.