NaturTtSML: Un esquema de anotación para la mejora de la naturalidad en los sistemas de síntesis de voz

Albert González Lamaña; Antonio Pareja Lora

NaturTtSMLUn esquema de anotación para la mejora de la naturalidad en los sistemas de síntesis de voz

Albert González Lamaña ¹
Antonio Pareja Lora ²

1 Universidad Nacional de Educación a Distancia

Universidad Nacional de Educación a Distancia

Madrid, España

ROR https://ror.org/02msb5n36
2 Universidad Complutense de Madrid

Universidad Complutense de Madrid

Madrid, España

ROR 02p0gd045

Journal:

E-Aesla

ISSN: 2444-197X

Year of publication: 2018

Issue: 4

Pages: 375-390

Type: Article

DIALNET GOOGLE SCHOLAR Open access editor

More publications in: E-Aesla

Abstract

Even though the output of current Text-to-Speech (TtS) systems is usually quite comprehensible, this output is quite often too monotonic. One of the main causes for this problem is that a TtS normally cannot actually understand plain text. Thus, the TtS system must be provided with the way to read it, in order to supply a more natural and expressive speech. So far, some annotation languages and schemes have been developed towards this end; however, they are partial and/or focus in different areas of expressive speech. This paper presents an annotation scheme (NaturTtSML) that merges and makes interoperate all of them altogether in just one scheme.

Data source: Dialnet

NaturTtSMLUn esquema de anotación para la mejora de la naturalidad en los sistemas de síntesis de voz

Universidad Nacional de Educación a Distancia

Universidad Complutense de Madrid

Abstract