2018-02-28T14:33:34Z
2018-02-28T14:33:34Z
2017-03-01
2018-02-28T14:33:34Z
In this paper, we present the process in the construction of SF-EPEC, a 300,000-word corpus syntactically annotated that aims to be a Gold Standard for the surface syntactic processing of Basque. First, the tagset designed for this purpose is described; being Basque an agglutinative language, sometimes complex syntactic tags were needed. We also account for the different phases in the construction of SF-EPEC.
Article
Versió publicada
Anglès
Sociedad Española para el Procesamiento del Lenguaje Natural (SEPLN)
Reproducció del document publicat a: http://journal.sepln.org/sepln/ojs/ojs/index.php/pln/article/view/5421
Procesamiento del lenguaje natural , 2017, num. 58, p. 125-132
(c) Aduriz, Itziar et al., 2017