Building the Gold Standard for the surface syntax of Basque

Publication date

2018-02-28T14:33:34Z

2018-02-28T14:33:34Z

2017-03-01

2018-02-28T14:33:34Z

Abstract

In this paper, we present the process in the construction of SF-EPEC, a 300,000-word corpus syntactically annotated that aims to be a Gold Standard for the surface syntactic processing of Basque. First, the tagset designed for this purpose is described; being Basque an agglutinative language, sometimes complex syntactic tags were needed. We also account for the different phases in the construction of SF-EPEC.

Document Type

Article


Published version

Language

English

Subjects and keywords

Sintaxi; Basc; Syntax; Basque language

Publisher

Sociedad Española para el Procesamiento del Lenguaje Natural (SEPLN)

Related items

Reproducció del document publicat a: http://journal.sepln.org/sepln/ojs/ojs/index.php/pln/article/view/5421

Procesamiento del lenguaje natural , 2017, num. 58, p. 125-132

Recommended citation

This citation was generated automatically.

Rights

(c) Aduriz, Itziar et al., 2017

This item appears in the following Collection(s)