Discurse Marker Characterisation Via Clustering: Extrapolation from Supervised to Unsupervised Corpora

Publication date

2019-03-11T15:10:09Z

2019-03-11T15:10:09Z

2002

2019-03-11T15:10:10Z

Abstract

In this paper we will show how clustering techniques provide empirical evidence for a characterisation of Discourse Markers (DMs) that helps in overcoming the lack of consensus and reduces the cost of building NLP resources based on DMs. By comparison of classifications from hand-tagged and unsupervised corpora we are capable of grounding a notion of DM prototypicality, from which reliable classifications can be obtained from fully unsupervised corpora.

Document Type

Article


Published version

Language

English

Publisher

Sociedad Española para el Procesamiento del Lenguaje Natural (SEPLN)

Related items

Reproducció del document publicat a: http://journal.sepln.org/sepln/ojs/ojs/index.php/pln/article/view/3257

Procesamiento del lenguaje natural , 2002, num. 29, p. 223-230

Recommended citation

This citation was generated automatically.

Rights

(c) Alonso, Laura et al., 2002

This item appears in the following Collection(s)