Title:
|
Discurse Marker Characterisation Via Clustering: Extrapolation from Supervised to Unsupervised Corpora
|
Author:
|
Alonso, Laura; Castellón Masalles, Irene; Padró, Lluís; Gibert, Karina
|
Other authors:
|
Universitat de Barcelona |
Abstract:
|
In this paper we will show how clustering techniques provide empirical evidence for a characterisation of Discourse Markers (DMs) that helps in overcoming the lack of consensus and reduces the cost of building NLP resources based on DMs. By comparison of classifications from hand-tagged and unsupervised corpora we are capable of grounding a notion of DM prototypicality, from which reliable classifications can be obtained from fully unsupervised corpora. |
Subject(s):
|
-Tractament del llenguatge natural (Informàtica) -Marcadors del discurs -Natural language processing (Computer science) -Discourse markers |
Rights:
|
(c) Alonso, Laura et al., 2002
|
Document type:
|
Article Article - Published version |
Published by:
|
Sociedad Española para el Procesamiento del Lenguaje Natural (SEPLN)
|
Share:
|
|