2019-03-11T15:10:09Z
2019-03-11T15:10:09Z
2002
2019-03-11T15:10:10Z
In this paper we will show how clustering techniques provide empirical evidence for a characterisation of Discourse Markers (DMs) that helps in overcoming the lack of consensus and reduces the cost of building NLP resources based on DMs. By comparison of classifications from hand-tagged and unsupervised corpora we are capable of grounding a notion of DM prototypicality, from which reliable classifications can be obtained from fully unsupervised corpora.
Article
Versió publicada
Anglès
Tractament del llenguatge natural (Informàtica); Marcadors del discurs; Natural language processing (Computer science); Discourse markers
Sociedad Española para el Procesamiento del Lenguaje Natural (SEPLN)
Reproducció del document publicat a: http://journal.sepln.org/sepln/ojs/ojs/index.php/pln/article/view/3257
Procesamiento del lenguaje natural , 2002, num. 29, p. 223-230
(c) Alonso, Laura et al., 2002