Discurse Marker Characterisation Via Clustering: Extrapolation from Supervised to Unsupervised Corpora

Alonso, Laura; Castellón Masalles, Irene; Padró, Lluís; Gibert, Karina; Alonso, Laura; Castellón Masalles, Irene; Padró, Lluís; Gibert, Karina

Discurse Marker Characterisation Via Clustering: Extrapolation from Supervised to Unsupervised Corpora

Author

Alonso, Laura

Castellón Masalles, Irene

Padró, Lluís

Gibert, Karina

Publication date

2019-03-11T15:10:09Z

2002

2019-03-11T15:10:10Z

Abstract

In this paper we will show how clustering techniques provide empirical evidence for a characterisation of Discourse Markers (DMs) that helps in overcoming the lack of consensus and reduces the cost of building NLP resources based on DMs. By comparison of classifications from hand-tagged and unsupervised corpora we are capable of grounding a notion of DM prototypicality, from which reliable classifications can be obtained from fully unsupervised corpora.

Document Type

Article

Published version

Language

English

Subjects and keywords

Tractament del llenguatge natural (Informàtica); Marcadors del discurs; Natural language processing (Computer science); Discourse markers

Publisher

Sociedad Española para el Procesamiento del Lenguaje Natural (SEPLN)

Related items

Reproducció del document publicat a: http://journal.sepln.org/sepln/ojs/ojs/index.php/pln/article/view/3257

Procesamiento del lenguaje natural , 2002, num. 29, p. 223-230

Recommended citation

This citation was generated automatically.

Export

DIDL MARC MARC_CCUC METS OAI_DC ORE QDC RDF

Rights

This item appears in the following Collection(s)

Filologia Catalana i Lingüística General [949]

ISGlobal - Institut de Salut Global de Barcelona [60793]

Discurse Marker Characterisation Via Clustering: Extrapolation from Supervised to Unsupervised Corpora

Author

Publication date

Share

Abstract

Document Type

Language

Subjects and keywords

Publisher

Related items

Recommended citation

Export

Rights

This item appears in the following Collection(s)