Comparing distributional semantic models for identifying groups of semantically related words

Kovatchev, Venelin; Salamó Llorente, Maria; Martí Antonin, M. Antònia; Kovatchev, Venelin; Salamó Llorente, Maria; Martí Antonin, M. Antònia

Comparing distributional semantic models for identifying groups of semantically related words

Autor/a

Kovatchev, Venelin

Salamó Llorente, Maria

Martí Antonin, M. Antònia

Fecha de publicación

2019-02-22T15:15:48Z

2016-09-15

2019-02-22T15:15:48Z

Resumen

Distributional Semantic Models (DSM) are growing in popularity in Computational Linguistics. DSM use corpora of language use to automatically induce formal representations of word meaning. This article focuses on one of the applications of DSM: identifying groups of semantically related words. We compare two models for obtaining formal representations: a well known approach (CLUTO) and a more recently introduced one (Word2Vec). We compare the two models with respect to the PoS coherence and the semantic relatedness of the words within the obtained groups. We also proposed a way to improve the results obtained by Word2Vec through corpus preprocessing. The results show that: a) CLUTO outperformsWord2Vec in both criteria for corpora of medium size; b) The preprocessing largely improves the results for Word2Vec with respect to both criteria.

Tipo de documento

Artículo

Versión publicada

Lengua

Inglés

Materias y palabras clave

Tractament del llenguatge natural (Informàtica); Semàntica; Natural language processing (Computer science); Semantics

Publicado por

Sociedad Española para el Procesamiento del Lenguaje Natural (SEPLN)

Documentos relacionados

Reproducció del document publicat a: http://journal.sepln.org/sepln/ojs/ojs/index.php/pln/article/view/5343

Procesamiento del lenguaje natural , 2016, num. 57, p. 109-116

Citación recomendada

Esta citación se ha generado automáticamente.

Exportar

DIDL MARC MARC_CCUC METS OAI_DC ORE QDC RDF

Derechos

Este ítem aparece en la(s) siguiente(s) colección(ones)

Filologia Catalana i Lingüística General [949]

ISGlobal - Institut de Salut Global de Barcelona [60807]

Comparing distributional semantic models for identifying groups of semantically related words

Autor/a

Fecha de publicación

Compartir

Resumen

Tipo de documento

Lengua

Materias y palabras clave

Publicado por

Documentos relacionados

Citación recomendada

Exportar

Derechos

Este ítem aparece en la(s) siguiente(s) colección(ones)