DISCOver: DIStributional approach based on syntactic dependencies for discovering COnstructions

dc.contributor.author
Martí Antonin, M. Antònia
dc.contributor.author
Taulé Delor, Mariona
dc.contributor.author
Kovatchev, Venelin
dc.contributor.author
Salamó Llorente, Maria
dc.date.issued
2020-10-21T13:05:16Z
dc.date.issued
2020-10-21T13:05:16Z
dc.date.issued
2019-01-04
dc.date.issued
2020-10-21T13:05:16Z
dc.identifier
1613-7027
dc.identifier
https://hdl.handle.net/2445/171321
dc.identifier
683887
dc.description.abstract
One of the goals in Cognitive Linguistics is the automatic identification and analysis of constructions, since they are fundamental linguistic units for understanding language. This article presents DISCOver, an unsupervised methodology for the automatic discovery of lexico-syntactic patterns that can be considered as candidates for constructions. This methodology follows a distributional semantic approach. Concretely, it is based on our proposed pattern-construction hypothesis: those contexts that are relevant to the definition of a cluster of semantically related words tend to be (part of) lexico-syntactic constructions. Our proposal uses Distributional Semantic Models for modelling the context taking into account syntactic dependencies. After a clustering process, we linked all those clusters with strong relationships and we use them as a source of information for deriving lexico-syntactic patterns, obtaining a total number of 220,732 candidates from a 100 million token corpus of Spanish. We evaluated the patterns obtained intrinsically, applying statistical association measures and they were also evaluated qualitatively by experts. Our results were superior to the baseline in both quality and quantity in all cases. While our experiments have been carried out using a Spanish corpus, this methodology is language independent and only requires a large corpus annotated with the parts of speech and dependencies to be applied.
dc.format
33 p.
dc.format
application/pdf
dc.format
application/pdf
dc.language
eng
dc.publisher
De Gruyter Mouton
dc.relation
Reproducció del document publicat a: https://doi.org/10.1515/cllt-2018-0028
dc.relation
Corpus Linguistics and Linguistic Theory, 2019
dc.relation
https://doi.org/10.1515/cllt-2018-0028
dc.rights
(c) Martí Antonin, M. Antònia et al., 2019
dc.rights
info:eu-repo/semantics/openAccess
dc.source
Articles publicats en revistes (Filologia Catalana i Lingüística General)
dc.subject
Gramàtica cognitiva
dc.subject
Models lingüístics
dc.subject
Semàntica
dc.subject
Cognitive grammar
dc.subject
Linguistic models
dc.subject
Semantics
dc.title
DISCOver: DIStributional approach based on syntactic dependencies for discovering COnstructions
dc.type
info:eu-repo/semantics/article
dc.type
info:eu-repo/semantics/publishedVersion


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)