Title:
|
Distributional vectors encode referential attributes
|
Author:
|
Boleda, Gemma; Gupta, Abhijeet; Baroni, Marco; Padó, Sebastian
|
Abstract:
|
Distributional methods have proven to excel at capturing fuzzy, graded aspects of meaning (Italy is more similar to Spain than to Germany). In contrast, it is difficult to extract the values of more specific attributes of word referents from distributional representations, attributes of the kind typically found in structured knowledge bases (Italy has 60 million inhabitants). In this paper, we pursue the hypothesis that distributional vectors also implicitly encode referential attributes.
We show that a standard supervised regression model is in fact sufficient to retrieve such attributes to a reasonable degree of accuracy: When evaluated on the prediction of both categorical and numeric attributes of countries and cities, the model consistently reduces baseline error by 30%, and is not far from the upper bound. Further analysis suggests that our model is able to “objectify” distributional representations for entities, anchoring them more firmly in the external world in measurable ways. |
Abstract:
|
This project has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie grant agreement No 655577 (LOVe); ERC 2011 Starting Independent Research Grant n. 283554 (COMPOSES); DFG (SFB 732, Project D10); and Spanish MINECO (grant FFI2013-41301-P). |
Subject(s):
|
-Reference -Distributed representations -Semantics -Computational semantics -Computational Linguistics -Natural Language Processing |
Rights:
|
© ACL, Creative Commons Attribution-NonCommercial-ShareAlike3.0
https://creativecommons.org/licenses/by-nc-sa/3.0/
|
Document type:
|
Conference Object Article - Published version |
Published by:
|
ACL (Association for Computational Linguistics)
|
Share:
|
|