Distributional vectors encode referential attributes

Publication date

2017-08-25T17:17:14Z

2017-08-25T17:17:14Z

2015

Abstract

Distributional methods have proven to excel at capturing fuzzy, graded aspects of meaning (Italy is more similar to Spain than to Germany). In contrast, it is difficult to extract the values of more specific attributes of word referents from distributional representations, attributes of the kind typically found in structured knowledge bases (Italy has 60 million inhabitants). In this paper, we pursue the hypothesis that distributional vectors also implicitly encode referential attributes. We show that a standard supervised regression model is in fact sufficient to retrieve such attributes to a reasonable degree of accuracy: When evaluated on the prediction of both categorical and numeric attributes of countries and cities, the model consistently reduces baseline error by 30%, and is not far from the upper bound. Further analysis suggests that our model is able to “objectify” distributional representations for entities, anchoring them more firmly in the external world in measurable ways.


This project has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie grant agreement No 655577 (LOVe); ERC 2011 Starting Independent Research Grant n. 283554 (COMPOSES); DFG (SFB 732, Project D10); and Spanish MINECO (grant FFI2013-41301-P).

Document Type

Object of conference


Published version

Language

English

Publisher

ACL (Association for Computational Linguistics)

Related items

Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. Lisbon: Association for Computational Linguistics; 2015. p. 12-21

info:eu-repo/grantAgreement/EC/H2020/655577

info:eu-repo/grantAgreement/EC/FP7/283554

info:eu-repo/grantAgreement/ES/1PE/FFI2013-41301-P

Recommended citation

This citation was generated automatically.

Rights

© ACL, Creative Commons Attribution-NonCommercial-ShareAlike3.0

https://creativecommons.org/licenses/by-nc-sa/3.0/

This item appears in the following Collection(s)