Jeff Mitchell and Mirella Lapata. 2010. Composition in Distributional Models of Semantics. To appear in Cognitive Science.

Vector-based models of word meaning have become increasingly popular in cognitive science. The appeal of these models lies in their ability to represent meaning simply by using distributional information under the assumption that words occurring within similar contexts are semantically similar. Despite their widespread use, vector-based models are typically directed at representing words in isolation and methods for constructing representations for phrases or sentences have received little attention in the literature. This is in marked contrast to experimental evidence (e.g.,~in sentential priming) suggesting that semantic similarity is more complex than simply a relation between isolated words. This article proposes a framework for representing the meaning of word combinations in vector space. Central to our approach is vector composition which we operationalize in terms of additive and multiplicative functions. Under this framework, we introduce a wide range of composition models which we evaluate empirically on a phrase similarity task.


@article{Mitchell:Lapata:2010,
   author = {Jeff Mitchell and Mirella Lapata},
   title = {Composition in Distributional Models of Semantics},
   journal = {Cognitive Science},
   year = {2010},
   note = {To appear}
}