Grand, Gabriel; Blank, Idan; Pereira, Francisco; Fedorenko, Evelina

doi:10.1038/s41562-022-01316-8

Download PDF

Semantic projection recovers rich human knowledge of multiple object features from word embeddings.

2022

Published Web Location

https://doi.org/10.1038/s41562-022-01316-8

Abstract

How is knowledge about word meaning represented in the mental lexicon? Current computational models infer word meanings from lexical co-occurrence patterns. They learn to represent words as vectors in a multidimensional space, wherein words that are used in more similar linguistic contexts-that is, are more semantically related-are located closer together. However, whereas inter-word proximity captures only overall relatedness, human judgements are highly context dependent. For example, dolphins and alligators are similar in size but differ in dangerousness. Here, we use a domain-general method to extract context-dependent relationships from word embeddings: semantic projection of word-vectors onto lines that represent features such as size (the line connecting the words small and big) or danger (safe to dangerous), analogous to mental scales. This method recovers human judgements across various object categories and properties. Thus, the geometry of word embeddings explicitly represents a wealth of context-dependent world knowledge.

Many UC-authored scholarly publications are freely available on this site because of the UC's open access policies. Let us know how this access is important for you.

Main Content

For improved accessibility of PDF content, download the file to your device.

UCLA

Semantic projection recovers rich human knowledge of multiple object features from word embeddings.

Published Web Location