Die Sprache und ihre Wissenschaft zwischen Tradition und Innovation / Language and its Study between Tradition and Innovation
Akten des 45. Linguistischen Kolloquiums in Veszprém 2010- Proceedings of the 45 th Linguistics Colloquium, Veszprém 2010
This volume presents 34 papers delivered at the 45th Linguistics Colloquium in Veszprém (Hungary) from 16th to 18th September, 2010. The authors deal with older and newer fields of work in linguistics as well as their innovative results. The international composition of the participants and the various methodological positions and aspects of the academic activities in linguistics offer the possibility of a broad field of research. Apart from Applied Linguistics, Intercultural Linguistics, Pragmatics, Lexicology, Semantics, Contact Linguistics and Grammaticography, the book also presents Foreign Language Didactics. Moreover, the book suggests topics for future research.
Reinhard Rapp: Constructing a Distributional Thesaurus by Applying Latent Semantic Analysis to a Part-of-Speech-Tagged Corpus
Constructing a Distributional Thesaurus by Applying Latent Semantic Analysis to a Part-of-Speech-Tagged Corpus
The distributional hypothesis states that words which occur in the same contexts tend to have similar meanings (Harris 1954). It is related to Firth’s observation that words are characterized by their context (Firth 1957), and to the considerations on word sense disambiguation in the Weaver memorandum (1955). Although linguists first postulated the distributional hypothesis, it is also relevant to cognitive science, and has been investigated from this perspective e.g. by McDonald & Ramscar (2001). Assuming that the meaning of a word depends on its contexts, McDonald & Ramscar manipulated subjects’ contextual experience with marginally familiar and nonce words. Their experiments provided support for the role of distributional information in developing representations of word meaning. Another finding was that this kind of behavior can be simulated by looking at the contexts of words in large corpora.
Several researchers have successfully put this into practice. To our knowledge, among the first had been Ruge (1992) who in the context of finding search terms in information retrieval showed that the semantic similarity of two words can be computed by looking at the agreement of their lexical neighborhoods. For example, a certain degree of semantic similarity between the words red and blue can be derived from the fact that they both frequently co-occur with words like color, dress, flower, etc. (see Figure 1) although there are also other context words...
You are not authenticated to view the full text of this chapter or article.
This site requires a subscription or purchase to access the full text of books or journals.
Do you have any questions? Contact us.Or login to access all content.