|
|
Abstract:
The use of natural language information can improve decision-making. Darwinian considerations suggest that language may have developed because it leads to improved decision making and survival, justifying the study of language's contribution to decision making. The study of information-based decision making within the context of evolution provides a view of information use that allows us to both describe the phenomenon of information use as well as to explain why an information use occurs as it does. Increasing information retrieval performance using phrases and part-of-speech (POS) information is one example of a type of decision-making performance that is improved when using this linguistic informa-tion. By studying a set of phrases used in a text retrieval system, such as noun phrases, and phrases incorporating specific parts of speech such as nouns, verbs, determiners, etc., we are able to show the relative effectiveness of using multi-term phrases as opposed to individual terms, as well as the relative worth of POS tagged terms or phrases, as opposed to untagged terms or phrases. An explanation is suggested for why POS tags contribute less to higher order grammatical constructs. We propose a quantitative measure of word sense disambiguation (WSD) that can be addressed by tagging (part-of-speech disambiguation capabilities); some example terms are analyzed using this measure of ambiguity, and specific degrees of ambiguity are proposed, including unambiguous, minimally ambiguous, moderately ambiguous, and highly ambiguous.
Return to Losee home page at http://www.ils.unc.edu/~losee