یک روش کلی برای معنا کردن کلمه: ابهام زدایی در ویکی پدیا A Generalized Method for Word Sense: Disambiguation based on Wikipedia
- نوع فایل : کتاب
- زبان : فارسی
- ناشر : اسپرینگر Springer
- چاپ و سال / کشور: 2011
توضیحات
رشته های مرتبط: مهندسی کامپیوتر، مهندسی نرم افزار و هوش مصنوعی
Description
In this paper we propose a general framework for word sense disambiguation using knowledge latent in Wikipedia. Specifically, we exploit the rich and growing Wikipedia corpus in order to achieve a large and robust knowledge repository consisting of keyphrases and their associated candidate topics. Keyphrases are mainly derived from Wikipedia article titles and anchor texts associated with wikilinks. The disambiguation of a given keyphrase is based on both the commonness of a candidate topic and the context-dependent relatedness where unnecessary (and potentially noisy) context information is pruned. With extensive experimental evaluations using different relatedness measures, we show that the proposed technique achieved comparable disambiguation accuracies with respect to state-of-the-art techniques, while incurring orders of magnitude less computation cost.