In this article we present a wsd algorithm based on random walks over large lexical knowledge bases lkb. We show that our algorithm performs better than other graph based methods when run on a graph built from wordnet and extended wordnet. Pdf a graph based approach to word sense disambiguation for. In graph based methods word senses are determined collectively by exploiting dependenciesacross senses, whereas in similarity based approaches each sense is determined for each word individually without considering the senses assigned to neighboring words. Request pdf graph based word sense disambiguation wordsense. An unsupervised method based on semantic relatedness. The solution to this problem impacts other computerrelated writing, such as discourse, improving relevance of search engines, anaphora resolution, coherence, and inference the human brain is quite proficient at word sense disambiguation. Two graphbased algorithms for stateoftheart wsd ehu. Improving machine translation using hybrid dictionary graph based word sense disambiguation with semantic and statistical methods. Word sense disambiguation based on word similarity. Combining knowledgebased methods and supervised learning for effective italian.
Experimental comparisons between the two algorithm types mihalcea, 2005. A graph based approach to word sense disambiguation for hindi language. In this paper, we approach entity linking by leveraging graph based methods. Graph connectivity measures for unsupervised word sense. An experimental study of graph connectivity for unsupervised. It is the process of identifying the actual meaning of the word based on the senses of the surrounding words of the.
In computational linguistics, word sense disambiguation wsd is an open problem concerned with identifying which sense of a word is used in a sentence. Unsupervised graphbasedword sense disambiguation using. Word sense disambiguation wsd systems automatically choose the intended meaning of a word in context. They found the best measures are degree and pagerank brin and page, 1998. Word sense disambiguation wsd is the task of automatically choosing the correct meaning. Index termsword sense disambiguation, graph connectivity, semantic networks, social network. Chinese word sense disambiguation with pagerank and hownet. Unsupervised largevocabulary word sense disambiguation with graph based algorithms for sequence data labeling. Named entity disambiguation, entity linking, wikification. Word sense disambiguation and namedentity disambiguation. Mihalcea 2005 and sinha and mihalcea 2007 construct a sentencewise graph, where, for each word every possible sense forms a vertex. There are many approaches for word sense disambiguation that in this paper proposes an algorithm based on weighted graph which has few parameters and does not require senseannotated data for. Embedding senses for efficient graphbased word sense. Random walks for knowledgebased word sense disambiguation.
Unsupervised methods, which is described in detail later. Unsupervised largevocabulary word sense disambiguation with. Word sense disambiguation is a basic problem in natural language processing. Corpus based word sense disambiguation was first implemented by. The algorithm is trying to disambiguate the senses of. The graphbased method selects the answer sense of the ambiguous word based on the semantic structure of lkbs. This paper proposed an unsupervised word sense disambiguation method based pagerank and hownet.
Word sense disambiguation is an open challenge in natural language processing. In the method, a free text is firstly represented as a sememe graph with sememes as vertices and relatedness of sememes as weighted edges based on hownet. We propose a simple graphbased method for word sense disambiguation wsd where sense and context embeddings are constructed. Knowledgebased word sense disambiguation using topic models. In proceedings of the conference on human language technology and empirical methods in natural language processing hlt05, pages 411418, morristown, nj. We first describe the graphbased method for word sense disambiguation, fol lowed by a description of the similarity measures and graph centrality algorithms. Then an iterative algorithm is applied to graph and the node having. How far can we go with current kbs and graphbased algorithms. A graphbased approach to word sense disambiguation.
1302 1200 667 143 686 1000 734 273 798 721 1237 184 1456 1474 80 1610 242 1031 1586 656 520 440 949 496 712 211 750 1397 1362 1535 481 431 484 481 1629 1204 335 211 1455 1007 249 1281 485