Faculty Publications

Paraphrase identification using weighted dependencies and word semantics

Mihai C. Lintean, University of Memphis
Vasile Rus, University of Memphis

Abstract

We present in this article a novel approach to the task of paraphrase identification. The proposed approach quantifies both the similarity and dissimilarity between two sentences. The similarity and dissimilarity is assessed based on lexico-semantic information, i.e., word semantics, and syntactic information in the form of dependencies,which are explicit syntactic relations between words in a sentence. Word semantics requires mapping words onto concepts in a taxonomy and then using word-to-word similarity metrics to compute their semantic relatedness. Dependencies are obtained using state-of-the-art dependency parsers. One important aspect of our approach is the weighting of missing dependencies, i.e., dependencies present in one sentence but not the other. We report experimental results on the Microsoft Paraphrase Corpus, a standard data set for evaluating approaches to paraphrase identification. The experiments showed that the proposed approach offers state-of-the-art results. In particular, our approach offers better precision when compared to other approaches.

Publication Title

Informatica (Ljubljana)

Recommended Citation

Lintean, M., & Rus, V. (2010). Paraphrase identification using weighted dependencies and word semantics. Informatica (Ljubljana), 34 (1), 19-28. Retrieved from https://digitalcommons.memphis.edu/facpubs/3056

This document is currently not available here.

COinS

Faculty Publications

Paraphrase identification using weighted dependencies and word semantics

Abstract

Publication Title

Recommended Citation

Search

Browse

Author Corner

Libraries

Faculty Publications

Paraphrase identification using weighted dependencies and word semantics

Authors

Abstract

Publication Title

Recommended Citation

Share

Search

Browse

Author Corner

Libraries