International Journal of Advanced Computer Research (IJACR) ISSN (P): 2249-7277 ISSN (O): 2277-7970 Vol - 5, Issue - 21, December 2015
  1. 1
    Google Scholar
  2. 4
    Impact Factor
Improved Vector Space Model TF/IDF Using Lexical Relations

Minh Chau Huynh, Pham Duy Thanh Le and Trong Hai Duong

Abstract

Current vector space model, for instance TF/IDF, has not yet taken into account the relations between terms; it only combines the term frequency in a document and the inverse document frequency in whole database to identify importance-score (weight) of a term respect with the document. Here we discover lexical relations among terms in the document to improve the vector space model TF/IDF. The weight generated from TF/IDF for each term, which is improved by lexical relations among related terms in the document. We evaluate the proposed method using documents selected from Wikipedia. The result shown that the proposed method is significant effective.

Keyword

Sector space model, TF/IDF, Semantics, Information retrieval, Natural language processing.

Cite this article

Refference