Summary of Text Feature Selection Method Based on VSM

Shi CHEN, Xiao-Song WU

Abstract


The selection of text feature selection method directly affects the accuracy of subsequent in text classification and clustering. This paper reviews 4 kinds of feature selection methods based on VSM include: supervised feature selection method, unsupervised feature selection method, feature selection based on TF-IDF and improvement, improved method based on ICTCLAS built-in noun patterns. We evaluate their advantages and disadvantages.

Keywords


VSM, Feature Selection, Tfidf.


DOI
10.12783/dtssehs/icss2016/9199