Research and Implementation of Unlisted Word Discovery System
Abstract
Unlisted word is a problem in Chinese word segmentation. In this paper, an improved Apriori algorithm is proposed, which can quickly and accurately identify unlisted words. The improved algorithm applied a compressed database approach to reduce the number of transactions. Compared with the traditional n-tuple algorithm and NApriori algorithm, it is faster and more effective.
Keywords
Unlisted Word, Apriori Algorithm, Transaction Compression
DOI
10.12783/dtetr/icmca2017/12347
10.12783/dtetr/icmca2017/12347
Refbacks
- There are currently no refbacks.