Urheen: A Chinese/English Lexical Analysis Toolkit
Hits
680
Authors
Unit
License
BSD
Programming Language
Urheen is a toolkit for Chinese word segmentation, Chinese pos tagging, English tokenize, and English pos tagging. The Chinese word segmentation and pos tagging modules are trained with the Chinese Tree Bank 7.0. The English pos tagging module is trained with the WSJ English treebank(02-23).
Reviews (0)
Be the first to review this listing!

