Urheen: A Chinese/English Lexical Analysis Toolkit

Hits
6174
Authors
Unit
License
BSD
Programming Language
Operating System
Linux-desktop
Windows-desktop
Rating
★★★★★
1 vote
Urheen is a toolkit for Chinese word segmentation, Chinese pos tagging, English tokenize, and English pos tagging. The Chinese word segmentation and pos tagging modules are trained with the Chinese Tree Bank 7.0. The English pos tagging module is trained with the WSJ English treebank(02-23).
OpenPR - Open Pattern Recognition Project, Powered by National Laboratory of Pattern Recognition,Casia,P.R.C ;Joomla templates by SG web hosting;Customized by Jiang nan