回复:A Corpus Worker`s Toolkit:语料库工具箱
ACWT Updates!
-Updated August 18, 2005:
* Added NEUCSP 东北大学自然语言实验室汉语分词器 & ICTCLAS 中科院计算所词法分析系统
to the TxtUtils group.
* Corrected some user guide inaccuracies.
* Added links to the relevant programs referenced in the clips.
ReadMe portions about the additions:
6) NEUCSP 东北大学自然语言实验室汉语分词器 can be downloaded from
http://www.nlplab.cn/cipsdk.html
Install the program to directory
where neucsp.exe and all other system files should be stored.
This program provides Parts of Speech (POS) tagged output for the currently
open file. (In a Windows-DOS console environment, which is not the case here,
it can also handle multiple files.)
7) ICTCLAS 中科院计算所词法分析系统 can be downloaded from
http://www.nlp.org.cn/categories/default.php?cat_id=12
Install the program to C:\ictclas, where ictclas.exe can be found. There should
be a subdirectory called C:\ictclas\data, where all other system files should be
stored.
___________________________________
For the latest information about ACWT, see page 1.