请教有关ICTCLAS分词处理的问题

请教各位高手:我用ICTCLAS2008分词处理从北大现代汉语语料库下载的语料,可是出来的全是乱码,怎么办?
 
It must be the problem with the text encoding.
ICTCLAS accepts by default ANSI/ASCII codes, and the texts you downloaded from the web could be utf-8. You need to convert the utf-8 texts to ASCII ones.
 
回复: Re: 请教有关ICTCLAS分词处理的问题

Thank you, Dr. Yu! You are right. After converting the text to ASCI, ICTCLAS works well now.
 
Back
顶部