What"s wrong with my ParaConc? 汉字显示为乱码

patricx

高级会员
The default setting is wrong? how to fix this problem? please look at the Chinese characters in the screenshot.
2005082418594356.jpg



[本贴已被 xujiajin 于 2005年08月25日 14时59分56秒 编辑过]
 
The version of ParaConc you are using does not support Unicode. You must use GB version of the Chinese data on a Chinese Windows system. The new release of ParaConc suports Unicode (UTF-8).
 
What"s wrong with my Paraconc? help!

Old version of Paraconc (working with GB2312 Chinese data)
2005082422432552.jpg


New version of Paraconc (working with UTF-8 Chinese data)
2005082422443867.jpg
 
You must keep your chinese data in GB (and have a Chinese Windows system or language pack) to use the old version of ParaConc.
 
回复:What"s wrong with my Paraconc? help!

yes. but the problem is still there.
when i search for "of", language =English is Ok, but when i search for“也许”,language=Chinese, nothing i can get. pls look at these two screenshots respecitively.
when i search for "of", language =English is Ok
2005082423065693.jpg


but when i search for“也许”,language=Chinese, nothing i can get
2005082423073194.jpg
 
回复:What"s wrong with my Paraconc? help!

以下是引用 patricx2005-8-24 23:07:33 的发言:
yes. but the problem is still there.
when i search for "of", language =English is Ok, but when i search for“也许”,language=Chinese, nothing i can get. pls look at these two screenshots respecitively.
when i search for "of", language =English is Ok
2005082423065693.jpg


but when i search for“也许”,language=Chinese, nothing i can get
2005082423073194.jpg

Your Chinese text was not tokenized (i.e. segmented with word boundaries),
and this kind of text tends to give trouble to English based programs.
 
回复:What"s wrong with my Paraconc? help!

以下是引用 xiaoz2005-8-24 23:17:30 的发言:
Try to search for 也许_* if you are using my Babel corpus.

right, Dr.xiao. i pasted the "也许_*" to the search window, and paraconc works! and i can't key" _*" these two special characters into the search window, and later on, i have to use these two special characters when i search other Chinese characters? what i need to do is paste them again?
2005082423490673.jpg



[本贴已被 作者 于 2005年08月24日 23时49分10秒 编辑过]
 
回复:What"s wrong with my Paraconc? help!

This version of Paraconc does not have a way to remove POS tags of the underscore format. But the new release can.

以下是引用 patricx2005-8-25 0:08:46 的发言:
and how can i remove these tags when displaying?
 
If you want to enter special characters such as *, you can first of all remove them from the reserve list (in Option I think)
 
回复:What"s wrong with my ParaConc? 汉字显示为乱码

以下是引用 刘语料2005-8-25 14:38:31 的发言:
perhaps you first segment the Chinese text with a POS tagger or put space between characters.

you're right. when we processed Chinese txts, we can't forget to tokenise the texts at first, or we can do nothing with these concordancers.
 
Back
顶部