什么软件可以分析汉语词汇的搭配?

rainbow

初级会员
#1
请教汉语检索的专家们
什么样的软件可以像wordsmith一样分析汉语的key word in context,能够检索出搭配并统计出现频率?谢谢赐教
 

armstrong

高级会员
#2
回复: 什么软件可以分析汉语词汇的搭配?

wordsmith 4.0就可以,前提是必须进行分词处理。另外AntConC,MonoConc Pro也可以。
 

rainbow

初级会员
#3
回复: 什么软件可以分析汉语词汇的搭配?

谢谢armstrong,我在每个字前加了空格,可wordsmith4显示找不到entry,能否请大侠演示一下具体怎么使用?
 

xujiajin

管理员
Staff member
#5
回复: 什么软件可以分析汉语词汇的搭配?

进入WordSmith的中文文本需要存为Unicode。
 

xiaoz

永远的超级管理员
Staff member
#7
回复: 什么软件可以分析汉语词汇的搭配?

...and Xaira, which is free but requires the text to be marked up in XML.
 

rainbow

初级会员
#8
回复: 什么软件可以分析汉语词汇的搭配?

thanks xiaoz, but i would like to analyse .txt text,is there any solution if xaira is used to do the research?
 

xiaoz

永远的超级管理员
Staff member
#9
回复: 什么软件可以分析汉语词汇的搭配?

.txt is a file saving format (for plain text), and XML is a mark-up format. An XML can also be saved as a .txt file.

To analyse collocation in Chinese text using Xaira, you must first of all tokenise/segment the text using software such ICTCLAS, the result would look like: word1/tag1 word2/tag2... Then you can convert the wor-tag pair frmt he backslash style to XML style - I uploaded some perl scripts for such jobs to the site last year (just search the site). It is easy to make the whole file XML-compliant: just add a tag at the beginning and end of the file, by yourself, or usng Preprocessing in Xaira Indexer tool.

After indexing your corpus, you can then use the Xaira client with your corpus.
 
#15
How to change a Chinese text in xml into Unicode?

Hi everybody,

Can anyone tell me how to change a Chinese text in xml into Unicode format as I'd like to do word search using Wordsmith 5.

Thanks
 
#16
回复: How to get an entry in a Chinese file coded in Unicode already using Wordsmith 5

Hi everybody,

Can anyone tell me how to do concordance search on Chinese texts in Unicode format already using Wordsmith 5?

I've already searched Chinese characters in Wordsmith 5 but nothing came up. Only some strange symbols or comment that "no entry found"


I'm desperate! I've tried many times but failed! Can anyone out there can help me?

Thanks

Grace
 

xiaoz

永远的超级管理员
Staff member
#17
回复: How to get an entry in a Chinese file coded in Unicode already using Wordsmith 5

You'll need to ensure your text is encoded in Unicode (i.e. UTF-16, not UTF-8).


Hi everybody,

Can anyone tell me how to do concordance search on Chinese texts in Unicode format already using Wordsmith 5?

I've already searched Chinese characters in Wordsmith 5 but nothing came up. Only some strange symbols or comment that "no entry found"


I'm desperate! I've tried many times but failed! Can anyone out there can help me?

Thanks

Grace
 
顶部