[探讨] Google As a Corpus Tool

回复: [探讨] Google As a Corpus Tool

论文里表格的格式乱了,可以看:http://blog.donews.com/dzhigner/archive/2008/05/11/1289013.aspx
此外,我认为目前在该领域里最为成功的项目是Internet corpora (http://corpus.leeds.ac.uk/internet.html)。基于“互联网作为语料库”概念的工具最终应该发展成象Google那种规模的东西,虽然作为商业项目来开发的话也许相当不划算。

很久没有听到丁版主的声音了。:)
 
回复: [探讨] Google As a Corpus Tool

Representativeness is the central concern in corpus construction. In this light the web can not be viewed as the optimal example of corpora.
 
Back
顶部