我在建一个小型英文语料库，不知什么建库软件比较好

散步的鱼 · 2010-09-25

我现在在建一个小型语料库，纯英文的，写论文用，但是不知道什么建库软件比较好，最好是免费的，谢谢帮助！

jeremy · 2010-12-03

回复: 求助，谢谢！

您好,

可能要先考慮研究問題是什麼, 再挑選適合的軟件.
免費當然 AntConc 為上選, 但 WordSmith v5.0 真的有許多有趣的功能, 而且價格還可以.

volfer · 2010-12-03

回复: 求助，谢谢！

AntConc和WordSmith都是检索工具，LZ是想问建库工具么？其实建库不需要什么工具。只要把一定数量的语料纯文本化，就可以称之为“语料库”了。

jeremy · 2010-12-03

回复: 求助，谢谢！

同意樓上前輩.

重點在純文本化.

concord world · 2010-12-06

回复: 我在建一个小型英文语料库，不知什么建库软件比较好

我也是这样想的，建库应该就是一个搜集和整理文件的过程，然后存为纯文本文件就行了，应该不用什么建库软件的，但如果想使用库，就得用一些检索软件了。

jeremy · 2010-12-06

回复: 我在建一个小型英文语料库，不知什么建库软件比较好

刚刚突然想到,WordSmith v5.0 有内建搭配搜寻引擎收集语料的功能: WebGetter.
Mick Scott老师真的很厉害.

以下为 WordSmith 内 WebGetter 的部份 manual
................................................................
WebGetter visits the search engine you specify and downloads the first 1000 sources or so. Basically it uses the search engine just as you do yourself, getting a list of useful references. Then it sends out a robot to visit each web address and download the web page in each case (not from the search engine's cache but from the original web-site). Quite a few robots may be out there searching for you at once -- the advantage of this is that one slow download doesn't hold all the others up.

After downloading a web page, that WebGetter robot checks it meets your requirements (in Settings) and cleans up the resulting text. If the page is big enough, a file with a name very similar to the web address will be saved to your hard disk.

When it runs out of references, WebGetter re-visits the search engine and gets some more.

jeremy · 2010-12-06

回复: 我在建一个小型英文语料库，不知什么建库软件比较好

我不知道 RMB 571.11 算不算贵.我前两个月用NT买的,可能刚好台币升值, 觉得价钱还好.

WordSmith Tools Single User Unit Price: RMB 571.11
Version: 5.0
Language: English

Drinksbeer · 2010-12-06

回复: 我在建一个小型英文语料库，不知什么建库软件比较好

如要建页面检索平台呢？

jeremy · 2010-12-06

回复: 我在建一个小型英文语料库，不知什么建库软件比较好

附上 WebGetter 操作说明, 请大家支持正版喔.

jeremy · 2010-12-06

回复: 我在建一个小型英文语料库，不知什么建库软件比较好

作者 Drinksbeer:
如要建页面检索平台呢？

不好意思,不太懂您问题的意思...

xujiajin · 2010-12-06

回复: 我在建一个小型英文语料库，不知什么建库软件比较好

Drinksbeer说的可能是online query interface。

那一般需要会编程才行，webgetter帮不了你建query system。

flycap · 2010-12-12

回复: 我在建一个小型英文语料库，不知什么建库软件比较好

如果字数不多的话，可以考虑从CLAWS的体验网站上获取标注语料。上面有C5，C7两种格式
http://ucrel.lancs.ac.uk/claws/trial.html

danway69 · 2012-05-24

回复: 我在建一个小型英文语料库，不知什么建库软件比较好

请问各位，大家听说过 CHINA ENGLISH CORPUS中国英语语料库吗？在网上怎么找到？

我在建一个小型英文语料库，不知什么建库软件比较好

散步的鱼

jeremy

volfer

Moderator

jeremy

concord world

jeremy

jeremy

Drinksbeer

jeremy

附件

jeremy

xujiajin

管理员

flycap

论坛混混

danway69