以下是引用 dzhigner 在 2005-8-14 10:35:21 的发言:
some must-download:
And the best part of it is the data of ICE-GB: first go to the official site of ICE-GB to download a sampler and refill the data folder with what you have got from the aforementioned FTP.
谢谢啦!这些资料确实很宝贵,但无法下载,或许是主人的保护得力吧!以下是引用 xujiajin 在 2005-8-12 17:42:13 的发言:
ftp://*************************
是别人放到网上的。不是我们随意传播。希望大家仅做学习之用。
**********
为了提倡对知识产权的保护,我们将论坛中涉及到的相关链接隐去。希望大家支持。对于之前我们工作中不力之处在此特别致歉。
[本贴已被 作者 于 2005年08月15日 17时58分42秒 编辑过]
该下载的我都下了,不该下的就留着了,呵呵!以下是引用 xujiajin 在 2005-8-15 20:31:10 的发言:
有些网友说可以下载,有些也下载不了。不知是怎么回事。我这里也下不了。
I have done what you suggested for ICE-GB and got a full set of 500 texts, but when it comes to fragment searching, say, for "play", the same thing displayed with only three hits produced instead of more than 300 as indicated by "Getting Started". How can I get a full function of ICE-GB? Many thanks.以下是引用 dzhigner 在 2005-8-14 10:35:21 的发言:
some must-download:
MONOCONCPRO V2.0
WORDPILOT (I think this one is a nice and simple DDL tool)
WORDSMITH V3 (IF YOU DON'T HAVE IT)
And the best part of it is the data of ICE-GB: first go to the official site of ICE-GB to download a sampler and refill the data folder with what you have got from the aforementioned FTP.
but i have full subfolder of data and index each containing 500 files....以下是引用 xiaoz 在 2005-8-16 22:14:39 的发言:
The additional files are not indexed....
以下是引用 xiaoz 在 2005-8-17 8:41:02 的发言:
Also check the files in following folders:
DATA: 500 corpus files
INDEX: 500 index files
LEXICAL: three files (ICE-GB.IDX 135 KB; ICE-GB.SID 948 KB; ICE-GB.SSI 3658 KB)
MARKUP: three files (ICE-GB.IDX 12 KB; ICE-GB.SID 10 KB; ICE-GB.SSI 639 KB)
NODAL: 7 files (ice-gb2.idx 42 kb; ice-gb.sid 3668 kb; ice-gb.ssi 3890 kb; ice-gb.sso 13160 kb; ice-gb.idx 30 kb; ice-gb.sid 277 kb; ice-gb.ssi 3790 kb)
VARS: 8 files
TEXT: the most important parameter files in this folder:
ICE-GB.txt 391 kb (description of 500 files)
TEXT.txt, 13 kb, giving 200 written files?
STEXT.txt, 17 kb, giving 300 spoken files
...
Just downloading 500 data files will not help. These data files must be indexed properly for use with ICE-UP.
以下是引用 dzhigner 在 2005-8-17 10:47:15 的发言:
Not just the "Data" folder, but replace everything in the folder initially named "ICE-GB-S"