Release 1.14 Available
The major change in this version is the switch to index format 23, with these changes: (i) support for thread file in several pieces (absence of this support was limiting corpus size to 500M words);
(ii) support for large indexes, that is, indexes for corpora with more than 4G words. This involves widening a couple of index fields to 64 bits and
(surprising to say) has not yet been tested;
(iii) 20% reduction in uncompressed index size;
(iv) redesign to reduce working space required by indexer. We also fixed a bug that only affected corpora of 1M+ words but caused disastrous memory leaks after that many words had been processed. This version has been used to index a corpus of approaching 1G words.
Because of the change in index format all corpora must be reindexed.
See the Download section for the download link.
The major change in this version is the switch to index format 23, with these changes: (i) support for thread file in several pieces (absence of this support was limiting corpus size to 500M words);
(ii) support for large indexes, that is, indexes for corpora with more than 4G words. This involves widening a couple of index fields to 64 bits and
(surprising to say) has not yet been tested;
(iii) 20% reduction in uncompressed index size;
(iv) redesign to reduce working space required by indexer. We also fixed a bug that only affected corpora of 1M+ words but caused disastrous memory leaks after that many words had been processed. This version has been used to index a corpus of approaching 1G words.
Because of the change in index format all corpora must be reindexed.
See the Download section for the download link.
[本贴已被 作者 于 2005年06月13日 01时07分35秒 编辑过]