如何建BNC子语料库

回复: 如何建BNC子语料库

It's unclear what criteria you use in selecting the files for your subcorpus. If you already know which files to include, you can simply select those files by filenams when you index your subcorpus using Xaira.

A more lilely scenario is that you are interested in a specific genre but do not know which files belong to that genre. In this case, the BNC Indexer can help: http://ucrel.lancs.ac.uk/bncindex/form.html


请问我如何建BNC的子语料库,比如把A01到A08选取出来做个子语料库,并可用XAIRA分析

谢谢
 
回复: 如何建BNC子语料库

非常感谢两位的解读。但是我想进一步解释一下我的目的。我已经用XAIRA打开了BNC的整个语料库,显示所有的子文件。我怎么从这些子文件中挑选若干个组成一个新的子语料库?这些文件的挑选是随机的,任意的。
版主的回答 you can simply select those files by filenams when you index your subcorpus using Xaira.
但是问题在于我还没有建成SUBCORPUS。而用XAIRA是无法打开BNC安装后的TEXTS文件夹中的文件的,因为要求打开的文件类型应该为*.xcorpus。
如何通过XAIRA提取BNC中的若干文件组建自己的语料库呢?
谢谢
 
回复: 如何建BNC子语料库

非常感谢两位的解读。但是我想进一步解释一下我的目的。我已经用XAIRA打开了BNC的整个语料库,显示所有的子文件。我怎么从这些子文件中挑选若干个组成一个新的子语料库?这些文件的挑选是随机的,任意的。
版主的回答 you can simply select those files by filenams when you index your subcorpus using Xaira.
但是问题在于我还没有建成SUBCORPUS。而用XAIRA是无法打开BNC安装后的TEXTS文件夹中的文件的,因为要求打开的文件类型应该为*.xcorpus。
如何通过XAIRA提取BNC中的若干文件组建自己的语料库呢?
谢谢

Xaira tools is designed for indexing, while Xaira client is used for concorndancing.
 
回复: 如何建BNC子语料库

You can use Xaira client to create a subcorpus, which is called "partition" in Xaira. There are three ways to create subcorpora, or to define partitions in Xaira terms (in the main menu go to Texts - Define partition):

Create an empty partition with selected classes;

Create a partition based on values in a column - but you will first need to add selested vlaues to the column by selecting Texts - Column control; this approach is particularly useful in creating subcorpora from the BNC by using the values in XML elements;

Create a partition based on solutions to a query (i.e. all texts that contain the search term after you have made a query).
 
Back
顶部