在看Mc Enery的书Corpus Linguistics 中提到 “ Balanced corpus, also known as sample corpus, tries to represent a particular type of language over a specific span of time. In doing so, it tries to be balanced and representative within a particular sampling frame which defines the type of language, the population that we would like to characterize. ”
书后面的glossary给的定义是:balanced corpus:A corpus that contains texts from a wide range of different language genres and text domains, so that, for example, it may include both spoken and written, and public and private texts. Balanced corpus is sometimes referred to as reference, general or core corpora. Corpus which seeks balance and representativeness within a given sampling frame is a balanced corpus.
想询问balanced corpus 和sample corpus 真的可以完全划等号吗
书后面的glossary给的定义是:balanced corpus:A corpus that contains texts from a wide range of different language genres and text domains, so that, for example, it may include both spoken and written, and public and private texts. Balanced corpus is sometimes referred to as reference, general or core corpora. Corpus which seeks balance and representativeness within a given sampling frame is a balanced corpus.
想询问balanced corpus 和sample corpus 真的可以完全划等号吗