求助：ACWT统计lexical density遇到问题

melia · 2007-03-19

请允许我重复发问，因为这个数据需要的比较急。是英文的语料。
在统计时，点击lexical density ure/stubb项，本来以为点击这项应该自动统计LD，结果弹出对话框要求输入实词数目。这个实词数目如何获得？急请行家指点。

melia · 2007-03-19

回复: 求助：ACWT统计lexical density遇到问题

range也可以用来统计LD吗？输出的统计结果中哪一项是LD呢？(LD = content/grammatical.)
因为时间紧，没有时间仔细研究软件了。肯请大家帮忙。

laohong · 2007-03-19

回复: 求助：ACWT统计lexical density遇到问题

Read this paper.

How Variable May a Constant be? Measures of Lexical Richness in Perspective

Fiona J. Tweedie1 and R. Harald Baayen2

(1) University of Glasgow, United Kingdom
(2) Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands

Abstract
A well-known problem in the domain of quantitative linguistics and stylistics concerns the evaluation of the lexical richness of texts. Since the most obvious measure of lexical richness, the vocabulary size (the number of different word types), depends heavily on the text length (measured in word tokens), a variety of alternative measures has been proposed which are claimed to be independent of the text length. This paper has a threefold aim. Firstly, we have investigated to what extent these alternative measures are truly textual constants. We have observed that in practice all measures vary substantially and systematically with the text length. We also show that in theory, only three of these measures are truly constant or nearly constant. Secondly, we have studied the extent to which these measures tap into different aspects of lexical structure. We have found that there are two main families of constants, one measuring lexical richness and one measuring lexical repetition. Thirdly, we have considered to what extent these measures can be used to investigate questions of textual similarity between and within authors. We propose to carry out such comparisons by means of the empirical trajectories of texts in the plane spanned by the dimensions of lexical richness and lexical repetition, and we provide a statistical technique for constructing confidence intervals around the empirical trajectories of texts. Our results suggest that the trajectories tap into a considerable amount of authorial structure without, however, guaranteeing that spatial separation implies a difference in authorship.

Keywords
lexical statistics - Monte Carlo methods - vocabulary richness

Download the full paper

melia · 2007-03-19

回复: 求助：ACWT统计lexical density遇到问题

Thanks, you're always so helpfu.

清风出袖 · 2007-03-22

回复: 求助：ACWT统计lexical density遇到问题

主要可能和ACWT支持的语料大小有关，文件太大就会有问题，适时其他的工具一样可以的。

求助：ACWT统计lexical density遇到问题

melia

初级会员

melia

初级会员

laohong

管理员

melia

初级会员

清风出袖

高级会员