关于Biber's MD analysis的讨论

xujiajin · 2011-02-07

下面corpora list上的讨论，说得非常好。转帖如下：

On 2/5/2011 5:14 AM, Andrea Nini wrote:
> 1) is it possible to calculate the factor scores of a new text using
> the factors that Biber used for his study?

Absolutely, but see below.

> 2) would it affect the results the fact that my texts have to be
> normalised to 100 words whereas Biber's texts were normalised to 1000
> words?

In principle yes, but see below.

> 3) when calculating the factor scores for my texts, what means should
> I consider? The ones taken from my dataset or the ones taken from
> Biber's study?

The ones from your dataset, absolutely, but...

Multidimensional analysis is exciting, but there are significant
problems. The main one that I found is that Biber did not use
per-choice frequencies, so the co-occurrences he identified could have
been due to grammar. In fact, you could interpret his Dimension 1 as
simply "nouns vs. verbs" and Dimension 2 as "past vs. present." I tried
to use the envelope of variation to counteract this, but I was not
successful. I discussed this more in a paper at the 2005 AACL:

http://www.grieve-smith.com/Academic/AAACL-grvsmth.060225.pdf

--
-Angus B. Grieve-Smith
Saint John's University
grvsmth@panix.com

volfer · 2011-02-07

回复: 关于Biber's MD analysis的讨论

谢谢分享。

qhdjason · 2012-09-23

回复: 关于Biber's MD analysis的讨论

Insightful ideas!

The link is broken now. The paper about envelope of variation has been included in the edited volume:

http://www.ingentaconnect.com/content/rodopi/lang/2006/00000060/00000001/art00003

关于Biber's MD analysis的讨论

xujiajin

管理员

volfer

Moderator

qhdjason