acuteknife
初级会员
最近阅读了刘鼎甲老师2021年发表的论文《新冠肺炎疫情中美国媒体涉华报道的语料库历时分析》,根据文献追踪到UGA方法,在操作时对于数据格式存在不解,想请教群里的大咖,以下数据是哪种软件操作得出的,数据格式是TXT。
https://cqpweb.lancs.ac.uk/eebov3/c...t=-|year~1608&t=-|year~1609&del=end&uT=yThere are 2,154 different words in your collocation database for "[lemma="(whore)_SUBST"%c]". (Your query "{whore/N}", restricted to texts meeting criteria "<em>year</em>: <em>1600</em> or <em>1601</em> or <em>1602</em> or <em>1603</em> or <em>1604</em> or <em>1605</em> or <em>1606</em> or <em>1607</em> or <em>1608</em> or <em>1609</em>", returned 1,215 matches in 292 different texts)
__________________
No. Word Total no. in whole corpus Expected collocate frequency Observed collocate frequency In no. of texts Mutual information value
1 dedle 6 0 2 1 14.288
2 Strumpett 8 0 2 2 14.288
3 panderly 13 0 1 1 13.288
4 permane 14 0 1 1 13.288
5 Cocatrise 5 0 1 1 13.288
6 seauen-headed 11 0 1 1 13.288
7 crosbitten 5 0 1 1 13.288
8 Curtesanes 7 0 1 1 13.288
9 brodell 12 0 1 1 13.288
10 swoundes 7 0 1 1 13.288
11 Shpheards 6 0 1 1 13.288
12 Bobylon 7 0 1 1 13.288
13 estreme 7 0 1 1 13.288
14 plaisure 6 0 1 1 13.288
15 huir 7 0 1 1 13.288
16 successon 13 0 1 1 13.288
17 Mezell 13 0 1 1 13.288
https://cqpweb.lancs.ac.uk/eebov3/c...t=-|year~1608&t=-|year~1609&del=end&uT=yThere are 2,154 different words in your collocation database for "[lemma="(whore)_SUBST"%c]". (Your query "{whore/N}", restricted to texts meeting criteria "<em>year</em>: <em>1600</em> or <em>1601</em> or <em>1602</em> or <em>1603</em> or <em>1604</em> or <em>1605</em> or <em>1606</em> or <em>1607</em> or <em>1608</em> or <em>1609</em>", returned 1,215 matches in 292 different texts)
__________________
No. Word Total no. in whole corpus Expected collocate frequency Observed collocate frequency In no. of texts Mutual information value
1 dedle 6 0 2 1 14.288
2 Strumpett 8 0 2 2 14.288
3 panderly 13 0 1 1 13.288
4 permane 14 0 1 1 13.288
5 Cocatrise 5 0 1 1 13.288
6 seauen-headed 11 0 1 1 13.288
7 crosbitten 5 0 1 1 13.288
8 Curtesanes 7 0 1 1 13.288
9 brodell 12 0 1 1 13.288
10 swoundes 7 0 1 1 13.288
11 Shpheards 6 0 1 1 13.288
12 Bobylon 7 0 1 1 13.288
13 estreme 7 0 1 1 13.288
14 plaisure 6 0 1 1 13.288
15 huir 7 0 1 1 13.288
16 successon 13 0 1 1 13.288
17 Mezell 13 0 1 1 13.288