温州口语语料库Wenzhou Spoken Corpus

xiaoz

永远的超级管理员
Staff member
I am pleased to announce the availability of the Wenzhou Spoken Corpus, a new online searchable corpus of Wenzhou of about 150,000 words:

http://corpora.tapor.ualberta.ca/wenzhou/

Users can select either KWIC concordance display or collocates list (either aggregated into one list or separated out into L1, R1 etc.). Speaker information (gender, age, education etc.) is also available for each speaker. Feedback on the corpus and the search tools welcome.


John Newman
Department of Linguistics
4-32 Assiniboia Hall, University of Alberta
Edmonton T6G 2E7
CANADA
Fax: (780) 492-0806, Tel: (780) 492-5500
Homepage: http:/www.ualberta.ca/~johnnewm

[本贴已被 xujiajin 于 2006年01月08日 19时42分55秒 编辑过]
 
Corpora of regional dialect and variety are extremely interesting even more interesting than standard language corpora.
不过15万字的口语库有点小。
我自己用于论文的8个多小时的语料差不多有14万字。

如果出去其中Wenzhou News Commentary, Internet Chat, Interview and Wenzhou Songs,面谈式的随意性话语量应该不会很多。
 
Back
顶部