追忆JDEST

xujiajin

管理员
Staff member
作为中国大陆最早的计算机化的大型英语语料库,在中国语料库语言学史上的确值得书上一笔。只是有关这个由杨惠中教授在上个世纪80年代初创建的科技英语语料库的相关信息很少,不知道大家又没有相关的介绍文章和基于该语料库所作研究的报告。
JDEST: Jiao Da English for Science and Technology
An introductory paper:
Page 1:
http://www.corpus4u.org/upload/forum/2005062316021275.pdf

Page 2:
http://www.corpus4u.org/upload/forum/2005062316040184.pdf

Page 3:
http://www.corpus4u.org/upload/forum/2005062316072971.pdf
 

Haiyang Ai

Administrator
在杨老师主编的《语料库语言学导论》中有提及,上海外语教育出版社,2002
不知道有没有基于这个专门做研究的论文。

我觉得还是Richard说的对,语料库只有公开才能有所发展,这一点应该多学习。
 

xujiajin

管理员
Staff member
其实杨老师1985,1986年前后在Literary and Linguistic Computing 上面发过两篇基于JDEST做的研究的文章。
 

xiaoz

永远的超级管理员
Staff member
之所以要追忆,是因为它已成为历史。
Brown and LOB 不需要追忆,because they are still in currency.
Yang is great as a pioneer in corpus research at a time when corpora were not as popular as today. The JDEST corpus is now only mentioned in the chapter on the history of corpus linguistics in textbooks.
 

xujiajin

管理员
Staff member
---------------------------------------------------------------------
JDEST CORPUS (Jiao Tong University corpus for EST)
---------------------------------------------------------------------
Compiled by: Yang Huizhong
Compiled at: Jiao Tong University
Date of compilation:
Language (variety): English
Spoken/written: Written
Size: Approximately 1 million words
Details of material: Consists of English used in Science and
Technology.
Organisation: Divided into the following subject areas:
Computers
Metallurgy
Machine Building
Physics
Electrical Engineering
Civil Engineering
Chemical Engineering
Naval Architecture
Atomic Energy
Aircraft Manufacturing
How transcribed: Ordinary written text
How analysed: Frequency list has been produced
Use of corpus: To meet the needs of students of English used
in science and technology.
Availability:
Storage details:
Other:

---------------------------------------------------------------------
 

xujiajin

管理员
Staff member
大家再帮忙找一下看有没有相关的文章了,谢谢了。特别是看能不能找到Literary and Linguistic Computing上的那两篇文章。C友中有没有杨老师的学生,帮忙顶一下吧。
 

hhyjulia

初级会员
Actually, JDEST is not a history; it keeps enlarging all the time. As gradutes in Sjtu, we all have contributed to it. But it is not publicly available.:(
 

xujiajin

管理员
Staff member
Could you please tell us about her current scale and general information and her objective? Thanks.
 

hhyjulia

初级会员
The academic English corpus JDEST (Jiao Tong University Corpus of English for Science and Technology) developed by Shanghai Jiao Tong University, shows a great advantage in its capacity. It consists of 4,000,000 running words. Moreover, there is a good balance between different research areas, between different text types and between American, British and Canadian English as well.
 

xujiajin

管理员
Staff member
julia, thank you for the updates of JDEST. But do you have the detailed info about JDEST's current scale to share with us? Or would you please to ask for Prof. Yang or Prof. Wei Naixing for an official version of the "vital statistics" of JDEST? Thanks in advance.
 

hancunxin

Moderator
《语料库语言学导论》杨惠中 主编 PAGE 79 注3 到2000年,JDEST的库容量已扩充为4,082,369 个词。
现在应该不止这个数把。
 

xujiajin

管理员
Staff member
目前,看来得到的信息都是规模上的扩大,不知道标注、应用方面有哪些设想和进展?
 

marilei

初级会员
回复: 追忆JDEST

论学术英语中的情态模糊限制语《外语电化教学》 aug, 2006 p47 是基于jdest的
 

oliverban

初级会员
回复: 追忆JDEST

THE T-SERIES REPRESENTS WHAT? THERE ARE SO MANY OF THEM AND I FOUND IT LARGER THAN 4MILLION RUNNING WORDS? IS IT BECAUSE OF THE NUMBERS CONTAINED?
 
回复: 追忆JDEST

YANG HUIZHONG
A New Technique for Identifying Scientific/Technical Terms and Describing Science Texts: (An Interim Report)
Lit Linguist Computing 1986 1: 93-103; doi:10.1093/llc/1.2.93 [Abstract] [PDF] [Request Permissions]
 
顶部