help:corpus of English news text

i want to study noun+noun in corpus of English news text, such as the New York times or Reuters, however, i am not sure whether it is worth doing since i am a beginner of corpus. if it is, how can i gain access to the corpus?
 
回复: help:corpus of English news text

You can find many news texts from balanced corpora, such as BNC, ANC, ICE, Penn Treebank Project, etc.

You may also want to build up your own corpus of newspaper texts as there are so many online newspapers freely available.

For Reuters Corpus, Volume 1, English language, 1996-08-20 to 1997-08-19 was released in Nov. 2000. It is distributed on two CDs and contains about 810,000 Reuters, English Language News stories. It requires about 2.5 GB for storage of the uncompressed files.

I have it but seldom use it. Anyway, you can get a copy of Reuters Corpus free of charge at:
http://about.reuters.com/researchandstandards/corpus/
 
回复: help:corpus of English news text

谢谢。我还想请教一下,怎样搜索noun+noun啊,在线搜索不行的吧。Antconc可以吗?
 
Back
顶部