Business Letter Corpus Online KWIC Concordancer
1 MILLION WORDS BUSINESS LETTER CORPUS (US & UK) AND OTHER CORPORA
http://ysomeya.hp.infoseek.co.jp/
01 Business Letter Corpus (BLC, contains 1,020,060 word tokens of U.S. and U.K. samples, as of March 1, Y2K)
02 POS tagged BLC (A part-of-speech tagged version of the BLC. Click here for the list of POS tags).
03 Personal Letter Corpus (PLC, contains 113,522 word tokens of American samples, as of June 16, Y2K).
04 POS tagged PLC (A part-of-speech tagged version of the Personal Letter Corpus, as of March 11, 2001).
--- (Letters of Historic Figures)
05-09 Personal Letters by 19th Century Historical Figures (These four corpora contain personal and professional letters by 19th century celebrities. Click here for more details, as of June 15, Y2K)
10 Above 05 to 09 combined (contains 910,363 word tokens).
--- (Literature and Screenplays)
11 Alice's Adventures in Wonderland (Lewis Carroll, 1865: 26,949 word tokens)
12 Through the Looking Glass and What Alice Found There (Lewis Carroll, 1872: 29,888 word tokens).
13 The Adventures of Tom Sawyer (Mark Twain, 1876: 65,942 word tokens).
14 The Adventures of Huckleberry Finn (Mark Twain, 1884: 110,865 word tokens).
15 It's a Wonderful Life (Screenplay by Frank Capra, 1946: 17,066 word tokens)
16 REBECCA (Screenplay by A. Hitchcock, 1940: 16,062 word tokens)
--- (Under construction)
17 U.S. Journalistic Articles (2,102,749 word tokens of U.S. journalistic articles)
18 Learner BLC: WM98 (209,461 word tokens in 1,464 letters written by Japanese business people. All the linguistic surface errors contained in the original data remain as they are.)
1 MILLION WORDS BUSINESS LETTER CORPUS (US & UK) AND OTHER CORPORA
http://ysomeya.hp.infoseek.co.jp/
01 Business Letter Corpus (BLC, contains 1,020,060 word tokens of U.S. and U.K. samples, as of March 1, Y2K)
02 POS tagged BLC (A part-of-speech tagged version of the BLC. Click here for the list of POS tags).
03 Personal Letter Corpus (PLC, contains 113,522 word tokens of American samples, as of June 16, Y2K).
04 POS tagged PLC (A part-of-speech tagged version of the Personal Letter Corpus, as of March 11, 2001).
--- (Letters of Historic Figures)
05-09 Personal Letters by 19th Century Historical Figures (These four corpora contain personal and professional letters by 19th century celebrities. Click here for more details, as of June 15, Y2K)
10 Above 05 to 09 combined (contains 910,363 word tokens).
--- (Literature and Screenplays)
11 Alice's Adventures in Wonderland (Lewis Carroll, 1865: 26,949 word tokens)
12 Through the Looking Glass and What Alice Found There (Lewis Carroll, 1872: 29,888 word tokens).
13 The Adventures of Tom Sawyer (Mark Twain, 1876: 65,942 word tokens).
14 The Adventures of Huckleberry Finn (Mark Twain, 1884: 110,865 word tokens).
15 It's a Wonderful Life (Screenplay by Frank Capra, 1946: 17,066 word tokens)
16 REBECCA (Screenplay by A. Hitchcock, 1940: 16,062 word tokens)
--- (Under construction)
17 U.S. Journalistic Articles (2,102,749 word tokens of U.S. journalistic articles)
18 Learner BLC: WM98 (209,461 word tokens in 1,464 letters written by Japanese business people. All the linguistic surface errors contained in the original data remain as they are.)