Here are the frequency lists of the top 5000 Chinese words and the top 2000 Chinese characters covered in the just published frequency dictionary of Mandarin Chinese (http://www.routledge.com/books/A-Frequency-Dictionary-of-Mandarin-Chinese-isbn9780415455862). These lists are based on a balanced corpus of ca. 50 million words (or ca. 73 million chinese characters).
In addition to the normalised frequency (normalised to per million words / characters for the character list), I have included here the usage rate and dispersion rate for both word list and character list. (The published dictionary does not include such statistics for the character list). For a discussion of these concepts and the rationale behind them, for a discussion of the relationship between the lists and the HSK lexical syllabus, or for a presentation of the corpus data, please refer to the Introduction chapter of the book.
These lists can be referenced in your own research by citing the above Routledge frequency dictionary.
In addition to the normalised frequency (normalised to per million words / characters for the character list), I have included here the usage rate and dispersion rate for both word list and character list. (The published dictionary does not include such statistics for the character list). For a discussion of these concepts and the rationale behind them, for a discussion of the relationship between the lists and the HSK lexical syllabus, or for a presentation of the corpus data, please refer to the Introduction chapter of the book.
These lists can be referenced in your own research by citing the above Routledge frequency dictionary.