回复:Lexical coverage of spoken discourse
Afraid wordlists are not a solution in this case. In all registers and genres, function words such as the and of will sit on top of the frequency lists. The key lies in keywords
^^^^^^^
and key keywords (WordSmith).
^^^^^^^^^^^^
How such (key) key words be worked out in addition to the stop list of function words?