如何得到一个词汇表,不包括这些划线的单词

patricx

高级会员
如题:如何得到一个词汇表,不包括这些划线的单词。

2006052116213313.jpg
 
You've already got it right in the capture you uplaoded.

The words displayed with strikethroughs will not be counted as a "word" in the wordlist. You can save the wordlist as a plain text file or an Excel file.

What you did is called lemmatization, grouping different inflectional forms to one single base form entry--the so called "lemma".
 
you can also use a stoplist, that is to say, put what you don't need in the stoplist and when you make a wordlist ,these worsds in the stoplists will not occur.
 
Actually, the attached picture shows that a stoplist had already been used. patricx just did not want to see them in the result.
 
yes, thanks Dr.xu and 刘语料! i really don't want to see those words with strikethroughs on the list and those words are also counted by wordlist.
 
The frequencies of those crossed out words are added up to those of the base form words.
 
Back
顶部